Ping a Specific Port

Question

Brian Postow

Asked: 2012-06-21 08:22:51 +0800 CST2012-06-21 08:22:51 +0800 CST 2012-06-21 08:22:51 +0800 CST

Best practices for backup on a massively parallel grid system

772

I work in the research group of a large company. We do a lot of work on a grid processing system with many nodes (More than 200, I'm not sure exactly how many) and several harddrives. More than 1000TB of data.

Most of this data can be re-produced, but that requires time. A lot of the data is code which is stored in separate RCS repos, which can have their own backup, but working copies are, of course, on the normal user-drives.

Can someone point me at a best-practices document, or something about how most companies go about protecting this much data?

Thanks

1 Answers

Voted

mfinni · Answer 1 · 2012-06-21T09:10:39+08:00

mfinni

2012-06-21T09:10:39+08:002012-06-21T09:10:39+08:00

Hire a backup admin or engineer.
Give him or her your requirements and budget. (this may be an iterative process.)
Do what he or she says.

There's a lot to designing an effective backup system for your business needs. You might snapshot the data to other disks and then mirror off-site (if you have another site), or send to tape, or just send to tape directly from your nodes. There may be concurrency issues of data backed up at different times - perhaps your application needs to export or quiesce first? We don't know, you didn't tell us. There's a lot of technical questions and issues.

And the first thing that needs to be addressed is your actual business needs - what's your RTO (how long can you be down until your data is restored) and RPO (how much data can you afford to lose between backup runs) ? Does this need to be part of a DR or business continuity plan, or if the building burns down, do you just not care about your data anymore?

3

Best practices for backup on a massively parallel grid system

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?