Ping a Specific Port

Question

imaginative

Asked: 2013-04-12 12:29:19 +0800 CST2013-04-12 12:29:19 +0800 CST 2013-04-12 12:29:19 +0800 CST

Taking Cassandra Backups

772

We currently have 12 nodes running in our Cassandra cluster. Ultimately even if a couple of the nodes go down, we're still up and running. The paranoia in me would like to do at least one backup a day and store it on Amazon S3. My question is the following:

When backing up Cassandra, is it sufficient to run the backup from one node, or do I have to run a backup script from each one of the 12 nodes and push its respective backup onto S3? If at one point a restore is required, do we have to backup from the individual nodes backup, or is there a way to "aggregate" the backups (assuming you need to take them from each node individually) into one large restore process?

Slightly confused by the documentation. Just want to get an efficient backup process rolling on my Cassandra cluster.

2 Answers

Voted

Lyuben Todorov · Answer 1 · 2013-05-04T07:51:39+08:00

Lyuben Todorov

2013-05-04T07:51:39+08:002013-05-04T07:51:39+08:00

You need to back each node up, unless every node stores 100% of the data, then you can back only one node up.

2

Jon Haddad · Answer 2 · 2014-12-18T12:39:03+08:00

Jon Haddad

2014-12-18T12:39:03+08:002014-12-18T12:39:03+08:00

The easiest way to back up Cassandra is to back up each node. I've used tablesnap before to do this automatically and it's pretty good. There's also Priam from Netflix but I haven't tried it personally. https://github.com/Netflix/Priam

1

Taking Cassandra Backups

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?