Ping a Specific Port

Question

MHS

Asked: 2010-05-15 14:07:55 +0800 CST2010-05-15 14:07:55 +0800 CST 2010-05-15 14:07:55 +0800 CST

Which Message Queue should I choose (must run on Linux)

772

There are many open source Message queues for Linux, and I need some help deciding what I should go for.

My problem is simple - I get sent a list of files that needs to be processed. Each job can't be split up, but they are self contained and can be spread to multiple computers.

I'm thinking of solving this using a message queue. Multiple clients send a message to a central queue. Each queue has a number of subscribers that will take jobs from that queue when they have finished processing the current job.

Ideally it should have the following qualities

Message queue must be able to store unprocessed messages in case of a shutdown/reboot
A job can only be processed by a single subscriber (don't want duplicate jobs)
The subscribers should be able to send jobs of their own, that will be processed by a different set of subscribers.

Can anyone suggest a simple to use message queue?

5 Answers

Voted

Marco Ramos · Answer 1 · 2010-05-15T14:12:55+08:00

Marco Ramos

2010-05-15T14:12:55+08:002010-05-15T14:12:55+08:00

You have RabbitMQ and ZeroMQ, but afaik ZeroMQ doesn't store unprocessed messages in case of a crash. They're both open source and use AMQP, an open messaging protocol.

1

Javier · Answer 2 · 2010-05-15T14:23:01+08:00

Javier

2010-05-15T14:23:01+08:002010-05-15T14:23:01+08:00

a very simple to use is memcacheq, which uses the same API as memcached, so you can use the same libraries. it uses a BDB backend, so it's not RAM-only like memcached

0

Alister Bulman · Answer 3 · 2010-11-20T09:28:43+08:00

Alister Bulman

2010-11-20T09:28:43+08:002010-11-20T09:28:43+08:00

Beanstalkd is a simple job-queue system that matches your basic needs. It can use a binary log to provide persistance if the queue itself fails and will only allow one worker to have a job at once, though jobs are also set with a timeout, so if they are not deleted, or returned to the queue before that, they are made available again (in case of worker problems).

I did a presentation on beanstalkd for a local user-group, which has some more information.

0

Arenstar · Answer 4 · 2010-11-20T10:45:53+08:00

I just went through this in my latest architecture planning..

Basically.. "the message queues".. all have problems that none of them guarantee both of the follow characteristics at the same time..

Guarantee recieving a message
Guarantee no duplicate messages

So what is currently offered as an open source solution cannot perform these two imperative tasks simultaneously.. (unless your want to spend 50K with IBM)

There is one great video which suggests that cassandra can handle this with quorum reads/writes, but is not taking into account concurrency on a high scale:/

In the end i settled on REDIS actually ( i avoided the clustered solution )

Simply and effectively single threaded.. (to avoid duplicity) Offers a atomic BlockonPop or even a multicast pubsubhubbub feature for queue workers..

a homegrown solution was developed to manage "lost jobs" that never arrived.. ( reliability )

Its quite a simple model actually.. seemingly easy to maintain aswell..

Hope this helps..

dkam · Answer 5 · 2011-02-23T02:44:04+08:00

dkam

2011-02-23T02:44:04+08:002011-02-23T02:44:04+08:00

I've used Beanstalkd for this type of task. It can be configured to persist jobs to disk between reboots. To help with removing duplicates, I pushed job identifier into memcached - if the job was in memcache already, delete it rather than queueing in Beanstalkd.

0

Which Message Queue should I choose (must run on Linux)

Ping a Specific Port

How do I tell Git for Windows where to find my private RSA key?

How do you restart php-fpm?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?