Ping a Specific Port

Question

duckus

Asked: 2011-08-24 10:28:08 +0800 CST2011-08-24 10:28:08 +0800 CST 2011-08-24 10:28:08 +0800 CST

Processing pre-existing log files with Flume

772

I have a large set of log files that I need to extract data from. Is it possible to use Flume to read these files and dump them into an HDFS (Cassandra, or another data source) which I can then query?

The documentation seems to suggest it's all live event based log processing. I'm wondering if I'm missing some obvious process to just have flume read and process static log files from a directory.

1 Answers

Voted

Jeff Wu · Answer 1 · 2011-10-18T13:24:55+08:00

Jeff Wu

2011-10-18T13:24:55+08:002011-10-18T13:24:55+08:00

Yes, this is the standard use case for flume.

The server with the log files will run a flume-node and another (or potentially the same) server will run a flume-master. The flume-nodes will discover the flume-master and from the flume-master you can execute commands like:

exec config my-config 'tail("/path/to/logfile")' 'collectorSink("hdfs://path/to/hdfs-folder", [options])'

This creates a configuration that tells flume how to access the file (it can tail or read the entire file, other options are available) and where to put it.

Then it is a matter of pointing the configuration at a particular server:

exec map (server-hostname) my-config

There is more information in the flume user guide: http://archive.cloudera.com/cdh/3/flume/UserGuide/index.html

1

Processing pre-existing log files with Flume

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?