Ping a Specific Port

Question

Bang Dao

Asked: 2014-04-11 20:15:13 +0800 CST2014-04-11 20:15:13 +0800 CST 2014-04-11 20:15:13 +0800 CST

In Hadoop, how to show current process of -copyFromLocal

772

I am still a newbie learner of Hadoop, and this time I was trying to process a 106GB file. I used -copyFromLocal to copy that big file to my Hadoop DFS, but since the file is big I have to wait for a long time without a clue about the current copying status.

Is there any way to show the current file copying status with this command?

Thank you guys in advance for your help!

4 Answers

Voted

datarockz2 · Answer 1 · 2014-09-24T19:30:56+08:00

Best Answer

datarockz2

2014-09-24T19:30:56+08:002014-09-24T19:30:56+08:00

CopyFromLocal does not have the ability to display the file copy progress. Alternatively, you could open another shell and run the $ watch hadoop fs -ls <filenameyouarecopying>. This will display the file and its size once every 2.0 seconds.

15

Alexander Rodin · Answer 2 · 2016-11-09T12:14:16+08:00

Alexander Rodin

2016-11-09T12:14:16+08:002016-11-09T12:14:16+08:00

It is also possible to track the progress of reading of the local file using pv command and pipe the file content to hdfs dfs stdin:

pv mylargefile.txt | hdfs dfs -put - /path/to/file/on/hdfs/mylargefile.txt

4

Travis Campbell · Answer 3 · 2014-04-22T08:49:40+08:00

Travis Campbell

2014-04-22T08:49:40+08:002014-04-22T08:49:40+08:00

It doesn't look like there's a verbose option to any of the copy commands (copyFromLocal, copyToLocal, get, put). Your best bet is probably to look at the size of the file at it's destination on HDFS in order to gauge it's progress.

1

Anan · Answer 4 · 2015-03-16T00:47:29+08:00

Anan

2015-03-16T00:47:29+08:002015-03-16T00:47:29+08:00

You can use "nohup &" to execute the copying as a background process. nohup will make the process to execute even after you log out of the server. When ever you need, you can check the process using "hadoop fs -ls .

1

In Hadoop, how to show current process of -copyFromLocal

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?