Ping a Specific Port

Question

UsAaR33

Asked: 2012-07-26 15:25:32 +0800 CST2012-07-26 15:25:32 +0800 CST 2012-07-26 15:25:32 +0800 CST

How to get less to seek faster with large log files?

772

I am often dealing with incredibly large log files (>3 GB). I've noticed the performance of less is terrible with these files. Often I want to jump do the middle of the file, but when I tell less to jump forward 15 M lines it takes minutes..

The problem I imagine is that less needs to scan the file for '\n' characters, but that takes too long.

Is there a way to make it just seek to an explicit offset? e.g. seek to byte offset 1.5 billion in the file. This operation should be orders of magnitude faster. If less does not provide such an ability, is there another tool that does?

3 Answers

Voted

Sekenre · Answer 1 · 2012-07-27T03:16:14+08:00

Best Answer

Sekenre

2012-07-27T03:16:14+08:002012-07-27T03:16:14+08:00

you can stop less from counting lines like this less -n

To jump to a specific place like say 50% in, less -n +50p /some/log This was instant for me on a 1.5GB log file.

Edit: For a specific byte offset: less -n +500000000P ./blah.log

24

womble · Answer 2 · 2012-07-26T16:45:25+08:00

womble

2012-07-26T16:45:25+08:002012-07-26T16:45:25+08:00

Less, being a pager, is inherently line-oriented. When you startup, if it's a large file it'll say "counting line numbers" and you hit ESC to stop that, but otherwise, it does lines. It's what it does.

If you want to jump straight into the middle of file and skip the beginning, you can always just seek past the beginning; I'd do something like tail -c +15000000 /some/log | less.

5

Romuald Brunet · Answer 3 · 2017-01-18T07:22:45+08:00

Romuald Brunet

2017-01-18T07:22:45+08:002017-01-18T07:22:45+08:00

less seems to have a small overhead from the locale settings

If you're using ASCII only characters, you can speed it up a bit by using:

LC_ALL=C less big-log-file.log

In my case, the throughput increased from ~ 30M ib/s to ~ 50 Mib/s (rate is CPU bound)

0

How to get less to seek faster with large log files?

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?