I want to sort file (with removing duplicates) which contains wordlist with a size of almost 25GB. I am using sort
command in Ubuntu terminal but it takes hours to output sorted file, the command is:
sort -u input.txt>output.txt
Is there some alternative and efficient way to do the same?
From my personal experience: if you want unique lines make sure you also use
I see speed improvements by a factor 10 but it probably depends on the characters in the file (I often have to use it for translations so I have accented characters in the file).
A really incredible tool is xsv (https://github.com/BurntSushi/xsv). Running it for a 3173959 rows file
I have the output in 3 seconds