How can I use docker without sudo?

Question

Amin

Asked: 2019-08-27 04:29:19 +0800 CST2019-08-27 04:29:19 +0800 CST 2019-08-27 04:29:19 +0800 CST

Why does uniq seem to keep some consecutive identical lines?

772

Without uniq:

amin@ubuntu:~/Desktop$ cut -f 1 info.log | tail -n +2 | head -n -1 | sort
Abol
Abol 
Ahmad
Akbar
Arash
Hadi 
Hamed
Mahmood
Maryam
Maryam
Mohsen
NIma
Rasool
Sadegh
Sepide
Sepide

With uniq:

amin@ubuntu:~/Desktop$ cut -f 1 info.log | tail -n +2 | head -n -1 | sort | uniq
Abol
Abol 
Ahmad
Akbar
Arash
Hadi 
Hamed
Mahmood
Maryam
Mohsen
NIma
Rasool
Sadegh
Sepide
Sepide

As you see result are same in both, why?

1 Answers

Voted

Eliah Kagan · Answer 1 · 2019-08-27T04:49:56+08:00

TL;DR: The lines have different whitespace (possibly spaces) at the end.

This happens when you have lines that look the same but are are actually different due to characters that don't display in your terminal, usually at the end. Often these are are trailing spaces (as fkraiem suggested) or inconsistent line terminators.

You might expect that starting the pipeline, as you do, with cut, would prevent this. It doesn't, though. cut uses a tab as its default delimiter. (Readers who wish to verify this behavior--and its relevance to having unexpected duplicate lines after uniq--can try cut -f 1 <<<$'foo\nfoo ' | uniq, which prints two lines.)

The solution in your case is probably to use something other than cut -f 1 to select the fields. In particular, if the fields are separated by spaces instead of tabs--whether by a single space or multiple spaces, and even if the number of spaces is different in different records--then you can use cut -d' ' -f 1 instead, specifying space as the delimiter character. Or you might not want to use cut at all, but instead use awk '{ print $1 }', which prints the first field, taking any sequence of consecutive spaces and tabs as the delimiter.

You could alternatively strip the trailing whitespace, though this makes your command even more complicated. One way to do that would be by piping your text through sed -E 's/[[:space:]]+$//' before it goes to uniq.

As a side note, if whatever command you ultimately use still ends up piping the output of sort directly to uniq, you might consider just using sort -u for that instead.

Why does uniq seem to keep some consecutive identical lines?

TL;DR: The lines have different whitespace (possibly spaces) at the end.

How to install Google Chrome

Is there a command to list all users? Also to add, delete, modify users, in the terminal?

How to delete a non-empty directory in Terminal?

How to unzip a zip file from the Terminal?

How can I copy the contents of a folder to another folder in a different directory using terminal?

How do I install a .deb file via the command line?

How do I run .sh scripts?

How do I install a .tar.gz (or .tar.bz2) file?

How to list all installed packages

Unable to lock the administration directory (/var/lib/dpkg/) is another process using it?