How can I use docker without sudo?

Question

αғsнιη

Asked: 2015-01-05 09:13:10 +0800 CST2015-01-05 09:13:10 +0800 CST 2015-01-05 09:13:10 +0800 CST

How can I get lines where a specific word is repeated exactly N times?

772

For this given input:

How to get This line that this word repeated 3 times in THIS line?
But not this line which is THIS word repeated 2 times.
And I will get This line with this here and This one
A test line with four this and This another THIS and last this

I want this output:

How to get This line that this word repeated 3 times in THIS line?
And I will get This line with this here and This one

Getting whole lines contains only three repeated "this" words. (case insensitive match)

7 Answers

Voted

muru · Answer 1 · 2015-01-05T10:13:58+08:00

Best Answer

muru

2015-01-05T10:13:58+08:002015-01-05T10:13:58+08:00

In perl, replace this with itself case-insensitively and count the number of replacements:

$ perl -ne 's/(this)/$1/ig == 3 && print' <<EOF
How to get This line that this word repeated 3 times in THIS line?
But not this line which is THIS word repeated 2 times.
And I will get This line with this here and This one
A test line with four this and This another THIS and last this
EOF
How to get This line that this word repeated 3 times in THIS line?
And I will get This line with this here and This one

Using a count of matches instead:

perl -ne 'my $c = () = /this/ig; $c == 3 && print'

If you have GNU awk, a very simple way:

gawk -F'this' -v IGNORECASE=1 'NF == 4'

The number of fields will be one more than the number of separators.

13

Jacob Vlijm · Answer 2 · 2015-01-05T09:53:02+08:00

In python, this would do the job:

#!/usr/bin/env python3

s = """How to get This line that this word repeated 3 times in THIS line?
But not this line which is THIS word repeated 2 times.
And I will get This line with this here and This one
A test line with four this and This another THIS and last this"""

for line in s.splitlines():
    if line.lower().count("this") == 3:
        print(line)

outputs:

How to get This line that this word repeated 3 times in THIS line?
And I will get This line with this here and This one

Or to read in from a file, with the file as argument:

#!/usr/bin/env python3
import sys

file = sys.argv[1]

with open(file) as src:
    lines = [line.strip() for line in src.readlines()]

for line in lines:
    if line.lower().count("this") == 3:
        print(line)

Paste the script into an empty file, save it as find_3.py, run it by the command:
```
python3 /path/to/find_3.py <file_withlines>
```

Of course the word "this" can be replaced by any other word (or other string or line section), and the number of occurrences per line can be set to any other value in the line:

    if line.lower().count("this") == 3:

Edit

If the file would be large (hundreds of thousands / millions of lines), the code below would be faster; it reads the file per line instead of loading the file at once:

#!/usr/bin/env python3
import sys
file = sys.argv[1]

with open(file) as src:
    for line in src:
        if line.lower().count("this") == 3:
            print(line.strip())

Sri · Answer 3 · 2015-01-05T10:54:08+08:00

Sri

2015-01-05T10:54:08+08:002015-01-05T10:54:08+08:00

Assuming your source file is tmp.txt,

grep -iv '.*this.*this.*this.*this' tmp.txt | grep -i '.*this.*this.*this.*'

The left grep outputs all lines that do not have 4 or more case-insensitive occurrences of "this" in tmp.txt.

The result is piped to the right grep, which outputs all lines with 3 or more occurrences in the left grep result.

Update: Thanks to @Muru, here is the better version of this solution,

grep -Eiv '(.*this){4,}' tmp.txt | grep -Ei '(.*this){3}'

replace 4 with n+1 and 3 with n.

9

fedorqui · Answer 4 · 2015-01-06T06:15:48+08:00

fedorqui

2015-01-06T06:15:48+08:002015-01-06T06:15:48+08:00

You can play a bit with awk for this:

awk -F"this" 'BEGIN{IGNORECASE=1} NF==4' file

This returns:

How to get This line that this word repeated 3 times in THIS line?
And I will get This line with this here and This one

Explanation

What we do is to define the field separator to this itself. This way, the line will have as many fields +1 as times the word this appears.
To make it case insensitive, we use IGNORECASE = 1. See reference: Case Sensitivity in Matching.
Then, it is just a matter of saying NF==4 to get all those lines having this exactly three times. No more code is needed, since {print $0} (that is, print the current line) is the default behaviour of awk when an expression evaluates to True.

6

xyz · Answer 5 · 2015-01-05T10:03:38+08:00

xyz

2015-01-05T10:03:38+08:002015-01-05T10:03:38+08:00

Assuming the lines are stored in a file named FILE:

while read line; do 
    if [ $(grep -oi "this" <<< "$line" | wc -w)  = 3 ]; then 
        echo "$line"; 
    fi  
done  <FILE

5

Bohr · Answer 6 · 2015-01-05T21:44:28+08:00

Bohr

2015-01-05T21:44:28+08:002015-01-05T21:44:28+08:00

If you're in Vim:

g/./if len(split(getline('.'), 'this\c', 1)) == 4 | print | endif

This will just print matched lines.

4

Sergiy Kolodyazhnyy · Answer 7 · 2017-01-08T02:37:04+08:00

Sergiy Kolodyazhnyy

2017-01-08T02:37:04+08:002017-01-08T02:37:04+08:00

Ruby one-liner solution:

$ ruby -ne 'print $_ if $_.chomp.downcase.scan(/this/).count == 3' < input.txt                                    
How to get This line that this word repeated 3 times in THIS line?
And I will get This line with this here and This one

Works in a quite simple fashion: we redirect file into ruby's stdin, ruby gets line from stdin, cleans it up with chomp and downcase, and scan().count gives us number of occurrences of a substring.

0

How can I get lines where a specific word is repeated exactly N times?

Edit

Explanation

How to install Google Chrome

Is there a command to list all users? Also to add, delete, modify users, in the terminal?

How to delete a non-empty directory in Terminal?

How to unzip a zip file from the Terminal?

How can I copy the contents of a folder to another folder in a different directory using terminal?

How do I install a .deb file via the command line?

How do I run .sh scripts?

How do I install a .tar.gz (or .tar.bz2) file?

How to list all installed packages

Unable to lock the administration directory (/var/lib/dpkg/) is another process using it?