Ping a Specific Port

Question

Alex L

Asked: 2009-08-07 08:34:18 +0800 CST2009-08-07 08:34:18 +0800 CST 2009-08-07 08:34:18 +0800 CST

Don't need the whole line, just the match from regular expression

772

I simply need to get the match from a regular expression:

$ cat myfile.txt | SOMETHING_HERE "/(\w).+/"

The output has to be only what was matched, inside the parenthesis.

Don't think I can use grep because it matches the whole line.

Please let me know how to do this.

7 Answers

Voted

Amandasaurus · Answer 1 · 2009-08-07T08:36:48+08:00

Amandasaurus

2009-08-07T08:36:48+08:002009-08-07T08:36:48+08:00

Use the -o option in grep.

Eg:

$ echo "foobarbaz" | grep -o 'b[aeiou]r'
bar

24

DrYak · Answer 2 · 2015-01-21T05:21:08+08:00

Best Answer

DrYak

2015-01-21T05:21:08+08:002015-01-21T05:21:08+08:00

2 Things:

As stated by @Rory, you need the -o option, so only the match are printed (instead of whole line)
In addition, you neet the -P option, to use Perl regular expressions, which include useful elements like Look ahead (?= ) and Look behind (?<= ), those look for parts, but don't actually match and print them.

If you want only the part inside the parenthesis to be matched, do the following:

grep -oP '(?<=\/\()\w(?=\).+\/)' myfile.txt

If the file contains the sting /(a)5667/, grep will print 'a', because:

/( are found by \/\(, but because they are in a look-behind (?<= ) they are not reported
a is matched by \w and is thus printed (because of -o )
)5667/ are found by \).+\/, but because they are in a look-ahead (?= ) they are not reported

23

Joshua · Answer 3 · 2016-04-23T07:58:03+08:00

Joshua

2016-04-23T07:58:03+08:002016-04-23T07:58:03+08:00

    sed -n "s/^.*\(captureThis\).*$/\1/p"

-n      don't print lines
s       substitute
^.*     matches anything before the captureThis 
\( \)   capture everything between and assign it to \1 
.*$     matches anything after the captureThis 
\1      replace everything with captureThis 
p       print it

18

DrYak · Answer 4 · 2015-01-21T05:47:54+08:00

DrYak

2015-01-21T05:47:54+08:002015-01-21T05:47:54+08:00

Because you tagged your question as bash in addition to shell, there is another solution beside grep :

Bash has its own regular expression engine since version 3.0, using the =~ operator, just like Perl.

now, given the following code:

#!/bin/bash
DATA="test <Lane>8</Lane>"

if [[ "$DATA" =~ \<Lane\>([[:digit:]]+)\<\/Lane\> ]]; then
        echo $BASH_REMATCH
        echo ${BASH_REMATCH[1]}
fi

Note that you have to invoke it as bashand not just sh in order to get all extensions
$BASH_REMATCH will give the whole string as matched by the whole regular expression, so <Lane>8</Lane>
${BASH_REMATCH[1]} will give the part matched by the 1st group, thus only 8

8

Kyle Brandt · Answer 5 · 2009-08-07T09:38:10+08:00

Kyle Brandt

2009-08-07T09:38:10+08:002009-08-07T09:38:10+08:00

If you want only what is in the parenthesis, you need something that supports capturing sub matches (Named or Numbered Capturing Groups). I don't think grep or egrep can do this, perl and sed can. For example, with perl:

If a file called foo has a line in that is as follows:

/adsdds      /

And you do:

perl -nle 'print $1 if /\/(\w).+\//' foo

The letter a is returned. That might be not what you want though. If you tell us what you are trying to match, you might get better help. $1 is whatever was captured in the first set of parenthesis. $2 would be the second set etc.

4

user427450 · Answer 6 · 2017-07-23T00:01:51+08:00

user427450

2017-07-23T00:01:51+08:002017-07-23T00:01:51+08:00

Assuming the file contains:

$ cat file
Text-here>xyz</more text

And you want the character(s) between > and </ , you can use either:

grep grep -oP '.*\K(?<=>)\w+(?=<\/)' file
sed sed -nE 's:^.*>(\w+)</.*$:\1:p' file
awk awk '{print(gensub("^.*>(\\w+)</.*$","\\1","g"))}' file
perl perl -nle 'print $1 if />(\w+)<\//' file

All will print a string "xyz".

If you want to capture the digits of this line:

$ cat file
Text-<here>1234</text>-ends

grep grep -oP '.*\K(?<=>)[0-9]+(?=<\/)' file
sed sed -E 's:^.*>([0-9]+)</.*$:\1:' file
awk awk '{print(gensub(".*>([0-9]+)</.*","\\1","g"))}' file
perl perl -nle 'print $1 if />([0-9]+)<\//' file

4

Chad Huneycutt · Answer 7 · 2009-08-07T10:02:20+08:00

Chad Huneycutt

2009-08-07T10:02:20+08:002009-08-07T10:02:20+08:00

This will accomplish what you are requesting, but I don't think it is what you really want. I put the .* in the front of the regex to eat up anything before the match, but that is a greedy operation, so this only matches the penultimate \w character in the string.

Note that you need to escape the parens and the +.

sed 's/.*\(\w\).\+/\1/' myfile.txt

0

Don't need the whole line, just the match from regular expression

Ping a Specific Port

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What's the command-line utility in Windows to do a reverse DNS look-up?

How to check if a port is blocked on a Windows machine?

What port should I open to allow remote desktop?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?