[sf-lug] regex: how to match any one to four character word in a file
jim
jim at well.com
Wed Dec 10 10:12:57 PST 2008
from charles-henri (using an email address not
registered on the sf-lug mailing list):
jim wrote:
> i've given up on the online tutorials.
>
> i have a text file with over 100000 words (and lines,
> one word per line). i wanna grep out all words that
> are from one to four characters, e.g. 'a' or 'and'
> or "fact" but not "apple" or "zounds".
>
> $ grep '[.]{4}' words.txt
> got me a newline.
>
[.] will match a literal '.'
'{' needs to be escaped. \{$4\} will match exactly 4
Also, you need to anchor your regex (with ^ and $)
So:
grep '^.\{1,4\}$'
--
Charles-Henri
More information about the sf-lug
mailing list