A beginner’s guide to regular expressions with grep

A regular expression (also called a regex or regexp) is a rule that a computer can use to match characters or groups of characters within a larger body of text. For instance, using regular expressions, you could find all the instances of the word cat in a document, or all instances of a word that begins with c and ends with t.

Use of regular expressions in the real world can get much more complex—and powerful—than that. For example, imagine you need to write code verifying that all content in the body of an HTTP POST request is free of script injection attacks. Malicious code can appear in any number of ways, but you know that injected script code will always appear between <script></script> HTML tags. You can apply the regular expression <script>.*<\/script>, which matches any block of code text bracketed by <script> tags, to the HTTP request body as part of your search for script injection code.

This example is but one of many uses for regular expressions. In this series, you'll learn more about how the syntax for this and other regular expressions work.

As just demonstrated, a regex can be a powerful tool for finding text according to a particular pattern in a variety of situations. Once mastered, regular expressions provide developers with the ability to locate patterns of text in source code and documentation at design time. You can also apply regular expressions to text that is subject to algorithmic processing at runtime such as content in HTTP requests or event messages.

Regular expressions are supported by many programming languages, as well as classic command-line applications such as awk, sed, and grep, which were developed for Unix many decades ago and are now offered on GNU/Linux.

This article examines the basics of using regular expressions under grep. The article shows how you can use a regular expression to declare a pattern that you want to match, and outlines the essential building blocks of regular expressions, with many examples. This article assumes no prior knowledge of regular expressions, but you should understand how to with the Linux operating system at the command line.

What are regular expressions, and what is grep?

As we've noted, a regular expression is a rule used for matching characters in text. These rules are declarative, which means they are immutable: once declared, they do not change. But a single rule can be applied to any variety of situations.

Regular expressions are written in a special language. Although this language has been standardized, dialects vary from one regular expression engine to another. For example, JavaScript has a regex dialect, as do C++, Java, and Python.

This article uses the regular expression dialect that goes with the Linux grep command, with an extension to support more powerful features. grep is a binary executable that filters content in a file or output from other commands (stdout). Regular expressions are central to grep: The re in the middle of the name stands for "regular expression."

This article uses grep because it doesn't require that you set up a particular coding environment or write any code to work with the examples of regular expressions demonstrated in this article. All you need to do is copy and paste an example onto the command line of a Linux terminal and you'll see results immediately. The grep command can be used in any shell.

Because this article focuses on regular expressions as a language, and not on manipulating files, the examples use samples of text piped to grep instead of input files.

How to use grep against content in a file

To print lines in a file that match a regular expression, use the following syntax:

$ grep -options <regular_expression> /paths/to/files

A beginner’s guide to regular expressions with grep

Share:

What are regular expressions, and what is grep?

How to use grep against content in a file

How to pipe content to a regular expression

Regular characters, metacharacters, and patterns: The building blocks of regular expressions

Running basic regular expressions

How to declare an exact pattern match using regular characters

How to declare a case-insensitive exact pattern match

How to declare a logical pattern match

How to find a character at the beginning of a line

How to find a character at the end of a line

How to find multiple characters at the end of a line

How to find occurrences of a character using the metacharacters for matching numerals

How to find a string using metacharacters for a numeral and a space

How to combine metacharacters to create a complex regular expression

How to traverse a line of text to a stop point

Regular expressions uncover patterns in text

Learn more

Products

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links

Report a website issue