PHP | Regular Expressions
Last Updated :
10 Feb, 2022
Regular expressions commonly known as a regex (regexes) are a sequence of characters describing a special search pattern in the form of text string. They are basically used in programming world algorithms for matching some loosely defined patterns to achieve some relevant tasks. Some times regexes are understood as a mini programming language with a pattern notation which allows the users to parse text strings. The exact sequence of characters are unpredictable beforehand, so the regex helps in fetching the required strings based on a pattern definition.
Regular Expression is a compact way of describing a string pattern that matches a particular amount of text. As you know, PHP is an open-source language commonly used for website creation, it provides regular expression functions as an important tool. Like PHP, many other programming languages have their own implementation of regular expressions. This is the same with other applications also, which have their own support of regexes having various syntaxes. Many available modern languages and tools apply regexes on very large files and strings. Let us look into some of the advantages and uses of regular expressions in our applications.
Advantages and uses of Regular expressions:
In many scenarios, developers face problems whenever data are collected in free text fields as most of the programming deals with data entries. Regular expressions are used almost everywhere in today’s application programming.
- Regular expressions help in validation of text strings which are of programmer’s interest.
- It offers a powerful tool for analyzing, searching a pattern and modifying the text data.
- It helps in searching specific string pattern and extracting matching results in a flexible manner.
- It helps in parsing text files looking for a defined sequence of characters for further analysis or data manipulation.
- With the help of in-built regexes functions, easy and simple solutions are provided for identifying patterns.
- It effectively saves a lot of development time, which are in search of specific string pattern.
- It helps in important user information validations like email address, phone numbers and IP address.
- It helps in highlighting special keywords in a file based on search result or input.
- It helps in identifying specific template tags and replacing those data with the actual data as per the requirement.
- Regexes are very useful for creation of HTML template system recognizing tags.
- Regexes are mostly used for browser detection, spam filtration, checking password strength and form validations.
We cannot cover everything under this topic, but let us look into some of the major regular expression concepts. The following table shows some regular expressions and the corresponding string which matches the regular expression pattern.
Regular Expression |
Matches |
geeks |
The string “geeks” |
^geeks |
The string which starts with “geeks” |
geeks$ |
The string which have “geeks” at the end. |
^geeks$ |
The string where “geeks” is alone on a string. |
[abc] |
a, b, or c |
[a-z] |
Any lowercase letter |
[^A-Z] |
Any letter which is NOT a uppercase letter |
(gif|png) |
Either “gif” or “png” |
[a-z]+ |
One or more lowercase letters |
^[a-zA-Z0-9]{1, }$ |
Any word with at least one number or one letter |
([ax])([by]) |
ab, ay, xb, xy |
[^A-Za-z0-9] |
Any symbol other than a letter or other than number |
([A-Z]{3}|[0-9]{5}) |
Matches three letters or five numbers |
Note: Complex search patterns can be created by applying some basic regular expression rules. Even many arithmetic operators like +, ^, – are used by regular expressions for creating little complex patterns.
Operators in Regular Expression: Let us look into some of the operators in PHP regular expressions.
Operator |
Description |
^ |
It denotes the start of string. |
$ |
It denotes the end of string. |
. |
It denotes almost any single character. |
() |
It denotes a group of expressions. |
[] |
It finds a range of characters for example [xyz] means x, y or z . |
[^] |
It finds the items which are not in range for example [^abc] means NOT a, b or c. |
– (dash) |
It finds for character range within the given item range for example [a-z] means a through z. |
| (pipe) |
It is the logical OR for example x | y means x OR y. |
? |
It denotes zero or one of preceding character or item range. |
* |
It denotes zero or more of preceding character or item range. |
+ |
It denotes one or more of preceding character or item range. |
{n} |
It denotes exactly n times of preceding character or item range for example n{2}. |
{n, } |
It denotes atleast n times of preceding character or item range for example n{2, }. |
{n, m} |
It denotes atleast n but not more than m times for example n{2, 4} means 2 to 4 of n. |
\ |
It denotes the escape character. |
Special Character Classes in Regular Expressions: Let us look into some of the special characters used in regular expressions.
Special Character |
Meaning |
\n |
It denotes a new line. |
\r |
It denotes a carriage return. |
\t |
It denotes a tab. |
\v |
It denotes a vertical tab. |
\f |
It denotes a form feed. |
\xxx |
It denotes octal character xxx. |
\xhh |
It denotes hex character hh. |
Shorthand Character Sets: Let us look into some shorthand character sets available.
Shorthand |
Meaning |
\s |
Matches space characters like space, newline or tab. |
\d |
Matches any digit from 0 to 9. |
\w |
Matches word characters including all lower and upper case letters, digits and underscore. |
Predefined functions or Regex library: Let us look into the quick cheat sheet of pre-defined functions for regular expressions in PHP. PHP provides the programmers to many useful functions to work with regular expressions.
The below listed in-built functions are case-sensitive.
Function |
Definition |
preg_match() |
This function searches for a specific pattern against some string. It returns true if pattern exists and false otherwise. |
preg_match_all() |
This function searches for all the occurrences of string pattern against the string. This function is very useful for search and replace. |
ereg_replace() |
This function searches for specific string pattern and replace the original string with the replacement string, if found. |
eregi_replace() |
The function behaves like ereg_replace() provided the search for pattern is not case sensitive. |
preg_replace() |
This function behaves like ereg_replace() function provided the regular expressions can be used in the pattern and replacement strings. |
preg_split() |
The function behaves like the PHP split() function. It splits the string by regular expressions as its parameters. |
preg_grep() |
This function searches all elements which matches the regular expression pattern and returns the output array. |
preg_quote() |
This function takes string and quotes in front of every character which matches the regular expression. |
ereg() |
This function searches for a string which is specified by a pattern and returns true if found, otherwise returns false. |
eregi() |
This function behaves like ereg() function provided the search is not case sensitive. |
Note:
- By default, regular expressions are case sensitive.
- There is a difference between strings inside single quotes and strings inside double quotes in PHP. The former are treated literally, whereas for the strings inside double-quotes means the content of the variable is printed instead of just printing their names.
Example 1:
<?php
$regex = '/^[a-zA-Z ]*$/' ;
$nameString = 'Sharukh khan' ;
if (preg_match( $regex , $nameString )) {
echo ( "Name string matching with"
. " regular expression" );
}
else {
echo ( "Only letters and white space"
. " allowed in name string" );
}
?>
|
Output:
Name string matching with regular expression
Example 2:
<?php
$regex = "/<b>(.*)<\/b>/U" ;
$inputString = "Name: <b>John</b> Position: <b>Developer</b>" ;
preg_match_all( $regex , $inputString , $output );
echo $output [0][0]. " <br> " . $output [0][1]. "\n" ;
?>
|
Output:
John
Developer
Example 3:
<?php
$regex = "([0-9]+)" ;
$original = "Completed graduation in 2004" ;
$replaceWith = "2002" ;
$original = ereg_replace ( $regex , $replaceWith , $original );
echo $original ;
?>
|
Output:
Completed graduation in 2002
Example 4:
<?php
<?php
$ip = "134.645.478.670" ;
$regex = "/\./" ;
$output = preg_split ( $regex , $ip );
echo "$output[0] <br>" ;
echo "$output[1] <br>" ;
echo "$output[2] <br>" ;
echo "$output[3] <br>" ;
?>
|
Output:
134
645
478
670
Metacharacters: There are two kinds of characters that are used in regular expressions these are: Regular characters and Metacharacters. Regular characters are those characters which have a ‘literal’ meaning and Metacharacters are those characters which have ‘special’ meaning in regular expression.
Metacharacter |
Description |
Example |
. |
It matches any single character other than a new line. |
/./ matches string which has a single character. |
^ |
It matches the beginning of string. |
/^geeks/ matches any string that starts with geeks. |
$ |
It matches the string pattern at the end of the string. |
/com$/ matches string ending with com for example google.com etc. |
* |
It matches zero or more characters. |
/com*/ matches commute, computer, compromise etc. |
+ |
It matches preceding character appear atleast once. |
For example /z+oom/ matches zoom. |
\ |
It is used to esacape metacharacters in regex. |
/google\.com/ will treat the dot as a literal value, not metacharacter. |
a-z |
It matches lower case letters. |
geeks |
A-Z |
It matches upper case letters. |
GEEKS |
0-9 |
It matches any number between 0 and 9. |
/0-5/ matches 0, 1, 2, 3, 4, 5 |
[…] |
It matches character class. |
/[pqr]/ matches pqr |
Other Examples:
Regular expression |
Meaning |
^[.-a-z0-9A-Z] |
It matches string with dot, dash and any lower case letters, numbers between 0 and 9 and upper case letters. |
+@[a-z0-9A-Z] |
It matches string with @ symbol in the beginning followed by any lower case letters, numbers between 0 and 9 and upper case letters. |
+\.[a-z]{2, 6}$/ |
It escapes the dot and then matches string with any lower case letters with string length between 2 and 6 at the end. |
Note:
- Metacharacters are very powerful in regular expressions pattern matching solutions. It handles a lot of complex pattern processing.
- Every character which is not a metacharacter is definitely a regular character.
- Every regular character matches the same character by itself.
POSIX Regular Expressions: Some regular expressions in PHP are like arithmetic expressions which are called POSIX regular expressions. Some times, complex expression are created by combining various elements or operators in regular expressions. The very basic regex is the one which matches a single character.
Lets look into some of the POSIX regular expressions.
Regex |
Meaning |
[0-9] |
It matches digit from 0 through 9. |
[a-z] |
It matches any lowercase letter from a through z. |
[A-Z] |
It matches any uppercase letter from A through Z. |
[a-Z] |
It matches any lowercase letter a through uppercase letter Z. |
[:lower:] |
It matches any lower case letters. |
[:upper:] |
It matches any upper case letters. |
[:alpha:] |
It matches all alphabetic characters or letters from a-z and A-Z. |
[[:alpha:]] |
It matches any string containing alphabetic characters or letters. |
[:alnum:] |
It matches all alphanumeric characters i.e all digits(0-9) and letters (a-z A-Z). |
[[:alnum:]] |
It matches any string containing alphanumeric characters and digits. |
[:digit:] |
It matches all the digits from 0 to 9. |
[[:digit:]] |
It matches any string containing digits from 0 to 9. |
[:xdigit:] |
It matches all the hexadecimal digits. |
[:punct:] |
It matches all the punctuation symbols. |
[:blank:] |
It matches blank characters like space and tab. |
[:space:] |
It matches all whitespace characters like line breaks. |
[[:space:]] |
It matches any string containing a space. |
[:cntrl:] |
It matches all control characters. |
[:graph:] |
It matches all visible or printed characters other than spaces and control characters. |
[:print:] |
It matches all printed characters and spaces other than control characters. |
[:word:] |
It matches all word characters like digits, letters and underscore. |
Quantifiers in Regular Expression: Quantifiers are special characters which tell the quantity, frequency or the number of instances or occurrence of bracketed character or group of characters. They are also called as greedy and lazy expressions. Let us look into some of the concepts and examples of quantifiers.
Quantifier |
Meaning |
a+ |
It matches the string containing at least one a. |
a* |
It matches the string containing zero or more a’s. |
a? |
It matches any string containing zero or one a’s. |
a{x} |
It matches letter ‘a’ exaclty x times . |
a{2, 3} |
It matches any string containing the occurrence of two or three a’s. |
a{2, } |
It matches any string containing the occurrence of at least two a’s. |
a{2} |
It matches any string containing the occurrence of exactly two a’s. |
a{, y} |
It matches any string containing the occurrence of not more than y number of a’s. |
a$ |
It matches any string with ‘a’ at the end of it. |
^a |
It matches any string with ‘a’ at the beginning of it. |
[^a-zA-Z] |
It matches any string pattern not having characters from a to z and A to Z. |
a.a |
It matches any string pattern containing a, then any character and then another a. |
^.{3}$ |
It matches any string having exactly three characters. |
Note:
- The $ character inside the expression will match the end of the target string.
- The *, ?, + symbols in a regular expression denotes the frequency of occurrence of a character. If it occurs zero or more time, zero or one time and one or more times.
- The ^ character inside the expression will match the beginning of the target string.
- The . metacharacter matches any single character other than the newline.
Commonly known regular expression engines:
Conclusion: A regular expression is a pattern that describes some string text in a particular pattern or it is defined as a pattern-matching algorithm expressed in a single line. Regular expressions are very useful in the programming world for validation checks and recognizing specific templates. PHP provides many in-built functions supporting regular expressions. Metacharacters helps in creating complex patterns.
Similar Reads
PHP Tutorial
PHP is a widely used, open-source server-side scripting language primarily designed for web development. It is embedded directly into HTML and generates dynamic content on web pages, making it essential for building data-driven websites. It allows developers to handle database interactions, session
10 min read
PHP Introduction
PHP stands for Hypertext Preprocessor. It is an open-source, widely used language for web development. Developers can create dynamic and interactive websites by embedding PHP code into HTML. PHP can handle data processing, session management, form handling, and database integration. The latest versi
9 min read
PHP Basics
How to install PHP in Windows 10?
PHP is a general-purpose scripting language geared towards web development. It is an open-source software (OSS), which is free to download and use. PHP stands for "Hypertext Preprocessor". PHP files are saved with an extension called .php. It supports many databases MySQL, Oracle, etc. Installation
2 min read
How to Install PHP on Linux?
PHP is a popular server-side scripting language that is especially used in web development. If you're working on a Linux environment, whether it's a personal development setup or a production server, you will likely need PHP installed. In this article, we will see the step-by-step guide to install P
2 min read
PHP Syntax
PHP, a powerful server-side scripting language used in web development. Itâs simplicity and ease of use makes it an ideal choice for beginners and experienced developers. This article provides an overview of PHP syntax. PHP scripts can be written anywhere in the document within PHP tags along with n
4 min read
How to write comments in PHP ?
Comments are non-executable lines of text in the code that are ignored by the PHP interpreter. Comments are an essential part of any programming language. It help developers to understand the code, provide explanations, and make the codebase more maintainable. Types of Comments in PHPPHP supports tw
1 min read
PHP Variables
A variable in PHP is a container used to store data such as numbers, strings, arrays, or objects. The value stored in a variable can be changed or updated during the execution of the script. All variable names start with a dollar sign ($).Variables can store different data types, like integers, stri
5 min read
PHP echo and print
PHP echo and print are two most language constructs used for output data on the screen. They are not functions but constructs, meaning they do not require parentheses (though parentheses can be used with print). Both are widely used for displaying strings, variables, and HTML content in PHP scripts.
5 min read
PHP Data Types
In PHP, data types define the kind of value a variable can hold. PHP is a loosely typed language, meaning you donât need to declare the data type of a variable. It is automatically assigned based on the value. But it is important to understand data types because it is important for writing reliable,
4 min read
PHP Strings
In PHP, strings are one of the most commonly used data types. A string is a sequence of characters used to represent text, such as words and sentences. Strings are enclosed in either single quotes (' ') or double quotes (" "). You can create a string using single quotes (' ') or double quotes (" ").
4 min read
PHP Array
PHP Arrays
Arrays are one of the most important data structures in PHP. They allow you to store multiple values in a single variable. PHP arrays can hold values of different types, such as strings, numbers, or even other arrays. Understanding how to use arrays in PHP is important for working with data efficien
5 min read
Associative Arrays in PHP
Associative arrays are used to store key value pairs. For example, to store the marks of different subject of a student in an array, a numerically indexed array would not be the best choice. Instead, we could use the respective subject's names as the keys in our associative array, and the value woul
3 min read
Multidimensional arrays in PHP
Multi-dimensional arrays in PHP are arrays that store other arrays as their elements. Each dimension adds complexity, requiring multiple indices to access elements. Common forms include two-dimensional arrays (like tables) and three-dimensional arrays, useful for organizing complex, structured data.
5 min read
Sorting Arrays in PHP
Sorting arrays is one of the most common operation in programming, and PHP provides a several functions to handle array sorting. Sorting arrays in PHP can be done by values or keys, in ascending or descending order. PHP also allows you to create custom sorting functions. Table of Content Sort Array
4 min read
PHP Loops
PHP Loops
In PHP, Loops are used to repeat a block of code multiple times based on a given condition. PHP provides several types of loops to handle different scenarios, including while loops, for loops, do...while loops, and foreach loops. In this article, we will discuss the different types of loops in PHP,
4 min read
PHP while Loop
The while loop is the simple loop that executes nested statements repeatedly while the expression value is true. The expression is checked every time at the beginning of the loop, and if the expression evaluates to true then the loop is executed otherwise loop is terminated. Flowchart of While Loop:
1 min read
PHP do-while Loop
The do-while loop is very similar to the while loop, the only difference is that the do-while loop checks the expression (condition) at the end of each iteration. In a do-while loop, the loop is executed at least once when the given expression is "false". The first iteration of the loop is executed
1 min read
PHP for Loop
The for loop is the most complex loop in PHP that is used when the user knows how many times the block needs to be executed. The for loop contains the initialization expression, test condition, and update expression (expression for increment or decrement). Flowchart of for Loop: Syntax: for (initial
2 min read
PHP foreach Loop
The foreach loop in PHP is a powerful and convenient way to iterate over arrays and objects. The foreach loop though iterates over an array of elements, the execution is simplified and finishes the loop in less time comparatively. In this article, we will explore the foreach loop in detail, includin
3 min read
PHP Advanced
PHP | Superglobals
We already have discussed about variables and global variables in PHP in the post PHP | Variables and Data Types. In this article, we will learn about superglobals in PHP. These are specially-defined array variables in PHP that make it easy for you to get information about a request or its context.
5 min read
HTTP GET and POST Methods in PHP
In this article, we will know what HTTP GET and POST methods are in PHP, how to implement these HTTP methods & their usage, by understanding them through the examples. HTTP: The Hypertext Transfer Protocol (HTTP) is designed to enable communications between clients and servers. HTTP works as a r
4 min read
PHP | Regular Expressions
Regular expressions commonly known as a regex (regexes) are a sequence of characters describing a special search pattern in the form of text string. They are basically used in programming world algorithms for matching some loosely defined patterns to achieve some relevant tasks. Some times regexes a
11 min read
PHP Form Handling
Form handling is the process of collecting and processing information that users submit through HTML forms. In PHP, we use special tools called $_POST and $_GET to gather the data from the form. Which tool to use depends on how the form sends the dataâeither through the POST method (more secure, hid
4 min read
PHP Date and Time
PHP provides functions to work with dates and times, allowing developers to display the current date/time, manipulate and format dates, and perform operations like date comparisons, time zone adjustments, and more. In this article, we'll discuss PHP date and time. Why are Date and Time Important in
5 min read
Describe PHP Include and Require
In this article, we will see what include() & the require() functions is, also will know how these functions affect the execution of the code, their differences & usage in PHP, along with understanding their implementation through the examples. As we know PHP allows us to create various func
3 min read
PHP File Handling
File handling allows the developers to interact with the serverâs file system, which is essential for tasks like saving data, processing user-uploaded files, and generating dynamic content. PHP File Handling enables developers to read from and write to files, making it a core feature for web applica
5 min read
PHP | Uploading File
Have you ever wondered how websites build their system of file uploading in PHP? Here we will come to know about the file uploading process. A question which you can come up with - 'Are we able to upload any kind of file with this system?'. The answer is yes, we can upload files with different types
3 min read
PHP Cookies
Cookies in PHP are used for maintaining state and storing user-specific information across multiple page visits. Since HTTP is a stateless protocol, every page request is independent, making it difficult to remember user preferences, authentication status, or other details between visits. Cookies al
10 min read
PHP | Sessions
PHP sessions are a tool used to store user-specific data across multiple pages during a single visit to a website. They solve the problem of maintaining a state in a stateless environment, where each page load is independent. Without sessions, a website would not be able to remember user information
7 min read
PHP Callback Functions
In PHP, the callback functions are related to the dynamic behavior and flexibility in code execution. They are used to pass custom logic or functions as arguments to other functions, letting developers change how a function behaves without changing its main structure. This makes the code more reusab
5 min read