PHP | Regular Expressions
Last Updated :
10 Feb, 2022
Regular expressions commonly known as a regex (regexes) are a sequence of characters describing a special search pattern in the form of text string. They are basically used in programming world algorithms for matching some loosely defined patterns to achieve some relevant tasks. Some times regexes are understood as a mini programming language with a pattern notation which allows the users to parse text strings. The exact sequence of characters are unpredictable beforehand, so the regex helps in fetching the required strings based on a pattern definition.
Regular Expression is a compact way of describing a string pattern that matches a particular amount of text. As you know, PHP is an open-source language commonly used for website creation, it provides regular expression functions as an important tool. Like PHP, many other programming languages have their own implementation of regular expressions. This is the same with other applications also, which have their own support of regexes having various syntaxes. Many available modern languages and tools apply regexes on very large files and strings. Let us look into some of the advantages and uses of regular expressions in our applications.
Advantages and uses of Regular expressions:
In many scenarios, developers face problems whenever data are collected in free text fields as most of the programming deals with data entries. Regular expressions are used almost everywhere in today's application programming.
- Regular expressions help in validation of text strings which are of programmer's interest.
- It offers a powerful tool for analyzing, searching a pattern and modifying the text data.
- It helps in searching specific string pattern and extracting matching results in a flexible manner.
- It helps in parsing text files looking for a defined sequence of characters for further analysis or data manipulation.
- With the help of in-built regexes functions, easy and simple solutions are provided for identifying patterns.
- It effectively saves a lot of development time, which are in search of specific string pattern.
- It helps in important user information validations like email address, phone numbers and IP address.
- It helps in highlighting special keywords in a file based on search result or input.
- It helps in identifying specific template tags and replacing those data with the actual data as per the requirement.
- Regexes are very useful for creation of HTML template system recognizing tags.
- Regexes are mostly used for browser detection, spam filtration, checking password strength and form validations.
We cannot cover everything under this topic, but let us look into some of the major regular expression concepts. The following table shows some regular expressions and the corresponding string which matches the regular expression pattern.
Regular Expression |
Matches |
geeks | The string "geeks" |
^geeks | The string which starts with "geeks" |
geeks$ | The string which have "geeks" at the end. |
^geeks$ | The string where "geeks" is alone on a string. |
[abc] | a, b, or c |
[a-z] | Any lowercase letter |
[^A-Z] | Any letter which is NOT a uppercase letter |
(gif|png) | Either "gif" or "png" |
[a-z]+ | One or more lowercase letters |
^[a-zA-Z0-9]{1, }$ | Any word with at least one number or one letter |
([ax])([by]) | ab, ay, xb, xy |
[^A-Za-z0-9] | Any symbol other than a letter or other than number |
([A-Z]{3}|[0-9]{5}) | Matches three letters or five numbers |
Note: Complex search patterns can be created by applying some basic regular expression rules. Even many arithmetic operators like +, ^, - are used by regular expressions for creating little complex patterns.
Operators in Regular Expression: Let us look into some of the operators in PHP regular expressions.
Operator | Description |
^ | It denotes the start of string. |
$ | It denotes the end of string. |
. | It denotes almost any single character. |
() | It denotes a group of expressions. |
[] | It finds a range of characters for example [xyz] means x, y or z . |
[^] | It finds the items which are not in range for example [^abc] means NOT a, b or c. |
– (dash) | It finds for character range within the given item range for example [a-z] means a through z. |
| (pipe) | It is the logical OR for example x | y means x OR y. |
? | It denotes zero or one of preceding character or item range. |
* | It denotes zero or more of preceding character or item range. |
+ | It denotes one or more of preceding character or item range. |
{n} | It denotes exactly n times of preceding character or item range for example n{2}. |
{n, } | It denotes atleast n times of preceding character or item range for example n{2, }. |
{n, m} | It denotes atleast n but not more than m times for example n{2, 4} means 2 to 4 of n. |
\ | It denotes the escape character. |
Special Character Classes in Regular Expressions: Let us look into some of the special characters used in regular expressions.
Special Character | Meaning |
\n | It denotes a new line. |
\r | It denotes a carriage return. |
\t | It denotes a tab. |
\v | It denotes a vertical tab. |
\f | It denotes a form feed. |
\xxx | It denotes octal character xxx. |
\xhh | It denotes hex character hh. |
Shorthand Character Sets: Let us look into some shorthand character sets available.
Shorthand | Meaning |
\s | Matches space characters like space, newline or tab. |
\d | Matches any digit from 0 to 9. |
\w | Matches word characters including all lower and upper case letters, digits and underscore. |
Predefined functions or Regex library: Let us look into the quick cheat sheet of pre-defined functions for regular expressions in PHP. PHP provides the programmers to many useful functions to work with regular expressions.
The below listed in-built functions are case-sensitive.
Function | Definition |
preg_match() | This function searches for a specific pattern against some string. It returns true if pattern exists and false otherwise. |
preg_match_all() | This function searches for all the occurrences of string pattern against the string. This function is very useful for search and replace. |
ereg_replace() | This function searches for specific string pattern and replace the original string with the replacement string, if found. |
eregi_replace() | The function behaves like ereg_replace() provided the search for pattern is not case sensitive. |
preg_replace() | This function behaves like ereg_replace() function provided the regular expressions can be used in the pattern and replacement strings. |
preg_split() | The function behaves like the PHP split() function. It splits the string by regular expressions as its parameters. |
preg_grep() | This function searches all elements which matches the regular expression pattern and returns the output array. |
preg_quote() | This function takes string and quotes in front of every character which matches the regular expression. |
ereg() | This function searches for a string which is specified by a pattern and returns true if found, otherwise returns false. |
eregi() | This function behaves like ereg() function provided the search is not case sensitive. |
Note:
- By default, regular expressions are case sensitive.
- There is a difference between strings inside single quotes and strings inside double quotes in PHP. The former are treated literally, whereas for the strings inside double-quotes means the content of the variable is printed instead of just printing their names.
Example 1:
php
<?php
// Declare a regular expression
$regex = '/^[a-zA-Z ]*$/';
// Declare a string
$nameString = 'Sharukh khan';
// Use preg_match() function to
// search string pattern
if(preg_match($regex, $nameString)) {
echo("Name string matching with"
. " regular expression");
}
else {
echo("Only letters and white space"
. " allowed in name string");
}
?>
Output:
Name string matching with regular expression
Example 2:
php
<?php
// Declare a regular expression
$regex = "/<b>(.*)<\/b>/U";
// Declare a string
$inputString = "Name: <b>John</b> Position: <b>Developer</b>";
// Use preg_match_all() function to perform
// a global regular expression match
preg_match_all($regex, $inputString, $output);
echo $output[0][0]." <br> ".$output[0][1]."\n";
?>
Output:
John
Developer
Example 3:
php
<?php
// Declare a regular expression
$regex = "([0-9]+)";
// Declare a string
$original = "Completed graduation in 2004";
$replaceWith = "2002";
// Use ereg_replace() function to search a
// string pattern in an other string
$original = ereg_replace($regex, $replaceWith, $original);
// Display result
echo $original;
?>
Output:
Completed graduation in 2002
Example 4:
php
<?php
<?php
// Declare a string
$ip = "134.645.478.670";
// Declare a regular expression
$regex = "/\./";
// Use preg_split() function to
// convert a given string into
// an array
$output = preg_split ($regex, $ip);
echo "$output[0] <br>";
echo "$output[1] <br>";
echo "$output[2] <br>";
echo "$output[3] <br>";
?>
Output:
134
645
478
670
Metacharacters: There are two kinds of characters that are used in regular expressions these are: Regular characters and Metacharacters. Regular characters are those characters which have a
'literal' meaning and Metacharacters are those characters which have
'special' meaning in regular expression.
Metacharacter | Description | Example |
. | It matches any single character other than a new line. | /./ matches string which has a single character. |
^ | It matches the beginning of string. | /^geeks/ matches any string that starts with geeks. |
$ | It matches the string pattern at the end of the string. | /com$/ matches string ending with com for example google.com etc. |
* | It matches zero or more characters. | /com*/ matches commute, computer, compromise etc. |
+ | It matches preceding character appear atleast once. | For example /z+oom/ matches zoom. |
\ | It is used to esacape metacharacters in regex. | /google\.com/ will treat the dot as a literal value, not metacharacter. |
a-z | It matches lower case letters. | geeks |
A-Z | It matches upper case letters. | GEEKS |
0-9 | It matches any number between 0 and 9. | /0-5/ matches 0, 1, 2, 3, 4, 5 |
[...] | It matches character class. | /[pqr]/ matches pqr |
Other Examples:
Regular expression | Meaning |
^[.-a-z0-9A-Z] | It matches string with dot, dash and any lower case letters, numbers between 0 and 9 and upper case letters. |
+@[a-z0-9A-Z] | It matches string with @ symbol in the beginning followed by any lower case letters, numbers between 0 and 9 and upper case letters. |
+\.[a-z]{2, 6}$/ | It escapes the dot and then matches string with any lower case letters with string length between 2 and 6 at the end. |
Note:
- Metacharacters are very powerful in regular expressions pattern matching solutions. It handles a lot of complex pattern processing.
- Every character which is not a metacharacter is definitely a regular character.
- Every regular character matches the same character by itself.
POSIX Regular Expressions: Some regular expressions in PHP are like arithmetic expressions which are called POSIX regular expressions. Some times, complex expression are created by combining various elements or operators in regular expressions. The very basic regex is the one which matches a single character.
Lets look into some of the POSIX regular expressions.
Regex | Meaning |
[0-9] | It matches digit from 0 through 9. |
[a-z] | It matches any lowercase letter from a through z. |
[A-Z] | It matches any uppercase letter from A through Z. |
[a-Z] | It matches any lowercase letter a through uppercase letter Z. |
[:lower:] | It matches any lower case letters. |
[:upper:] | It matches any upper case letters. |
[:alpha:] | It matches all alphabetic characters or letters from a-z and A-Z. |
[[:alpha:]] | It matches any string containing alphabetic characters or letters. |
[:alnum:] | It matches all alphanumeric characters i.e all digits(0-9) and letters (a-z A-Z). |
[[:alnum:]] | It matches any string containing alphanumeric characters and digits. |
[:digit:] | It matches all the digits from 0 to 9. |
[[:digit:]] | It matches any string containing digits from 0 to 9. |
[:xdigit:] | It matches all the hexadecimal digits. |
[:punct:] | It matches all the punctuation symbols. |
[:blank:] | It matches blank characters like space and tab. |
[:space:] | It matches all whitespace characters like line breaks. |
[[:space:]] | It matches any string containing a space. |
[:cntrl:] | It matches all control characters. |
[:graph:] | It matches all visible or printed characters other than spaces and control characters. |
[:print:] | It matches all printed characters and spaces other than control characters. |
[:word:] | It matches all word characters like digits, letters and underscore. |
Quantifiers in Regular Expression: Quantifiers are special characters which tell the quantity, frequency or the number of instances or occurrence of bracketed character or group of characters. They are also called as greedy and lazy expressions. Let us look into some of the concepts and examples of quantifiers.
Quantifier | Meaning |
a+ | It matches the string containing at least one a. |
a* | It matches the string containing zero or more a's. |
a? | It matches any string containing zero or one a's. |
a{x} | It matches letter 'a' exaclty x times . |
a{2, 3} | It matches any string containing the occurrence of two or three a's. |
a{2, } | It matches any string containing the occurrence of at least two a's. |
a{2} | It matches any string containing the occurrence of exactly two a's. |
a{, y} | It matches any string containing the occurrence of not more than y number of a's. |
a$ | It matches any string with 'a' at the end of it. |
^a | It matches any string with 'a' at the beginning of it. |
[^a-zA-Z] | It matches any string pattern not having characters from a to z and A to Z. |
a.a | It matches any string pattern containing a, then any character and then another a. |
^.{3}$ | It matches any string having exactly three characters. |
Note:
- The $ character inside the expression will match the end of the target string.
- The *, ?, + symbols in a regular expression denotes the frequency of occurrence of a character. If it occurs zero or more time, zero or one time and one or more times.
- The ^ character inside the expression will match the beginning of the target string.
- The . metacharacter matches any single character other than the newline.
Commonly known regular expression engines:
Conclusion: A regular expression is a pattern that describes some string text in a particular pattern or it is defined as a pattern-matching algorithm expressed in a single line. Regular expressions are very useful in the programming world for validation checks and recognizing specific templates. PHP provides many in-built functions supporting regular expressions. Metacharacters helps in creating complex patterns.
Similar Reads
PHP Tutorial PHP is a widely used, open-source server-side scripting language primarily designed for web development. It is embedded directly into HTML and generates dynamic content on web pages. It allows developers to handle database interactions, session management, and form handling tasks.PHP code is execute
9 min read
Basics
PHP SyntaxPHP, a powerful server-side scripting language used in web development. Itâs simplicity and ease of use makes it an ideal choice for beginners and experienced developers. This article provides an overview of PHP syntax. PHP scripts can be written anywhere in the document within PHP tags along with n
4 min read
PHP VariablesA variable in PHP is a container used to store data such as numbers, strings, arrays, or objects. The value stored in a variable can be changed or updated during the execution of the script.All variable names start with a dollar sign ($).Variables can store different data types, like integers, strin
5 min read
PHP | FunctionsA function in PHP is a self-contained block of code that performs a specific task. It can accept inputs (parameters), execute a set of statements, and optionally return a value. PHP functions allow code reusability by encapsulating a block of code to perform specific tasks.Functions can accept param
8 min read
PHP LoopsIn PHP, Loops are used to repeat a block of code multiple times based on a given condition. PHP provides several types of loops to handle different scenarios, including while loops, for loops, do...while loops, and foreach loops. In this article, we will discuss the different types of loops in PHP,
4 min read
Array
PHP ArraysArrays are one of the most important data structures in PHP. They allow you to store multiple values in a single variable. PHP arrays can hold values of different types, such as strings, numbers, or even other arrays. Understanding how to use arrays in PHP is important for working with data efficien
5 min read
PHP Associative ArraysAn associative array in PHP is a special array where each item has a name or label instead of just a number. Usually, arrays use numbers to find things. For example, the first item is at position 0, the second is 1, and so on. But in an associative array, we use words or names to find things. These
4 min read
Multidimensional arrays in PHPMulti-dimensional arrays in PHP are arrays that store other arrays as their elements. Each dimension adds complexity, requiring multiple indices to access elements. Common forms include two-dimensional arrays (like tables) and three-dimensional arrays, useful for organizing complex, structured data.
5 min read
Sorting Arrays in PHPSorting arrays is one of the most common operation in programming, and PHP provides a several functions to handle array sorting. Sorting arrays in PHP can be done by values or keys, in ascending or descending order. PHP also allows you to create custom sorting functions.Table of ContentSort Array in
4 min read
OOPs & Interfaces
MySQL Database
PHP | MySQL Database IntroductionWhat is MySQL? MySQL is an open-source relational database management system (RDBMS). It is the most popular database system used with PHP. MySQL is developed, distributed, and supported by Oracle Corporation. The data in a MySQL database are stored in tables which consists of columns and rows.MySQL
4 min read
PHP Database connectionThe collection of related data is called a database. XAMPP stands for cross-platform, Apache, MySQL, PHP, and Perl. It is among the simple light-weight local servers for website development. Requirements: XAMPP web server procedure: Start XAMPP server by starting Apache and MySQL. Write PHP script f
2 min read
PHP | MySQL ( Creating Database )What is a database? Database is a collection of inter-related data which helps in efficient retrieval, insertion and deletion of data from database and organizes the data in the form of tables, views, schemas, reports etc. For Example, university database organizes the data about students, faculty,
3 min read
PHP | MySQL ( Creating Table )What is a table? In relational databases, and flat file databases, a table is a set of data elements using a model of vertical columns and horizontal rows, the cell being the unit where a row and column intersect. A table has a specified number of columns, but can have any number of rows. Creating a
3 min read
PHP Advance
PHP SuperglobalsPHP superglobals are predefined variables that are globally available in all scopes. They are used to handle different types of data, such as input data, server data, session data, and more. These superglobal arrays allow developers to easily work with these global data structures without the need t
6 min read
PHP | Regular ExpressionsRegular expressions commonly known as a regex (regexes) are a sequence of characters describing a special search pattern in the form of text string. They are basically used in programming world algorithms for matching some loosely defined patterns to achieve some relevant tasks. Some times regexes a
12 min read
PHP Form HandlingForm handling is the process of collecting and processing information that users submit through HTML forms. In PHP, we use special tools called $_POST and $_GET to gather the data from the form. Which tool to use depends on how the form sends the dataâeither through the POST method (more secure, hid
4 min read
PHP File HandlingIn PHP, File handling is the process of interacting with files on the server, such as reading files, writing to a file, creating new files, or deleting existing ones. File handling is essential for applications that require the storage and retrieval of data, such as logging systems, user-generated c
4 min read
PHP | Uploading FileHave you ever wondered how websites build their system of file uploading in PHP? Here we will come to know about the file uploading process. A question which you can come up with - 'Are we able to upload any kind of file with this system?'. The answer is yes, we can upload files with different types
3 min read
PHP CookiesA cookie is a small text file that is stored in the user's browser. Cookies are used to store information that can be retrieved later, making them ideal for scenarios where you need to remember user preferences, such as:User login status (keeping users logged in between sessions)Language preferences
9 min read
PHP | SessionsA session in PHP is a mechanism that allows data to be stored and accessed across multiple pages on a website. When a user visits a website, PHP creates a unique session ID for that user. This session ID is then stored as a cookie in the user's browser (by default) or passed via the URL. The session
7 min read