Cara menggunakan python regex options

A special sequence is a \ followed by one of the characters in the list below, and has a special meaning:

CharacterDescriptionExampleTry it\AReturns a match if the specified characters are at the beginning of the string"\AThe"Try it »\bReturns a match where the specified characters are at the beginning or at the end of a word
(the "r" in the beginning is making sure that the string is being treated as a "raw string")r"\bain"
r"ain\b"Try it »
Try it »\BReturns a match where the specified characters are present, but NOT at the beginning (or at the end) of a word
(the "r" in the beginning is making sure that the string is being treated as a "raw string")r"\Bain"
r"ain\B"Try it »
Try it »\dReturns a match where the string contains digits (numbers from 0-9)"\d"Try it »\DReturns a match where the string DOES NOT contain digits"\D"Try it »\sReturns a match where the string contains a white space character"\s"Try it »\SReturns a match where the string DOES NOT contain a white space character"\S"Try it »\wReturns a match where the string contains any word characters (characters from a to Z, digits from 0-9, and the underscore _ character)"\w"Try it »\WReturns a match where the string DOES NOT contain any word characters"\W"Try it »\ZReturns a match if the specified characters are at the end of the string"Spain\Z"Try it »

Sets

A set is a set of characters inside a pair of square brackets [] with a special meaning:

SetDescriptionTry it[arn]Returns a match where one of the specified characters (a, r, or n) is presentTry it »[a-n]Returns a match for any lower case character, alphabetically between a and nTry it »[^arn]Returns a match for any character EXCEPT a, r, and nTry it »[0123]Returns a match where any of the specified digits (re4, re5, re6, or re7) are presentTry it »[0-9]Returns a match for any digit between re4 and re9Try it »[0-5][0-9]Returns a match for any two-digit numbers from re0 and re1Try it »[a-zA-Z]Returns a match for any character alphabetically between a and re3, lower case OR upper caseTry it »[+]In sets, re4, re5, re6, re7, re8, re9,re0 has no special meaning, so re1 means: return a match for any re4 character in the stringTry it »

The findall() Function

The re3 function returns a list containing all matches.

Example

Print a list of all matches:

import re

txt = "The rain in Spain"
x = re.findall("ai", txt)
print(x)

Try it Yourself »

The list contains the matches in the order they are found.

If no matches are found, an empty list is returned:

Example

Return an empty list if no match was found:

import re

txt = "The rain in Spain"
x = re.findall("Portugal", txt)
print(x)

Try it Yourself »


The search() Function

The re4 function searches the string for a match, and returns a if there is a match.

If there is more than one match, only the first occurrence of the match will be returned:

Example

Search for the first white-space character in the string:

import re

txt = "The rain in Spain"
x = re.search("\s", txt)

print("The first white-space character is located in position:", x.start())

Try it Yourself »

If no matches are found, the value re5 is returned:

Example

Make a search that returns no match:

import re

txt = "The rain in Spain"
x = re.search("Portugal", txt)
print(x)

Try it Yourself »


The split() Function

The re6 function returns a list where the string has been split at each match:

Example

Split at each white-space character:

import re

txt = "The rain in Spain"
x = re.split("\s", txt)
print(x)

Try it Yourself »

You can control the number of occurrences by specifying the re7 parameter:

Example

Split the string only at the first occurrence:

import re

txt = "The rain in Spain"
x = re.split("\s", txt, 1)
print(x)

Try it Yourself »


The sub() Function

The re8 function replaces the matches with the text of your choice:

Example

Replace every white-space character with the number 9:

import re

txt = "The rain in Spain"
x = re.sub("\s", "9", txt)
print(x)

Try it Yourself »

You can control the number of replacements by specifying the re9 parameter:

Example

Replace the first 2 occurrences:

import re

txt = "The rain in Spain"
x = re.sub("\s", "9", txt, 2)
print(x)

Try it Yourself »


Match Object

A Match Object is an object containing information about the search and the result.

Note: If there is no match, the value re5 will be returned, instead of the Match Object.

Example

Do a search that will return a Match Object:

import re

txt = "The rain in Spain"
x = re.search("ai", txt)
print(x) #this will print an object

Try it Yourself »

The Match object has properties and methods used to retrieve information about the search, and the result:

Modul apa dalam python untuk menjalankan regex?

Regex dalam python dapat diakses menggunakan modul re, dimana itu adalah bagian dari library standar. Setelah kita mendefiniskan regex, fungsi re.match dapat digunakan untuk menentukan apakah cocok dengan awal sebuah string.

Apa itu python regex?

Regex merupakan singkatan dari Regular Expression yang merupakan serangkaian karakter yang mendefinisikan sebuah pola pencarian. Beberapa bidang yang menggunakan metode ini adalah seperti Natural Language Processing (NLP), Text Mining, Data Validation, Finding Pattern, Anomaly Detection dan lainnya.

Apa itu regex Pattern?

Jadi intinya: Regex itu adalah sebuah teks dalam bentuk pola untuk pencarian dan banyak dipakai untuk pencocokan, pencarian, dan manipulasi teks.

Apa kepanjangan dari regexp?

Singkatan regex dan regexp (regular expression) menunjukkan Regular Expression yang digunakan dalam ilmu komputer teoritis, pemrograman, pengembangan perangkat lunak, pengolah kata dan optimisasi mesin pencari.