Regex 速查表

Q: 什麼是正則表達式（regex）？

正則表達式是定義搜尋模式的字元序列。用於字串搜尋、匹配和操作。Regex在JavaScript、Python、Java、PHP、Go和大多數現代程式語言中都有原生支援。

Q: 貪婪量詞和惰性量詞有什麼區別？

貪婪量詞（*, +, {n,m}）盡可能多地匹配字元。惰性量詞（*?, +?, {n,m}?）盡可能少地匹配字元。

Q: 什麼是lookahead和lookbehind？

Lookahead（?=...）和lookbehind（?<=...）是零寬度斷言——它們在不消耗字元的情況下檢查模式。正向lookahead（?=foo）斷言'foo'在後面跟隨。

Q: 正則表達式模式在所有語言中都相同嗎？

核心語法（字元類、量詞、錨點）在大多數語言中是一致的。但是，lookbehind、命名組和特定標誌等功能因引擎而異。

包含語法、標誌、分組、lookahead和實際模式範例的完整正則表達式參考。

在Regex測試器中嘗試 →

字元類

語法	描述	範例	匹配
`.`	Any character except newline	`h.t`	hat, hit, hot
`\d`	Digit [0-9]	`\d+`	123, 42, 7
`\D`	Non-digit [^0-9]	`\D+`	abc, foo, !
`\w`	Word char [a-zA-Z0-9_]	`\w+`	hello, foo_bar
`\W`	Non-word character	`\W+`	!@#, spaces
`\s`	Whitespace (space, tab, newline)	`a\sb`	'a b', 'a\tb'
`\S`	Non-whitespace	`\S+`	hello, 123
`[abc]`	One of a, b, or c	`[aeiou]`	'a', 'e', 'i'
`[^abc]`	Not a, b, or c	`[^aeiou]`	'b', 'c', '1'
`[a-z]`	Character in range a–z	`[a-f]`	'a', 'b', 'f'
`[a-zA-Z]`	Letter (any case)	`[a-zA-Z]+`	hello, World
`\p{L}`	Any Unicode letter (u flag)	`\p{L}+`	café, 你好

錨點與邊界

語法	描述	範例	匹配
`^`	Start of string (or line with m flag)	`^Hello`	Hello world
`$`	End of string (or line with m flag)	`world$`	Hello world
`\b`	Word boundary	`\bcat\b`	'cat' but not 'catfish'
`\B`	Non-word boundary	`\Bcat`	'catfish' (not standalone 'cat')
`\A`	Start of string (Python/Java)	`\AHello`	Hello world
`\Z`	End of string (Python/Java)	`world\Z`	Hello world

量詞

語法	描述	範例	匹配
`*`	0 or more (greedy)	`a*`	'', 'a', 'aaa'
`+`	1 or more (greedy)	`a+`	'a', 'aaa'
`?`	0 or 1 (optional)	`colou?r`	color, colour
`{n}`	Exactly n times	`\d{4}`	'2024', '1999'
`{n,}`	n or more times	`\d{2,}`	'12', '1234'
`{n,m}`	Between n and m times	`\d{2,4}`	'12', '123', '1234'
`*?`	0 or more (lazy)	`<.*?>`	'<b>' from '<b>text</b>'
`+?`	1 or more (lazy)	`<.+?>`	'<b>' from '<b>text</b>'
`??`	0 or 1 (lazy)	`a??b`	'b' or 'ab'

分組與引用

語法	描述	範例	匹配
`(abc)`	Capturing group	`(\d+)-(\d+)`	Captures both numbers in '2024-01'
`(?:abc)`	Non-capturing group	`(?:foo\|bar)baz`	'foobaz', 'barbaz'
`(?<name>abc)`	Named capturing group	`(?<year>\d{4})`	Named capture of year
`\1`	Backreference to group 1	`(\w+)\s\1`	'the the', 'word word'
`\k<name>`	Named backreference	`(?<tag>\w+).*\k<tag>`	Matching HTML tags
`(?\|...)`	Branch reset group (PCRE)	`(?\|(a)\|(b))`	Both captured in group 1

Lookahead與Lookbehind

語法	描述	範例	匹配
`(?=abc)`	Positive lookahead — must be followed by abc	`\d+(?= dollars)`	'100' in '100 dollars'
`(?!abc)`	Negative lookahead — must NOT be followed by abc	`\d+(?! dollars)`	'100' in '100 euros'
`(?<=abc)`	Positive lookbehind — must be preceded by abc	`(?<=\$)\d+`	'100' in '$100'
`(?<!abc)`	Negative lookbehind — must NOT be preceded by abc	`(?<!\$)\d+`	'100' in '100 items' but not '$100'

標誌（修飾符）

標誌	名稱	描述
`g`	Global	Find all matches, not just the first
`i`	Case-insensitive	Match regardless of case (A = a)
`m`	Multiline	^ and $ match start/end of each line
`s`	Dotall	. matches newline characters too
`u`	Unicode	Enable full Unicode support (\p{} classes)
`y`	Sticky	Match only at lastIndex position (JS)
`x`	Extended	Allow whitespace + comments in pattern (Python/PHP)

常用模式

模式	Regex	測試
Email	`[a-zA-Z0-9._%+\-]+@[a-zA-Z0-9.\-]+\.[a-zA-Z]{2,}`	在Regex測試器中嘗試 →
URL	`https?:\/\/[\w\-]+(\.[\w\-]+)+[\w\-._~:/?#[\]@!$&'()+,;=%]`	在Regex測試器中嘗試 →
IPv4 Address	`\b(?:(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)\.){3}(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)\b`	在Regex測試器中嘗試 →
Phone (US)	`\+?1?[\s.-]?$?\d{3}$?[\s.-]?\d{3}[\s.-]?\d{4}`	在Regex測試器中嘗試 →
Date (YYYY-MM-DD)	`\d{4}-(?:0[1-9]\|1[0-2])-(?:0[1-9]\|[12]\d\|3[01])`	在Regex測試器中嘗試 →
Time (HH:MM)	`(?:[01]\d\|2[0-3]):[0-5]\d`	在Regex測試器中嘗試 →
UUID	`[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}`	在Regex測試器中嘗試 →
Hex Color	`#(?:[0-9a-fA-F]{3}){1,2}\b`	在Regex測試器中嘗試 →
Slug	`[a-z0-9]+(?:-[a-z0-9]+)*`	在Regex測試器中嘗試 →
ZIP Code (US)	`\d{5}(?:-\d{4})?`	在Regex測試器中嘗試 →
Credit Card	`(?:4[0-9]{12}(?:[0-9]{3})?\|5[1-5][0-9]{14}\|3[47][0-9]{13})`	在Regex測試器中嘗試 →
HTML Tag	`<([a-zA-Z][a-zA-Z0-9])(?:\s[^>])?\/?>.*?<\/\1>`	在Regex測試器中嘗試 →
Markdown Bold	`\\([^]+)\\*`	在Regex測試器中嘗試 →
JWT Token	`[A-Za-z0-9-_]+\.[A-Za-z0-9-_]+\.[A-Za-z0-9-_]+`	在Regex測試器中嘗試 →
Semver	`\bv?(?:0\|[1-9]\d)\.(?:0\|[1-9]\d)\.(?:0\|[1-9]\d*)(?:-[\w.]+)?\b`	在Regex測試器中嘗試 →
GitHub Username	`(?<=github\.com\/)([a-zA-Z0-9](?:[a-zA-Z0-9-]{0,37}[a-zA-Z0-9])?)`	在Regex測試器中嘗試 →
Positive Integer	`^[1-9]\d*$`	在Regex測試器中嘗試 →
Float / Decimal	`-?\d+(?:\.\d+)?`	在Regex測試器中嘗試 →
Blank Lines	`^\s*$`	在Regex測試器中嘗試 →
Duplicate Words	`\b(\w+)\s+\1\b`	在Regex測試器中嘗試 →

常見問題

什麼是正則表達式（regex）？

正則表達式是定義搜尋模式的字元序列。用於字串搜尋、匹配和操作。Regex在JavaScript、Python、Java、PHP、Go和大多數現代程式語言中都有原生支援。

貪婪量詞和惰性量詞有什麼區別？

貪婪量詞（*, +, {n,m}）盡可能多地匹配字元。惰性量詞（*?, +?, {n,m}?）盡可能少地匹配字元。

什麼是lookahead和lookbehind？

Lookahead（?=...）和lookbehind（?<=...）是零寬度斷言——它們在不消耗字元的情況下檢查模式。正向lookahead（?=foo）斷言'foo'在後面跟隨。

正則表達式模式在所有語言中都相同嗎？

核心語法（字元類、量詞、錨點）在大多數語言中是一致的。但是，lookbehind、命名組和特定標誌等功能因引擎而異。

Regex 速查表

包含語法、標誌、分組、lookahead和實際模式範例的完整正則表達式參考。

在Regex測試器中嘗試 →

字元類

語法	描述	範例	匹配
`.`	Any character except newline	`h.t`	hat, hit, hot
`\d`	Digit [0-9]	`\d+`	123, 42, 7
`\D`	Non-digit [^0-9]	`\D+`	abc, foo, !
`\w`	Word char [a-zA-Z0-9_]	`\w+`	hello, foo_bar
`\W`	Non-word character	`\W+`	!@#, spaces
`\s`	Whitespace (space, tab, newline)	`a\sb`	'a b', 'a\tb'
`\S`	Non-whitespace	`\S+`	hello, 123
`[abc]`	One of a, b, or c	`[aeiou]`	'a', 'e', 'i'
`[^abc]`	Not a, b, or c	`[^aeiou]`	'b', 'c', '1'
`[a-z]`	Character in range a–z	`[a-f]`	'a', 'b', 'f'
`[a-zA-Z]`	Letter (any case)	`[a-zA-Z]+`	hello, World
`\p{L}`	Any Unicode letter (u flag)	`\p{L}+`	café, 你好

錨點與邊界

語法	描述	範例	匹配
`^`	Start of string (or line with m flag)	`^Hello`	Hello world
`$`	End of string (or line with m flag)	`world$`	Hello world
`\b`	Word boundary	`\bcat\b`	'cat' but not 'catfish'
`\B`	Non-word boundary	`\Bcat`	'catfish' (not standalone 'cat')
`\A`	Start of string (Python/Java)	`\AHello`	Hello world
`\Z`	End of string (Python/Java)	`world\Z`	Hello world

量詞

語法	描述	範例	匹配
`*`	0 or more (greedy)	`a*`	'', 'a', 'aaa'
`+`	1 or more (greedy)	`a+`	'a', 'aaa'
`?`	0 or 1 (optional)	`colou?r`	color, colour
`{n}`	Exactly n times	`\d{4}`	'2024', '1999'
`{n,}`	n or more times	`\d{2,}`	'12', '1234'
`{n,m}`	Between n and m times	`\d{2,4}`	'12', '123', '1234'
`*?`	0 or more (lazy)	`<.*?>`	'<b>' from '<b>text</b>'
`+?`	1 or more (lazy)	`<.+?>`	'<b>' from '<b>text</b>'
`??`	0 or 1 (lazy)	`a??b`	'b' or 'ab'

分組與引用

語法	描述	範例	匹配
`(abc)`	Capturing group	`(\d+)-(\d+)`	Captures both numbers in '2024-01'
`(?:abc)`	Non-capturing group	`(?:foo\|bar)baz`	'foobaz', 'barbaz'
`(?<name>abc)`	Named capturing group	`(?<year>\d{4})`	Named capture of year
`\1`	Backreference to group 1	`(\w+)\s\1`	'the the', 'word word'
`\k<name>`	Named backreference	`(?<tag>\w+).*\k<tag>`	Matching HTML tags
`(?\|...)`	Branch reset group (PCRE)	`(?\|(a)\|(b))`	Both captured in group 1

Lookahead與Lookbehind

語法	描述	範例	匹配
`(?=abc)`	Positive lookahead — must be followed by abc	`\d+(?= dollars)`	'100' in '100 dollars'
`(?!abc)`	Negative lookahead — must NOT be followed by abc	`\d+(?! dollars)`	'100' in '100 euros'
`(?<=abc)`	Positive lookbehind — must be preceded by abc	`(?<=\$)\d+`	'100' in '$100'
`(?<!abc)`	Negative lookbehind — must NOT be preceded by abc	`(?<!\$)\d+`	'100' in '100 items' but not '$100'

標誌（修飾符）

標誌	名稱	描述
`g`	Global	Find all matches, not just the first
`i`	Case-insensitive	Match regardless of case (A = a)
`m`	Multiline	^ and $ match start/end of each line
`s`	Dotall	. matches newline characters too
`u`	Unicode	Enable full Unicode support (\p{} classes)
`y`	Sticky	Match only at lastIndex position (JS)
`x`	Extended	Allow whitespace + comments in pattern (Python/PHP)

常用模式

模式	Regex	測試
Email	`[a-zA-Z0-9._%+\-]+@[a-zA-Z0-9.\-]+\.[a-zA-Z]{2,}`	在Regex測試器中嘗試 →
URL	`https?:\/\/[\w\-]+(\.[\w\-]+)+[\w\-._~:/?#[\]@!$&'()+,;=%]`	在Regex測試器中嘗試 →
IPv4 Address	`\b(?:(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)\.){3}(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)\b`	在Regex測試器中嘗試 →
Phone (US)	`\+?1?[\s.-]?$?\d{3}$?[\s.-]?\d{3}[\s.-]?\d{4}`	在Regex測試器中嘗試 →
Date (YYYY-MM-DD)	`\d{4}-(?:0[1-9]\|1[0-2])-(?:0[1-9]\|[12]\d\|3[01])`	在Regex測試器中嘗試 →
Time (HH:MM)	`(?:[01]\d\|2[0-3]):[0-5]\d`	在Regex測試器中嘗試 →
UUID	`[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}`	在Regex測試器中嘗試 →
Hex Color	`#(?:[0-9a-fA-F]{3}){1,2}\b`	在Regex測試器中嘗試 →
Slug	`[a-z0-9]+(?:-[a-z0-9]+)*`	在Regex測試器中嘗試 →
ZIP Code (US)	`\d{5}(?:-\d{4})?`	在Regex測試器中嘗試 →
Credit Card	`(?:4[0-9]{12}(?:[0-9]{3})?\|5[1-5][0-9]{14}\|3[47][0-9]{13})`	在Regex測試器中嘗試 →
HTML Tag	`<([a-zA-Z][a-zA-Z0-9])(?:\s[^>])?\/?>.*?<\/\1>`	在Regex測試器中嘗試 →
Markdown Bold	`\\([^]+)\\*`	在Regex測試器中嘗試 →
JWT Token	`[A-Za-z0-9-_]+\.[A-Za-z0-9-_]+\.[A-Za-z0-9-_]+`	在Regex測試器中嘗試 →
Semver	`\bv?(?:0\|[1-9]\d)\.(?:0\|[1-9]\d)\.(?:0\|[1-9]\d*)(?:-[\w.]+)?\b`	在Regex測試器中嘗試 →
GitHub Username	`(?<=github\.com\/)([a-zA-Z0-9](?:[a-zA-Z0-9-]{0,37}[a-zA-Z0-9])?)`	在Regex測試器中嘗試 →
Positive Integer	`^[1-9]\d*$`	在Regex測試器中嘗試 →
Float / Decimal	`-?\d+(?:\.\d+)?`	在Regex測試器中嘗試 →
Blank Lines	`^\s*$`	在Regex測試器中嘗試 →
Duplicate Words	`\b(\w+)\s+\1\b`	在Regex測試器中嘗試 →

常見問題

什麼是正則表達式（regex）？

正則表達式是定義搜尋模式的字元序列。用於字串搜尋、匹配和操作。Regex在JavaScript、Python、Java、PHP、Go和大多數現代程式語言中都有原生支援。

貪婪量詞和惰性量詞有什麼區別？

貪婪量詞（*, +, {n,m}）盡可能多地匹配字元。惰性量詞（*?, +?, {n,m}?）盡可能少地匹配字元。

什麼是lookahead和lookbehind？

Lookahead（?=...）和lookbehind（?<=...）是零寬度斷言——它們在不消耗字元的情況下檢查模式。正向lookahead（?=foo）斷言'foo'在後面跟隨。

正則表達式模式在所有語言中都相同嗎？

核心語法（字元類、量詞、錨點）在大多數語言中是一致的。但是，lookbehind、命名組和特定標誌等功能因引擎而異。

Regex 速查表

字元類

錨點與邊界

量詞

分組與引用

Lookahead與Lookbehind

標誌（修飾符）

常用模式

常見問題

什麼是正則表達式（regex）？

貪婪量詞和惰性量詞有什麼區別？

什麼是lookahead和lookbehind？

正則表達式模式在所有語言中都相同嗎？

References & cheat sheets

Regex 速查表

字元類

錨點與邊界

量詞

分組與引用

Lookahead與Lookbehind

標誌（修飾符）

常用模式

常見問題

什麼是正則表達式（regex）？

貪婪量詞和惰性量詞有什麼區別？

什麼是lookahead和lookbehind？

正則表達式模式在所有語言中都相同嗎？