Please explain this regex

Question

I have regex which reads:

@"<img\s*[^>]*>(?:\s*?</img>)?

Can someone please explain this part: (?:\s*?)?

What is that?

have you tried downloading one of the free tools (like expresso 3.0) that explains regex's? — Mitch Wheat, Nov 16 '09 at 02:47
http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags/1732454#1732454 — Amarghosh, Nov 16 '09 at 12:17

score 9 · Accepted Answer · answered Nov 16 '09 at 02:48

9

match but don't capture any number of whitespace followed by a close image tag, zero or one times:

(?: = match but don't capture

\s*? = any number of whitespace (not greedy)

</img> = close image tag

)? = zero or one times

:)

answered Nov 16 '09 at 02:48

Luke Schafer

9,111
2
27
28

score 1 · Answer 2 · answered Nov 16 '09 at 02:48

1

(?:\s*?) selects any whitespace, if it exists, after the image tag. The ?: at the beginning tells the regex engine to not capture that group (meaning it won't be returned in the matches array)

answered Nov 16 '09 at 02:48

brianreavis

11,422
3
40
50

score 0 · Answer 3 · answered Nov 16 '09 at 02:49

0

non-capturing group of any number of whitespace characters, followed by a closing img tag

answered Nov 16 '09 at 02:49

benPearce

36,402
14
63
95

score 0 · Answer 4 · answered Nov 16 '09 at 03:07

The entire expression will capture any <img> tags that have corresponding </img> tags (but it won't capture the close tags). It doesn't capture the close tags because the (?:) syntax means "match but don't capture".

Some restrictions that are part of this regex:

The \s* in the opening tag is redundant because [^>]* will capture this too
Only whitespace is allowed between the opening and closing tags

Some examples:

<img> will not match
<img></img> will match, but only capture <img>
<img attr="123"></img> will match, but only capture <img attr="123">
<imgabc></img> will not match
<img> </img> will match, but only capture <img>
<img>ab</img> will not match

I highly recommend the Regular Expression Designer available for free at www.radsoftware.com.au for testing regexs

wrong - the ? after the final group causes it not to be greedy, meaning things like will match — Luke Schafer, Nov 16 '09 at 04:27

Please explain this regex

4 Answers4