Parse string to get an array of the URLs

Question

Imagine I have a string - something like this:

This is some really cool text about http://google.com/ and it also contains some urls like http://apple.com/ and its very nice! This is too long and I need to do some magic stuff to fix this very big problem. Oh no.

As you can see there are two URLs in the string and somehow, assuming I need some kind of REGEX I need to get an array of those URL's so I can manipulate them. Something like this...

Array()

- [0] = 'http://google.com/'
- [1] = 'http://apple.com/'

All help appreciated :) Thanks!

possible duplicate of [extract urls from text in PHP](http://stackoverflow.com/questions/910912/extract-urls-from-text-in-php) — Gordon, Jul 13 '10 at 21:30
And since you didnt specify a language, here is a couple others: http://stackoverflow.com/search?q=extract+urls+out+of+text — Gordon, Jul 13 '10 at 21:36

score 2 · Answer 1 · answered Jul 13 '10 at 21:27

2

https?:\/\/[^\s]+

Find something that starts with http:// or https://, then pull characters until we find whitespace.

answered Jul 13 '10 at 21:27

Donald Miner

37,251
7
89
116

For some reason php preg_match said i couldnt use \ as delimiter :( – tarnfeld Jul 13 '10 at 21:32
Try just // instead of \/\/, some implementations don't require you to escape forward slashes. – Donald Miner Jul 13 '10 at 21:37

Dylan · Accepted Answer · 2010-07-13T21:49:23.543

2

I think this should also work.

$string = 'ajskldfj http://google.ca jslfkjals s http://www.apple.com jalksf';
$pattern = '/[A-Za-z]+:\/\/[A-Za-z0-9-_]+\.[A-Za-z0-9-_:%&\?\/.=]+/';
preg_match_all($pattern, $string, $matches);

edited Jul 13 '10 at 21:49

answered Jul 13 '10 at 21:29

Dylan

849
1
8
12

score 0 · Answer 3 · answered Jul 13 '10 at 21:29

0

You'll want a regex pattern like the following:

'https?:\/\/(\w*?.)?(?P<domain>\w+).(\w+)/'

answered Jul 13 '10 at 21:29

Josiah

4,566
1
18
19

Parse string to get an array of the URLs

3 Answers3