0

I would like to scrape innertext from html block using regex.

Here is the block along with my best parsing effort which is far from perfect: http://www.regexr.com/3andc

What I would like to extract is the string "10 Extremely Popular Youtubers Who Are Now Millionaires"

Thanks.

Andy Lester
  • 86,927
  • 13
  • 98
  • 148
hvs
  • 478
  • 5
  • 18

1 Answers1

2

Please do not use regex to scrape HTML... RegEx match open tags except XHTML self-contained tags

maybe use HTMLAgilityPack? https://htmlagilitypack.codeplex.com/

Edit :

Then try this : http://regexr.com/3ane7

Community
  • 1
  • 1
Bruno
  • 4,650
  • 5
  • 53
  • 96
  • Not an option in this case. I am using a proprietary piece of software that will only accept regex input – hvs Mar 30 '15 at 18:18