1

Possible Duplicates:
What HTML parsing libraries do you recommend in Java
Need a good HTML parser on php
What are the pros and cons of the leading Java HTML parsers?

Hello, I have used http://simplehtmldom.sourceforge.net/ for parsing html in php and is there is any thing similar to that....

Community
  • 1
  • 1
  • 3
    I think at least 5 relevant questions were shown in "Related questions" for this title. – Bozho Aug 02 '10 at 11:14

5 Answers5

2

IMHO,Jsoup is the best HTML Parser

chedine
  • 2,374
  • 3
  • 19
  • 24
0

There are a few listed here: http://java-source.net/open-source/html-parsers However JTidy is very popular among Java developers.

Behrang
  • 44,452
  • 23
  • 114
  • 153
0

The Mozilla Java Html Parser may be of interest to you

Colin Pickard
  • 44,639
  • 13
  • 95
  • 146
0

you can use Jericho HTML Parser or Mozilla Java Html Parser

Neel
  • 429
  • 7
  • 17
0

The TagSoup parser was a real good choice for me because HTML is, depending on it's author, not always well-formed. Since most parsers expect well-formed xml (or html, for that matter), I needed to use TagSoup in order to be able to parse all html that was fed to my application.

f1sh
  • 10,458
  • 3
  • 23
  • 49