Possible Duplicate:
Is there a validating HTML parser implemented in Java?
Hi,
Is there is any API which parse the HTML text using java.
All the function should in the format of Objects
e.g. In the following text i want to Parse the HTML file and parser should return me the list of tags , attribute ..
<开发者_Go百科HTML>
<BODY>
<INPUT TYPE="text" value="100">
</BODY>
</HTML>
Thanks
Comprehensive list here
Refer to HTML/XML Parser for Java and Is there a validating HTML parser implemented in Java? and finally Which HTML Parser is the best?
These should answer your question nicely.
Regex's should work just fine.... cough
精彩评论