How could I use HTMLUnit to get data from a webpage generated by a java servlet. I keep getting an error when I try to read the webpage. /getSurvey is the servlet that creates the webpage but how ca开发者_运维百科n I access the HTML generated from the servlet.
final WebClient webClient = new WebClient(); final HtmlPage page = webClient.getPage("http://survey-creator.appspot.com/getSurvey");
HtmlUnit is not really "just" a HTML parser. It's kind of a programmatic webbrowser. It's intented to surf through web pages and/or fill out web forms programmatically using Java language. If your sole purpose is to get the HTML as a String, use a real HTML parser. I can recommend Jsoup for this.
String html = Jsoup.connect("http://stackoverflow.com").get().html();
That's it. It can however do much more than that, such as selecting elements of interest.
See also:
- Pros and cons of leading HTML parsers in Java
精彩评论