开发者

crawler get external website search result

开发者 https://www.devze.com 2022-12-13 23:21 出处:网络
What is the best practice and library I can use to key in search textbox on external website and collect the search result?
  1. What is the best practice and library I can use to key in search textbox on external website and collect the search result?

  2. How do tackle website with different search box and checkbox and collect the result?

  3. Can Selenium be used to automate this?

  4. Should I us开发者_如何学JAVAe Heritrix or nutch? Which one is better? I heard nutch comes with plugins. Which one has a bigger community?


you can use:

  • The Selenium API
  • HtmlUnit
  • Htmlparser

etc.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号