开发者

How to index my website [closed]

开发者 https://www.devze.com 2023-03-11 06:35 出处:网络
Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this po
Closed. This question needs to be more focused. It is not currently accepting answers.

Want to improve this question? Update the question so it focuses on one problem only by editing this post.

Closed 4 years ago.开发者_StackOverflow社区

Improve this question

I just come across Elastic Search, I have done installation and sample indexing and searchs. Now I want to use this for my website. For that how to create indexes for my website? do I need to store entire web pages content as index??

Thanks, Nehatha


ES doesn't come with a crawler built-in - You will need to write a crawler to index websites. There are plenty of F/OSS crawlers/spiders you can use - Nutch being one of them.

Here is a thread that might be of interest: http://elasticsearch-users.115913.n3.nabble.com/Suggestions-for-places-to-start-for-a-crawler-td900225.html

If your needs are lightweight, you can easily hack together a crawler in your fave language within a few hours.

0

精彩评论

暂无评论...
验证码 换一张
取 消