开发者

How do I implement a web search using Hadoop/Hbase?

开发者 https://www.devze.com 2023-02-18 06:54 出处:网络
How do I implement a web search using Hadoop/Hbase. Can anyone please provide me some guidelines or tutorial just to get s开发者_开发知识库tarted?this article is about map-Reduce, and it gives some ex

How do I implement a web search using Hadoop/Hbase. Can anyone please provide me some guidelines or tutorial just to get s开发者_开发知识库tarted?


this article is about map-Reduce, and it gives some examples on implementing principals of information retrieval using map-reduce.
http://labs.google.com/papers/mapreduce.html
You might also want to check out Apache Lucene, which is an open source information retrieval system, and Apache Solr


Try lily project this deals with indexing hbase table data and the indexed data can easily be used for implimentation of search. here is the link that you may refer

http://www.lilyproject.org/lily/index.html

Lily unifies Apache HBase, Hadoop and Solr into a comprehensively integrated, interactive data platform with easy-to-use access APIs, a high-level data model and schema language, flexible, real-time indexing and the expressive search power of Apache Solr. Best of all, Lily is open source


@Yash:

For Hbase: Read chapter on Hbase from Definitive guide. Then read Apache Hbase guide. This will make you more clear about Hbase.

Jmeter can help you to send multiple queries to server for testing purpose.

Nutch is a open source hadoop based web crawler. Solr is a search platform which leverages lucene internally. You might peek into these to understand their design and implement your web search app.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号