开发者

Setup Nutch 1.3 and Hadoop

开发者 https://www.devze.com 2023-04-01 20:39 出处:网络
I am a newbie to Nutch and Hadoop and trying to follow the tutorial here at http://wiki.apache.org/nutch/NutchHadoopTutorial.

I am a newbie to Nutch and Hadoop and trying to follow the tutorial here at http://wiki.apache.org/nutch/NutchHadoopTutorial.

So I started with Nutch 1.3 release.

Even though Hadoop is included in Nutch, I did not see any of these .sh or .xml files referred in the tutorial under /nutch/search/conf after the build.

I开发者_StackOverflow社区 was wondering if I have to setup hadoop first in the same directory structure or copy over hadoop config files before proceeding to Nutch setup.

Can anyone please put me in the right direction. I am pretty sure that I am lost :-(

Thanks very much in advance


Well hadoop is not included anymore in Nutch since 1.3 ... I have complained in the mailing list. But the goal of Nutch group seem to have changed to a crawler component only. To make use of it you need to install hadoop here is good tutorial & solr (for search).
Some people announced they are going to fix that but for Nutch1.4 only. Not sure where it will end up.

0

精彩评论

暂无评论...
验证码 换一张
取 消