I am a newbie to Nutch and Hadoop and trying to follow the tutorial here at http://wiki.apache.org/nutch/NutchHadoopTutorial.
So I started with Nutch 1.3 release.
Even though Hadoop is included in Nutch, I did not see any of these .sh or .xml files referred in the tutorial under /nutch/search/conf after the build.
I开发者_StackOverflow社区 was wondering if I have to setup hadoop first in the same directory structure or copy over hadoop config files before proceeding to Nutch setup.
Can anyone please put me in the right direction. I am pretty sure that I am lost :-(
Thanks very much in advance
Well hadoop is not included anymore in Nutch since 1.3 ... I have complained in the mailing list. But the goal of Nutch group seem to have changed to a crawler component only. To make use of it you need to install hadoop here is good tutorial & solr (for search).
Some people announced they are going to fix that but for Nutch1.4 only. Not sure where it will end up.
精彩评论