开发者

How to configure Tika 0.9 with Solr 3.1

开发者 https://www.devze.com 2023-02-27 00:08 出处:网络
can you give me the Steps to configure Tika 0.9 with Solr 3.1 &l开发者_如何学Ct;requestHandler name=\"/update/extract\"

can you give me the Steps to configure Tika 0.9 with Solr 3.1

&l开发者_如何学Ct;requestHandler name="/update/extract" 
                  startup="lazy"
                  class="solr.extraction.ExtractingRequestHandler" >
    <lst name="defaults">
      <!-- All the main content goes into "text"... if you need to return
           the extracted text or do highlighting, use a stored field. -->
      <str name="fmap.content">text</str>
      <str name="lowernames">true</str>
      <str name="uprefix">ignored_</str>

      <!-- capture link hrefs but ignore div attributes -->
      <str name="captureAttr">true</str>
      <str name="fmap.a">links</str>
      <str name="fmap.div">ignored_</str>
    </lst>
  </requestHandler>

This one i use in solrconfig.xml to configure Please help me on that

Thanks,


Assuming you have Tika (and dependencies) installed in solr, that ought to be all you need to do.

Have you had a read through the ExtractingRequestHandler wiki page? It has quite a bit of information, and handily also has several recipes using curl to let you test that it's all working properly.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号