apache-tika
tika solr integration
I am trying to index using curl based request the request is curl \"http://localhost:8080/solr1/update/extract?literal.id=who.pdf&uprefix=attr_&fmap.content=attr_content&commit=true\" -F[详细]
2023-03-09 05:18 分类:问答Apache Tika and File access instead of Java Input Stream
I want to be able to create a new Tika parser to extract metadata from a file.We\'re already using Tika and the metadata extraction will be done consistently.[详细]
2023-03-06 09:53 分类:问答How to integrate database search with pdf search in a web app?
I\'ve a jsp web application with a custom search engine. The search engine is basically build on top of a \'documents\' table of a SQL Server database.[详细]
2023-03-06 05:49 分类:问答How to show filenames in search results using Solr's FileListEntityProcessor
I am trying to scan all pdf/doc files in a directory. This works fine and I am able to scan all documents.[详细]
2023-03-05 10:00 分类:问答Doesn't index or extract the Document (.pdf .doc) from Remotely
I am using Solr 3.1, Apache Tika 0.9 and Solrnet 0.3.1 to index the docuent like a .doc and .pdf file.[详细]
2023-03-03 16:59 分类:问答Is it possible to extract text by page for word/pdf files using Apache Tika?
All the documentation I can find seems to suggest I can only extract the entire file\'s content. But I need to extract pages individual开发者_高级运维ly. Do I need to write my own parser for that? Is[详细]
2023-03-01 04:56 分类:问答Solr 3.1 doesn't index the file
I have configured Solr 3.1 with Apache tika 0.9 successfully I don\'t change Schema.xml(default schema) and solrconfig.xml file[详细]
2023-02-28 22:52 分类:问答How to configure Tika 0.9 with Solr 3.1
can you give me the Steps to configure Tika 0.9 with Solr 3.1 &l开发者_如何学Ct;requestHandler name=\"/update/extract\"[详细]
2023-02-27 00:08 分类:问答How to get file extension from content type?
I\'m using Apache Tika, and I have files (without ext开发者_如何学Goension) of particular content type that need to be renamed to have extension that reflect the content type.[详细]
2023-02-22 03:46 分类:问答Apache Tika compilation error
I\'m getting this error when compiling Apache Tika the latest version on debian. Any help will be appreciated.[详细]
2023-02-18 04:22 分类:问答