tokenize
how to use solr.analysis.WordDelimiterFilter standalone
At this point I\'m pretty sure I\'ve searched enough and I can\'t find a nice example of how 开发者_StackOverflow中文版put to work this class.[详细]
2023-04-12 18:13 分类:问答Solr: exact phrase query with a EdgeNGramFilterFactory
In Solr (3.3), is it possible to make a field letter-by-letter searchable through a EdgeNGramFilterFactory and also sensitive to phrase queries?[详细]
2023-04-10 08:08 分类:问答Difference between StandardTokenizerFactory and KeywordTokenizerFactory in Solr?
I am new to Solr.I want to know when to use StandardTokenizerFactory and KeywordTokenizerFactory? I read the docs on Apache Wiki, but I am not getting it.[详细]
2023-04-10 05:34 分类:问答Tokenize a string in KornShell
I need to tokenize in string in KornShell (ksh). I have got the following script for bash; but it does not seem to work in ksh.[详细]
2023-04-09 06:58 分类:问答Lucene Tokenizer with LookAhead
can anyone point me in the right direction for implementing a Lucene Tokenizer with LookAhead? I\'m using a snowball stemmer and I want to be able to get phrases of city names and prevent them from b[详细]
2023-04-07 22:53 分类:问答How to extend WhitespaceTokenizer?
I need to use a tokenizer that splits words 开发者_开发知识库on whitespace but that doesn\'t split if the whitespace is whithin double parenthesis. Here an example:[详细]
2023-04-07 16:23 分类:问答Boost::Split using whole string as delimiter
I would like to know if there is a method using boost::split to split a string using whole strings as a delimiter. For example:[详细]
2023-04-05 17:56 分类:问答Solr Tokenizer Question
I have what I think is a simple solr exercise, but I\'m unsure what to use. I have a field of names, e.g. Joe Smith and Jack Daniels and Steve. They could each be one name or two names. I want to be[详细]
2023-04-05 17:47 分类:问答What's wrong with this Lucene TokenFilter?
Disclaimer: I\'ve been coding for 36 of the last 41 hours. I have a headache. And I can\'t figure out why this combining TokenFilter is returning 2 tokens, both the first token from the source stream.[详细]
2023-04-04 17:29 分类:问答Class hierarchy of tokens and checking their type in the parser
I\'m attempting to write a reusable parsing library (for fun). I wrote a Lexer class which generates a sequence of Tokens. Token is a base class for a hierarchy of subclas开发者_StackOverflowses, eac[详细]
2023-04-04 06:29 分类:问答