Does anyone here have experience with writing custom FT开发者_开发问答S3 (the full-text-search extension) tokenizers? I'm looking for a tokenizer that will ignore HTML tags.
Thanks.
I have no direct experience, but by doing a web search with "sqlite3 registerTokenizer" I found two tokenizers that can be used as a basis: a snowball tokenizer and a MeCab tokenizer.
精彩评论