fuzzy-search
Advice on how to improve a current fuzzy search implementation
I\'m currently working on implementing a fuzzy search for a terminology web service and I\'m looking for suggestions on how I might improve the current implementation. It\'s too much code to share, bu[详细]
2023-01-20 15:34 分类:问答Real world typo statistics? [closed]
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references,or expertise, but this question will likely solicit debate, a[详细]
2023-01-10 05:20 分类:问答Super fuzzy name checking?
I\'m working on some stuff for an in-house CRM. The company\'s current frontend allows for lots of duplicates. I\'m trying to stop end-users from putting in the same person because they searched for \[详细]
2023-01-07 21:12 分类:问答Fuzzy runtime search without using database\index
I need to filter stream of text articles by checking every entry for fuzzy matches of predefined string(I am searching for misspelled product names, sometime they have dif开发者_Python百科ferent order[详细]
2023-01-07 09:04 分类:问答SOLR - how to do a fuzzy search on booleans
If my index contains three boolean fields: a, b and c... I would like to search for: \"a=True, 开发者_如何学Gob=False, c=True\" and SOLR should return all entries, and their score should represent how[详细]
2023-01-06 23:46 分类:问答"Go to file" feature in various editors
In TextMate there is a feature called \"Go to file\" that is used for file navigation.It is a box where you type the name of a file in your project and it will use fuzzy matching to generate a list of[详细]
2022-12-29 07:32 分类:问答Algorithm detect repeating/similiar strings in a corpus of data -- say email subjects, in Python [duplicate]
This question already has answers here: 开发者_StackOverflow A better similarity ranking algorithm for variable length strings[详细]
2022-12-28 16:32 分类:问答Lucene query: bla~* (match words that start with something fuzzy), how?
In the Lucene query syntax I\'d like to combine * and ~ in a valid query similar to: bla~* //invalid query[详细]
2022-12-27 06:07 分类:问答Lucene Fuzzy Match on Phrase instead of Single Word
I\'m trying to do a fuzzy match on the Phrase \"Grand Prarie\" (deliberately misspel开发者_JAVA百科led) using Apache Lucene. Part of my issue is that the ~ operator only does fuzzy matches on single w[详细]
2022-12-25 16:46 分类:问答Fuzzy Regular Expressions
In my work I have with great results used approximate string matching algorithms such as Damerau–Levenshtein distance to make my code less vulnerable to spelling mistakes.[详细]
2022-12-21 17:39 分类:问答