information-retrieval
Python: intersection of lists/sets
def boolean_search_and(self, text): results = [] and_tokens = self.tokenize(text) tokencount = len(and_tokens)[详细]
2023-01-16 09:04 分类:问答Python: Storing a list value associated with a key in dictionary
I know how python dictionaries store key: value tuples. In the project I\'m working on, I\'m required to store key associated with a value that\'s a list.[详细]
2023-01-16 00:40 分类:问答Which information is stored by Google crawler?
.. and ho开发者_Go百科w the web crawler infers the semantics of information on the website? List out the ranking signal in separate answer.From http://www.google.com/corporate/tech.html:[详细]
2023-01-15 23:39 分类:问答Wikipedia: pages across multiple languages
I want to use wikipedia dump for my project. The below information is requ开发者_StackOverflow中文版ired for my project.[详细]
2023-01-15 20:23 分类:问答HTML\PHP - How to elicit user (visitor)'s info
How to elicit user\'s information when he/she is visiting your website? IP Address Mac Address User Profile Name[详细]
2023-01-14 06:37 分类:问答how to cluster evolving data streams
I want to incrementally cluster text documents reading them as data streams but there seems to be a problem. Most of the term weighting options are based on vector space model using TF-IDF开发者_如何学[详细]
2023-01-13 05:28 分类:问答Information retrieval (IR) vs data mining vs Machine Learning (ML)
People often throw around the terms IR, ML, and data mining, but I have noticed a lot of overlap between them.[详细]
2023-01-10 04:49 分类:问答Fast Sequence Alignment on Unicode Strings
I want to run something like the BLAST algorithm to query a large database of unicode strings.开发者_C百科Most of the alignment software like BLAST expects nucleotide or protein strings as input.But m[详细]
2023-01-10 00:11 分类:问答Good documentation on structure tcp_info [closed]
Closed. This question does not meet Stack Overf开发者_运维问答low guidelines. It is not currently accepting answers.[详细]
2023-01-09 11:20 分类:问答Cosine Similarity of Vectors, with < O(n^2) complexity
Having looked around this site for similar issues, I fou开发者_开发知识库nd this: http://math.nist.gov/javanumerics/jama/ and this: http://sujitpal.blogspot.com/2008/09/ir-math-with-java-similarity-me[详细]
2023-01-09 06:43 分类:问答