text-mining
Converting data into information:Where to start?
We (my company) runs a website which have lots of data recorded like user registration, visits, clicks, what the stuff they post etc etc but so far we don\'t have a tool to find out how to monitor ent[详细]
2023-02-14 14:10 分类:问答Find 3-8 word common phrases in body of text using PHP
I\'m looking for a way to find common phrases within a body of text using PHP. If it\'s not possible in php, I\'d be interested in other web languages that would help me complete this.[详细]
2023-02-07 05:51 分类:问答Is OpenNLP unable to identify dates of the format "January 10th, 2009"?
OpenNLP(in Java) is unable to identify dates of the format \"Januar开发者_JAVA技巧y 10th, 2010\" or \"January 10, 2010\". I replaced all \',\'s in the text with an empty string \"\" before using OpenN[详细]
2023-02-05 10:42 分类:问答Full Text PDFs for PubMed Articles
While working on a project I need to download and process full text articles for PubMed a开发者_开发问答bstracts, is there any implemented code or tool that allows the user to input a set of PubMed id[详细]
2023-02-05 04:58 分类:问答Paraphrase recognition using sentence level similarity
I\'m a new entrant to NLP (Natural Language Processing). As a start up project, I\'m developing a paraphrase recognizer (a system which can recognize two similar sentences). For that recognizer I\'m g[详细]
2023-02-03 08:53 分类:问答Runtime pompt for Rapidminer
I have been using Rapidminer and created 开发者_JAVA百科a series of processes which preform a standard set of tasks. Now, I want allow the user to dynamically set the parameters of a process at the st[详细]
2023-01-31 05:35 分类:问答Lucene Entity Extraction
Given a finite dictionary of entity terms, I\'m looking for a way to do Entity Extraction with intelligent tagging using Lucene. Currently I\'ve been able to use Lucene for:[详细]
2023-01-25 16:38 分类:问答How to extract words from text as per the context
I want to extract relevant words from a text statement provided by the user. eg. For a question \"How many sides are there in a rectangle?\"[详细]
2023-01-22 08:23 分类:问答C# algorithm for N-gram
I am intending to use the n-gram code from this article. The algorithm produces these tri-gram开发者_StackOverflow results:[详细]
2023-01-18 09:00 分类:问答Clustering text in MATLAB
I want to do hierarchical agglomerative clustering on texts in MATLAB. Say, I have four sentences, I have a pen.[详细]
2023-01-14 06:53 分类:问答