text-mining
Using Regexpr with $
Just a quick question, does anyone know how to use regexpr with \"\\$\" ?Essentially, I want to parse out strings and figure out what numeric value came after the \\$ (for example \"Get $50 off on p开[详细]
2023-04-13 10:04 分类:问答unsupervised Named entity recognition (NER) with custom controlled vocabulary for crosslink-suggestions in Java
I\'m looking for a Java library that can do Named entity recognition (NER) with a custom controlled vocabulary, without needing labeled training data first. I searched some on SE, but most questions a[详细]
2023-04-11 12:39 分类:问答How to sum up the word count for each person in a dialogue?
I\'m starting to learn Python and I\'m trying to write a program that would import a text file, count the total number of words, count the number of words in a specific paragraph (said by each partici[详细]
2023-04-04 21:13 分类:问答Text Mining on huge list of strings
I have list of strings. (pretty big list of ids and strings scattered in 4-5 big files. around a GB each). These strings are formatted like this:[详细]
2023-04-02 15:55 分类:问答WEKA - Classifying New Data from Java - IDF Transform
We are trying to implement a WEKA classifier from inside a Java program. So far so good, everything works well however when building 开发者_C百科the classifier from the training set in Weka GUI we use[详细]
2023-04-01 11:19 分类:问答Rattle loading String to Vector file from WEKA
I have been using WEKA to do some text classification work and I want to try out R. The problem is I cannot load the String to Vector ARFF files created by[详细]
2023-03-25 17:55 分类:问答RapidMiner Sentiment Analysis
I have a collection of short messages classified as positive or negative which is saved in an ARFF file genereated in WEKA. I would like to move this data to RapidMiner for classification and processi[详细]
2023-03-25 08:04 分类:问答How I classify a word of a text in things like names, number, money, date,etc?
I did some questions about text-mining a week ago, but I was a bit confused and still, but now I know wgat I want to do.[详细]
2023-03-24 09:16 分类:问答Java implementation of singular value decomposition for large sparse matrices
I\'m just wondering if anyone out there knows of a java implementation of singular value decomposition (SVD) for large sparse matrices? I need this implementation for latent semantic analysis (LSA).[详细]
2023-03-23 07:05 分类:问答R text mining package DocumentTermMatrix with a dictionary in the control list takes way too much memory [closed]
Closed. This question needs details or clarity. It is not currently accepting answers. Want to improve this question? Add details and clarify the problem by editing this post.[详细]
2023-03-19 10:40 分类:问答