corpus
which similarity function of nltk.corpus.wordnet is Appropriate for find similarity of two words?
which similarity function in nltk.corpus.wordnet is Appropriate for find similarity of two words? path_similarity()?[详细]
2023-04-04 07:07 分类:问答Where can I find texts that describe topic-specific events?
So, some background: I\'m trying to train a ML system to answer questions about events, where both the event descriptions and questions are posed in natural language; the event descriptions are constr[详细]
2023-04-01 09:43 分类:问答A corpus with semantic role tags for an NLP application
So, I\'ve constructed a NLP program that learns to extract a semantic event description from a sentence, but right now my training set is limited to sentences I\'ve parsed into semantic event componen[详细]
2023-03-24 13:55 分类:问答Python - Search for items in hundreds of large, gzipped files
Unfortunately, I\'m working with an extremely large corpus which is spread into hundreds of .gz files -- 24 gigabytes (packed) worth, in fact. Python is really my native language (hah) but I was wonde[详细]
2023-03-09 03:18 分类:问答Programmatically install NLTK corpora / models, i.e. without the GUI downloader?
My project uses the NLTK. How can I list the project\'s corpus & model requirements so they can be automatically installed? I don\'t want to click through the nltk.download() GUI, installing packa[详细]
2023-03-02 13:17 分类:问答Replacing synonyms in a corpus using WordNet and NLTK - python
Trying to write simple python script which will use NLTK to find and replace synonyms in txt file. Following code gives me error:[详细]
2023-02-14 12:43 分类:问答How do I tag textfiles with hunpos in nltk?
Can someone help me with the syntax for hunpos tagging a corpus in nltk? What开发者_如何学C do I import for the hunpos.HunPosTagger module?[详细]
2023-02-12 15:31 分类:问答How to build a IMS open source corpus workbench and NLTK readable corpus?
Currently i\'ve a bunch of .txtfiles. within each .txt files, each sentence is separated by newline. how do i change it to the IMS CWB format so that it\'s readable by CWB? and also to nltk format.[详细]
2023-02-12 07:25 分类:问答Word frequency tally script is too slow
Background Created a script to count the frequency of words in a plain text file. The script performs the following steps:[详细]
2023-02-02 20:26 分类:问答understanding semcor corpus structure h
I\'m learning NLP.I currently playing with Word Sense Disambiguation.I\'m planning to use the semcor corpus as training data but I have trouble understanding the xml structure.I tried googling but did[详细]
2023-02-01 23:19 分类:问答