information-extraction
Script to find webmaster contact details
As part of a summer project I am currently undertaking, I am interested in writing a script to automate the retrieval of the contact email address for a particular site\'s webmaster. Is there any info[详细]
2023-03-09 22:40 分类:问答Extracting information from millions of simple but inconsistent text files
We have millions of simple txt documents containing various data structures we extracted from pdf, the text is printed line by line so all formatting is lost (because when we tried tools to maintain t[详细]
2023-03-04 12:07 分类:问答What techniques are there to extract a navigational menu from a web page?
I\'m looking for a method to extract a menu used for navigation from a web page heavy with links (and probably text). The pages I\'m interested in are quite plain, valid XHTML, and it\'s a safe assump[详细]
2023-03-01 15:22 分类:问答Forum Data Analysis
I\'m working on an expert system that analysing data from a Forum and get some trustableinformations , then i\'m using these information to learn my expert system .[详细]
2023-02-28 22:04 分类:问答Are there libraries to assist in AutoCAD structure extraction?
I need to query AutoCAD models to extract structures and connections (e.g., power, data) between them, for storage in a database. I know from experience and research that handling native AutoCAD .dwg[详细]
2023-02-27 08:16 分类:问答Information extraction. Counting mentions to measure relevance
Is it possible to count how ma开发者_JAVA百科ny times an entity has been mentioned in an article? For example[详细]
2023-02-26 10:07 分类:问答R: Data structure for a ontology and web extraction
I want to extrac开发者_运维知识库t information from a large website and generate an ontology. Something that can be processed with description logic.[详细]
2023-02-18 05:45 分类:问答image feature identification
I am looking for a solution to do the following: ( the focus of my question is step 2. ) a picture of a house including the front yard[详细]
2023-02-08 18:59 分类:问答Best turnkey relation detection library? [closed]
As it currently stands, this question is not a good fit for our Q&A开发者_运维百科 format. We expect answers to be supported by facts, references,or expertise, but this question will likely so[详细]
2023-02-05 05:09 分类:问答Understanding Relevance Score of OpenCalais
I am trying to understand what is the relevance score that opencalais returns associated with each entity? What does it signify and how is it to开发者_运维百科 be interpreted? I would be thankful for[详细]
2023-02-02 23:40 分类:问答