Invention Grant
US09535910B2 Corpus generation based upon document attributes 有权
基于文档属性的语料库生成

Corpus generation based upon document attributes
Abstract:
The present disclosure provides an approach in which a domain corpus subset generator correlates documents from a document corpus to domain discernible attributes associated with domain corpus subsets. The domain corpus subset generator analyzes correlation results from the correlation and stores the documents into domain corpus subsets accordingly. In turn, a question-answer system utilizes documents included in a specific domain corpus subset to provide relevant and accurate answers to an input question.
Public/Granted literature
Information query
Patent Agency Ranking
0/0