Invention Grant
- Patent Title: Corpus generation based upon document attributes
- Patent Title (中): 基于文档属性的语料库生成
-
Application No.: US14292857Application Date: 2014-05-31
-
Publication No.: US09535910B2Publication Date: 2017-01-03
- Inventor: Corville O. Allen , Paul R. Bastide , Matthew E. Broomhall , Robert E. Loredo , Fang Lu
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: VanLeeuwen & VanLeeuwen
- Agent William J. Stock
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
The present disclosure provides an approach in which a domain corpus subset generator correlates documents from a document corpus to domain discernible attributes associated with domain corpus subsets. The domain corpus subset generator analyzes correlation results from the correlation and stores the documents into domain corpus subsets accordingly. In turn, a question-answer system utilizes documents included in a specific domain corpus subset to provide relevant and accurate answers to an input question.
Public/Granted literature
- US20150347557A1 Corpus Generation Based Upon Document Attributes Public/Granted day:2015-12-03
Information query