Invention Grant
US08122032B2 Identifying and linking similar passages in a digital text corpus
有权
识别和链接数字文本语料库中的类似段落
- Patent Title: Identifying and linking similar passages in a digital text corpus
- Patent Title (中): 识别和链接数字文本语料库中的类似段落
-
Application No.: US11781213Application Date: 2007-07-20
-
Publication No.: US08122032B2Publication Date: 2012-02-21
- Inventor: William N. Schilit , Okan Kolak , Adam Mathes
- Applicant: William N. Schilit , Okan Kolak , Adam Mathes
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fenwick & West LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A corpus contains digital text from multiple documents. A passage mining engine identifies similar passages in the documents and stores data describing the similarities. The passage mining engine groups similar passages into groups based on degree of similarity or other criteria. The passage mining engine ranks the similar passages found in the text corpus based on quality or other criteria. A user interface is presented that includes hypertext links associated with the similar passages that allow a user to navigate the documents.
Public/Granted literature
- US20090024606A1 Identifying and Linking Similar Passages in a Digital Text Corpus Public/Granted day:2009-01-22
Information query