Invention Grant
- Patent Title: Method and apparatus for detecting a table of contents and reference determination
- Patent Title (中): 用于检测目录和参考确定的方法和装置
-
Application No.: US11032814Application Date: 2005-01-10
-
Publication No.: US08706475B2Publication Date: 2014-04-22
- Inventor: Herve Dejean , Jean-Luc Meunier , Olivier Fambon
- Applicant: Herve Dejean , Jean-Luc Meunier , Olivier Fambon
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F17/20
- IPC: G06F17/20 ; G06F17/27 ; G06F3/00 ; G06F7/00

Abstract:
In a method for identifying a table of contents in a document, an ordered sequence of text fragments is derived from the document. A table of contents is selected as a contiguous sub-sequence of the ordered sequence of text fragments satisfying the criteria: (i) entries defined by text fragments of the table of contents each have a link to a target text fragment having textual similarity with the entry; (ii) no target text fragment lies within the table of contents; and (iii) the target text fragments have an ascending ordering corresponding to an ascending ordering of the entries defining the target text fragments.
Public/Granted literature
- US20060155703A1 Method and apparatus for detecting a table of contents and reference determination Public/Granted day:2006-07-13
Information query