Invention Grant
- Patent Title: Segmenting printed media pages into articles
- Patent Title (中): 将印刷媒体页面分割成文章
-
Application No.: US13612072Application Date: 2012-09-12
-
Publication No.: US08693779B1Publication Date: 2014-04-08
- Inventor: Ankur Jain , Vivek Sahasranaman , Shobhit Saxena , Krishnendu Chaudhury
- Applicant: Ankur Jain , Vivek Sahasranaman , Shobhit Saxena , Krishnendu Chaudhury
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Sterne, Kessler, Goldstein & Fox P.L.L.C.
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/46 ; G06K9/66

Abstract:
Methods and systems for segmenting printed media pages into individual articles quickly and efficiently. A printed media based image that may include a variety of columns, headlines, images, and text is input into the system which comprises a block segmenter and an article segmenter system. The block segmenter identifies and produces blocks of textual content from a printed media image while the article segmenter system determines which blocks of textual content belong to one or more articles in the printed media image based on a classifier algorithm. A method for segmenting printed media pages into individual articles is also presented.
Information query