Invention Grant
- Patent Title: System and method for identification and extraction of data
- Patent Title (中): 用于识别和提取数据的系统和方法
-
Application No.: US14552099Application Date: 2014-11-24
-
Publication No.: US09589183B2Publication Date: 2017-03-07
- Inventor: Jason Brown
- Applicant: Parchment
- Applicant Address: US AZ Scottsdale
- Assignee: PARCHMENT, INC.
- Current Assignee: PARCHMENT, INC.
- Current Assignee Address: US AZ Scottsdale
- Agency: Sheridan Ross P.C.
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/18 ; G06F17/30

Abstract:
A system and method of for describing target data as a sequence of pattern elements and pattern element groups that comprise an overall target pattern is described. Pattern elements may utilize regular expression syntax along with other metadata that describe the behavior of the element. A pattern element group may be a collection of fully defined pattern elements where at least one pattern element from the group must have a match for the overall pattern to match. Patterns contain both pattern elements and pattern element groups. The general process involves first performing optical character recognition (OCR) on the document, which in turn produces a sequence of text tokens representing the lines of text on each page of the document. The search algorithm may then apply each defined pattern to the entire document capturing and/or extracting data that match each pattern's required elements and element groups.
Public/Granted literature
- US20150146984A1 SYSTEM AND METHOD FOR IDENTIFICATION AND EXTRACTION OF DATA Public/Granted day:2015-05-28
Information query