Invention Grant
- Patent Title: Performing optical character recognition using spatial information of regions within a structured document
-
Application No.: US15219888Application Date: 2016-07-26
-
Publication No.: US10013643B2Publication Date: 2018-07-03
- Inventor: Vijay Yellapragada , Peijun Chiang , Sreeneel K. Maddika
- Applicant: INTUIT INC.
- Applicant Address: US CA Mountain View
- Assignee: INTUIT INC.
- Current Assignee: INTUIT INC.
- Current Assignee Address: US CA Mountain View
- Agency: Patterson + Sheridan, LLP
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06T7/00

Abstract:
Techniques are disclosed for facilitating optical character recognition (OCR) by identifying one or more regions in an electronic document to perform the OCR. For example a method for identifying information in an electronic document includes obtaining a set of training documents for each template of a plurality of templates for the electronic document, extracting spatial attributes for at least a first label region and at least a first corresponding value region from the set, and training a classifier model based on the extracted spatial attributes, wherein the classifier model is used to identify the information in the electronic document. The spatial attributes represent a position of at least the first label region and at least the first value region within the electronic document.
Public/Granted literature
Information query