Invention Grant
- Patent Title: Automating creation of accurate OCR training data using specialized UI application
-
Application No.: US16111121Application Date: 2018-08-23
-
Publication No.: US10282604B2Publication Date: 2019-05-07
- Inventor: Eugene Krivopaltsev , Sreeneel K. Maddika , Vijay S. Yellapragada
- Applicant: INTUIT INC.
- Applicant Address: US CA Mountain View
- Assignee: Intuit, Inc.
- Current Assignee: Intuit, Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Patterson + Sheridan, LLP
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/46 ; G06K9/52 ; G06K9/62 ; G06T7/60 ; G06T11/60 ; G06T7/73 ; G06T7/13 ; G06T7/70 ; G06F3/0481 ; G06F17/21

Abstract:
Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.
Public/Granted literature
- US20180365487A1 AUTOMATING CREATION OF ACCURATE OCR TRAINING DATA USING SPECIALIZED UI APPLICATION Public/Granted day:2018-12-20
Information query