- Patent Title: Online training data generation for optical character recognition
-
Application No.: US16843811Application Date: 2020-04-08
-
Publication No.: US11295155B2Publication Date: 2022-04-05
- Inventor: Ting Xu
- Applicant: KONICA MINOLTA BUSINESS SOLUTIONS U.S.A., INC.
- Applicant Address: US CA San Mateo
- Assignee: KONICA MINOLTA BUSINESS SOLUTIONS U.S.A., INC.
- Current Assignee: KONICA MINOLTA BUSINESS SOLUTIONS U.S.A., INC.
- Current Assignee Address: US CA San Mateo
- Agency: Squire Patton Boggs (US) LLP
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06K15/02 ; G06K9/34 ; G06T7/11 ; G06T3/00 ; G06T5/00 ; G06T1/20

Abstract:
A method and system to generate training data for a deep learning model in memory instead of loading pre-generated data from disk storage. A corpus may be stored as lines of text. The lines of text can be manipulated in the memory of a central processing unit (CPU) of a computing system, using asynchronous multi-processing, in parallel with a training process being conducted on the system's graphics processing unit (GPU). With such an approach, for a given line of text, it is possible to take advantage of different fonts and different types of image augmentation without having to put the images in disk storage for subsequent retrieval. Consequently, the same line of text can be used to generate different training images for use in different epochs, providing more variability in training data (no training sample is trained on more than once). A single training corpus may yield many different training data sets. In one aspect, the model being trained is a deep learning model, which may be one of several different types of neural networks. The training enables the deep learning model to perform OCR on line images.
Public/Granted literature
- US20210319246A1 ONLINE TRAINING DATA GENERATION FOR OPTICAL CHARACTER RECOGNITION Public/Granted day:2021-10-14
Information query