Invention Grant
- Patent Title: Layout-aware, scalable recognition system
-
Application No.: US16297388Application Date: 2019-03-08
-
Publication No.: US11928875B2Publication Date: 2024-03-12
- Inventor: Yan Wang , Ye Wu , Arun Sacheti
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee Address: US WA Redmond
- Agency: Calfee, Halter & Griswold LLP
- Main IPC: G06F16/387
- IPC: G06F16/387 ; G06F16/31 ; G06F16/35 ; G06F18/2411 ; G06N20/10 ; G06V10/70 ; G06V10/75 ; G06V10/80 ; G06V20/62 ; G06V30/144 ; G06V30/148 ; G06V30/18 ; G06V30/19 ; G06V30/413 ; G06V30/414 ; G06V30/10

Abstract:
Described herein is a mechanism for visual recognition of items or visual search using Optical Character Recognition (OCR) of text in images. Recognized OCR blocks in an image comprise position information and recognized text. The embodiments utilize a location-aware feature vector created using the position and recognized information in each recognized block. The location-aware features of the feature vector utilize position information associated with the block to calculate a weight for the block. The recognized text is used to construct a tri-character gram frequency, inverse document frequency (TGF-IDP) metric using tri-character grams extracted from the recognized text. Features in location-aware feature vector for the block are computed by multiplying the weight and the corresponding TGF-IDF metric. The location-aware feature vector for the image is the sum of the location-aware feature vectors for the individual blocks.
Public/Granted literature
- US20200285878A1 LAYOUT-AWARE, SCALABLE RECOGNITION SYSTEM Public/Granted day:2020-09-10
Information query