Invention Grant
- Patent Title: Method, apparatus, and computer-readable medium for determining a data domain associated with data
-
Application No.: US15666065Application Date: 2017-08-01
-
Publication No.: US11669574B2Publication Date: 2023-06-06
- Inventor: Igor Balabine
- Applicant: Informatica LLC
- Applicant Address: US CA Redwood City
- Assignee: Informatica LLC
- Current Assignee: Informatica LLC
- Current Assignee Address: US CA Redwood City
- Agency: Reed Smith LLP
- Agent Amardeep S. Grewal
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F16/93 ; G06F16/31 ; G06F16/33 ; G06F40/30 ; G06F40/205 ; G06F40/211 ; G06F40/216 ; G06F40/284

Abstract:
A system, method and computer-readable medium for determining a data domain associated with data, including parsing a document to generate one or more document indexes corresponding to the document, the one or more document indexes comprising a plurality of index terms and location information, determining a syntactic confidence score corresponding to a non-dictionary term in the plurality of index terms based on a syntactic analysis of the non-dictionary term, determining a proximity confidence score corresponding to the non-dictionary term based on the location information and at least one proximity query associated with the non-dictionary term and one or more other terms in the document index, determining a semantic confidence score based on a plurality of dictionary terms in the plurality of index terms, and determining an overall confidence score corresponding to the non-dictionary term based on the syntactic confidence score, the proximity confidence score, and the semantic confidence score.
Public/Granted literature
- US20190042568A1 METHOD, APPARATUS, AND COMPUTER-READABLE MEDIUM FOR DETERMINING A DATA DOMAIN ASSOCIATED WITH DATA Public/Granted day:2019-02-07
Information query