- Patent Title: Discovery of related entities in a master data management system
-
Application No.: US13954155Application Date: 2013-07-30
-
Publication No.: US10042911B2Publication Date: 2018-08-07
- Inventor: Prasad M. Deshpande , Salil R. Joshi , Mukesh Kumar Mohania , Karin Murthy , Scott Schumacher , Bruhathi H. Sundarmurthy
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporations
- Current Assignee: International Business Machines Corporations
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates LLC
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Methods and arrangements for discovering entity types for a set of records. A set of records is input, with each record comprising attributes with associated attribute values. The records are grouped into candidate entity types in view of at least one of: the attribute values of the records, at least one domain ontology and at least one dimension hierarchy. An interestingness measure of each candidate entity type is calculated, via estimating interestingness based on at least one factor selected from the group consisting of: a correlation between attribute values of records, a number of attributes, a log of queries issued to a server, and an average group size for candidate entity types. At least one candidate entity type is validated based on the calculated interestingness measures. Other variants and embodiments are broadly contemplated herein.
Public/Granted literature
- US20150039611A1 DISCOVERY OF RELATED ENTITIES IN A MASTER DATA MANAGEMENT SYSTEM Public/Granted day:2015-02-05
Information query