Invention Grant
- Patent Title: Associating data records in multiple languages
- Patent Title (中): 以多种语言关联数据记录
-
Application No.: US12239380Application Date: 2008-09-26
-
Publication No.: US08417702B2Publication Date: 2013-04-09
- Inventor: Douglas Scott Harger , Scott Schumacher
- Applicant: Douglas Scott Harger , Scott Schumacher
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Edell, Shapiro & Finnan, LLC
- Agent Elissa Y. Wang
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
Embodiments disclosed herein provide a system and method for associating data records in multiple languages within a single hub. As a record comes in from an information source coupled to the hub, it is associated with a particular language at a core layer. The hub maps each language one-to-one to a member type. For each data record of a particular member type, unique derivation code is utilized to perform standardization and bucketing at a derived layer. A weight may be used to balance the richness of languages so that data records in different languages can have the same statistical meaning. Since attributes are standardized with respect to a language of a data record, appropriate languages or script can be passed along with the data record. The hub can then match the data record to the optimum algorithm(s) for entity processing at an entity layer.
Public/Granted literature
- US20090089332A1 METHOD AND SYSTEM FOR ASSOCIATING DATA RECORDS IN MULTIPLE LANGUAGES Public/Granted day:2009-04-02
Information query