Invention Grant
- Patent Title: Record linkage based on a trained blocking scheme
- Patent Title (中): 基于训练有素的阻塞方案记录链接
-
Application No.: US13372360Application Date: 2012-02-13
-
Publication No.: US08843492B2Publication Date: 2014-09-23
- Inventor: Yunbo Cao , Chin-Yew Lin , Pei Yue , Zhiyuan Chen
- Applicant: Yunbo Cao , Chin-Yew Lin , Pei Yue , Zhiyuan Chen
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agent Dan Choi; Judy Yee; Micky Minhas
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Some implementations disclosed herein provide techniques and arrangements to train a blocking scheme using both labeled data and unlabeled data. For example, training the blocking scheme may include iteratively: learning a conjunction, identifying first matches in the labeled data and the unlabeled data that are uncovered by the conjunction, and identifying second matches in the labeled data and the unlabeled data that are covered by the conjunction. The conjunction learned in each iteration may be combined using a disjunction. A search engine may use the search engine when searching for records that match an entity.
Public/Granted literature
- US20130212103A1 RECORD LINKAGE BASED ON A TRAINED BLOCKING SCHEME Public/Granted day:2013-08-15
Information query