Invention Grant
- Patent Title: Selection of domain-adapted translation subcorpora
- Patent Title (中): 选择领域适应翻译子公司
-
Application No.: US13022633Application Date: 2011-02-08
-
Publication No.: US08838433B2Publication Date: 2014-09-16
- Inventor: Amittai Axelrod , Jianfeng Gao , Xiaodong He
- Applicant: Amittai Axelrod , Jianfeng Gao , Xiaodong He
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agent Judy Yee; Sandy Swain; Micky Minhas
- Main IPC: G06F17/28
- IPC: G06F17/28

Abstract:
An architecture is discussed that provides the capability to subselect the most relevant data from an out-domain corpus to use either in isolation or in combination conjunction with in-domain data. The architecture is a domain adaptation for machine translation that selects the most relevant sentences from a larger general-domain corpus of parallel translated sentences. The methods for selecting the data include monolingual cross-entropy measure, monolingual cross-entropy difference, bilingual cross entropy, and bilingual cross-entropy difference. A translation model is trained on both the in-domain data and an out-domain subset, and the models can be interpolated together to boost performance on in-domain translation tasks.
Public/Granted literature
- US20120203539A1 SELECTION OF DOMAIN-ADAPTED TRANSLATION SUBCORPORA Public/Granted day:2012-08-09
Information query