Invention Grant
- Patent Title: Automated computation of semantic similarity of pairs of named entity phrases using electronic document corpora as background knowledge
- Patent Title (中): 使用电子文档语料库作为背景知识自动计算命名实体短语对的语义相似度
-
Application No.: US12246894Application Date: 2008-10-07
-
Publication No.: US08170969B2Publication Date: 2012-05-01
- Inventor: Hans Roettger , Cai-Nicolas Ziegler
- Applicant: Hans Roettger , Cai-Nicolas Ziegler
- Applicant Address: DE Munich
- Assignee: Siemens Aktiengesellschaft
- Current Assignee: Siemens Aktiengesellschaft
- Current Assignee Address: DE Munich
- Agency: Staas & Halsey LLP
- Priority: EP08014457 20080813
- Main IPC: G06F17/00
- IPC: G06F17/00

Abstract:
An overall semantic similarity score value between pairs of named entities in a text corpus is obtained by calculating for at least one pair of named entities a plurality of corresponding pair similarity score values according to a first and at least a second classifier using electronic information sources. Each pair similarity score value of the pair of named entities per classifier is normalized by calculating a rank list per classifier, for example, for each named entity. The rank list holds each pair of named entities of the text corpus, wherein a rank of each pair of named entities within the rank list reflects the respective pair similarity score value. Further an arithmetic mean of the normalized pair similarity score value of each pair of named entities is calculated to provide the overall semantic similarity score value.
Public/Granted literature
Information query