Invention Grant
- Patent Title: Fast text character set recognition
- Patent Title (中): 快速文本字符集识别
-
Application No.: US10909262Application Date: 2004-07-30
-
Publication No.: US07865355B2Publication Date: 2011-01-04
- Inventor: Ming Xu , Nobuyoshi Mori
- Applicant: Ming Xu , Nobuyoshi Mori
- Applicant Address: DE Walldorf
- Assignee: SAP Aktiengesellschaft
- Current Assignee: SAP Aktiengesellschaft
- Current Assignee Address: DE Walldorf
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/20
- IPC: G06F17/20 ; G06F17/27

Abstract:
Methods and apparatus, including computer program products, for identifying a language corresponding to a string of data include receiving a data string and dividing the data string into coded character sequences for each of a plurality of languages. A length of one or more coded character sequences varies among different languages for coded character sequences having a particular number of characters. The coded character sequences are analyzed to calculate, for each of the plurality of languages, a probability that the data string corresponds to language. The calculated probabilities are compared among the languages, and a language is identified as corresponding to the data string based on the comparison.
Public/Granted literature
- US20060025988A1 Fast text character set recognition Public/Granted day:2006-02-02
Information query