Invention Grant
US08666976B2 Methods and systems for implementing approximate string matching within a database 有权
在数据库中实现近似字符串匹配的方法和系统

Methods and systems for implementing approximate string matching within a database
Abstract:
A computer-based method for character string matching of a candidate character string with a plurality of character string records stored in a database is described. The method includes performing a clustering operation on at least a portion of the plurality of character string records, the clustering operation generating a plurality of clusters, each cluster comprising a plurality of character strings from the plurality of character string records, the plurality of character strings in each cluster are determined to be similar with respect to each other based on at least one characteristic of the plurality of character strings. The method also includes generating a set of reference character strings that are selected from the plurality of character strings in each cluster, generating an n-gram representation for one of the reference character strings in the set of reference character strings, and generating an n-gram representation for the candidate character string.
Information query
Patent Agency Ranking
0/0