Efficient tagging of content items using multi-granular embeddings

Invention Grant

US11947571B2 Efficient tagging of content items using multi-granular embeddings 有权

Please log in to see more content

Patent Title: Efficient tagging of content items using multi-granular embeddings
Application No.: US17235325

Application Date: 2021-04-20
Publication No.: US11947571B2

Publication Date: 2024-04-02
Inventor: Fares Hedayati , Young Jin Yun , Sneha Chaudhari , Mahesh Subhash Joshi , Gungor Polatkan , Gautam Borooah
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: NICHOLSON DE VOS WEBSTER & ELLIOTT LLP
Main IPC: G06F16/28
IPC: G06F16/28 ; G06N20/00

Efficient tagging of content items using multi-granular embeddings

Abstract:

Efficient tagging of content items using content embeddings are provided. In one technique, multiple content items are stored a content embedding for content item is stored. Entity names are also stored along with an entity name embedding for each entity name. For each content item, (1) multiple content embeddings that are associated with the content item are identified; (2) a subset of the entity names is identified; and (3) for each entity name in the subset, (i) an embedding of the entity name is identified, (ii) similarity measures are generated based on the entity name embedding and the multiple content embeddings, (iii), a distribution of the similarity measures is generated, (iv) feature values are generated based on the distribution, (v) the feature values are input into a machine-learned classifier, and (vi) based on output from the classifier, it is determined whether to associate the entity name with the content item.

Public/Granted literature

US20220335066A1 EFFICIENT TAGGING OF CONTENT ITEMS USING MULTI-GRANULAR EMBEDDINGS Public/Granted day:2022-10-20

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/20	.•结构化数据，例如关系型数据
G06F16/28	..••以数据库模型为特征的数据库，例如，关系或对象模型