- Patent Title: Efficient tagging of content items using multi-granular embeddings
-
Application No.: US17235325Application Date: 2021-04-20
-
Publication No.: US11947571B2Publication Date: 2024-04-02
- Inventor: Fares Hedayati , Young Jin Yun , Sneha Chaudhari , Mahesh Subhash Joshi , Gungor Polatkan , Gautam Borooah
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: NICHOLSON DE VOS WEBSTER & ELLIOTT LLP
- Main IPC: G06F16/28
- IPC: G06F16/28 ; G06N20/00

Abstract:
Efficient tagging of content items using content embeddings are provided. In one technique, multiple content items are stored a content embedding for content item is stored. Entity names are also stored along with an entity name embedding for each entity name. For each content item, (1) multiple content embeddings that are associated with the content item are identified; (2) a subset of the entity names is identified; and (3) for each entity name in the subset, (i) an embedding of the entity name is identified, (ii) similarity measures are generated based on the entity name embedding and the multiple content embeddings, (iii), a distribution of the similarity measures is generated, (iv) feature values are generated based on the distribution, (v) the feature values are input into a machine-learned classifier, and (vi) based on output from the classifier, it is determined whether to associate the entity name with the content item.
Public/Granted literature
- US20220335066A1 EFFICIENT TAGGING OF CONTENT ITEMS USING MULTI-GRANULAR EMBEDDINGS Public/Granted day:2022-10-20
Information query