Systems and methods for training an information extraction transformer model architecture

Invention Grant

US11861884B1 Systems and methods for training an information extraction transformer model architecture 有权

Please log in to see more content

Patent Title: Systems and methods for training an information extraction transformer model architecture
Application No.: US18297708

Application Date: 2023-04-10
Publication No.: US11861884B1

Publication Date: 2024-01-02
Inventor: Karelia Del Carmen Pena Pena , Tharathorn Rimchala , Peter Lee Frick , Tak Yiu Daniel Li
Applicant: Intuit, Inc.
Applicant Address: US CA Mountain View
Assignee: Intuit, Inc.
Current Assignee: Intuit, Inc.
Current Assignee Address: US CA Mountain View
Agency: Dinsmore & Shohl LLP
Main IPC: G06V10/80
IPC: G06V10/80 ; G06V30/413 ; G06V30/19

Abstract:

Certain aspects of the disclosure provide systems and methods for training an information extraction transformer model architecture directed to pre-training a first multimodal transformer model on an unlabeled dataset, training a second multimodal transformer model on a first labeled dataset to perform a key information extraction task processing the unlabeled dataset with the second multimodal transformer model to generate pseudo-labels for the unlabeled dataset, training the first multimodal transformer model based on a second labeled dataset comprising one or more labels, the pseudo-labels generated, or combinations thereof to generate a third multimodal transformer model, generating updated pseudo-labels based on label completion predictions from the third multimodal transformer model, and training the third multimodal transformer model using a noise-aware loss function and the updated pseudo-labels to generate an updated third multimodal transformer model.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/77	..处理特征空间中的图像或视频特征；使用数据集成或数据缩减，例如主成分分析 [PCA] 或独立成分分析 [ICA] 或自组织图 [SOM]；盲源分离
G06V10/80	...融合，即在传感器级别、预处理级别、特征提取级别或分类级别融合来自各种来源的数据（多模态讲话者的识别或验证 G10L17/10）