Methods and systems for generating mobile enabled extraction models

    公开(公告)号:US12210828B2

    公开(公告)日:2025-01-28

    申请号:US18630990

    申请日:2024-04-09

    Applicant: INTUIT INC.

    Abstract: A computing system generates a plurality of training data sets for generating the NLP model. The computing system trains a teacher network to extract and classify tokens from a document. The training includes a pre-training stage where the teacher network is trained to classify generic data in the plurality of training data sets and a fine-tuning stage where the teacher network is trained to classify targeted data in the plurality of training data sets. The computing system trains a student network to extract and classify tokens from a document by distilling knowledge learned by the teacher network during the fine-tuning stage from the teacher network to the student network. The computing system outputs the NLP model based on the training. The computing system causes the NLP model to be deployed in a remote computing environment.

    Methods and systems for generating mobile enabled extraction models

    公开(公告)号:US11977842B2

    公开(公告)日:2024-05-07

    申请号:US17246277

    申请日:2021-04-30

    Applicant: INTUIT INC.

    CPC classification number: G06F40/284 G06N3/045 G06N3/08

    Abstract: A computing system generates a plurality of training data sets for generating the NLP model. The computing system trains a teacher network to extract and classify tokens from a document. The training includes a pre-training stage where the teacher network is trained to classify generic data in the plurality of training data sets and a fine-tuning stage where the teacher network is trained to classify targeted data in the plurality of training data sets. The computing system trains a student network to extract and classify tokens from a document by distilling knowledge learned by the teacher network during the fine-tuning stage from the teacher network to the student network. The computing system outputs the NLP model based on the training. The computing system causes the NLP model to be deployed in a remote computing environment.

    MACHINE LEARNING MODEL-AGNOSTIC CONFIDENCE CALIBRATION SYSTEM AND METHOD

    公开(公告)号:US20220351088A1

    公开(公告)日:2022-11-03

    申请号:US17246383

    申请日:2021-04-30

    Applicant: Intuit Inc.

    Abstract: A method may include extracting, from a document, a first key-value pair including a key and a first value and corresponding to a first confidence score, extracting a second key-value pair including the key and a second value corresponding to a second confidence score, classifying a first match probability for the first key-value pair and a second match probability for the second key-value pair, generating a first calibrated confidence score for the first confidence score and a second calibrated confidence score for the second confidence score by transforming, using precision lookup tables constructed from training records, the first match probability to the first calibrated confidence score and the second match probability to second calibrated confidence score, selecting, using the first and second calibrated confidence scores, one of the first key-value pair and the second key-value pair, and presenting, in a graphical user interface (GUI), the selected key-value pair.

Patent Agency Ranking