Invention Grant
- Patent Title: Dataset adaptation for high-performance in specific natural language processing tasks
-
Application No.: US15852167Application Date: 2017-12-22
-
Publication No.: US10942954B2Publication Date: 2021-03-09
- Inventor: David Martinez Iraola , Sheng Hua Bao , Donna K. Byron , Priscilla Moraes
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Patterson + Sheridan, LLP
- Main IPC: G06F16/332
- IPC: G06F16/332 ; G06N20/00 ; G06F16/33

Abstract:
Systems, methods, and computer program products to perform an operation comprising identifying a first available dataset having a degree of similarity to a received input dataset that exceeds a similarity threshold, determining, based on a plurality of features of the first available dataset and a plurality of features of the input dataset, a set of recommendations for transforming the input dataset, and transforming a text of the input dataset based on the set of recommendations and to optimize the input dataset for processing by a natural language processing (NLP) algorithm.
Public/Granted literature
- US20190197128A1 DATASET ADAPTATION FOR HIGH-PERFORMANCE IN SPECIFIC NATURAL LANGUAGE PROCESSING TASKS Public/Granted day:2019-06-27
Information query