Invention Grant
- Patent Title: Method and system for long-form answer extraction based on combination of sentence index generation techniques
-
Application No.: US18470657Application Date: 2023-09-20
-
Publication No.: US12111856B2Publication Date: 2024-10-08
- Inventor: Anumita Dasguptabandyopadhyay , Prabir Mallick , Tapas Nayak , Indrajit Bhattacharya , Sangameshwar Suryakant Patil
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: Tata Consultancy Services Limited
- Current Assignee: Tata Consultancy Services Limited
- Current Assignee Address: IN Mumbai
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
- Priority: IN 2221058931 2022.10.14
- Main IPC: G06F16/30
- IPC: G06F16/30 ; G06F16/31 ; G06F16/332

Abstract:
This disclosure relates generally to long-form answer extraction and, more particularly, to long-form answer extraction based on combination of sentence index generation techniques. Existing answer extractions techniques have achieved significant progress for extractive short answers; however, less progress has been made for long form questions that require explanations. Further the state-of-art long-answer extractions techniques result in poorer long-form answers or not address sparsity which becomes an issue longer contexts. Additionally, pre-trained generative sequence-to-sequence models are gaining popularity for factoid answer extraction tasks. Hence the disclosure proposes a long-form answer extraction based on several steps including training a set of generative sequence-to-sequence models comprising a sentence indices generation model and a sentence index spans generation. The trained set of generative sequence-to-sequence models is further utilized for model long-form answer extraction based on a union of several sentence index generation techniques comprising a sentence indices and a sentence index spans.
Public/Granted literature
Information query