Invention Grant
- Patent Title: NLP-based context-aware log mining for troubleshooting
-
Application No.: US16437989Application Date: 2019-06-11
-
Publication No.: US11409754B2Publication Date: 2022-08-09
- Inventor: Giacomo Domeniconi , Eun Kyung Lee , Alessandro Morari
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: F. Chau & Associates, LLC
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/2458 ; G06F17/18 ; G06N3/04 ; G06F40/205 ; G06F40/284

Abstract:
A method for context-aware data mining of a text document includes receiving a list of words parsed and preprocessed from an input query; computing a related distributed embedding representation for each word in the list of words using a word embedding model of the text document being queried; aggregating the related distributed embedding representations of all words in the list of words to represent the input query with a single embedding, by using one of an average of all the related distributed embedding representations or a maximum of all the related distributed embedding representations; retrieving a ranked list of document segments of N lines that are similar to the aggregated word embedding representation of the query, where N is a positive integer provided by the user; and returning the list of retrieved segments to a user.
Information query