Invention Application
- Patent Title: INTERACTIVE IDENTIFICATION OF SIMILAR SQL QUERIES
-
Application No.: US15495397Application Date: 2017-04-24
-
Publication No.: US20170308592A1Publication Date: 2017-10-26
- Inventor: Rituparna Agrawal , Anupam Singh , Prithviraj Pandian
- Applicant: Cloudera, Inc.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Systems and methods very fast grouping of “similar” SQL queries according to user-supplied similarity criteria are disclosed. The user-supplied similarity criteria includes a threshold quantifying the degree of similarity between SQL queries and common artifacts included in the queries. A similarity-characterizing data structure is disclosed that allows for the very fast grouping of “similar” SQL queries. Because the computation is distributed among multiple compute nodes, a small cluster of compute nodes takes a short time to compute the similarity-characterizing data on a workload of tens of millions of queries. The user can supply the similarity criteria through a UI or a command line tool. Furthermore, in some embodiments, the user can adjust the degree of similarity by supplying new similarity criteria. Accordingly, the system can display in real time or near real time, updated SQL groupings corresponding to the newly supplied similarity criteria using the originally computed similarity-characterizing data structure.
Public/Granted literature
- US10599664B2 Interactive identification of similar SQL queries Public/Granted day:2020-03-24
Information query