Invention Grant
US08135667B2 System, method, and computer-readable medium that facilitate in-database analytics with supervised data discretization
有权
系统,方法和计算机可读介质,通过监督数据离散化便于进行数据库内分析
- Patent Title: System, method, and computer-readable medium that facilitate in-database analytics with supervised data discretization
- Patent Title (中): 系统,方法和计算机可读介质,通过监督数据离散化便于进行数据库内分析
-
Application No.: US12651086Application Date: 2009-12-31
-
Publication No.: US08135667B2Publication Date: 2012-03-13
- Inventor: Congnan Luo
- Applicant: Congnan Luo
- Applicant Address: US OH Dayton
- Assignee: Teradata US, Inc.
- Current Assignee: Teradata US, Inc.
- Current Assignee Address: US OH Dayton
- Agent Steve McDonald
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
A system, method, and computer-readable medium that facilitate in-database supervised discretisation mechanisms which improve data classification are provided. The disclosed mechanisms provide an efficient, automatic, and repeatable way to perform data discretisation without human intervention. Efficient processing of large and complex unknown data is provided that advantageously does not require the data being analyzed to be processed outside the database. The disclosed mechanisms may use an External Stored Procedure to avoid multiple joins of large tables and minimize the number of full table scans and, consequently, provide better performance than contemporary mechanisms. The disclosed system produces intermediate results in tables which may be conveyed to a visualization subsystem thereby providing users a better understanding of the data distribution in each category. Further, the disclosed system and method introduce a novel similarity-based solution to merge intervals when chi-square testing is not reliable and thereby improves the quality of the interval merge process.
Public/Granted literature
Information query