System and Method for Efficiently Generating Association Rules

    公开(公告)号:US20180225574A1

    公开(公告)日:2018-08-09

    申请号:US15938654

    申请日:2018-03-28

    Inventor: David Franke

    CPC classification number: G06N5/02 G06F16/21 G06N5/025 G06Q30/02

    Abstract: A data processing system processes data sets (such as low-resolution transaction data) into high-resolution data sets by mapping generic information into attribute-based specific information that may be processed to identify frequent sets therein. When association rules are generated from such frequent sets, the complexity and/or quantity of such rules may be managed by removing redundancies from the rules, such as by removing rules providing only trivial associations, removing rules having only a part group as the consequent, modifying rules to remove redundant antecedent items and/or filtering subsumed rules from the generated rule set that do not provide sufficient lift to meet an adjustable specialization lift threshold requirement.

    System and Method for Efficiently Generating Association Rules
    4.
    发明申请
    System and Method for Efficiently Generating Association Rules 审中-公开
    有效生成关联规则的系统和方法

    公开(公告)号:US20130204830A1

    公开(公告)日:2013-08-08

    申请号:US13832920

    申请日:2013-03-15

    Inventor: David Franke

    CPC classification number: G06N5/02 G06F17/30289 G06N5/025 G06Q30/02

    Abstract: A data processing system processes data sets (such as low-resolution transaction data) into high-resolution data sets by mapping generic information into attribute-based specific information that may be processed to identify frequent sets therein. When association rules are generated from such frequent sets, the complexity and/or quantity of such rules may be managed by removing redundancies from the rules, such as by removing rules providing only trivial associations, removing rules having only a part group as the consequent, modifying rules to remove redundant antecedent items and/or filtering subsumed rules from the generated rule set that do not provide sufficient lift to meet an adjustable specialization lift threshold requirement.

    Abstract translation: 数据处理系统通过将通用信息映射到基于属性的特定信息中来处理数据集(例如低分辨率事务数据)到高分辨率数据集中,该信息可以被处理以识别其中的频繁集合。 当从这样的频繁集合生成关联规则时,可以通过从规则中去除冗余来管理这些规则的复杂性和/或数量,例如通过移除仅提供微不足道的关联的规则,去除仅具有部分组的规则作为后果, 修改规则以从生成的规则集中删除冗余先行项目和/或过滤归约规则,该规则集不提供足够的电梯以满足可调整的专业化提升阈值要求。

    System and method for efficiently generating association rules using scaled lift threshold values to subsume association rules

    公开(公告)号:US11501174B2

    公开(公告)日:2022-11-15

    申请号:US15938654

    申请日:2018-03-28

    Inventor: David Franke

    Abstract: A data processing system processes data sets (such as low-resolution transaction data) into high-resolution data sets by mapping generic information into attribute-based specific information that may be processed to identify frequent sets therein. When association rules are generated from such frequent sets, the complexity and/or quantity of such rules may be managed by removing redundancies from the rules, such as by filtering subsumed rules from the generated rule set that have a confidence metric value that does not exceed a first confidence metric value for a subsuming rule by more than a scaled lift threshold value that is calculated by determining a complement of the first confidence metric value, squaring the complement to obtain a squared value and multiplying the squared value by a scaling factor.

    System and method for efficiently generating association rules

    公开(公告)号:US09934464B2

    公开(公告)日:2018-04-03

    申请号:US13832920

    申请日:2013-03-15

    Inventor: David Franke

    CPC classification number: G06N5/02 G06F17/30289 G06N5/025 G06Q30/02

    Abstract: A data processing system processes data sets (such as low-resolution transaction data) into high-resolution data sets by mapping generic information into attribute-based specific information that may be processed to identify frequent sets therein. When association rules are generated from such frequent sets, the complexity and/or quantity of such rules may be managed by removing redundancies from the rules, such as by removing rules providing only trivial associations, removing rules having only a part group as the consequent, modifying rules to remove redundant antecedent items and/or filtering subsumed rules from the generated rule set that do not provide sufficient lift to meet an adjustable specialization lift threshold requirement.

Patent Agency Ranking