Machine learning modeling to identify sensitive data

Invention Grant

US11977660B2 Machine learning modeling to identify sensitive data 有权

Please log in to see more content

Patent Title: Machine learning modeling to identify sensitive data
Application No.: US17476388

Application Date: 2021-09-15
Publication No.: US11977660B2

Publication Date: 2024-05-07
Inventor: Shubhanshu Gupta , Ashish Awasthi , Amaruvi Devanathan , Mallapu Raghavulu Surya Prakash
Applicant: Citibank, N.A.
Applicant Address: US NY New York
Assignee: CITIBANK, N.A.
Current Assignee: CITIBANK, N.A.
Current Assignee Address: US NY New York
Agency: Foley & Lardner LLP
Main IPC: G06F21/62
IPC: G06F21/62 ; G06F16/22 ; G06F16/33 ; G06F16/335

Machine learning modeling to identify sensitive data

Abstract:

Methods and systems identify and redact PII. A PII sensitivity detection framework includes multiple layers where each layer corresponds to a model. The framework analyzes data stored within different data tables and predicts whether a data column includes PII. The first layer corresponds to an AI model that analyzes each column metadata and predicts a first score indicative of a first likelihood of PII existence. The second layer corresponds to a rule-based model that uses various rules to determine a second score indicative of a second likelihood of PII existence for each column. The third layer corresponds to a column content model that analyzes content of each column using various natural language processing techniques to generate a third score indicative of a third likelihood of PII existence. The framework masks data presented to a user based on the scores generated via execution of one or more of the layers.

Public/Granted literature

US20230080686A1 MACHINE LEARNING MODELING TO IDENTIFY SENSITIVE DATA Public/Granted day:2023-03-16

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/60	.保护数据
G06F21/62	..通过一个平台保护数据存取访问，例如使用密钥或访问控制规则