Invention Grant
- Patent Title: Cognitive data pseudonymization
-
Application No.: US16669894Application Date: 2019-10-31
-
Publication No.: US11574186B2Publication Date: 2023-02-07
- Inventor: Ilyas Mohamed Iyoob , Krishna Teja Rekapalli , Aly Megahed
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Christopher M. Pignato
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04 ; G06F21/62 ; G06K9/62 ; G06F40/205

Abstract:
Computer systems, methods and program products for automating pseudonymization of personal identifying information (PII) using machine learning, metadata, and crowdsourcing patterns to identify and replace PII. Machine learning models are trained for classifying known column names or key names for processing, using metadata. Column or key names are classified to be unprocessed, anonymized or pseudonymized by a pseudonymizer without revealing PII or scrubbing data into a useless format. A library of crowdsourced patterns are utilized for matching PII to data values within column or key names and PII is mapped to replacement methods. Feedback from user annotations retrains the algorithms to improve classification accuracy and Deep Learning algorithms automate the identification of PII using regular expression generation to concisely articulate how pseudonymizers search for PII patterns within a data set. PII replacement is mapped consistently across entire data packages and the crowdsourced pattern library is updated with generated regular expressions.
Public/Granted literature
- US20210133557A1 COGNITIVE DATA PSEUDONYMIZATION Public/Granted day:2021-05-06
Information query