Invention Grant
- Patent Title: Method, machine learning engines and file management platform systems for content and context aware data classification and security anomaly detection
-
Application No.: US17268381Application Date: 2018-08-14
-
Publication No.: US12033040B2Publication Date: 2024-07-09
- Inventor: Christopher Muffat
- Applicant: Dathena Science Pte. Ltd.
- Applicant Address: SG Singapore
- Assignee: Dathena Science Ptd. Ltd.
- Current Assignee: Dathena Science Ptd. Ltd.
- Current Assignee Address: SG Singapore
- Agency: Crowell & Moring LLP
- International Application: PCT/SG2018/050411 2018.08.14
- International Announcement: WO2019/035765A 2019.02.21
- Date entered country: 2021-02-12
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06F18/23213 ; G06F40/284 ; G06F40/30

Abstract:
Systems, methods and computer readable medium are provided for perform a method for content and context aware data classification or a method for content and context aware data security anomaly detection. The method for content and context aware data confidentiality classification includes scanning one or more documents in one or more network data repositories of a computer network and extracting content features and context features of the one or more documents into one or more term frequency-inverse document frequency (TF-IDF) vectors and one or more latent semantic indexing (LSI) vectors. The method further includes classifying the one or more documents into a number of category classifications by machine learning the extracted content features and context features of the one or more documents at a file management platform of the computer network, each of the category classifications being associated with one or more confidentiality classifications.
Public/Granted literature
Information query