Amazon technologies, inc. (20240256704). EFFICIENT STATISTICAL TECHNIQUES FOR DETECTING SENSITIVE DATA
EFFICIENT STATISTICAL TECHNIQUES FOR DETECTING SENSITIVE DATA
Organization Name
Inventor(s)
Silviu Catalin Poede of Iasi RO
Marian-Razvan Udrea of Iasi RO
EFFICIENT STATISTICAL TECHNIQUES FOR DETECTING SENSITIVE DATA
This abstract first appeared for US patent application 20240256704 titled 'EFFICIENT STATISTICAL TECHNIQUES FOR DETECTING SENSITIVE DATA
Original Abstract Submitted
a candidate attribute combination of a first data set is identified, such that the candidate attribute combination meets a data type similarity criterion with respect to a collection of data types of sensitive information for which the first data set is to be analyzed. a collection of input features is generated for a machine learning model from the candidate attribute combination, including at least one feature indicative of a statistical relationship between the values of the candidate attribute combination and a second data set. an indication of a predicted probability of a presence of sensitive information in the first data set is obtained using the machine learning model.