17541704. FINGERPRINT-BASED DATA CLASSIFICICATION simplified abstract (INTERNATIONAL BUSINESS MACHINES CORPORATION)

From WikiPatents
Jump to navigation Jump to search

FINGERPRINT-BASED DATA CLASSIFICICATION

Organization Name

INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor(s)

Xu Bin Cai of Beijing (CN)

Xiaobo Wang of Beijing (CN)

Chun Hua Sun of Beijing (CN)

Yi Wang of Beijing (CN)

Wei Wang of Beijing (CN)

FINGERPRINT-BASED DATA CLASSIFICICATION - A simplified explanation of the abstract

This abstract first appeared for US patent application 17541704 titled 'FINGERPRINT-BASED DATA CLASSIFICICATION

Simplified Explanation

The patent application describes a system and method for automatically classifying data using fingerprints. Here is a simplified explanation of the abstract:

  • The system generates a fingerprint of a data column in a dataset to be classified.
  • The fingerprint consists of dimensions, where each dimension represents a characteristic of the data in the column.
  • The system compares the generated fingerprint to target fingerprints associated with different classes.
  • If a match is found, the system assigns one or more classes to the data column, resulting in classified data.

Potential applications of this technology:

  • Data classification in various industries such as finance, healthcare, and marketing.
  • Automated sorting and categorization of large datasets.
  • Fraud detection and anomaly identification in financial transactions.
  • Customer segmentation and targeted marketing campaigns.

Problems solved by this technology:

  • Manual data classification can be time-consuming and prone to errors.
  • Traditional classification methods may not be efficient for large datasets.
  • Identifying patterns and similarities in data columns can be challenging without automated techniques.

Benefits of this technology:

  • Saves time and resources by automating the data classification process.
  • Improves accuracy and consistency in classifying data.
  • Enables faster decision-making based on classified data.
  • Provides insights and patterns that may not be easily identifiable through manual analysis.


Original Abstract Submitted

Systems and methods are provided for automated classification of data using fingerprints. In embodiments, a method includes: generating, by a computing device based on predetermined rules, a fingerprint of a data column in a data set to be classified, the fingerprint comprising dimensions, wherein each of the dimension is assigned an attribute representing a characteristic of data in the data column; determining, by the computing device, that the fingerprint matches one or more target fingerprints by comparing the fingerprint to the target fingerprints, wherein each target fingerprint is associated with a class and includes dimensions, and each dimension is assigned an attribute representing a characteristic of data in the class; and assigning, by the computing device, one or more classes to the data column based on the one or more target fingerprints, thereby generating classified data.