17809034. UNIFIED DATA CLASSIFICATION TECHNIQUES simplified abstract (INTERNATIONAL BUSINESS MACHINES CORPORATION)

From WikiPatents
Jump to navigation Jump to search

UNIFIED DATA CLASSIFICATION TECHNIQUES

Organization Name

INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor(s)

Youngja Park of Princeton NJ (US)

MOHAMMED FAHD Alhamid of North York (CA)

Stefano Braghin of Dublin (IE)

Jing Xin Duan of Toronto (CA)

Mokhtar Kandil of Toronto (CA)

Michael Vu Le of Danbury CT (US)

Killian Levacher of Dublin (IE)

Micha Gideon Moffie of Zichron Yaakov (IL)

Ian Michael Molloy of Chappaqua NY (US)

Walid Rjaibi of Markham (CA)

ARIEL Farkash of Shimshit (IL)

UNIFIED DATA CLASSIFICATION TECHNIQUES - A simplified explanation of the abstract

This abstract first appeared for US patent application 17809034 titled 'UNIFIED DATA CLASSIFICATION TECHNIQUES

Simplified Explanation

The patent application describes a method, computer system, and computer program for data processing. The process involves obtaining multiple files from a data source and analyzing them to gather information about their content and structure.

  • The files are analyzed to determine the structural information and content details of each file.
  • The information in each file is sorted and categorized based on common content.
  • Sensitive information is extracted and categorized separately.
  • The categorized information is then merged using the categories to create a single unified file.

Potential applications of this technology:

  • Data management and organization in large-scale databases.
  • Content analysis and categorization in document management systems.
  • Information extraction and classification in data mining and analysis tasks.

Problems solved by this technology:

  • Efficiently organizing and categorizing large volumes of data.
  • Identifying and extracting sensitive information from files.
  • Streamlining the process of merging and unifying data from multiple sources.

Benefits of this technology:

  • Improved data organization and accessibility.
  • Enhanced efficiency in analyzing and processing large amounts of data.
  • Enhanced security and privacy by identifying and categorizing sensitive information.


Original Abstract Submitted

A method, computer system, and a computer program product for data processing, comprising obtaining a plurality of files from a data source. These files are analyzed the files for information about the content and in order to determine structural information of each file. Once the files have been analyzed, information in each file may be sorted and categorized by common content. Sensitive information may also be extracted and categorized separately. Information may then be then merged using the categories to create a single unified file.