17528607. METHOD AND SYSTEM FOR IMPLEMENTING A FAST DATASET SEARCH USING A COMPRESSED REPRESENTATION OF A PLURALITY OF DATASETS simplified abstract (Capital One Services, LLC)

From WikiPatents
Jump to navigation Jump to search

METHOD AND SYSTEM FOR IMPLEMENTING A FAST DATASET SEARCH USING A COMPRESSED REPRESENTATION OF A PLURALITY OF DATASETS

Organization Name

Capital One Services, LLC

Inventor(s)

Austin Walters of Savoy IL (US)

Mark Watson of Philadelphia PA (US)

Anh Truong of Champaign IL (US)

Reza Farivar of Champaign IL (US)

Vincent Pham of Champaign IL (US)

Kate Key of Powhatan VA (US)

Galen Rafferty of Mahomet IL (US)

Jeremy Goodsitt of Champaign IL (US)

METHOD AND SYSTEM FOR IMPLEMENTING A FAST DATASET SEARCH USING A COMPRESSED REPRESENTATION OF A PLURALITY OF DATASETS - A simplified explanation of the abstract

This abstract first appeared for US patent application 17528607 titled 'METHOD AND SYSTEM FOR IMPLEMENTING A FAST DATASET SEARCH USING A COMPRESSED REPRESENTATION OF A PLURALITY OF DATASETS

Simplified Explanation

The patent application describes a method for storing and analyzing datasets using compressed representations. Here are the key points:

  • The method involves storing multiple datasets in computer memory.
  • Index representations are generated for each dataset, which include compressed versions of the datasets.
  • These index representations are also stored in computer memory.
  • When a sample dataset is received, a compressed representation of the sample dataset is generated.
  • The method then determines which dataset from the stored datasets is most similar to the sample dataset based on the sample dataset representation and the index representations.

Potential applications of this technology:

  • Data analysis: The method can be used to efficiently compare and analyze large datasets, helping in various fields such as scientific research, market analysis, and machine learning.
  • Data storage: By compressing the datasets and storing them as index representations, the method can help save storage space and improve data retrieval efficiency.

Problems solved by this technology:

  • Efficient storage: The method addresses the challenge of storing large datasets by compressing them into index representations, reducing storage requirements.
  • Fast data analysis: By generating compressed representations and using them for comparison, the method enables quick identification of similar datasets, saving time and computational resources.

Benefits of this technology:

  • Storage optimization: The compressed representations help save storage space, allowing for more efficient use of computer memory.
  • Time-saving analysis: The method enables rapid identification of similar datasets, facilitating faster data analysis and decision-making processes.


Original Abstract Submitted

A method includes storing, by one or more processors of one or more computing devices, a plurality of datasets in a non-transitory computer memory associated with the one or more computing devices. A plurality of index representations is generated where each one of the plurality of index representations includes a compressed representation of a respective one of the plurality of datasets. The plurality of index representations is stored in the non-transitory computer memory. A sample dataset is received by the one or more processors. A sample dataset representation is generated that includes a compressed representation of the sample dataset. A determination that at least one of the plurality of datasets is most similar to the sample dataset based on the sample dataset representation and the plurality of index representations is performed.