Snowflake inc. (20240273417). MULTI-PARTY MACHINE LEARNING USING A DATABASE CLEANROOM simplified abstract

From WikiPatents
Jump to navigation Jump to search

MULTI-PARTY MACHINE LEARNING USING A DATABASE CLEANROOM

Organization Name

snowflake inc.

Inventor(s)

Orestis Kostakis of Redmond WA (US)

Justin Langseth of Kailua HI (US)

MULTI-PARTY MACHINE LEARNING USING A DATABASE CLEANROOM - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240273417 titled 'MULTI-PARTY MACHINE LEARNING USING A DATABASE CLEANROOM

The abstract describes a data sharing system implemented as a local application in a consumer database of a distributed database. The system includes a training function and a scoring function to train a machine learning model on provider and consumer data, generating output data by applying the trained model on input data, which consists of data portions from both consumer and provider databases joined to create a joined dataset for scoring.

  • The system is implemented as a local application in a consumer database of a distributed database.
  • It includes a training function and a scoring function for machine learning model training and data output generation.
  • Input data is created by joining data portions from consumer and provider databases to form a joined dataset for scoring.

Potential Applications: - Data sharing and analysis in consumer databases - Machine learning model training and scoring in a distributed database environment

Problems Solved: - Efficient data sharing and analysis between providers and consumers - Automated machine learning model training and scoring process

Benefits: - Improved data analysis capabilities - Streamlined machine learning model training process - Enhanced collaboration between data providers and consumers

Commercial Applications: Title: Enhanced Data Sharing and Analysis System for Consumer Databases Description: This technology can be used in various industries such as healthcare, finance, and marketing for efficient data sharing and analysis, leading to improved decision-making processes and insights.

Questions about the technology: 1. How does this system improve data sharing between providers and consumers? - This system streamlines the process by training machine learning models on provider and consumer data, generating output data for analysis. 2. What are the potential applications of this technology in different industries? - This technology can be applied in healthcare, finance, and marketing for data analysis and decision-making processes.


Original Abstract Submitted

embodiments of the present disclosure may provide a data sharing system implemented as a local application in a consumer database of a distributed database. the local application can include a training function and a scoring function to train a machine learning model on provider and consumer data, and generate output data by applying the trained machine learning model on input data. the input data can include data portions from a consumer database and a provider database that are joined to create a joined dataset for scoring.