US Patent Application 18333301. DATA VALUATION USING REINFORCEMENT LEARNING simplified abstract

From WikiPatents
Jump to navigation Jump to search

DATA VALUATION USING REINFORCEMENT LEARNING

Organization Name

Google LLC


Inventor(s)

Sercan Omer Arik of San Francisco CA (US)


Jinsung Yoon of San Jose CA (US)


Tomas Pfister of Foster City CA (US)


DATA VALUATION USING REINFORCEMENT LEARNING - A simplified explanation of the abstract

  • This abstract for appeared for US patent application number 18333301 Titled 'DATA VALUATION USING REINFORCEMENT LEARNING'

Simplified Explanation

This abstract describes a method for training a machine learning model using a batch of training samples. The method involves using a data value estimator model to generate predicted values for each training sample in the batch. Based on these predicted values, a subset of the training samples is selected. For each sample in the subset, the machine learning model is used to determine a prediction performance measurement. Finally, the estimator parameter values of the data value estimator model are adjusted based on these performance measurements.


Original Abstract Submitted

A method includes obtaining a batch of training samples. For each particular training sample in the batch of training samples, the method includes generating, using a data value estimator model and the particular training sample, a corresponding predicted value of the particular training sample when used to train a machine learning model. The method includes selecting, based on the corresponding predicted values, a subset of the batch of training samples. For each particular training sample in the subset of the batch of training samples, the method includes determining, using the machine learning model and the particular training sample, a corresponding prediction performance measurement. The method includes adjusting one or more estimator parameter values of the data value estimator model based on the corresponding prediction performance measurements.