DATA VALUATION USING REINFORCEMENT LEARNING: abstract simplified (18333301)

From WikiPatents
Jump to navigation Jump to search
  • This abstract for appeared for patent application number 18333301 Titled 'DATA VALUATION USING REINFORCEMENT LEARNING'

Simplified Explanation

The abstract describes a method for training a machine learning model using a batch of training samples. The method involves using a data value estimator model to generate predicted values for each training sample. Based on these predicted values, a subset of the training samples is selected. The machine learning model is then used to determine a prediction performance measurement for each sample in the subset. Finally, the estimator parameter values of the data value estimator model are adjusted based on these prediction performance measurements.


Original Abstract Submitted

A method includes obtaining a batch of training samples. For each particular training sample in the batch of training samples, the method includes generating, using a data value estimator model and the particular training sample, a corresponding predicted value of the particular training sample when used to train a machine learning model. The method includes selecting, based on the corresponding predicted values, a subset of the batch of training samples. For each particular training sample in the subset of the batch of training samples, the method includes determining, using the machine learning model and the particular training sample, a corresponding prediction performance measurement. The method includes adjusting one or more estimator parameter values of the data value estimator model based on the corresponding prediction performance measurements.