Schlumberger technology corporation (20240428138). GENERATION AND USE OF CLASSIFICATION MODEL FROM SYNTHETICALLY GENERATED DATA
Contents
GENERATION AND USE OF CLASSIFICATION MODEL FROM SYNTHETICALLY GENERATED DATA
Organization Name
schlumberger technology corporation
Inventor(s)
GENERATION AND USE OF CLASSIFICATION MODEL FROM SYNTHETICALLY GENERATED DATA
This abstract first appeared for US patent application 20240428138 titled 'GENERATION AND USE OF CLASSIFICATION MODEL FROM SYNTHETICALLY GENERATED DATA
Original Abstract Submitted
a method for training and using a field machine learning (ml) model to classify emission data is presented. the method includes generating synthetic data by a large language model (llm) by prompting the llm with emission classes and few shot examples. the synthetic data includes multiple synthetic data instances and corresponding instance labels. a training dataset is obtained from the synthetic data. the method further includes training the field ml model with training instances which are synthetic data instances from the training dataset and corresponding training labels. the field ml model generates a predicted probability distribution of a training output class corresponding to a training instance. the method further includes adjusting a model parameter weight of the field ml model to minimize a categorical cross-entropy loss function calculated based on the generated predicted probability distribution. the trained field ml model is used to classify emission data.