Jump to content

18835230. ACCURACY-PRESERVING DEEP MODEL COMPRESSION (Nokia Technologies Oy)

From WikiPatents


ACCURACY-PRESERVING DEEP MODEL COMPRESSION

Organization Name

Nokia Technologies Oy

Inventor(s)

Yash Garg of Murray Hill NJ US

Ahmet Akyamac of Murray Hill NJ US

ACCURACY-PRESERVING DEEP MODEL COMPRESSION

This abstract first appeared for US patent application 18835230 titled 'ACCURACY-PRESERVING DEEP MODEL COMPRESSION

Original Abstract Submitted

Techniques described herein provide for compression of machine learning models without significant loss in model accuracy and without requiring model re-training. Compressed machine learning models may then be deployed by resource-constrained devices to improve operational efficiency and throughput. An example method includes providing input data for one or more deep learning tasks to a machine learning model having a plurality of neuronal units. The neuronal units are associated with respective parameters. The method further includes determination of respective confidence scores for the plurality of neuronal units responsive to the input data. A confidence score represents a contribution, significant, or impact of a neuronal unit with respect to the overall model output. The method further includes generating a compressed machine learning model based at least in part on removing a subset of neuronal units according to their respective confidence scores and redistributing their parameters to another subset of neuronal units.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.