Low-Rank Compression of Neural Networks

Organization Name

samsung electronics co., ltd.

Inventor(s)

Shangqian Gao of Mountain View CA US

Ting Hua of Cupertino CA US

Yen-Chang Hsu of Fremont CA US

Yilin Shen of Mountain View CA US

Hongxia Jin of San Jose CA US

Low-Rank Compression of Neural Networks

This abstract first appeared for US patent application 20250021826 titled 'Low-Rank Compression of Neural Networks

Original Abstract Submitted

in one embodiment, a method includes accessing at least a portion of a training dataset for a trained neural network that includes multiple layers, where each layer includes a number of parameters, and where the training dataset includes multiple training samples that each include an input and a ground-truth output used to train the trained neural network. the method further includes training a hypernetwork to generate a layer-specific compression mask for each of one or more of the multiple layers of the trained neural network. the method further includes generating, by the trained hypernetwork, a final layer-specific compression mask for the trained neural network and compressing the trained neural network by reducing, for each of the one or more layers of the neural network, the number of parameters of that layer according to the final layer-specific compression mask.