Jump to content

Cisco technology, inc. (20250095348). COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

From WikiPatents

COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

Organization Name

cisco technology, inc.

Inventor(s)

Myungjin Lee of Bellevue WA US

Gustav Adrian Baumgart of Seattle WA US

Jaemin Shin of Seattle WA US

Ramana Rao V.R. Kompella of Foster City CA US

COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

This abstract first appeared for US patent application 20250095348 titled 'COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

Original Abstract Submitted

in one implementation, a device generates outputs of nodes in a upstream layer of a partitioned neural network. the device assigns priorities to each of the outputs of the nodes. the device selects, based on the priorities, a subset of the outputs to send to a remote device. the device sends, via a computer network, the subset of the outputs to the remote device for input to a downstream layer of the partitioned neural network.