Jump to content

18368790. COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS (Cisco Technology, Inc.)

From WikiPatents
Revision as of 05:06, 24 March 2025 by Unknown user (talk) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

Organization Name

Cisco Technology, Inc.

Inventor(s)

Myungjin Lee of Bellevue WA US

Gustav Adrian Baumgart of Seattle WA US

Jaemin Shin of Seattle WA US

Ramana Rao V.R. Kompella of Foster City CA US

COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

This abstract first appeared for US patent application 18368790 titled 'COMMUNICATION-AWARE INFERENCE SERVING FOR PARTITIONED NEURAL NETWORKS

Original Abstract Submitted

In one implementation, a device generates outputs of nodes in a upstream layer of a partitioned neural network. The device assigns priorities to each of the outputs of the nodes. The device selects, based on the priorities, a subset of the outputs to send to a remote device. The device sends, via a computer network, the subset of the outputs to the remote device for input to a downstream layer of the partitioned neural network.

(Ad) Transform your business with AI in minutes, not months

Custom AI strategy tailored to your specific industry needs
Step-by-step implementation with measurable ROI
5-minute setup that requires zero technical skills
Get your AI playbook

Trusted by 1,000+ companies worldwide

Cookies help us deliver our services. By using our services, you agree to our use of cookies.