Zoom Video Communications, Inc. (20240212666). Word Replacement During Poor Network Connectivity Or Network Congestion simplified abstract

From WikiPatents
Jump to navigation Jump to search

Word Replacement During Poor Network Connectivity Or Network Congestion

Organization Name

Zoom Video Communications, Inc.

Inventor(s)

Nick Swerdlow of Santa Clara CA (US)

Word Replacement During Poor Network Connectivity Or Network Congestion - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240212666 titled 'Word Replacement During Poor Network Connectivity Or Network Congestion

Simplified Explanation

A server generates a continuous audio stream in real-time communication sessions by predicting missing words in speech data.

  • The server receives an audio stream from a user device.
  • It converts speech data to text and identifies missing words.
  • Based on context, it predicts the missing word.
  • The server synthesizes the predicted word in the user's voice.
  • The continuous audio stream is transmitted to other user devices.

Key Features and Innovation

  • Real-time prediction of missing words in speech data.
  • Synthesizing missing words in the user's voice.
  • Seamless integration of predicted words into the audio stream.

Potential Applications

This technology can be used in:

  • Real-time communication applications.
  • Transcription services.
  • Language learning tools.

Problems Solved

  • Improves audio quality in poor network conditions.
  • Enhances user experience in real-time communication.
  • Reduces miscommunication due to missing words.

Benefits

  • Enhanced audio streaming quality.
  • Improved user experience in communication.
  • Increased accuracy in transcriptions.

Commercial Applications

  • Real-time communication platforms.
  • Transcription services for meetings and interviews.
  • Language learning applications for pronunciation practice.

Prior Art

Researchers can explore prior art related to speech-to-text technologies, real-time communication systems, and audio stream optimization.

Frequently Updated Research

Stay updated on advancements in speech recognition technology, real-time communication protocols, and audio processing algorithms.

Questions about the Technology

How does this technology improve user experience in real-time communication sessions?

This technology enhances user experience by predicting and synthesizing missing words in speech data, ensuring seamless communication flow.

What are the potential applications of this technology beyond real-time communication?

Apart from real-time communication, this technology can be applied in transcription services, language learning tools, and other speech-to-text applications.


Original Abstract Submitted

a server generates a continuous audio stream during periods of poor network connectivity or network congestion. the server obtains a first audio stream from a user device connected to a real-time communication session and detects speech data in the first audio stream. the server converts the speech data to text data that includes one or more words. the server determines that the text data is missing a word based on a context of the one or more words. the server synthesizes a predicted word for replacing the missing word in a voice of a user of the user device and combines the synthesized word with the first audio stream to generate the continuous audio stream. the server transmits the continuous audio stream to other user devices connected to the real-time communication session.