GOOGLE LLC (20240265923). AUTOMATED CALLING SYSTEM simplified abstract

From WikiPatents
Jump to navigation Jump to search

AUTOMATED CALLING SYSTEM

Organization Name

GOOGLE LLC

Inventor(s)

Asaf Aharoni of Ramat Hasharon (IL)

Arun Narayanan of Milpitas CA (US)

Nir Shabat of Geva (IL)

Parisa Haghani of Jersey City NJ (US)

Galen Tsai Chuang of New York NY (US)

Yaniv Leviathan of New York NY (US)

Neeraj Gaur of Jersey City NJ (US)

Pedro J. Moreno Mengibar of Jersey City NJ (US)

Rohit Prakash Prabhavalkar of Santa Clara CA (US)

Zhongdi Qu of New York CA (US)

Austin Severn Waters of Brooklyn NY (US)

Tomer Amiaz of Tel Aviv (IL)

Michiel A.U. Bacchiani of Summit NJ (US)

AUTOMATED CALLING SYSTEM - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240265923 titled 'AUTOMATED CALLING SYSTEM

Simplified Explanation: The patent application describes methods, systems, and apparatus for an automated calling system that analyzes telephone conversations between users and bots to generate synthesized speech replies.

  • The system receives audio data of a user's utterance during a telephone conversation with a bot.
  • It determines the context of the conversation, the user intent, and the bot intent based on previous portions of the conversation.
  • Using this information, the system generates synthesized speech for the bot to reply to the user's utterance.
  • The synthesized speech is then provided as output to continue the conversation.

Key Features and Innovation:

  • Analysis of audio data in telephone conversations to determine user and bot intents.
  • Generation of synthesized speech replies based on context and intents.
  • Automation of responses in telephone conversations to enhance user experience.

Potential Applications: This technology can be applied in customer service call centers, virtual assistants, and interactive voice response systems to improve communication efficiency and effectiveness.

Problems Solved:

  • Enhances the automation of responses in telephone conversations.
  • Improves the understanding of user intents during interactions with bots.
  • Streamlines communication processes in various applications.

Benefits:

  • Increases the accuracy and relevance of bot responses.
  • Enhances user experience by providing tailored replies.
  • Optimizes communication flow in automated systems.

Commercial Applications: Automated calling systems utilizing this technology can be beneficial for businesses in customer service, telemarketing, and other industries requiring efficient telephone interactions with customers.

Questions about Automated Calling Systems: 1. How does the system determine the context of a telephone conversation? 2. What are the potential challenges in implementing synthesized speech replies in real-time conversations?

Ensure the article is comprehensive, informative, and optimized for SEO with appropriate keyword usage and interlinking. Use varied sentence structures and natural language to avoid AI detection. Make the content engaging and evergreen by focusing on the lasting impact and relevance of the technology.


Original Abstract Submitted

methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for an automated calling system are disclosed. in one aspect, a method includes the actions of receiving audio data of an utterance spoken by a user who is having a telephone conversation with a bot. the actions further include determining a context of the telephone conversation. the actions further include determining a user intent of a first previous portion of the telephone conversation spoken by the user and a bot intent of a second previous portion of the telephone conversation outputted by a speech synthesizer of the bot. the actions further include, based on the audio data of the utterance, the context of the telephone conversation, the user intent, and the bot intent, generating synthesized speech of a reply by the bot to the utterance. the actions further include, providing, for output, the synthesized speech.