18197595. COMPUTERIZED INTELLIGENT ASSISTANT FOR CONFERENCES simplified abstract (Microsoft Technology Licensing, LLC)

From WikiPatents
Jump to navigation Jump to search

COMPUTERIZED INTELLIGENT ASSISTANT FOR CONFERENCES

Organization Name

Microsoft Technology Licensing, LLC

Inventor(s)

Adi Diamant of Tel-Aviv (IL)

Xuedong Huang of Bellevue WA (US)

Karen Master Ben-dor of Kfar-Saba (IL)

Eyal Krupka of Redmond WA (US)

Raz Halaly of Ness Ziona (IL)

Yoni Smolin of Shimshit (IL)

Ilya Gurvich of Haifa (IL)

Aviv Hurvitz of Tel-Aviv (IL)

Lijuan Qin of Redmond WA (US)

Wei Xiong of Bellevue WA (US)

Shixiong Zhang of Beijing (CN)

Lingfeng Wu of Bothell WA (US)

Xiong Xiao of Bothell WA (US)

Ido Leichter of Haifa (IL)

Moshe David of Bney Braq (IL)

Amit Kumar Agarwal of Bellevue WA (US)

COMPUTERIZED INTELLIGENT ASSISTANT FOR CONFERENCES - A simplified explanation of the abstract

This abstract first appeared for US patent application 18197595 titled 'COMPUTERIZED INTELLIGENT ASSISTANT FOR CONFERENCES

Simplified Explanation

The patent application describes a method for facilitating remote conferences by using various machines to process digital video and audio signals. Here are the key points:

  • The method involves receiving a digital video and a computer-readable audio signal.
  • A face recognition machine is used to identify the face of a first conference participant in the video.
  • A speech recognition machine is used to convert the audio signal into text.
  • An attribution machine attributes the text to the first conference participant.
  • The same process is repeated for a second conference participant, resulting in a second text.
  • A transcription machine automatically creates a transcript that includes the attributed texts of both participants.

Potential applications of this technology:

  • Remote conferencing platforms can use this method to automatically transcribe and attribute speech in real-time.
  • It can be used in video conferencing tools to provide accurate and reliable transcripts of meetings.
  • Educational platforms can utilize this method to automatically generate transcripts of online lectures or discussions.

Problems solved by this technology:

  • Manual transcription of conference calls or meetings can be time-consuming and prone to errors.
  • It eliminates the need for participants to take detailed notes during a conference, as the transcript is automatically generated.
  • Language barriers can be overcome by using speech recognition to translate audio into text.

Benefits of this technology:

  • Saves time and effort by automating the transcription process.
  • Provides accurate and reliable transcripts of conference calls or meetings.
  • Enhances accessibility by providing text-based records of audio content.
  • Facilitates communication and collaboration in remote settings.


Original Abstract Submitted

A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.