Spotify ab (20240233776). SYSTEMS AND METHODS FOR LYRICS ALIGNMENT simplified abstract

From WikiPatents
Jump to navigation Jump to search

SYSTEMS AND METHODS FOR LYRICS ALIGNMENT

Organization Name

spotify ab

Inventor(s)

Simon René Georges Durand of Paris (FR)

Daniel Stoller of Bonn (DE)

SYSTEMS AND METHODS FOR LYRICS ALIGNMENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240233776 titled 'SYSTEMS AND METHODS FOR LYRICS ALIGNMENT

Simplified Explanation: The patent application describes a method that aligns lyrics text with audio for a media item by generating embeddings for symbols in the lyrics text and an acoustic representation of the audio, determining similarities between these embeddings, and displaying the aligned lyrics text while streaming the audio.

  • **Key Features and Innovation:**
   - Obtaining lyrics text and audio for a media item
   - Generating embeddings for symbols in the lyrics text
   - Generating embeddings for the acoustic representation of the audio
   - Aligning the lyrics text and audio based on similarities between embeddings
   - Providing aligned lyrics text with streamed audio
  • **Potential Applications:**
   - Music streaming platforms
   - Karaoke applications
   - Language learning tools
  • **Problems Solved:**
   - Aligning lyrics with audio in media items
   - Enhancing user experience while streaming audio
  • **Benefits:**
   - Improved synchronization of lyrics and audio
   - Enhanced user engagement
   - Better understanding of the content being streamed
  • **Commercial Applications:**
   - "Enhanced Media Synchronization Method for Streaming Platforms"
  • **Questions about the Technology:**
   * **How does this method improve user experience while streaming audio?**
       - This method enhances user experience by providing synchronized lyrics text with the streamed audio, making it more engaging for the users.
   * **What are the potential commercial uses of this technology?**
       - This technology can be utilized in music streaming platforms, karaoke applications, and language learning tools to enhance the user experience and engagement.


Original Abstract Submitted

a method includes obtaining lyrics text and audio for a media item and generating, using a first encoder, a first plurality of embeddings representing symbols that appear in the lyrics text for the media item. the method includes generating, using a second encoder, a second plurality of embeddings representing an acoustic representation of the audio for the media item. the method includes determining respective similarities between embeddings of the first plurality of embeddings and embeddings of the second plurality of embeddings and aligning the lyrics text and the audio for the media item based on the respective similarities. the method includes, while streaming the audio for the media item, providing, for display, the aligned lyrics text with the streamed audio.