20240022767. DYNAMIC COMPRESSION OF AUDIO-VISUAL DATA simplified abstract (International Business Machines Corporation)

From WikiPatents
Jump to navigation Jump to search

DYNAMIC COMPRESSION OF AUDIO-VISUAL DATA

Organization Name

International Business Machines Corporation

Inventor(s)

Fang Lu of Billerica MA (US)

Jeremy R. Fox of Georgetown TX (US)

Tushar Agrawal of West Fargo ND (US)

Sarbajit K. Rakshit of Kolkata (IN)

DYNAMIC COMPRESSION OF AUDIO-VISUAL DATA - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240022767 titled 'DYNAMIC COMPRESSION OF AUDIO-VISUAL DATA

Simplified Explanation

The disclosed patent application describes techniques for dynamically compressing audio-visual data using a digital encoding method, digital compression method, and a system. The digital encoding method involves identifying a scene captured by a sensor, which includes multiple objects. The method further involves analyzing the objects to determine which ones can be redeveloped by a GAN (Generative Adversarial Network) regeneration module to a certain quality level, and which ones cannot. An optimized capture plan is then generated based on this analysis. The scene is encoded according to the optimized capture plan to produce a digital media file.

  • Techniques for dynamic compression of audio-visual data
  • Digital encoding method for identifying scenes and objects
  • Analysis of objects to determine redevelopability using GAN regeneration module
  • Generation of optimized capture plan based on analysis
  • Encoding of scene according to optimized capture plan to produce digital media file

Potential Applications:

  • Video streaming platforms that require efficient compression techniques
  • Video surveillance systems that need to store large amounts of footage
  • Virtual reality and augmented reality applications that require real-time compression and decompression of audio-visual data

Problems Solved:

  • Efficient compression of audio-visual data to reduce storage and bandwidth requirements
  • Identification and redevelopment of objects in a scene to improve overall quality
  • Optimization of capture plan to prioritize important objects and reduce redundancy

Benefits:

  • Reduced storage and bandwidth requirements for audio-visual data
  • Improved quality of redeveloped objects using GAN regeneration module
  • Real-time optimization of capture plan for efficient encoding
  • Enhanced user experience in video streaming, surveillance, and virtual reality applications


Original Abstract Submitted

disclosed are techniques for dynamic compression of audio-visual data, including a digital encoding method, digital compression method, and a system. one embodiment of the digital encoding method may comprise identifying a scene projected onto a sensor, wherein the scene comprises a plurality of objects. some embodiments may further comprise analyzing the plurality of objects to determine which of the plurality objects can be redeveloped by a gan regeneration module to a threshold quality level and which of the plurality of objects cannot be redeveloped with the gan regeneration module to the threshold quality level. some embodiments may further comprise generating an optimized capture plan based on the analyzing. some embodiments may further comprise encoding the scene pursuant to the optimized capture plan to produce a digital media file.