US Patent Application 17829012. OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS simplified abstract

From WikiPatents
Jump to navigation Jump to search

OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS

Organization Name

Microsoft Technology Licensing, LLC==Inventor(s)==

[[Category:Rajsekhar Das of Sammamish WA (US)]]

[[Category:Gregory David Garbern of Seattle WA (US)]]

OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS - A simplified explanation of the abstract

This abstract first appeared for US patent application 17829012 titled 'OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS

Simplified Explanation

The patent application describes techniques for efficient data deduplication operations using stability tags.

  • Stability tags are introduced to enable efficient data deduplication.
  • A stability tag is set for a block of data before analysis by the deduplication process.
  • The block of data is considered to be in a known state while the stability tag is set.
  • An identifier is generated based on the content of the block of data.
  • Later, if a second block of data is found with a matching identifier, it is a potential candidate for deduplication.
  • The deduplication process checks the stability tags of both blocks of data.
  • If the stability tags are still set, indicating no modification to the blocks of data, one of them can be scheduled for deduplication.


Original Abstract Submitted

The techniques disclosed herein enable systems to efficiently perform data deduplication operations through the introduction of stability tags. To achieve this, a stability tag for an associated block of data is set prior to analysis by a data deduplication process. While the stability tag is set, the block of data is understood by the system to be in a known state. An identifier is then generated based on the content within the block of data. At a later time, the deduplication process detects a second block of data that results in a matching identifier, indicating a potential candidate for deduplication. In response, the deduplication process returns to the stability tag of the first block of data and the second block of data. If the stability tags are still set, indicating no modification to the blocks of data, one of the blocks of data is eligible to be scheduled for deduplication.