US Patent Application 17829012. OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS simplified abstract
Contents
OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS
Organization Name
Microsoft Technology Licensing, LLC==Inventor(s)==
[[Category:Rajsekhar Das of Sammamish WA (US)]]
[[Category:Gregory David Garbern of Seattle WA (US)]]
OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS - A simplified explanation of the abstract
This abstract first appeared for US patent application 17829012 titled 'OPTIMIZATIONS FOR DATA DEDUPLICATION OPERATIONS
Simplified Explanation
The patent application describes techniques for efficient data deduplication operations using stability tags.
- Stability tags are introduced to enable efficient data deduplication.
- A stability tag is set for a block of data before analysis by the deduplication process.
- The block of data is considered to be in a known state while the stability tag is set.
- An identifier is generated based on the content of the block of data.
- Later, if a second block of data is found with a matching identifier, it is a potential candidate for deduplication.
- The deduplication process checks the stability tags of both blocks of data.
- If the stability tags are still set, indicating no modification to the blocks of data, one of them can be scheduled for deduplication.
Original Abstract Submitted
The techniques disclosed herein enable systems to efficiently perform data deduplication operations through the introduction of stability tags. To achieve this, a stability tag for an associated block of data is set prior to analysis by a data deduplication process. While the stability tag is set, the block of data is understood by the system to be in a known state. An identifier is then generated based on the content within the block of data. At a later time, the deduplication process detects a second block of data that results in a matching identifier, indicating a potential candidate for deduplication. In response, the deduplication process returns to the stability tag of the first block of data and the second block of data. If the stability tags are still set, indicating no modification to the blocks of data, one of the blocks of data is eligible to be scheduled for deduplication.