18464188. SYSTEMS AND METHODS FOR SEARCHING DEDUPLICATED DATA simplified abstract (Dell Products L.P.)

From WikiPatents
Jump to navigation Jump to search

SYSTEMS AND METHODS FOR SEARCHING DEDUPLICATED DATA

Organization Name

Dell Products L.P.

Inventor(s)

Philip Shilane of Newtown PA (US)

SYSTEMS AND METHODS FOR SEARCHING DEDUPLICATED DATA - A simplified explanation of the abstract

This abstract first appeared for US patent application 18464188 titled 'SYSTEMS AND METHODS FOR SEARCHING DEDUPLICATED DATA

Simplified Explanation

The abstract describes a deduplicated storage system that receives a search term and uses metadata and segmented objects to find matches. The system identifies partial matches by finding the search term's prefix or suffix within a segment. These partial matches are then merged to form full matches. Each full match includes the fingerprints of the segments where the prefix and suffix were found. The system checks if these fingerprints have been recorded in the database and reports the object associated with them if a match is found.

  • The system stores objects in a deduplicated storage system.
  • Metadata and segmented objects are used to search for matches.
  • Partial matches are identified by finding the prefix or suffix of the search term within a segment.
  • Partial matches are merged to form full matches, including the fingerprints of the segments.
  • The system checks if the fingerprints have been recorded in the database.
  • If a match is found, the system reports the object associated with the fingerprints.

Potential Applications

  • Data storage and retrieval systems
  • Content management systems
  • Search engines
  • Document management systems

Problems Solved

  • Efficient storage and retrieval of deduplicated objects
  • Effective search and retrieval of objects based on search terms
  • Reducing storage space by eliminating duplicate objects

Benefits

  • Improved storage efficiency by deduplicating objects
  • Faster search and retrieval of objects
  • Reduced storage costs by eliminating duplicate objects


Original Abstract Submitted

A deduplicated storage system storing objects receives a search term. Storage includes metadata and segments into which the objects have been split and deduplicated. Metadata includes fingerprint sequences according to which the segments should be assembled. A partial match is found when a prefix of the term is found at an end of a segment or a suffix is found at a beginning of the segment. The partial matches are recorded and merged to form full matches. Each full match includes a first fingerprint of a first segment in which the prefix was found and a second fingerprint of a second segment in which the suffix was found. A check is made of whether a fingerprint and a next fingerprint from a first sequence of fingerprints associated with a first object have been recorded in the database. If so, the first object is reported as having the search term.