Invention Grant
- Patent Title: Hierarchical identification and mapping of duplicate data in a storage system
- Patent Title (中): 存储系统中重复数据的分层识别和映射
-
Application No.: US13160474Application Date: 2011-06-14
-
Publication No.: US09043292B2Publication Date: 2015-05-26
- Inventor: Giridhar Appaji Nag Yasa , Nagesh Panyam Chandrasekarasastry
- Applicant: Giridhar Appaji Nag Yasa , Nagesh Panyam Chandrasekarasastry
- Applicant Address: US CA Sunnyvale
- Assignee: NetApp, Inc.
- Current Assignee: NetApp, Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Perkins Coie LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/00 ; G06F17/30

Abstract:
The technique introduced here includes a system and method for identifying and mapping duplicate data objects referenced by data objects. The technique illustratively utilizes a hierarchical tree of fingerprints for each data object to compare the data objects and identify duplicate data blocks referenced by the data objects. A progressive comparison of the hierarchical trees starts from a top layer of the hierarchical trees and proceeds toward a base layer. Between the compared data objects (i.e., the compared hierarchical trees), the technique maps matching fingerprints only at the top-most layer of the hierarchical trees at which the fingerprints match. Lower layer matching fingerprints are neither compared nor mapped. Data blocks corresponding to the matching fingerprints are then deleted. Such an identification and mapping technique substantially reduces the amount of mapping metadata stored in data objects that have been subject to deduplication.
Public/Granted literature
- US20120323859A1 HIERARCHICAL IDENTIFICATION AND MAPPING OF DUPLICATE DATA IN A STORAGE SYSTEM Public/Granted day:2012-12-20
Information query