专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20070043757A1 Storage reports duplicate file detection 有权
标题翻译：存储报告重复文件检测
公开(公告)号：US20070043757A1
公开(公告)日：2007-02-22
申请号：US11206710
申请日：2005-08-17
申请人： James Benton , Ran Kalach , Paul Oltean , Georgi Matev
发明人： James Benton , Ran Kalach , Paul Oltean , Georgi Matev
IPC分类号： G06F7/00
CPC分类号： G06F17/30097 , Y10S707/99937
摘要： Described is a storage reports duplicate file detector that operates by receiving file records during a first scan of file system metadata. The detector computes a hash based on attributes in the record, and maintains the hash value in association with information that indicates whether a hash value corresponds to more than one file. In one implementation, the information corresponds to the amount of space wasted by duplication. The information is used to determine which hash values correspond to groups of potentially duplicate files, and eliminate non-duplicates. A second scan locates file information for each of the potentially duplicate files, and the file information is then used to determine which groups of potentially duplicate files are actually duplicate files.
摘要翻译：描述的是存储报告重复文件检测器，其通过在文件系统元数据的第一次扫描期间接收文件记录来操作。检测器基于记录中的属性来计算散列，并且将哈希值与指示散列值是否对应于多于一个文件的信息相关联地维护。在一个实现中，信息对应于通过重复浪费的空间量。该信息用于确定哪些哈希值对应于潜在重复文件的组，并消除不重复的数据。第二次扫描查找每个潜在重复文件的文件信息，然后使用文件信息来确定哪些可能重复的文件组实际上是重复的文件。

2. 发明申请

US20060236069A1 Method and system for efficient generation of storage reports 失效
公开(公告)号：US20060236069A1
公开(公告)日：2006-10-19
申请号：US11107977
申请日：2005-04-15
申请人： Ran Kalach , James Benton , Paul Oltean , Georgi Matev
发明人： Ran Kalach , James Benton , Paul Oltean , Georgi Matev
IPC分类号： G06F12/00
CPC分类号： G06F3/0653 , G06F3/0605 , G06F3/0689 , G06F11/3485 , G06F2201/88 , Y10S707/99933 , Y10S707/99934 , Y10S707/99945
摘要： Described is a method and system by which reports of storage usage in computer systems are generated in an efficient manner by consolidating multiple requests for reports into a minimal number of volume scans, including by intelligently selecting a scanning method (e.g., of file system metadata versus find-first/find-next) and by performing parallel scans on different volumes. Namespace consolidation scans namespaces together, so as to generate multiple reports from the same set of files, reducing the number of volumes scans required to collect the data. Each volume scan may be a find-first, find next directory-based scan, or a volume metadata database scan. Time consolidation groups independent storage report generations together, such as storage report requests received within an administrator-specified interval. Parallel scans of different volumes may be performed, subject to I/O and processing resource limitations, and so that volumes partitioned on the same spindle are not scanned in parallel.

3. 发明申请

US20070043747A1 Storage reports file system scanner 有权
标题翻译：存储报告文件系统扫描器
公开(公告)号：US20070043747A1
公开(公告)日：2007-02-22
申请号：US11206425
申请日：2005-08-17
申请人： James Benton , Ran Kalach , Paul Oltean , Sarosh Havewala
发明人： James Benton , Ran Kalach , Paul Oltean , Sarosh Havewala
IPC分类号： G06F7/00
CPC分类号： G06F17/30067 , Y10S707/99942
摘要： Described is a storage reports scanner that works to generate reports of storage usage in computer systems in an efficient manner. The scanner receives a set of namespaces for a file system volume from a storage reports engine. The scanner scans file system metadata to construct a directory table of entries corresponding to a directory tree of nodes representative of the hierarchy of directories of the file system volume. Each node corresponding to a namespace in the namespace set is marked as included. A second scan of the file system metadata determines, for each file, whether that file is in or under an included directory by accessing the directory table. For each file that is in or is under an included directory, file information is returned to the engine. The engine may request the scanner to provide full path information, which the scanner determines via the directory table.
摘要翻译：描述了一种存储报告扫描器，用于以有效的方式生成计算机系统中的存储使用的报告。扫描仪从存储报告引擎接收一组文件系统卷的命名空间。扫描仪扫描文件系统元数据以构成与表示文件系统卷的目录的层次结构的节点的目录树相对应的条目的目录表。与命名空间集中的命名空间相对应的每个节点都被标记为包含。对于每个文件，文件系统元数据的第二次扫描是通过访问目录表来确定该文件是否在所包含的目录中或之下。对于位于或位于所包含的目录中的每个文件，文件信息将返回引擎。引擎可以请求扫描仪提供完整路径信息，扫描仪通过目录表确定。

4. 发明申请

US20060235892A1 Generating storage reports using volume snapshots 失效
标题翻译：使用卷快照生成存储报告
公开(公告)号：US20060235892A1
公开(公告)日：2006-10-19
申请号：US11107119
申请日：2005-04-15
申请人： Ran Kalach , James Benton , Paul Oltean
发明人： Ran Kalach , James Benton , Paul Oltean
IPC分类号： G06F17/30
CPC分类号： G06F11/3409 , G06F11/3485 , Y10S707/99953 , Y10S707/99954
摘要： Described is a method and system by which storage reports are generated from a volume snapshot set rather than the live volume or volumes, wherein a volume snapshot set comprises a representation or copy of one or more volume at a single point-in-time. By scanning the snapshot, a consistent file system image is obtained. Scanning may take place by enumerating a volume's directories of files, or, when available, by accessing a file system metadata of file information (e.g., a master file table) separately maintained on the volume. With some (e.g., hardware-based) snapshot technologies, the snapshot can be transported to another computing system for scanning by that other computing system, thereby avoiding burdening a live system's resources when scanning. Accurate and consistent storage reports are thus obtained at a single point in time, independent of the number of volumes being scanned.
摘要翻译：描述了一种通过其从卷快照集而不是实际卷或卷生成存储报告的方法和系统，其中卷快照集合包括在单个时间点的一个或多个卷的表示或副本。通过扫描快照，获得一致的文件系统映像。可以通过枚举卷的文件目录，或者在可用时通过访问单独维护在卷上的文件信息（例如，主文件表）的文件系统元数据来进行扫描。利用一些（例如基于硬件的）快照技术，快照可以被传送到另一个计算系统，以便由其他计算系统进行扫描，从而避免在扫描时负担现场系统的资源。因此，在单个时间点上获得了准确和一致的存储报告，与被扫描的卷数无关。

5. 发明申请

US20060053259A1 Framework for taking shadow copies and performing backups in a networked environment 有权
标题翻译：在网络环境中获取卷影副本和执行备份的框架
公开(公告)号：US20060053259A1
公开(公告)日：2006-03-09
申请号：US10939189
申请日：2004-09-09
申请人： Brian Berkowitz , Catharine Ingen , Paul Oltean , Ran Kalach , Reuven Lax
发明人： Brian Berkowitz , Catharine Ingen , Paul Oltean , Ran Kalach , Reuven Lax
IPC分类号： G06F12/16
CPC分类号： G06F11/1458 , G06F11/1464 , G06F11/1466 , G06F2201/84
摘要： A framework for taking shadow copies and performing backups in systems that may have data spread across multiple machines. A requester communicates names to a primary coordinator and requests the creation of shadow copies of all the volumes associated with the names. The primary coordinator communicates with one or more writers and one or more secondary coordinators to create the shadow copies of the volumes. The primary and one or more secondary coordinators create shadow copies of one or more of the volumes that reside on the machines upon which they execute. After the shadow copies of the volumes have been created, the requester may obtain data from the shadow copies and create a consistent backup.
摘要翻译：用于在可能会在多台机器上传播数据的系统中进行卷影复制和执行备份的框架。请求者将名称传达给主协调器，并请求创建与名称关联的所有卷的卷影副本。主协调器与一个或多个写入程序和一个或多个辅助协调器通信以创建卷的卷影副本。主要和一个或多个辅助协调器创建驻留在其执行的计算机上的一个或多个卷的卷影副本。在创建卷的卷影副本之后，请求者可以从卷影副本获取数据，并创建一致的备份。

6. 发明申请

US20060117070A1 Auto quota 失效
标题翻译：自动配额
公开(公告)号：US20060117070A1
公开(公告)日：2006-06-01
申请号：US11000294
申请日：2004-11-30
申请人： Ravinder Thind , Neal Christiansen , Ran Kalach , James Benton , Rajeev Nagar
发明人： Ravinder Thind , Neal Christiansen , Ran Kalach , James Benton , Rajeev Nagar
IPC分类号： G06F17/30
CPC分类号： G06F17/30067 , Y10S707/99942
摘要： Method and system for establishing and maintaining quotas. An auto quota is defined and applied to a directory. Input and output is monitored to detect a successful operation that involves a subdirectory of the directory. A determination is made as to whether to apply a quota associated with the auto quota to the subdirectory. If the determination is that the quota is to be applied to the subdirectory, it is automatically applied.
摘要翻译：建立和维持配额的方法和制度。自动配额被定义并应用于目录。监视输入和输出以检测涉及目录子目录的成功操作。确定是否将与自动配额相关联的配额应用于子目录。如果确定将配额应用于子目录，则会自动应用。

7. 发明申请

US20060117056A1 Method and system of detecting file system namespace changes and restoring consistency 失效
公开(公告)号：US20060117056A1
公开(公告)日：2006-06-01
申请号：US11000180
申请日：2004-11-30
申请人： Sarosh Havewala , Ravinder Thind , Neal Christiansen , Ran Kalach , James Benton
发明人： Sarosh Havewala , Ravinder Thind , Neal Christiansen , Ran Kalach , James Benton
IPC分类号： G06F17/00
CPC分类号： G06F17/30067 , Y10S707/99943 , Y10S707/99945 , Y10S707/99948
摘要： Method and system for maintaining namespace consistency between selected objects maintained by a file system and a filter associated therewith. Metadata regarding selected objects of a file system is maintained by a filter while the filter is attached to the file system and persisted in non-volatile storage. The namespace of the file system may be changed while the filter is unattached from the file system. Afterwards, when the filter is attached to the file system, the namespace of the filter is synchronized with the namespace of the file system for the selected objects.

8. 发明授权

US09823981B2 Backup and restore strategies for data deduplication 有权
公开(公告)号：US09823981B2
公开(公告)日：2017-11-21
申请号：US13045692
申请日：2011-03-11
申请人： Ran Kalach , Chun Ho (Ian) Cheung , Paul Adrian Oltean , Mathew James Dickson
发明人： Ran Kalach , Chun Ho (Ian) Cheung , Paul Adrian Oltean , Mathew James Dickson
IPC分类号： G06F11/14 , G06F3/06
CPC分类号： G06F11/1469 , G06F3/0641 , G06F11/1451 , G06F11/1453
摘要： Techniques for backup and restore of optimized data streams are described. A chunk store includes each optimized data stream as a plurality of chunks including at least one data chunk and corresponding optimized stream metadata. The chunk store includes data chunks in a deduplicated manner. Optimized data streams stored in the chunk store are identified for backup. At least a portion of the chunk store is stored in backup storage according to an optimized backup technique, an un-optimized backup technique, an item level backup technique, or a data chunk identifier backup technique. Optimized data streams stored in the backup storage may be restored. A file reconstructor includes a callback module that generates calls to a restore application to request optimized stream metadata and any referenced data chunks from the backup storage. The file reconstructor reconstructs the data streams from the referenced data chunks.

9. 发明授权

US08805837B2 Alternate data stream cache for file classification 有权
标题翻译：用于文件分类的备用数据流缓存
公开(公告)号：US08805837B2
公开(公告)日：2014-08-12
申请号：US12605451
申请日：2009-10-26
申请人： Clyde Law , Paul Adrian Oltean , Ran Kalach , Nir Ben-Zvi , Matthias H. Wollnik
发明人： Clyde Law , Paul Adrian Oltean , Ran Kalach , Nir Ben-Zvi , Matthias H. Wollnik
IPC分类号： G06F17/30
CPC分类号： G06F17/30115 , G06F17/30598
摘要： Described is caching classification-related metadata for a file in an alternate data stream of that file. When a file is classified (e.g., for data management), the classification properties are cached in association with the file, along with classification-related metadata that indicates the state of the file at the time of caching. The classification-related metadata in the alternate data stream is then useable in determining whether the classification properties are valid and up-to-date when next accessed, or whether the file needs to be reclassified. If the properties are valid and up-to-date, they may be used without requiring the computationally costly steps of reclassification. Also described is using more than one alternate data stream for the cache, and extending the classification-related metadata through a defined extension mechanism.
摘要翻译：描述了该文件的备用数据流中文件的缓存分类相关元数据。当文件被分类（例如，用于数据管理）时，分类属性与文件相关联地缓存，以及指示缓存时文件状态的分类相关元数据。备用数据流中的分类相关元数据可用于确定下次访问时分类属性是否有效和最新，还是文件是否需要重新分类。如果属性是有效和最新的，则可以使用它们，而不需要重新分类的计算上昂贵的步骤。还描述了为缓存使用多于一个备用数据流，并通过定义的扩展机制来扩展与分类有关的元数据。

10. 发明申请

US20120158672A1 Extensible Pipeline for Data Deduplication 有权
标题翻译：可重复数据删除的可扩展管道
公开(公告)号：US20120158672A1
公开(公告)日：2012-06-21
申请号：US12970839
申请日：2010-12-16
申请人： Paul Adrian Oltean , Ran Kalach , Ahmed M. El-Shimi , James Robert Benton
发明人： Paul Adrian Oltean , Ran Kalach , Ahmed M. El-Shimi , James Robert Benton
IPC分类号： G06F17/30
CPC分类号： G06F17/30091 , G06F17/3007
摘要： The subject disclosure is directed towards data deduplication (optimization) performed by phases/modules of a modular data deduplication pipeline. At each phase, the pipeline allows modules to be replaced, selected or extended, e.g., different algorithms can be used for chunking or compression based upon the type of data being processed. The pipeline facilitates secure data processing, batch processing, and parallel processing. The pipeline is tunable based upon feedback, e.g., by selecting modules to increase deduplication quality, performance and/or throughput. Also described is selecting, filtering, ranking, sorting and/or grouping the files to deduplicate, e.g., based upon properties and/or statistical properties of the files and/or a file dataset and/or internal or external feedback.
摘要翻译：主题公开针对由模块化重复数据消除管道的阶段/模块执行的重复数据删除（优化）。在每个阶段，流水线允许模块被替换，选择或扩展，例如，可以根据所处理的数据类型将不同的算法用于分组或压缩。该管道有助于安全数据处理，批量处理和并行处理。基于反馈可以调整流水线，例如通过选择模块来增加重复数据删除的质量，性能和/或吞吐量。还描述的是，例如基于文件和/或文件数据集和/或内部或外部反馈的属性和/或统计属性来选择，过滤，排序和/或分组文件以进行重复数据删除。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式