Skip to content

checksums for chunks #392

@ttung

Description

@ttung

Problem description

Having checksums for individual chunks is good for verifying the integrity of the data we're loading. The existing mechanisms for checksumming data are inadequate for various reasons:

  1. Checksum of the entire array's data: This does not work for loading a subset of the data.
  2. Checksum of each individual chunk recorded by a filter as part of the chunk: This does not protect against chunks being swapped, and does not help for building a persistent cache for previously read chunks.

Recording the checksums in the .zarray file could work, but may be problematic for larger data sets.


see also:

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew features or improvements

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions