self-addressing-data

AI-Generated Content

This comprehensive explanation has been generated from 178 GitHub source documents. All source documents are searchable here.

Last updated: October 7, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

Self-addressing data (SAD) is a data structure where a Self-Addressing Identifier (SAID) is cryptographically derived from and embedded within the data content itself, creating a mutually tamper-evident relationship where the identifier both addresses and verifies the integrity of its containing data.

No related concepts available

Comprehensive Explanation

self-addressing-data

Technical Definition

Self-addressing data (SAD) represents a fundamental cryptographic primitive in the KERI ecosystem where data content and its identifier are cryptographically bound through a self-referential mechanism. A SAD is formally defined as a representation of data content from which a SAID (Self-Addressing Identifier) is derived, where the SAID is both content-addressable and encapsulated by (self-referential to) its SAD.

The core innovation lies in the circular relationship: the SAID is computed from the serialized data that includes the SAID itself. This creates an immutable cryptographic binding where any modification to either the data content or the embedded SAID breaks the verifiable relationship, making tampering immediately evident.

Purpose in KERI/ACDC

SADs serve multiple critical functions in the KERI protocol suite:

Content Integrity: Provide cryptographic proof that data has not been modified
Self-Certification: Enable data to prove its own authenticity without external verification infrastructure
Compact References: Allow data structures to reference other data using SAIDs without embedding full content
Graduated Disclosure: Support privacy-preserving credential presentations by revealing SAIDs before full content
Verifiable Data Graphs: Enable construction of cryptographically linked data structures (ACDCs) forming directed acyclic graphs

Implementation Notes

Critical Implementation Requirements

Canonicalization

Field Ordering: Implementations MUST preserve insertion order in JSON objects. Use Python 3.7+ dict, JavaScript ES2015+ Object, or equivalent ordered map structures. Do NOT sort fields lexicographically.

Compact Serialization: Remove all unnecessary whitespace before SAID computation. Use json.dumps(separators=(',', ':')) in Python or JSON.stringify() without space parameters in JavaScript.

Consistent Encoding: Use UTF-8 encoding consistently. Normalize Unicode strings to NFC form before serialization.

Placeholder Mechanics

Length Matching: The placeholder MUST be exactly the same length as the final SAID. For Blake3-256 text encoding, use 44 # characters.

Field Position: The SAID field (typically d) must be in the correct position according to the schema. For ACDCs, d appears after v (version) in top-level objects.

Hash Algorithm Selection

Default Algorithm: Use Blake3-256 (derivation code E) for new implementations unless specific requirements dictate otherwise.

Algorithm Support: Implement support for multiple algorithms to enable verification of existing SADs and future algorithm transitions. Minimum support: Blake3-256, Blake2b-256, SHA3-256.

Derivation Code Validation: Always validate the derivation code before attempting verification. Reject SAIDs with unsupported or invalid codes.

Verification Process

Step-by-Step Verification:

Extract SAID from d field
Parse derivation code to determine hash algorithm
Serialize data structure in canonical form
Replace SAID field with placeholder of matching length
Compute hash using indicated algorithm
Encode hash with CESR derivation code
Compare computed SAID with extracted SAID

Nested SAD Verification: Verify nested SADs recursively from innermost to outermost. Each nested SAD must verify independently before verifying the parent SAD.

Performance Optimization

Caching: Cache computed SAIDs for frequently accessed data structures (schemas, credential templates). Invalidate cache only when structure changes.

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

self-addressing-data

Technical Definition

Purpose in KERI/ACDC

Implementation Notes

Critical Implementation Requirements

Canonicalization

Placeholder Mechanics

Hash Algorithm Selection

Verification Process

Performance Optimization

Type Classification

Cryptographic Properties

SAID Generation Algorithm

Underlying Algorithms

Security Properties

Key/Output Sizes

Data Format & Encoding

CESR Encoding Format

Text and Binary Representations

Derivation Codes

Usage in KERI/ACDC

In Which Event Types

Common Usage Patterns

Verification Procedures

Related Primitives

SAID (Self-Addressing Identifier)

Diger (Digest Primitive)

ACDC (Authentic Chained Data Container)

Composition Patterns

Implementation Considerations

Canonicalization Requirements

Placeholder Mechanics

Performance Optimization

Security Considerations

Interoperability

Advanced Topics

SAD Path Language

Transposable Signatures

Blinded SADs

SAID-Based Merkle Trees

Security Considerations

Common Pitfalls

Testing Requirements

Library Recommendations

Interoperability