SAID

AI-Generated Content

This comprehensive explanation has been generated from 180 GitHub source documents. All source documents are searchable here.

Last updated: October 7, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

A Self-Addressing Identifier (SAID) is a cryptographic identifier that is deterministically generated from the content it identifies and then embedded within that content, creating a self-referential, content-addressable identifier with tamper-evident properties.

No related concepts available

Comprehensive Explanation

SAID (Self-Addressing Identifier)

Technical Definition

A Self-Addressing Identifier (SAID) is a specialized cryptographic primitive that serves as both an identifier and an integrity proof for data structures. According to the IETF SAID specification, a SAID is "an identifier which is deterministically generated out of the content, digest of the content" and is simultaneously content-addressable (cryptographically bound to the data) and self-referential (embedded within the data it identifies).

The fundamental innovation of SAIDs is resolving the inherent tension between traditional content-addressable identifiers (which cannot be self-referential because including the identifier changes the content) and self-referential identifiers (which traditionally lack cryptographic binding). SAIDs achieve both properties through a special derivation protocol that makes the identifier both embedded in and derived from the serialized data structure.

Purpose in KERI/ACDC

SAIDs serve multiple critical functions in the KERI ecosystem:

Integrity Verification: Any modification to the data structure invalidates the SAID, providing tamper-evidence
Content Addressing: SAIDs enable universal, unique retrieval of specific data versions
Compact References: In ACDC credentials, SAIDs allow compact disclosure where only the identifier is revealed initially
Cryptographic Commitment: SAIDs create verifiable commitments to data without revealing the data itself
Schema Identification: Schemas themselves are identified by SAIDs, ensuring schema immutability

Type Classification

SAIDs are classified as qualified cryptographic primitives in encoding, meaning they include a prepended derivation code that indicates the cryptographic algorithm used for digest computation. This qualification enables self-describing data structures that can be parsed without external schema information.

Implementation Notes

Critical Implementation Requirements

Canonical Serialization

Field Ordering: KERI/ACDC mandates insertion-ordered field maps. The serialization must preserve the order in which fields were added to the data structure. Lexicographic (alphabetical) ordering is explicitly NOT used.

Compact JSON: SAID computation requires compact JSON serialization with:

No whitespace between elements
No newlines
Consistent UTF-8 encoding
No trailing commas

Example canonical form:

{"d":"############################################","i":"EpDA1n-WiBA0A8YOqnKrB-wWQYYC49i5zY_qrIZIicQg","name":"Alice"}

Dummy String Requirements

The dummy string must:

Use the # character (ASCII 35)
Match the exact length of the final SAID (44 characters for Blake3-256 text encoding)
Be placed in the d field before digest computation

Recursive Computation for Nested Structures

For ACDCs with multiple sections:

Compute innermost SAIDs first: Start with leaf-level sections (individual attributes, rules)
Embed child SAIDs: Replace dummy strings with computed SAIDs in parent structures
Compute parent SAIDs: Hash the parent structure containing embedded child SAIDs
Proceed to root: Continue upward until the top-level SAID is computed

Algorithm Selection

Recommended: Blake3-256 (CESR code E)

Required for vLEI credentials per GLEIF governance framework
Provides 128-bit collision resistance
High performance
Post-quantum resistant

Supported Alternatives: Blake2b-256 (F), Blake2s-256 (G), SHA3-256 (H)

Deprecated: SHA2-256 (I) - not recommended for new implementations

Verification Implementation

# Pseudocode for SAID verification
def verify_said(data_structure):
    # Extract SAID
    said = data_structure['d']
    
    # Parse derivation code to identify algorithm
    algorithm = parse_cesr_code(said[0])
    
    # Create copy with dummy string
    verification_copy = data_structure.copy()
    verification_copy['d'] = '#' * len(said)
    
    # Canonicalize (compact JSON, insertion order)
    canonical = canonicalize(verification_copy)
    
    # Compute digest
    computed_digest = algorithm.hash(canonical)
    
    # CESR encode with derivation code
    computed_said = cesr_encode(algorithm.code, computed_digest)
    
    # Compare
    return computed_said == said

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

SAID (Self-Addressing Identifier)

Technical Definition

Purpose in KERI/ACDC

Type Classification

Implementation Notes

Critical Implementation Requirements

Canonical Serialization

Dummy String Requirements

Recursive Computation for Nested Structures

Algorithm Selection

Verification Implementation

Cryptographic Properties

Underlying Algorithms

Security Properties

Key/Output Sizes

Data Format & Encoding

CESR Encoding Format

Text and Binary Representations

Derivation Codes

SAID Generation Protocol

Step 1: Reserve Field Space

Step 2: Insert Dummy String

Step 3: Compute Digest

Step 4: CESR Encode

Step 5: Replace Dummy

Verification Process

Usage in KERI/ACDC

In Key Event Logs (KELs)

In ACDC Credentials

In Transaction Event Logs (TELs)

Common Usage Patterns

Verification Procedures

Related Primitives

Self-Addressed Data (SAD)

Cryptographic Digests

CESR Primitives

Composition Patterns

Implementation Considerations

Canonical Serialization

Algorithm Selection

Dummy String Length

Recursive SAID Computation

Performance Optimization

Security Considerations

Performance Considerations

Common Pitfalls

Testing Recommendations