self-framing

AI-Generated Content

This comprehensive explanation has been generated from 75 GitHub source documents. All source documents are searchable here.

Last updated: October 7, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

Self-framing is an encoding property where each primitive contains type, size, and value information in a single atomic unit, enabling parsers to extract elements from a stream without external delimiters or schemas by reading only the beginning of each element.

No related concepts available

Comprehensive Explanation

self-framing

Conceptual Definition

Self-framing is a fundamental encoding property in CESR where each primitive embeds its own metadata—specifically type and size information—directly within its encoding. This design enables parsers to determine exactly how many characters (in text domain) or bytes (in binary domain) to extract for a given element without parsing the element's content or relying on external schemas, delimiters, or encapsulation structures.

The core principle is that a stream of concatenated self-framing primitives can be parsed sequentially, with each primitive being extracted atomically based solely on information contained in its leading characters or bytes. This property eliminates the need for:

External delimiter characters between primitives
Wrapper envelopes around data structures
Pre-parsing to determine structural boundaries
Schema lookups to interpret element sizes

Self-framing operates through prepended codes (derivation codes, framing codes, or count codes) that encode both the type of primitive and sufficient information to calculate its total length. A parser reads these leading codes, determines the primitive's boundaries, extracts exactly that many characters/bytes, and continues to the next primitive—all without examining the primitive's actual content.

Historical Context

Traditional encoding schemes face a fundamental trade-off between human readability and parsing efficiency:

Text-based protocols (HTTP, SMTP, JSON) use delimiters and structural markers that are human-readable but require parsing the entire content to find boundaries. JSON, for example, requires parsing nested structures to determine where objects and arrays end, making it inherently non-self-framing.

Binary protocols (TCP, UDP, DNS) achieve compact representation but sacrifice readability and often require complex framing mechanisms or length-prefixed fields that are protocol-specific.

Hybrid formats (XML, JSON/CBOR, MessagePack) attempt to bridge this gap by offering both text and binary serializations, but they still rely on structural parsing rather than self-framing at the primitive level.

Implementation Notes

Implementation Considerations

Code Table Management

Implementations must maintain code tables mapping derivation codes to primitive types and lengths. These tables should be:

Version-aware: Support multiple CESR versions through version codes
Efficient: Use hash maps or switch statements for O(1) code lookup
Extensible: Allow registration of new primitive types

Stream Parsing

Parsers should implement:

Single-pass parsing: Read codes, extract primitives, continue—no backtracking
Buffer management: Handle partial primitives at buffer boundaries
Error recovery: Implement cold start mechanisms for malformed streams

24-Bit Alignment

When encoding primitives:

Calculate pad size: ps = (3 - (N mod 3)) mod 3 where N is raw binary length
Prepend lead bytes before Base64 conversion
Replace lead bytes with derivation code after conversion

Group Processing

For count codes:

Parse count code to determine number of primitives in group
Extract group boundary without parsing individual primitives
Enable routing of entire groups to processors

Domain Conversion

When converting between text and binary domains:

Preserve primitive boundaries through 24-bit alignment
Maintain self-framing property in both domains
Ensure round-trip conversion is lossless

Performance Optimization

For high-throughput applications:

Use group codes to enable parallel processing
Implement zero-copy extraction where possible
Cache code table lookups for frequently used codes

Loading vLEI.wiki

self-framing

Short Definition

Comprehensive Explanation

self-framing

Conceptual Definition

Historical Context

Implementation Notes

Implementation Considerations

Code Table Management

Stream Parsing

24-Bit Alignment

Group Processing

Domain Conversion

Performance Optimization

KERI's Approach

Dual-Domain Self-Framing

24-Bit Alignment Constraint

Code Table Architecture

Group Framing Codes

Cold Start Recovery

Streaming Protocol Foundation

Practical Implications

Use Cases

Benefits

Trade-offs

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

self-framing

Conceptual Definition

Historical Context

Implementation Notes

Implementation Considerations

Code Table Management

Stream Parsing

24-Bit Alignment

Group Processing

Domain Conversion

Performance Optimization

KERI's Approach

Dual-Domain Self-Framing

24-Bit Alignment Constraint

Code Table Architecture

Group Framing Codes

Cold Start Recovery

Streaming Protocol Foundation

Practical Implications

Use Cases

Benefits

Trade-offs