sniffable

AI-Generated Content

This comprehensive explanation has been generated from 4 GitHub source documents. All source documents are searchable here.

Last updated: September 21, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

A stream property in CESR where data begins with a group code or field map, enabling parsers to immediately identify and process the stream format without prior context, solving the cold start problem in stream parsing.

No related concepts available

Comprehensive Explanation

sniffable

Technical Definition

A sniffable stream is a fundamental property in the CESR (Composable Event Streaming Representation) protocol ecosystem where a data stream begins with either a group code or field map that provides immediate format identification. This property enables parsers to determine the stream's structure and content type without requiring external context or pre-negotiated format agreements.

Formally, a stream S is sniffable if and only if:

S[0:n] ∈ {GroupCode ∪ FieldMap}

where n represents the length of the identifying prefix and the prefix belongs to the union of valid group codes and field maps defined in the CESR specification.

Core Architecture

Data Structures

Sniffable streams are built upon CESR's self-framing primitive architecture, where each stream begins with specific identifying markers:

Group Code Structure

GroupCode := {'-' + TypeCode + CountCode}
TypeCode := [A-Z, a-z, 0-9, -, _]
CountCode := Base64Count(2-5 chars)

Field Map Structure

FieldMap := {ObjectCode + FieldDefinitions}
ObjectCode := [A-Z, a-z] (single character)
FieldDefinitions := SerializedFieldMap

Three-Bit Combination Identification

The Parside parser utilizes unique three-bit combinations at the stream beginning to distinguish between formats:

Implementation Notes

Critical Implementation Details

Parser State Management

Thread Safety: Sniffable parsers must be thread-safe for concurrent stream processing:

class ThreadSafeSniffableParser:
    def __init__(self):
        self._lock = threading.RLock()
        self._format_cache = {}
    
    def sniff(self, stream: bytes) -> SniffResult:
        with self._lock:
            # Thread-safe format detection
            return self._internal_sniff(stream)

Performance Optimization

Format Detection Caching: Cache format detection results for repeated stream prefixes:

@lru_cache(maxsize=1024)
def cached_format_detection(prefix: bytes) -> FormatType:
    return detect_format_internal(prefix)

Lazy Parsing: Only parse version strings when needed:

class LazySniffResult:
    def __init__(self, stream: bytes, format_type: FormatType):
        self._stream = stream
        self._format_type = format_type
        self._version_info = None  # Lazy loaded
    
    @property
    def version_info(self) -> VersionInfo:
        if self._version_info is None:
            self._version_info = self._extract_version()
        return self._version_info

Security Best Practices

Input Validation: Always validate stream length before accessing bytes:

def safe_sniff(stream: bytes) -> SniffResult:
    if len(stream) < MIN_SNIFFABLE_LENGTH:
        raise InsufficientDataError("Stream too short for sniffing")
    
    # Validate first 8 bytes are within expected ranges
    if not all(0 <= b <= 255 for b in stream[:8]):
        raise InvalidStreamError("Invalid byte values in stream prefix")

Resource Limits: Prevent memory exhaustion from large version strings:

MAX_VERSION_STRING_LENGTH = 1024

def extract_version_safely(stream: bytes) -> str:
    # Limit search scope to prevent DoS
    search_window = stream[:MAX_VERSION_STRING_LENGTH]
    match = VERSION_PATTERN.search(search_window)
    if match:
        return match.group(1).decode('utf-8')
    raise VersionNotFoundError("Version string not found in safe window")

Testing Strategies

Property-Based Testing: Use hypothesis to generate edge cases:

from hypothesis import given, strategies as st

@given(st.binary(min_size=4, max_size=1024))
def test_sniffable_property(stream_data):
    # Test that sniffable detection is deterministic
    result1 = sniffer.is_sniffable(stream_data)
    result2 = sniffer.is_sniffable(stream_data)
    assert result1 == result2

Feature	CESR v1.0	KERI v1.1	ACDC v1.0
Group Code Detection	✓	✓	✓
Field Map Support	✓	✓	✓
Multi-format Sniffing	✓	✓	✓
24-bit Alignment	✓	✓	✓
Composability	✓	✓	✓

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

sniffable

Technical Definition

Core Architecture

Data Structures

Group Code Structure

Field Map Structure

Three-Bit Combination Identification

Implementation Notes

Critical Implementation Details

Parser State Management

Performance Optimization

Security Best Practices

Testing Strategies

Protocol Mechanics

Stream Detection Algorithm

Cold Start Problem Resolution

Cryptographic Foundation

Security Properties

Threat Model

Implementation Specifications

API Design

Parside Parser Interface

SniffResult Structure

Code Architecture

Format Detection State Machine

Version String Extraction

Integration Requirements

CESR Primitive Compatibility

Parser Integration Points

Technical Relationships

CESR Integration

KERI Protocol Integration

ACDC Integration

Performance Analysis

Computational Complexity

Memory Footprint

Network Overhead

Benchmarks

Edge Cases & Error Handling

Failure Modes

Recovery Procedures

Input Validation

Standards & Specifications

CESR Specification Compliance

Version History

Compliance Matrix

Production Considerations

Deployment Architectures

Monitoring & Observability

Operational Procedures

Security Operations

Common Pitfalls