sniffer

AI-Generated Content

This comprehensive explanation has been generated from 14 GitHub source documents. All source documents are searchable here.

Last updated: October 7, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

The sniffer is a format detection component within KERI's Parside parser that automatically identifies serialization formats (CESR binary, CESR text, JSON, CBOR, MessagePack) in streaming data by examining initial codes and markers, enabling proper parsing dispatch without prior configuration.

No related concepts available

Comprehensive Explanation

sniffer

Implementation Overview

The sniffer is a specialized detection component implemented within KERI's Parside parser infrastructure. It serves as the first-stage analyzer in the CESR (Composable Event Streaming Representation) stream processing pipeline, responsible for identifying serialization formats before parsing begins.

Purpose and Scope

The sniffer addresses the fundamental cold start problem in stream parsing: determining how to begin processing structured data in an incoming stream without prior knowledge of its format. In KERI's multi-format ecosystem, streams may contain:

CESR binary - Binary-encoded CESR primitives
CESR Text - Text-encoded CESR primitives
JSON - JavaScript Object Notation
CBOR - Concise Binary Object Representation
MGPK (MessagePack) - Binary serialization format

The sniffer enables automatic format detection by examining recognizable markers at the beginning of streams, allowing the parser to dispatch processing to the appropriate handler without manual configuration or buffering.

Relationship to Protocol Specifications

The sniffer implements CESR's core design principle of self-describing streams. CESR streams are designed to be sniffable - they contain sufficient information at their beginning to enable automatic format detection. This capability is essential for:

Mixed-format streams - Handling transitions between different serialization formats within a single stream

Implementation Notes

Critical Implementation Details

Format Detection Strategy

Implementations must examine the initial bytes/characters of the stream to identify format markers:

CESR formats: Look for group codes or object codes with unique three-bit signatures
JSON: Detect opening brace { and locate version string field via regex
CBOR: Identify CBOR major type markers in initial bytes
MessagePack: Detect MessagePack format markers

Version String Extraction

For non-CESR formats (JSON, CBOR, MessagePack):

Apply format-specific regex patterns to locate the version string field
Parse the version string to extract the total length of the serialized structure
Use this length to determine the exact boundary where the current format section ends
Resume sniffer operation after the bounded section

This approach is necessary because these formats are not self-framing like CESR primitives.

Version Code Handling

When detecting CESR format:

Search for the CESR version count code at the top level
Extract the version identifier to determine which CESR code table to load
Maintain a default version for cold start scenarios where no explicit version code appears
Ensure version count codes are processed at top level (parser must "elevate" if nested)

Error Conditions

Implementations should handle:

Unrecognizable initial markers: Stream is not sniffable, requires alternative parsing strategy
Format/content mismatch: Detected format must match actual serialized content
Missing version information: Fall back to default version for CESR, reject for non-CESR
Malformed length information: Version string length extraction fails

Performance Optimization

The sniffer should be optimized for:

Minimal buffering: Detection from initial bytes without extensive look-ahead
Single-pass processing: No backtracking required
: Format identification should be O(1) operation

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

sniffer

Implementation Overview

Purpose and Scope

Relationship to Protocol Specifications

Implementation Notes

Critical Implementation Details

Format Detection Strategy

Version String Extraction

Version Code Handling

Error Conditions

Performance Optimization

Key Features & Capabilities

Multi-Format Detection

Detection Process Architecture

Non-CESR Format Handling

CESR Format Handling

Sniffability Criteria

Version Management Integration

Architecture and Integration

Position in Parsing Pipeline

Relationship to Parside

Hierarchical Parsing Support

Technical Significance

Composability Enablement

Cold Start Problem Solution

Interoperability Support

Implementation Considerations

Performance Characteristics

Error Handling

Design Evolution

Relationship to KERI Protocol

Event Stream Processing

OOBI Protocol Support

ACDC Credential Exchange

Integration with Parside

Hierarchical Parsing Support

Testing Considerations