byzantine-fault-tolerance

AI-Generated Content

This comprehensive explanation has been generated from 62 GitHub source documents. All source documents are searchable here.

Last updated: October 7, 2025

This content is meant to be consumed by AI agents via MCP. Click here to get the MCP configuration.
Note: In rare cases it may contain LLM hallucinations.
For authoritative documentation, please consult the official GLEIF vLEI trainings and the ToIP Glossary.

Short Definition

Byzantine Fault Tolerance (BFT) is a property of distributed computing systems that enables them to reach consensus and maintain correct operation despite the presence of Byzantine faults—failures where components may behave arbitrarily, provide inconsistent information to different observers, or act maliciously. A BFT system can continue functioning correctly as long as at least two-thirds of the network reaches consensus, tolerating up to one-third faulty or malicious nodes.

No related concepts available

Comprehensive Explanation

byzantine-fault-tolerance

Conceptual Definition

Byzantine Fault Tolerance (BFT) represents a fundamental property of distributed computing systems that enables them to achieve reliable consensus despite the presence of Byzantine faults. A Byzantine fault occurs when a system component fails in a way that presents different symptoms to different observers, making it difficult for other components to determine whether the component has actually failed and what corrective action to take.

The term derives from the Byzantine Generals Problem, a classical thought experiment in distributed computing that illustrates the challenge of achieving agreement when some participants may be unreliable or malicious. In this allegory, several Byzantine generals must coordinate an attack on a city, but they can only communicate via messengers who may be intercepted or corrupted. The generals must reach consensus on whether to attack or retreat, despite knowing that some generals may be traitors attempting to prevent agreement.

A system achieves Byzantine Fault Tolerance when it can maintain correct operation as long as two-thirds of the network reaches consensus. This two-thirds threshold is mathematically significant: it ensures that honest nodes can always outvote malicious nodes, even when up to one-third of participants are faulty or adversarial. The system resists:

Independent node failures: Components failing in unpredictable ways
Manipulated messages: Messages altered or fabricated by specific nodes
Inconsistent behavior: Nodes presenting different information to different observers
Malicious actors: Participants deliberately attempting to disrupt consensus

Historical Context

The Byzantine Generals Problem was formalized by Leslie Lamport, Robert Shostak, and Marshall Pease in their 1982 paper "The Byzantine Generals Problem." This foundational work established the theoretical framework for understanding consensus in the presence of arbitrary failures.

Traditional distributed systems often assumed fail-stop behavior, where components either work correctly or stop completely. Byzantine faults are more challenging because faulty components continue operating while providing incorrect or inconsistent information. This makes Byzantine faults particularly relevant for:

Implementation Notes

Witness Pool Configuration

Implementing Byzantine fault tolerance in KERI requires careful witness pool configuration:

Minimum Witness Count: For meaningful BFT properties, deploy at least n = 3*f + 1 witnesses where f is the expected maximum faults. For example:

To tolerate 1 faulty witness: minimum 4 witnesses
To tolerate 2 faulty witnesses: minimum 7 witnesses
To tolerate 3 faulty witnesses: minimum 10 witnesses

Threshold Selection: Set the witness threshold M according to:

Conservative: M = N - F (requires all honest witnesses)
Balanced: M = (N + F + 1) / 2 (requires supermajority)
Aggressive: M = F + 1 (minimum for safety, reduces availability)

Geographic Distribution: Deploy witnesses across multiple geographic regions to prevent common-mode failures from network partitions, natural disasters, or regional infrastructure outages.

Organizational Diversity: Select witnesses from different organizations to prevent collusion and reduce single points of failure in governance.

TOAD Configuration

The Threshold of Accountable Duplicity should be set considering:

Security Requirements: Higher-stakes identifiers (legal entities, financial institutions) should use higher TOAD values approaching N - F.

Availability Requirements: Systems requiring high availability may use lower TOAD values, accepting slightly reduced security for faster confirmation.

Dynamic Adjustment: TOAD can be modified through rotation events, allowing controllers to adjust security/availability trade-offs as requirements evolve.

Watcher Network Deployment

Enhance BFT properties through watcher networks:

Independent Monitoring: Deploy watchers operated by different entities than witnesses to provide independent verification.

Promiscuous Mode: Watchers run in promiscuous mode, collecting and verifying all key events they observe without requiring controller designation.

Duplicity Detection: Watchers compare key event logs from multiple witnesses, detecting inconsistencies that indicate Byzantine behavior.

Loading vLEI.wiki

byzantine-fault-tolerance

Short Definition

Comprehensive Explanation

byzantine-fault-tolerance

Conceptual Definition

Historical Context

Implementation Notes

Witness Pool Configuration

TOAD Configuration

Watcher Network Deployment

KERI's Approach

Witness-Based BFT Architecture

KAACE: Simplified BFT Consensus

Distinctive Properties

Threshold of Accountable Duplicity (TOAD)

Supermajority and Ample

Practical Implications

Security Benefits

Operational Trade-offs

Use Cases

Eclipse Attack Mitigation

Performance Characteristics

Conclusion

Performance Optimization

Security Considerations

Loading vLEI.wiki

Short Definition

Related Concepts

Comprehensive Explanation

byzantine-fault-tolerance

Conceptual Definition

Historical Context

Implementation Notes

Witness Pool Configuration

TOAD Configuration

Watcher Network Deployment

KERI's Approach

Witness-Based BFT Architecture

KAACE: Simplified BFT Consensus

Distinctive Properties

Threshold of Accountable Duplicity (TOAD)

Supermajority and Ample

Practical Implications

Security Benefits

Operational Trade-offs

Use Cases

Eclipse Attack Mitigation

Performance Characteristics

Conclusion

Performance Optimization

Security Considerations