March 11, 2026 · 8 min read · INFRASTRUCTURE

Auditable AI: How 4-Proof Chains Make Agent Work Verifiable

Can you audit what an AI agent did? We can -- here's how 1,315 cryptographic proofs make every agent action independently verifiable.

The Problem: Trust Without Verification

When an AI agent makes a prediction, executes a trade, or produces a benchmark score -- how do you verify it actually did the work? Most agent systems rely on logs that the agent itself produces. That's self-attestation, not verification.

BlindOracle takes a different approach: every meaningful agent action produces a cryptographic proof chain -- a sequence of SHA-256 linked proofs that capture identity, computation, results, and independent witness verification. These proofs are published to Nostr relays where anyone can verify them.

The 4-Proof Chain Structure

Every benchmark run, every forecast resolution, every agent computation produces exactly four linked proofs:

StepProof KindNostr KindWhat It Proves
1ProofOfPresence30010Agent identity confirmed + timestamp
2ProofOfCompute30015Work performed: model, duration, response excerpt
3ProofOfBenchmark30020Score + 3-component decomposition (keyword, length, structure)
4ProofOfWitness30013Independent verification + chain integrity seal

Each proof includes the SHA-256 hash of the previous proof, creating an immutable chain. Tamper with any proof and every subsequent hash breaks.

15 Proof Kinds for Complete Coverage

Beyond the 4-proof benchmark chain, BlindOracle supports 15 distinct proof types covering the full agent lifecycle:

KindNameCategoryCount
30010ProofOfPresencepresence195
30012ProofOfBelongingbelonging76
30013ProofOfWitnesswitnessing229
30014ProofOfDelegationdelegation129
30015ProofOfComputecompute248
30016ProofOfServiceservice--
30017ProofOfReputationreputation--
30018ProofOfAuditaudit--
30019ProofOfDeploymentcompute55
30020ProofOfBenchmarkcompute128
30021ReputationAttestationfleet55
30022ProofOfResearchresearch--
30023ProofOfConsensusresearch55
30099EncryptedProofbackup--
1AttestationReplyintegrity55

The Numbers

1,315
Proofs in SQLite
3,690
Q&A Pairs Extracted
248
Proof Chains
15
Proof Kinds
80+
Unique Agents

Score Verification: Fully Auditable

The benchmark scoring formula is captured redundantly across three proof kinds:

final_score = (keyword_score * 0.4) + (length_score * 0.3) + (structure_score * 0.3)
threshold = 0.5  # pass/fail boundary

An auditor can reconstruct any benchmark score from the proof data alone -- no access to the original agent or its logs required. The ProofOfCompute captures the model and response, ProofOfBenchmark captures the score decomposition, and ProofOfWitness validates the chain.

Cross-Agent Witness Co-Signing

Self-attestation is not enough. In v1.2.1, we added cross-agent witness co-signing: a completely independent agent can verify and co-sign any proof chain.

The same agent that ran the benchmark should not be the one vouching for it. Cross-witness co-signing ensures independent verification by a different agent with its own keypair.

Co-signatures are stored in a dedicated cross_witness_co_signs table with SHA-256 co-sign hashes and can be queried via the proof_query.py co-sign-stats CLI.

ProofDB: SQL-Queryable Proof Storage

All proofs are ingested into a SQLite database (data/proof_qa.db) for fast querying. The ProofDB system automatically extracts Q&A pairs from proof content, making agent work searchable:

# Query proofs by agent
python3 scripts/proof_query.py query --agent benchmark-L9 --limit 5

# Get database statistics
python3 scripts/proof_query.py stats

# Cross-agent witness co-sign
python3 scripts/proof_query.py co-sign <chain_hash> \
  --witness audit-trail-agent \
  --verdict valid \
  --finding "Score formula verified"

Nostr Publication: Decentralized Verification

All proofs are signed with BIP-340 Schnorr signatures and published to three Nostr relays. This means:

What's Next

Three audit gaps remain open (MEDIUM severity) that will further strengthen the proof chain:

These additions will close the remaining audit gaps, making BlindOracle's proof system fully self-contained for independent audit without any access to the original infrastructure.


Resources


Copyright © 2026 Craig M. Brown. All rights reserved. | Back to Blog | BlindOracle Home