RFP: Living Benchmark Dashboard

Why It Matters

  • Replaces "trust the vendor" with "run this yourself"
  • Provides credible data for institutional architecture decisions
  • Creates accountability: vendors can't make unverifiable claims
  • Supports ongoing comparison as systems mature

Scope

In-Scope

  • Automated benchmark pipeline (CI/CD) that runs weekly on latest builds
  • Benchmark harnesses for institutional workloads:
    • Private transfer (shielded send with nullifier update)
    • DvP settlement pattern
    • Batch operations (10, 100, 1000 txs)
    • Compliance proof generation
  • Target systems (public testnets where available):
    • Privacy L2s (see L2 Privacy Evaluation Pattern):
      • Public L2: Aztec, Miden, Intmax
      • AppChain SDK: Prividium, EY Nightfall, Scroll Cloak
    • Privacy App Layers:
      • Zama fhEVM (coprocessor)
      • Kaleido/Paladin (L1 privacy)
      • Railgun (L1 shielded pool)
  • Metrics per system (see L2 Privacy Evaluation Pattern for full criteria):
    • Performance: Throughput (TPSPublic/TPSPrivate), latency, finality
    • Cost: Gas usage, bridging costs, forced exit costs
    • Privacy: What is hidden, from whom, trust model
    • Security: Sequencer decentralization, censorship resistance, upgrade process
    • Trust dimensions (from Trust Assurance)
  • Dockerized reproduction: anyone can run docker-compose up to verify

Out-of-Scope

  • Proprietary/closed systems without public access
  • Legal analysis of results
  • Optimization recommendations (that's vendor work)

Deliverables

  • Benchmark harness repo (open source, Docker-based)
  • CI pipeline that runs weekly and publishes results
  • Comparison dashboard (static site or markdown in iptf-map)
  • Methodology document (how to interpret results, caveats)
  • Integration with iptf-map vendor pages (automated updates)

Dependencies

Requires:

  • Public testnet access for target systems
  • Clear workload definitions (coordinate with IPTF team)
  • Trust schema from Trust Assurance for complete picture

Enables:

See Also