Corpus & leaderboard
Beyond your own runs, Provenova maintains a public calibration corpus: a cross-vendor, longitudinal
record of how real quantum hardware behaves over time. It powers the State of Quantum Hardware
leaderboard at /leaderboard.
The corpus
The corpus is a collection of calibration snapshots normalized to the open qlprov/calibration/1.0
schema, so an IBM device and an IonQ device can be compared on the same axes. Snapshots are
content-addressed, so identical captures are stored once and referenced many times.
The crawler
The quantumledger-crawler package collects vendor calibration data, normalizes it, applies each vendor's
terms-of-service redistribution policy, deduplicates, and ingests it into the corpus. It has two sources:
FixtureSource(default, offline) — reads representative payloads fromfixtures/{vendor}/*.json. This is what the demo and tests use; no network required.LiveSource(optional) — a skeleton for calling real vendor APIs (IBMBackendProperties, IonQ characterization, Amazon Braket device properties).
A ToS gate governs what may be redistributed, so the public corpus respects each vendor's terms.
Sources & licensing
The corpus is genuinely multi-vendor and every row is labelled by source and licence so mixed provenance is transparent:
- IBM — real device calibration snapshots from Qiskit's
fake_provider(Apache-2.0). - Metriq — community benchmark submissions (Algorithmic Qubits, CLOPS, Quantum Volume, 2Q fidelity) from metriq.info, CC-BY-4.0.
- Zenodo — a ready slot for openly-licensed raw calibration datasets, loaded only when a record's licence is verified CC-BY / CC0 (none published for the tracked devices yet; IQM hardware currently appears via the Metriq and vendor-reported rows).
- Vendor-reported — headline specifications published by manufacturers, shown as claims with a distinct badge and the source URL recorded in each snapshot's provenance — never presented as independently reproduced.
Nothing is fabricated: each value is copied from its cited source. The redistributable_raw flag and
license_ref travel with every snapshot.
The leaderboard
/leaderboard ranks devices across vendors and switches the ranking metric. Alongside the calibration
metrics (median two-qubit gate error, T1 / T2 coherence, best 2Q fidelity) it also ranks the
cross-vendor benchmark metrics where published:
- Algorithmic qubits (#AQ), Quantum Volume, CLOPS (higher is better),
- 2-qubit gate fidelity (higher is better), EPLG (error per layered gate — lower is better).
Devices that don't report the selected metric are dropped from that ranking, and each remaining row carries its source / licence badge.
Because snapshots are time-stamped, the corpus also exposes trends per device:
GET /api/v1/backends/<provider>/<backend_id>/trend
— a time series of a backend's calibration quality, so you can see a device improving or degrading over weeks and months.
Why it exists
The corpus turns one-off calibration captures into a shared, comparable, historical view of the quantum hardware fleet — the empirical backdrop against which your own runs and reproductions can be judged.
Next: Product tour.