The benchmark methodology is prepared. First compliant measurements are pending. We publish numbers only when they meet the full methodology — hardware, model, power source, and prompt suite all documented.
Every published number will meet these requirements. If a number does not come from a compliant run, it will not appear on this page.
usage response field. Character or word counts are not used.wall_power_W) from an external clamp meter or smart plug. Apple SMC package power is recorded separately as an internal reference and is clearly labelled as such if shown. A run without a wall meter is marked smc_only and not published publicly.A benchmark harness (saturn-bench) is in development as a SwiftPM executable. It dispatches each prompt in a fixed suite to Saturn-Control's /v1/chat/completions endpoint, records per-request timing and token usage, and writes a versioned JSON result file. Wall-power sampling is recorded separately by the operator during the run.
{
"schemaVersion": 1,
"status": "dry_run",
"hardware": { "model": "<hardware model>", "chip": "<chip>", "ramGB": null },
"software": { "modelID": "<model-id-and-quantization>", "saturnControlSHA": "<sha>" },
"promptSuite": { "name": "<suite-name-and-version>", "promptCount": null },
"conditions": { "powerSource": "wall_power_W", "batchSize": 1 },
"results": {
"tokensPerSecond": null,
"meanWatts": null,
"joulesPerToken": null,
"tokensPerWattHour": null
},
"notes": "Pending first compliant run."
}