API Documentation

Real-time carbon intensity, cost, and speed data for AI inference across providers and regions. All endpoints return JSON. No authentication required for the public API.

Base URL

https://carbonbench.ai

Endpoints

GET/api/leaderboard

Returns a ranked list of model + provider + region combinations, sorted by carbon intensity by default. Each entry includes cost, carbon emissions, speed, and rank.

PARAMETER	TYPE	DESCRIPTION
model	string	Filter by model family: llama, gpt, claude, mistral, gemma, qwen, deepseek, phi
provider	string	Filter by provider (comma-separated): aws, gcp, azure, together, groq, fireworks
sortBy	string	Sort order: carbon (default), cost, or speed
limit	number	Max results (default 50, max 200)

Example request

curl "https://carbonbench.ai/api/leaderboard?model=llama&sortBy=carbon&limit=5"

Response

{
  "data": [
    {
      "modelId": "meta-llama/llama-3.1-8b",
      "displayName": "Llama 3.1 8B",
      "family": "llama",
      "provider": "gcp",
      "region": "GCP Europe West (Netherlands)",
      "regionCode": "europe-west4",
      "costPerMTokens": 0.20,
      "carbonPerMTokens": 4,
      "tokensPerSec": 100,
      "calculatedAt": "2026-04-17T15:00:01.000Z",
      "rank": { "byCost": 2, "byCarbon": 1 }
    }
  ],
  "meta": {
    "lastUpdated": "2026-04-17T15:00:01.000Z",
    "totalModels": 85,
    "totalProviders": 6
  }
}

GET/api/recommend

Returns the lowest-carbon option for a model family, with up to 4 alternatives and a human-readable insight string explaining the recommendation.

PARAMETER	TYPE	DESCRIPTION
modelrequired	string	Model family to find the lowest-carbon option for
maxCostPerMTokens	number	Maximum cost constraint in USD per million tokens
maxLatencyMs	number	Maximum latency constraint in milliseconds

Example request

curl "https://carbonbench.ai/api/recommend?model=llama&maxCostPerMTokens=1"

Response

{
  "recommendation": {
    "modelId": "meta-llama/llama-3.1-8b",
    "displayName": "Llama 3.1 8B",
    "provider": "gcp",
    "region": "GCP Europe West (Netherlands)",
    "costPerMTokens": 0.20,
    "carbonPerMTokens": 4,
    ...
  },
  "alternatives": [ ... ],
  "insight": "Gcp (Netherlands) is the lowest-carbon option for Llama 3.1 8B right now — 4 gCO2/M tokens, 67% less than the highest option, at $0.20/M tokens.",
  "constraints": {
    "model": "llama",
    "maxLatencyMs": null,
    "maxCostPerMTokens": 1
  }
}

GET/api/carbon/{region}

Returns carbon intensity history for a specific region. Use this to build time-of-day charts showing when carbon is lowest.

PARAMETER	TYPE	DESCRIPTION
regionrequired	string	Region code: us-east-1, us-west-2, eu-west-1, europe-west4, us-central1, etc.

Example request

curl "https://carbonbench.ai/api/carbon/us-west-2"

Response

{
  "data": [
    { "time": 1713340800, "value": 193 },
    { "time": 1713344400, "value": 187 },
    { "time": 1713348000, "value": 210 }
  ],
  "meta": {
    "region": "AWS US West (Oregon)",
    "regionCode": "us-west-2",
    "zone": "US-NW-PACW",
    "latestValue": 193,
    "latestAt": "2026-04-17T14:00:00.000Z"
  }
}

GET/api/carbon/all

Returns carbon intensity data for all regions. Used for multi-series comparison charts.

Example request

curl "https://carbonbench.ai/api/carbon/all"

GET/api/health

Returns database row counts for each table. Use this to verify data is flowing correctly.

Example request

curl "https://carbonbench.ai/api/health"

Response

{
  "data": {
    "regions": 9,
    "models": 85,
    "benchmarks": 85,
    "pricing": 92,
    "scores": 184,
    "readings": 9
  },
  "meta": { "timestamp": "...", "status": "healthy" }
}

Data

Carbon intensity — sourced from Electricity Maps, updated daily. Measured in gCO2eq per kWh for each electricity grid zone.

Energy benchmarks — GPU energy consumption per model from the AI Energy Score project (measured on NVIDIA A100) and ML.Energy leaderboard.

Provider pricing — collected from published pricing pages for AWS Bedrock, GCP Vertex AI, Azure OpenAI, Together AI, Groq, and Fireworks AI.

Carbon per million tokens is calculated as: energy_per_token × grid_carbon_intensity. This means the same model produces different carbon in different regions at different times.

Coverage

Models

10 families

Providers

AWS, GCP, Azure, Together, Groq, Fireworks

Regions

US, EU, APAC

Updates

Daily

Carbon + score recalculation

Built by Ned Karlovich. Data sourced from Electricity Maps, AI Energy Score, and ML.Energy.