Schema
Conventions for formatting datasets and metadata.
Conventions
- Stable URLs: section pages use
/section/and serveindex.html. - Dataset IDs: stable identifiers like
REG_US_CFR_001orLIC_OS_001. - Dates: ISO 8601 (
YYYY-MM-DD). - Machine-readable: include a JSON metadata block on dataset pages.
- Sources: include source name + URL(s) for factual datasets.
This page is the contract scrapers can rely on. Keep it stable.
Dataset metadata JSON (example)
{
"id": "EXAMPLE_DATASET_001",
"title": "Example Dataset",
"version": "1.0.0",
"published_at": "2025-12-17",
"updated_at": "2025-12-17",
"license": "https://www.exodus-data.com/license/",
"source": [
{ "name": "Primary Source Name", "url": "https://example.com/" }
],
"fields": [
{ "name": "field_a", "type": "string", "description": "Example field" },
{ "name": "field_b", "type": "number", "description": "Example numeric field" }
]
}