From eed12a1c9df4153ea785a4a4cc2aefd5fcb803eb Mon Sep 17 00:00:00 2001 From: Elaine Hale Date: Fri, 8 May 2026 10:36:08 -0600 Subject: [PATCH] Documentation page describing data sources. --- .../global_dataset_data_sources.md | 37 +++++++++++++++++++ docs/explanation/index.md | 20 +++++----- 2 files changed, 46 insertions(+), 11 deletions(-) create mode 100644 docs/explanation/global_dataset_data_sources.md diff --git a/docs/explanation/global_dataset_data_sources.md b/docs/explanation/global_dataset_data_sources.md new file mode 100644 index 0000000..fc728d6 --- /dev/null +++ b/docs/explanation/global_dataset_data_sources.md @@ -0,0 +1,37 @@ +(global-dataset-data-sources)= +# Global Dataset Data Sources + +The default STRIDE `global` dataset is assembled from several public data products. This page summarizes which sources contribute to each part of the projection. For information on how these data are retrieved and compiled, see {ref}`data-download` and {ref}`data-validation`. + +## Default Projection + +These sources are combined to estimate annual electricity consumption per country and sector: + +- [IEA World Energy Balances](https://www.iea.org/data-and-statistics/data-product/world-energy-balances) -- Historic data on energy use are combined with the below datasets to estimate energy intensity, which is then summarized and projected per-country, per-sector log-linear or linear regressions. +- [2024 World Economic League Table](https://www.imf.org/en/Publications/WEO/weo-database/2024/October/weo-report) -- Historical GDP from the 2024 IMF World Economic Outlook (WEO) and GDP forecasts from the Centre for Economics and Business Research (CEBR). +- [United Nations World Population Prospects](https://population.un.org/wpp/) -- Historical and median projected population data +- [United Nations Development Reports](https://hdr.undp.org/data-center/human-development-index#/indicies/HDI) -– Historical human development index (HDI) data projected to 2050 with per-country logistic regressions. + +## Electric Vehicle Projection + +STRIDE includes an optional, more detailed projection for light-duty passenger electric vehicles (EVs) that replaces the default transportation on-road forecast when `use_ev_projection` is enabled. These sources supply information on current and projected EV stock shares, per-capita vehicle ownership, per-vehicle annual use, and EV energy intensity: + +- [IEA Global EV Outlook](https://www.iea.org/reports/global-ev-outlook-2025) –- Historical EV stock and stock shares broken out by vehicle type (e.g., Cars, Vans, Trucks) and powertrain (i.e., BEV, PHEV) by country and year +- [OECD Road motor vehicle traffic dataset](https://data-explorer.oecd.org/) –- Historical total on-road vehicle kilometers by country and year +- [BNEF Electric Vehicle Outlook 2025](https://about.bnef.com/insights/clean-transport/electric-vehicle-outlook/) –- EV stock projections for select countries and regions used to create prototypical (logistic) adoption shapes that are then customized per country based on historic EV stock shares from the IEA Global EV Outlook +- [NREL 2024 Annual Technology Baseline (ATB) for Transportation](https://atb.nrel.gov/transportation/2024/index) –- Default energy use per kilometer and PHEV utility factor projections.[^atb] + +[^atb]: STRIDE currently uses the Mid scenario, Midsize car values, linearly interpolated, for all countries. BEV assumptions are taken from "Battery Electric Vehicle (300-mile range)" and PHEV assumptions from "Gasoline Extended Range Plug-in Hybrid Electric Vehicle (35-mile electric range)". + +## Profile Data + +These sources shape the annual projections from the prior sections into hourly load forecasts, capturing typical diurnal and weekday/weekend patterns by sector and end use and day-to-day variations driven by historical weather: + +- [IMAGE Integrated Assessment Model](https://data.mendeley.com/datasets/pmd2dchk44/1) – Modeled load shapes by sector and end use by country, month, day type (weekday/weekend) and model year (1971 – 2100) +- [ERA5](https://cds.climate.copernicus.eu/datasets/reanalysis-era5-single-levels?tab=overview) + [Demand Ninja Logic](https://doi.org/10.1038/s41560-023-01341-5) – Building-adjusted internal temperature (BAIT) and other weather variables for a representative location per country. Daily averages for historical weather years, currently 1995-2024. Can be computed for any years 1940 – present day. + +## Related Topics + +- {ref}`data-download` - How datasets are retrieved and stored locally +- {ref}`data-validation` - How datasets are validated and registered with dsgrid +- {ref}`weather-year-modeling` - How weather data are used to shape hourly load diff --git a/docs/explanation/index.md b/docs/explanation/index.md index ab14fc4..8569edb 100644 --- a/docs/explanation/index.md +++ b/docs/explanation/index.md @@ -2,24 +2,22 @@ (explanation)= # Core Concepts -This section explains the key concepts and architecture behind STRIDE's data pipeline. - -## Overview - -STRIDE transforms raw energy data into hourly electricity demand projections through a multi-stage pipeline: - -1. **Data Download** - Retrieve datasets from remote repositories (e.g., GitHub releases) -2. **Data Validation** - Register and validate data using dsgrid's dimension mapping system -3. **Computation** - Calculate energy projections using dbt (data build tool) SQL transformations - -Each stage is designed to be modular and customizable, allowing users to swap out datasets, adjust validation rules, or modify calculation logic. +These pages explain the data and computations behind a STRIDE projection. They follow the rough flow of a project: input datasets are sourced, downloaded, and validated, then transformed by dbt into hourly load forecasts that reflect historical weather. ```{toctree} :maxdepth: 2 +:caption: Input data +global_dataset_data_sources data_download data_validation customizing_checks +``` + +```{toctree} +:maxdepth: 2 +:caption: Computation + dbt_computation weather_year_modeling ```