ERA5 Data Downloader

A tool for downloading ERA5 climate data from the Copernicus Climate Data Store (CDS) using multiple API keys concurrently to improve download speeds.

Features

Dynamic task assignment system that automatically balances workload among multiple API keys
- Pre-flight API key validation with automatic removal of invalid keys
- Configurable concurrent request threads and parallel download threads per key
- Keeps the CDS server queue occupied while downloads proceed concurrently
Robust download mechanism:
- Automatic fallback download if the CDS API method fails
- Exponential backoff retry strategy for failed downloads
- Resumable for interrupted downloads
Automatic file name handling:
- Optional automatic variable short name extraction from NetCDF files
- Skip existing files when provided with short names
Supports both ERA5 single-level and pressure-level datasets

Installation

Clone or download this repository:

git clone https://github.com/Hem-W/ERA5_toolbox.git
cd ERA5_toolbox

Installation Method 1: Using Conda (Recommended)

Create a conda environment using the provided environment.yml file:
```
conda env create -f environment.yml
conda activate era5_toolbox
```

Installation Method 2: Manual Installation

Install the required dependencies manually:

pip install cdsapi json5 tqdm urllib3 netcdf4 xarray

Configuration

Configure your API keys by creating or modifying the cdsapi_keys.json file:

{
    "keys": [
        "your-first-api-key",
        "your-second-api-key",
        "your-third-api-key"
    ]
}

You can obtain CDS API keys by registering at https://cds.climate.copernicus.eu/

Make sure the cdsapi_keys.json file is in the same directory as the script, or specify a different location in the api_keys_file parameter.

Usage

Option 1: YAML Configuration File (Recommended)

Create a YAML configuration file based on template_request.yaml and run:

python -u downloader_ERA5.py --file my_config.yaml

Or run in the background:

nohup python -u downloader_ERA5.py --file my_config.yaml &

See template_request.yaml for a commented example with all available parameters.

Option 2: Hardcoded in Script

If no --file is provided, the script uses the hardcoded configuration in the main section of downloader_ERA5.py:

python -u downloader_ERA5.py

Configuration Parameters

The following parameters can be set via the YAML file or by editing the hardcoded configuration in the script:

# User Specification
years = range(2019, 2025)
variables = ["10m_u_component_of_wind", "2m_temperature"]
dataset = "reanalysis-era5-single-levels"
pressure_levels = None  # List of pressure levels (hPa)
api_keys_file = None  # Use default 'cdsapi_keys.json'
concurrent_requests = 4  # Number of concurrent request threads per key
download_workers = 1  # Number of parallel download threads per key
skip_existing = True  # Whether to skip downloading existing files

# Optional: Provide short names for variables (recommended when skip_existing=True)
short_names = {
    '10m_u_component_of_wind': 'u10', 
    '2m_temperature': 't2m'
}

Single-Level Data Example

To download single-level ERA5 data:

years = range(1940, 2025)
variables = ["toa_incident_solar_radiation", "2m_temperature", "total_precipitation"]
dataset = "reanalysis-era5-single-levels"
# Define short names for better file naming and skipping existing files
short_names = {
    "toa_incident_solar_radiation": "tisr", 
    "2m_temperature": "t2m", 
    "total_precipitation": "tp"
}

Pressure-Level Data Example

To download pressure-level ERA5 data:

years = range(1940, 2025)
variables = ["geopotential", "u_component_of_wind", "v_component_of_wind"]
dataset = "reanalysis-era5-pressure-levels"
pressure_levels = ["500", "700"]  # Pressure levels in hPa
short_names = {
    "geopotential": "z", 
    "u_component_of_wind": "u", 
    "v_component_of_wind": "v"
}

Output Path Layout

Downloaded files are written to <folder_pattern>/<name_pattern>, both of which are configurable via the YAML file (folder_pattern, name_pattern). Placeholders available in either pattern:

{short_name} — variable short name (falls back to the API {variable} name when no entry is provided in short_names)
{variable} — CDS API variable name (e.g. surface_pressure)
{year} — year being downloaded
{pressure_level} — pressure level in hPa (empty for single-level data)
{dataset} — CDS dataset name

When a pattern is omitted, dataset-aware defaults are used:

folder_pattern: hour/{short_name}
name_pattern (single-level): era5.reanalysis.{short_name}.1hr.0p25deg.global.{year}.nc
name_pattern (pressure-level): era5.reanalysis.{short_name}.{pressure_level}hpa.1hr.0p25deg.global.{year}.nc

The variable short name can be:

Provided by the user via the short_names dictionary (recommended)
Automatically extracted from the downloaded NetCDF file (when short_name is not provided)

API Key Security

The script loads API keys from a separate JSON file, which:

Keeps sensitive credentials out of source code
Makes it easier to maintain and update keys

Experimental Helper Utilities

Resampling ERA5 data: utils/resampler_ERA5.py
Calculate relative humidity from temperature and specific humidity: utils/humid-helper_ERA5.py

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
utils		utils
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
ROADMAP.md		ROADMAP.md
downloader_ERA5.py		downloader_ERA5.py
environment.yml		environment.yml
template_request.yaml		template_request.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ERA5 Data Downloader

Features

Installation

Installation Method 1: Using Conda (Recommended)

Installation Method 2: Manual Installation

Configuration

Usage

Option 1: YAML Configuration File (Recommended)

Option 2: Hardcoded in Script

Configuration Parameters

Single-Level Data Example

Pressure-Level Data Example

Output Path Layout

API Key Security

Experimental Helper Utilities

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ERA5 Data Downloader

Features

Installation

Installation Method 1: Using Conda (Recommended)

Installation Method 2: Manual Installation

Configuration

Usage

Option 1: YAML Configuration File (Recommended)

Option 2: Hardcoded in Script

Configuration Parameters

Single-Level Data Example

Pressure-Level Data Example

Output Path Layout

API Key Security

Experimental Helper Utilities

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages