PyHazards: A Python framework for AI-powered hazard prediction

Datasets · Models · Benchmarks · Training Pipelines · Evaluation

Documentation · GitHub · Slack

Overview

PyHazards is built for hazard-AI work that needs more than a single model or paper reproduction. It unifies dataset discovery, model construction, benchmark-aligned evaluation, and experiment plumbing so the same library can support first-run baselines, comparative studies, and contributor extensions.

Intended users:

Researchers: run benchmark-aligned experiments and compare baselines across hazard tasks.
Practitioners: reuse hazard-specific workflows for data inspection, model building, and evaluation.
Contributors: extend datasets, models, and benchmarks through registry and catalog patterns already used in the repo.

Why PyHazards

Unified datasets: public hazard datasets, forcing sources, and inspection entrypoints live in one curated catalog.
Benchmark-aligned evaluation: shared benchmark families, smoke configs, and reports keep experiments comparable.
Registry-based models: published baselines and adapters are built through a consistent model-registry surface.
Shared training and inference pipelines: one engine layer supports fit, evaluate, predict, and benchmark execution workflows.

Hazard Coverage

Wildfire: danger forecasting, weekly forecasting, spread baselines, fuels, burn products, and active-fire sources.
Earthquake: waveform picking, dense-grid forecasting adapters, and linked benchmark ecosystems for picking and forecasting.
Flood: streamflow and inundation baselines with benchmark-backed evaluation paths.
Tropical Cyclone: track-and-intensity forecasting baselines plus shared benchmark ecosystems and adapters.

Installation

Install PyHazards from PyPI:

pip install pyhazards

If you need GPU execution, install a compatible PyTorch build first and then select the device as needed:

export PYHAZARDS_DEVICE=cuda:0

Quick Start

Use this as the shortest benchmark-aware starter path: verify the package, build one registered model, and run one smoke benchmark config.

Verify the installation:

python -c "import pyhazards; print(pyhazards.__version__)"

Build a registered model:

from pyhazards.models import build_model

model = build_model(
    name="hydrographnet",
    task="regression",
    node_in_dim=2,
    edge_in_dim=3,
    out_dim=1,
)
print(type(model).__name__)

Run a benchmark-aligned smoke configuration:

python scripts/run_benchmark.py --config pyhazards/configs/flood/hydrographnet_smoke.yaml

Continue with the full docs for dataset inspection, benchmark pages, and training workflows.

Project Structure

pyhazards.datasets - dataset catalog, registry surfaces, and inspection entrypoints.
pyhazards.models - model registry, builders, and reusable baseline implementations.
pyhazards.benchmarks - benchmark families, ecosystem mappings, and evaluation contracts.
pyhazards.engine - shared training, inference, runner, and experiment utilities.
pyhazards.configs - smoke and example benchmark configurations.
docs/ and docs/source/ - published documentation, generated catalogs, and contributor guides.

Supported Workflows

inspect hazard datasets and forcing sources before training,
build baseline and adapter models through the unified registry,
run smoke tests and benchmark configs for hazard-specific tasks,
export benchmark reports and compare metrics across models,
extend the library with new datasets, models, benchmarks, and catalog entries.

Documentation

Full documentation: https://labrai.github.io/PyHazards

Contributing

If you want to extend PyHazards:

Contributing guide: .github/CONTRIBUTING.md
Developer implementation guide: docs/source/implementation.rst
Maintainer notes: .github/IMPLEMENTATION.md

Roadmap themes:

more benchmark ecosystems and external data adapters,
more hazard-specific baselines and evaluation coverage,
expanded reproducibility, report tooling, and smoke-test coverage,
stronger examples, tutorials, and contributor automation.

Community

Slack: RAI Lab Slack Channel

Project activity:

Citation

If you use PyHazards in your research, please cite:

@misc{pyhazards2025,
  title        = {PyHazards: An Open-Source Library for AI-Powered Hazard Prediction},
  author       = {Cheng et al.},
  year         = {2025},
  howpublished = {\url{https://github.com/LabRAI/PyHazards}},
  note         = {GitHub repository}
}

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 308 Commits
.codex/skills		.codex/skills
.github		.github
docs		docs
pyhazards		pyhazards
scripts		scripts
static		static
tests		tests
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyHazards: A Python framework for AI-powered hazard prediction

Overview

Why PyHazards

Hazard Coverage

Installation

Quick Start

Project Structure

Supported Workflows

Documentation

Contributing

Community

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PyHazards: A Python framework for AI-powered hazard prediction

Overview

Why PyHazards

Hazard Coverage

Installation

Quick Start

Project Structure

Supported Workflows

Documentation

Contributing

Community

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages