EOVLMs: A Family of Multi-Sensor, Multi-Granularity Vision-Language Models for Earth Observation Understanding
- [2026/03/19] π₯ TerraScope is released.
- [2026/02/21] π TerraScope is accepted by CVPR 2026.
- [2025/06/01] π₯ The technical report of EarthMind is released.
- [2025/05/29] π₯ EarthMind is released, including data, model weight, training and evaluation code.
If you find this repository useful, please consider giving a star β and citation
@article{shu2025earthmind,
title={EarthMind: Leveraging Cross-Sensor Data for Advanced Earth Observation Interpretation with a Unified Multimodal LLM},
author={Shu, Yan and Ren, Bin and Xiong, Zhitong and Paudel, Danda Pani and Van Gool, Luc and Demir, Beg{\"u}m and Sebe, Nicu and Rota, Paolo},
journal={arXiv preprint arXiv:2506.01667},
year={2025}
}
@article{shu2026terrascope,
title={TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation},
author={Shu, Yan and Ren, Bin and Xiong, Zhitong and Zhu, Xiao Xiang and Demir, Beg{\"u}m and Sebe, Nicu and Rota, Paolo},
journal={arXiv preprint arXiv:2603.19039},
year={2026}
}
This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.