Interactive 3D World Map Visualization

Project Overview

We constructed an interactive 3D chart of a world map with countries colored according to various statistical indicators from a dataset. Users can rotate the Earth sphere and click on a specific country to view detailed information and related plots.

How to Launch the Website Locally

Linux

curl -o- https://githubusercontent.com | bash

Restart your terminal

nvm install --lts

macOS

# Install Node.js if not installed
brew install node

# Clone the repo
git clone <REPOSITORY-URL>

Windows

Download: Go to the official Node.js website and download the LTS (Long-Term Support) version for Windows.
Run Setup: Open the downloaded .msi file and follow the installation wizard. Keep the default settings, specifically ensuring the "Add to PATH" option is checked.
Finish: Click "Install" and then "Finish" once the process completes

Other actions are equivalent for any OS

Launch two terminals

Terminal 1

cd <PATH-TO-PROJECT> 
cd client
npm install 
npm start

Terminal 2

cd <PATH-TO-PROJECT> 
cd server
npm install 
npm start

Team Roles

Zamir Safin: Frontend Visualization (React, D3.js)
Denis Beliaev: Backend Database (Node.js, PostgreSQL)
Rustem Gilmetdinov: Data Pipelining & GenAI Agents (Python, LLMs)

Technical Stack

Frontend: React
Backend: Node.js
Database: PostgreSQL, SQLite
DevOps: Docker

Data Strategy

Structured Data: Scrapped from worldometers.info
Unstructured Data: Extracted from PDFs (Demographic Yearbooks)
Processing: GenAI agents used for data extraction and cleaning

Data Extraction Pipeline

Tools Used:

PyPDF - Page pre-extraction from PDF reports
Docling - PDF to Markdown conversion
Qwen2.5-VL - Table extraction to CSV

Process:

For unstructured data

Identify table pages using PDF inspector
Extract specific pages with PyPDF (reduces context by ~90%)
Convert extracted pages to Markdown with Docling
Send Markdown to Qwen2.5-VL for structured CSV extraction
Validate output with pandas
Linearly interpolate missing values

For structured data

Scrap the data from the website (worldometers.info) and upload to CSV
Clear the CSV by deleting redundant symbols and columns
Linearly interpolate missing values

Construction of Database

Merge the gained CSV tables by using INNER JOIN
Upload the gained pd.DataFrame into SQLite database file .db

Justification:

Page pre-extraction reduces processing time and increases the accuracy
Qwen2.5-VL chosen for the ability of reading text in various scenarios (multi-orientation), interpreting tables, charts, diagrams

How to launch the Data Pipeline Process

Linux/macOS

# Clone the repository
git clone <REPOSITORY_URL>
cd "DWaV Project"

# Create a user-specific `.env` file
chmod +x create_env_unix.sh
./create_env_unix.sh

# Create and activate a virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Launch the data pipeline
python3 data_pipeline/src/main.py

Windows

REM Clone the repository
git clone <REPOSITORY_URL>
cd "DWaV Project"

REM Create a user-specific `.env` file
create_env_windows.bat

REM Create and activate a virtual environment
python -m venv .venv
.venv\Scripts\activate

REM Install dependencies
pip install -r requirements.txt

REM Launch the data pipeline:
python data_pipeline/src/main.py

[!warn] Do not forget to edit DB_USER and DB_PASSWORD with your own values

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interactive 3D World Map Visualization

Project Overview

How to Launch the Website Locally

Linux

macOS

Windows

Other actions are equivalent for any OS

Terminal 1

Terminal 2

Team Roles

Technical Stack

Data Strategy

Data Extraction Pipeline

Tools Used:

Process:

For unstructured data

For structured data

Construction of Database

Justification:

How to launch the Data Pipeline Process

Linux/macOS

Windows

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
client		client
data_pipeline		data_pipeline
database		database
docker		docker
server		server
.gitignore		.gitignore
README.md		README.md
create_env_unix.sh		create_env_unix.sh
create_env_windows.bat		create_env_windows.bat
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Interactive 3D World Map Visualization

Project Overview

How to Launch the Website Locally

Linux

macOS

Windows

Other actions are equivalent for any OS

Terminal 1

Terminal 2

Team Roles

Technical Stack

Data Strategy

Data Extraction Pipeline

Tools Used:

Process:

For unstructured data

For structured data

Construction of Database

Justification:

How to launch the Data Pipeline Process

Linux/macOS

Windows

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages