📚 Research Digest

Automated daily research paper digest from arXiv with smart filtering, mobile-friendly interface, and AI-powered summaries.

Fetch, filter, and browse the latest research papers tailored to your interests. Desktop grid view for deep reading, mobile feed for quick scrolling.

✨ Features

🎯 Smart Filtering - Keyword-based relevance scoring across custom research interests
📱 Mobile Feed - Swipeable, full-screen card interface optimized for phones
🖥️ Desktop Grid - Multi-column layout with rich metadata and difficulty badges
🧠 AI Summaries - Auto-generated layman explanations using transformers
🔄 Deduplication - Never see the same paper twice with built-in tracking
⚙️ Configurable - JSON-based settings for interests, filters, and preferences
📦 Archive - Auto-saves daily digests with browsable index

🖼️ Screenshots

Desktop View

Mobile Feed

🚀 Quick Start

Windows

Clone & Run

git clone https://github.com/usr-wwelsh/research-digest.git
cd research-digest
run_digest.bat

First run automatically:
- Creates virtual environment
- Installs dependencies
- Fetches papers
- Generates HTML digests
Open in browser:
- latest.html - Most recent digest
- index.html - Browse all archives
- tiktok_feed.html - Mobile-optimized feed

Linux/macOS

git clone https://github.com/usr-wwelsh/research-digest.git
cd research-digest
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt
python main.py
python generate_index.py

⚙️ Configuration

Edit config.json to customize:

{
  "interests": {
    "Your Research Area": {
      "query": "cat:cs.LG OR cat:cs.AI",
      "keywords": ["keyword1", "keyword2", "keyword3"]
    }
  },
  "settings": {
    "papers_per_interest": 10,
    "recent_days": 7,
    "summary_max_length": 160
  }
}

Available Settings

Setting	Default	Description
`papers_per_interest`	10	Papers to fetch per category
`recent_days`	7	Look back window (0 = all time)
`fallback_days`	90	Extended search if few results
`summary_max_length`	160	Max characters for summaries
`fetch_multiplier`	5	Over-fetch for better filtering

📖 arXiv Query Syntax

Use arXiv category codes in queries:

cat:cs.LG - Machine Learning
cat:cs.CV - Computer Vision
cat:cs.CL - Computation & Language (NLP)
cat:cs.AI - Artificial Intelligence
cat:cs.CR - Cryptography & Security
cat:cs.DC - Distributed Computing

Combine with OR/AND: cat:cs.LG OR cat:cs.AI

Full category list

🔧 Advanced Usage

Proxmox LXC Deployment (One-Liner)

Want a self-hosted, always-on instance with Cloudflare Tunnel?

From your Proxmox host:

bash <(curl -sL https://raw.githubusercontent.com/usr-wwelsh/Research-Digest/main/create-lxc.sh)

This automatically:

Creates a Debian 12 LXC container (4GB RAM, 4 cores, 20GB disk)
Installs Python, Caddy web server, and cloudflared
Sets up the venv and all dependencies
Configures a weekly cron job (Monday 8am) with CPU/memory limits
Starts Caddy to serve digests on port 8080

After the script finishes:

Enter the container: pct enter <CTID>
Edit /opt/research-digest/config.json with your research interests
Set up Cloudflare Tunnel to expose it publicly
Run a test: /opt/research-digest/run.sh

Idle footprint is ~50-80MB RAM (Caddy + cloudflared). The weekly digest run spikes to ~4GB briefly for torch inference, then drops back down.

Automated Daily Digests & Mobile Sync

Want automatic daily updates synced to your phone? See the 📱 Complete Setup Guide for:

Windows Task Scheduler configuration
Linux/macOS cron jobs
Syncthing mobile sync setup
Troubleshooting tips

Reset Seen Papers

python reset_seen_papers.py

📂 Project Structure

research-digest/
├── config.json              # Configuration (edit this!)
├── main.py                  # Core paper fetcher
├── generate_index.py        # Archive browser generator
├── generate_tiktok_feed.py  # Mobile feed generator
├── run_digest.bat           # Windows launcher
├── run.sh                   # Linux pipeline runner (used by cron)
├── requirements.txt         # Python dependencies
├── create-lxc.sh            # Proxmox LXC creator (run on host)
├── setup.sh                 # Container bootstrap script
├── Caddyfile                # Caddy web server config
├── research-digest-caddy.service  # Systemd unit for Caddy
├── latest.html              # Latest digest (auto-generated)
├── index.html               # Archive browser (auto-generated)
├── tiktok_feed.html         # Mobile feed (auto-generated)
├── seen_papers.json         # Deduplication tracker
└── arxiv_archive/           # Daily archives
    ├── arxiv_digest_20251101.html
    └── ...

🛠️ Requirements

Python 3.8+
Dependencies: transformers, torch, requests
Disk Space: ~2GB for model, ~10MB per digest
Internet: Required for arXiv API and first-time model download

📝 License

MIT License - see LICENSE file for details

🤝 Contributing

Contributions welcome! Ideas:

Additional paper sources (bioRxiv, SSRN, etc.)
Browser extension for direct syncing
Custom ML models for better summaries
Export to Notion/Obsidian/Roam

🙏 Acknowledgments

arXiv for the open research repository
Hugging Face for transformer models
Inspired by modern feed UIs and research workflows

Built with ❤️ for researchers who want to stay current without drowning in papers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Research Digest

✨ Features

🖼️ Screenshots

Desktop View

Mobile Feed

🚀 Quick Start

Windows

Linux/macOS

⚙️ Configuration

Available Settings

📖 arXiv Query Syntax

🔧 Advanced Usage

Proxmox LXC Deployment (One-Liner)

Automated Daily Digests & Mobile Sync

Reset Seen Papers

📂 Project Structure

🛠️ Requirements

📝 License

🤝 Contributing

🙏 Acknowledgments

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
Caddyfile		Caddyfile
LICENSE		LICENSE
PLEASE_READ.md		PLEASE_READ.md
README.md		README.md
SETUP_GUIDE.md		SETUP_GUIDE.md
arxiv_digest_20251101.html		arxiv_digest_20251101.html
config.json		config.json
create-lxc.sh		create-lxc.sh
desktop_demo.png		desktop_demo.png
generate_index.py		generate_index.py
generate_tiktok_feed.py		generate_tiktok_feed.py
main.py		main.py
mobile_demo.png		mobile_demo.png
requirements.txt		requirements.txt
research-digest-caddy.service		research-digest-caddy.service
reset_seen_papers.py		reset_seen_papers.py
run.sh		run.sh
run_digest.bat		run_digest.bat
setup.sh		setup.sh
tiktok_feed.html		tiktok_feed.html

Folders and files

Latest commit

History

Repository files navigation

📚 Research Digest

✨ Features

🖼️ Screenshots

Desktop View

Mobile Feed

🚀 Quick Start

Windows

Linux/macOS

⚙️ Configuration

Available Settings

📖 arXiv Query Syntax

🔧 Advanced Usage

Proxmox LXC Deployment (One-Liner)

Automated Daily Digests & Mobile Sync

Reset Seen Papers

📂 Project Structure

🛠️ Requirements

📝 License

🤝 Contributing

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages