Save arXiv papers from the abstract page as clean Markdown, title-based PDFs, or MinerU result packages.
Chrome Web Store · 中文说明 · FAQ · Architecture · Development · Privacy · Contributing · Changelog
arXiv to Markdown is a Manifest V3 browser extension for Chrome and Edge.
On an arXiv abstract page, it injects two buttons below the submission history:
Markdown: convert the paper throughar5ivor submit a MinerU task, depending on your default modePDF: download the original paper with a title-based filename
The current codebase supports two real workflows:
Standard Modear5iv HTML -> local Markdown conversion in the page -> PDF fallback if ar5iv is unavailable or failsMinerU Modesubmit the paper PDF URL to MinerU -> poll in the background -> download a ZIP package when the task finishes
- Injects controls on
https://arxiv.org/abs/* - Extracts paper metadata from the abstract page, with arXiv export API as fallback
- Converts ar5iv HTML to Markdown in the browser with custom Turndown rules
- Preserves formulas as LaTeX where possible
- Keeps complex tables as raw HTML tables to avoid losing merged cells
- Leaves images as remote ar5iv asset links
- Adds optional YAML frontmatter to Markdown exports
- Tracks MinerU background tasks in the popup
- Supports English and Chinese in the popup, settings page, and content UI
Install from the Chrome Web Store.
git clone https://github.com/Tendo33/arxiv-md.git
cd arxiv-md
npm install
npm run buildThen open chrome://extensions, enable Developer mode, click Load unpacked, and choose dist/.
- Open an arXiv abstract page such as
https://arxiv.org/abs/1706.03762. - Find the injected
MarkdownandPDFbuttons belowSubmission history. - Click
Markdownfor the default conversion route configured in settings. - Click
PDFif you only want the original paper with a readable filename. - Open the popup if you want to inspect, retry, delete, or download MinerU task results.
- Source: ar5iv HTML
- Conversion runtime: browser content script
- Metadata: optional YAML frontmatter with
title,arxiv_id,source,authors, andyear - Tables: preserved as HTML when necessary
- Images: stored as remote links, not bundled locally
- Triggered by the dedicated
PDFbutton - Downloads the original arXiv PDF directly from the page context
- Uses a title-based filename
- Available only when
MinerU Modeis enabled and a token is configured - Runs as an async background task
- Downloads a ZIP package, not a Markdown file
- Appears in the popup task center
The popup is a MinerU task center, not a universal conversion history.
It shows:
- pending, processing, completed, and failed MinerU tasks
- progress for background parsing jobs
- actions to retry, delete, copy result links, or re-download ZIP files
The settings page lets you:
- switch between
Standard ModeandMinerU Mode - save and test a MinerU API token
- enable or disable desktop notifications
- show the auto-convert prompt when a paper page loads
- include or exclude Markdown metadata
- switch UI language between English and Chinese
- reset usage statistics
- README_CN.md: Chinese product overview
- docs/FAQ.md: usage questions and troubleshooting
- docs/ARCHITECTURE.md: current runtime and module design
- docs/DEVELOPMENT.md: local setup, debugging, and release flow
- docs/mentor/README.md: guided codebase reading pack for maintainers
npm run dev
npm run build
npm run lint
npm test
npm run packageThe webpack build emits dist/, and npm run package creates build/arxiv-md-v<version>.zip.
- Buttons are injected on arXiv abstract pages, not on unrelated sites
- Standard mode does not auto-fallback to MinerU; it falls back to PDF
- Popup task management is only for MinerU jobs
- Images in Markdown stay remote
- MinerU is optional and depends on a third-party service
MIT.