WordPress plugin that automatically extracts content from external URLs and publishes posts via scheduled cron jobs. Perfect for content creators who need to aggregate and republish content from multiple sources.
- Automated Content Extraction: Fetches content from specified URLs on a configurable schedule
- Batch Processing: Processes multiple posts per batch to avoid server overload
- WordPress Integration: Seamlessly publishes extracted content as WordPress posts
- Featured Image Support: Automatically downloads and sets featured images for posts
- Admin Settings Page: Easy-to-use settings interface in WordPress admin
- Error Handling: Comprehensive error logging and handling
- Security First: Uses WordPress best practices for remote requests and data sanitization
# 1) Download the plugin
git clone https://github.com/VoxHash/WPContent-extractor.git
cd WPContent-extractor
# 2) Copy to WordPress plugins directory
cp -r . /path/to/wordpress/wp-content/plugins/wp-content-extractor
# 3) Activate in WordPress admin
# Navigate to Plugins → Installed Plugins → Activate "Content Extractor"See docs/installation.md for detailed installation steps.
- Download or clone this repository
- Copy the plugin files to your WordPress
wp-content/plugins/directory - Activate the plugin through the WordPress admin panel
- Navigate to Settings → Content Extractor to configure
Basic usage here. Advanced usage in docs/usage.md.
- Configure Source URL: Go to Settings → Content Extractor and enter your source URL
- Automatic Processing: The plugin runs every 30 minutes via WordPress cron
- Manual Trigger: You can trigger extraction manually by calling
ce_extract_content()function
The plugin automatically:
- Fetches content from the configured URL
- Parses and extracts post data
- Publishes posts to WordPress
- Downloads and sets featured images
- Handles pagination for large content sets
| Setting | Description | Default |
|---|---|---|
| Source URL | The URL to extract content from | - |
| Posts per batch | Number of posts processed per run | 5 |
| Cron interval | How often to run extraction | Every 30 minutes |
Full configuration reference: docs/configuration.md
- Start here: docs/examples/example-01.md
- More: docs/examples/
High-level overview:
- Main Plugin File (
content-extractor.php): Handles plugin initialization, settings, and cron scheduling - Functions File (
functions.php): Contains content parsing and post publishing logic - WordPress Cron: Runs extraction every 30 minutes automatically
- Settings API: Uses WordPress Settings API for secure configuration
See docs/architecture.md for detailed architecture documentation.
Planned milestones live in ROADMAP.md. For changes, see CHANGELOG.md.
We welcome PRs! Please read CONTRIBUTING.md and follow the PR template.
Please report vulnerabilities via SECURITY.md.
This project is licensed under the MIT License - see LICENSE file for details.