Autonomous AI agent that scrapes Amazon India, analyzes market trends using LLaMA AI, and delivers actionable business intelligence on a React dashboard.
- Frontend: https://trade-hawk.vercel.app
- GitHub: https://github.com/Prateek022/TradeHawk
TradeHawk is an autonomous market intelligence platform for Indian e-commerce sellers and traders. Enter any product keyword — TradeHawk scrapes Amazon India in real time, processes the data through an AI pipeline, and delivers:
- 📊 Live price comparison charts
- 🤖 AI-generated market summary and insights
- 💰 Best deals ranked by price vs rating
- 🔍 Market opportunity analysis
- ⏰ Autonomous 24-hour re-scraping via scheduler
| Layer | Technology |
|---|---|
| Scraping | Playwright, BeautifulSoup, Scrapy |
| Backend | Python, FastAPI, SQLAlchemy, SQLite |
| AI | Groq API (LLaMA 3.1-8b-instant) |
| Frontend | React.js, Recharts, Vite |
| Automation | APScheduler (24hr autonomous pipeline) |
| Deployment | GitHub, Vercel |
- User enters a product keyword on the React dashboard
- FastAPI triggers concurrent scrapers via ThreadPoolExecutor
- Playwright launches real Chromium browser → scrapes Amazon India
- BeautifulSoup parses HTML → extracts name, price, discount, rating
- Scrapy pipeline processes and stores data in SQLite
- Groq LLaMA AI analyzes data → generates market intelligence
- React dashboard displays results with interactive charts
- APScheduler autonomously re-scrapes top keywords every 24 hours
# Clone repo
git clone https://github.com/Prateek022/TradeHawk.git
cd TradeHawk
# Backend setup
python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt
python -m playwright install chromium
# Add environment variables
# Create .env file and add:
# GROQ_API_KEY=your_groq_api_key_here
# Start backend
python -m uvicorn backend.main:app --reload
# Frontend setup (new terminal)
cd frontend
npm install
npm run devtradehawk/
├── backend/
│ ├── scraper/
│ │ ├── amazon.py
│ │ ├── flipkart.py
│ │ └── pipeline.py
│ ├── database/
│ │ └── models.py
│ ├── ai/
│ │ └── analyst.py
│ ├── scheduler.py
│ └── main.py
├── frontend/
│ └── src/
│ └── App.jsx
├── requirements.txt
└── README.md
Prateek Soni — MCA Final Year | Python Developer | Full Stack
- Email: prateek.ps988@gmail.com
- LinkedIn: linkedin.com/in/prateek0210
- GitHub: github.com/Prateek022