This project focuses on extracting data from multiple websites using Python and performing basic data analysis.
- Python
- BeautifulSoup
- Requests
- Pandas
- Scraped quotes (text and author) from Quotes website
- Scraped job data (title, company, location) from Fake Jobs website
- Stored data in structured format using pandas
- Exported datasets to Excel files
- Performed filtering and basic analysis
- Filtered jobs based on location
- Filtered jobs based on domain (Python, Teaching)
- Found most common job locations
- Basic exploration of quotes dataset
fake_jobs.xlsx→ job datasetquotes.xlsx→ quotes dataset
-
Install required libraries:
pip install requests beautifulsoup4 pandas -
Run the script:
python scraper1.py python scraper2.py
Waheed Mujtaba