This simple project demonstrates how to scrape table data from websites using Python.
website url = https://en.wikipedia.org/wiki/List_of_largest_companies_in_the_United_States_by_revenue
- Python 3.x
requests
librarybeautifulsoup4
librarypandas
-
Clone the repository:
git clone https://github.com/kingstondoesit/scrape-test.git
-
Navigate to the project directory:
cd scrape-test
-
Install the required libraries:
pip install -r requirements.txt
-
Run the scraper script:
python scraping_table.py
-
The scraped data will be saved in the specified output file.
Contributions are welcome! Please fork the repository and submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.