Client:
Start Date:
End Date:
Technologies:
- Python programming language
- Requests library
- Pandas library
Project Description:
GitHub Scraper was developed to extract information from over 5000 public repositories on GitHub, including commit counts, star counts, pull request counts, and fork counts. The extracted data needed to be sorted and stored in an Excel file.
Our Tasks:
- Calling the GitHub platform API
- Utilizing Google BigQuery to handle a large number of internet requests
- Collecting selected data
- Sorting the data
- Creating and inserting data into an Excel file