This project is designed to streamline the process for outgoing exchange students from the University of Hong Kong (HKU) by providing a unified querying system for credit transfer data. Using Selenium, multiple scrapers have been developed to gather course-related information from various publicly available HKU resources. The data is then consolidated into a consistent format in CSV files, enabling students to easily access and query the information they need. Currently, the database contains equivalence information on 25,000+ courses from 150+ universities.
Live Demo: HKU Credit Transfer Database
To run this project, you will need:
- Python 3.x (currently using Python 3.12.8)
- Selenium
- PyPDF2
- Camelot
- Pandas
- time
- A compatible web driver (e.g., Safari, Chrome) — (I used Safari)
-
Clone the repository:
git clone https://github.com/aatmiyasilwal/HKU-Credit-Transfer-Database.git cd hku-credit-transfer-scraper
-
Install the required dependencies:
pip install -r requirements.txt
-
Ensure you have the appropriate web driver installed and accessible in your system's PATH.
- Run the scrapers to collect data.
- The scraped data will be saved in CSV format in the
backend/
directory, and eventually compiled in themerged-credit_transfer_database.csv
. - To launch the website locally, use:
python -m http.server <port number>
- Improve the user interface, which is currently barebones.
- Incorporating the Law faculty database
Contributions are welcome! If you have suggestions for improvements or new features, please open an issue or submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for more details.
For any questions or feedback, please reach out to [email protected].
This project is not an official HKU product, and it simply serves as a tool to assist students in their search for credit transfer information. The final decision regarding course equivalencies and credit transfers rests with the home faculty of the student. Always consult with your faculty for official guidance.