-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
GSOC 2018 Guide
Gensim will be participating in GSoC 2018 under NumFOCUS umbrella, useful information for students available here.
The main part of GSoC is 3 months of coding, which is really long time and it’s very important that you feel comfortable and dedicated. First of all, have a look at GSoC 2018 ideas and the Gensim Roadmap for an understanding of our OSS plans. If you didn’t find a project that fits exactly, don't worry - feel free to suggest any project related to NLP that you feel would be a successful addition to Gensim. At the same time, projects that are mindful of our roadmap and broad goals are preferred.
Base skills:
- Git
- Python
- GitHub ("How to submit PR", "How to merge upstream from repository", etc)
- Additional skills (depends on concrete project):
- Cython, or C or C++ with Python bindings (for implementing new models from scratch or optimization projects)
- PyTorch/Keras for NN projects.
Experience shows that the best thing to help your application is to contact the project you want to work with early. For this, you can make several things:
- Introduce yourself to our mailing list, an answer to questions of gensim users.
- Fix bugs. Feel free to ping @menshikh-iv & authors of issue if you not sure in your solution or bug isn't clear for you.
- Help us with docstrings, check out Documentation project and progress
Proposal tips (in addition to great NumFOCUS intro)
- Have a look at good proposals from the previous years: Parul, Prakhar
- Write you own proposal, mentioning:
- Project name
- You personal information (name, github account, email)
- A detailed description, with motivating examples of what users this project will benefit and why
- Implementation plan
- Timeline
- Sent to [email protected] with subject "GSoC 2018 <NAME_SURNAME>":
- Your proposal (PDF)
- Your CV
- Cover letter
- Submit your proposal to NumFOCUS-repo and to Google