Skip to content

GSOC 2018 Guide

Menshikh Ivan edited this page Feb 13, 2018 · 7 revisions

Into

Gensim will be participating in GSoC 2018 under NumFOCUS umbrella, useful information for students available here.

How to choose project

The main part of GSoC is 3 months of coding, which is really long time and it’s very important that you feel comfortable and dedicated. First of all, have a look at GSoC 2018 ideas and the Gensim Roadmap for an understanding of our OSS plans. If you didn’t find a project that fits exactly, don't worry - feel free to suggest any project related to NLP that you feel would be a successful addition to Gensim. At the same time, projects that are mindful of our roadmap and broad goals are preferred.

What're skills you need

Base skills:

  • Git
  • Python
  • GitHub ("How to submit PR", "How to merge upstream from repository", etc)
  • Additional skills (depends on concrete project):
    • Cython, or C or C++ with Python bindings (for implementing new models from scratch or optimization projects)
    • PyTorch/Keras for NN projects.

Getting Started Early

Experience shows that the best thing to help your application is to contact the project you want to work with early. For this, you can make several things:

Proposal tips (in addition to great NumFOCUS intro)

  1. Have a look at good proposals from the previous years: Parul, Prakhar
  2. Write you own proposal, mentioning:
    • Project name
    • You personal information (name, github account, email)
    • A detailed description, with motivating examples of what users this project will benefit and why
    • Implementation plan
    • Timeline
  3. Sent to [email protected] with subject "GSoC 2018 <NAME_SURNAME>":
    • Your proposal (PDF)
    • Your CV
    • Cover letter
  4. Submit your proposal to NumFOCUS-repo and to Google
Clone this wiki locally