Text mining NCBI Pubmed using Entrez

In this session, we will cover some basic features from the Entrez module embedded within the BioPython package. The session will introduce you to scripting automated searches of the NCBI Pubmed database as well as some approaches to exploring rudimentary ways of analysing text data. Anyone who has attended both of our previous Python workshops will have all the necessary background to complete this session. If you have not been able to make our previous Python sessions, all the Jupyter notebooks from them are posted on repositories within the IC-Computational-Biology-Society organisation.

NB: This session does not cover natural language processing or topics in machine learning. Nevertheless, it should give you the foundation to begin an investigation that culminates in the use of dedicated Python packages, such as NLTK. By the end of the session, you should be able to construct your own dataset of NCBI Pubmed text data on which to (potentially) start training machine learning models.

If you are attending our virtual interactive session on Microsoft Teams, please make sure you can run Anaconda, which can be easily obtained from Imperial College's AppsAnywhere platform or from the offical Anaconda website (only recommended if you cannot access AppsAnywhere or are completing the tutorial outside of the scheduled session).

Details of use

This tutorial is intended for educational use. If you would like to use any material herein for teaching or ulterior purposes outside the remit of the Imperial College Computational Biology Society, please contact the referenced authors.

Author

Joseph I. J. Ellaway

Email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
text_mining_tutorial_using_Entrez.ipynb		text_mining_tutorial_using_Entrez.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text mining NCBI Pubmed using Entrez

Details of use

Author

About

Releases

Packages

Languages

License

IC-Computational-Biology-Society/NCBI_text_mining_session

Folders and files

Latest commit

History

Repository files navigation

Text mining NCBI Pubmed using Entrez

Details of use

Author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages