Skip to content

Latest commit

 

History

History
144 lines (108 loc) · 7.21 KB

README.md

File metadata and controls

144 lines (108 loc) · 7.21 KB

Data-Science-Crate

Contributors Forks Issues Pull Request

A one-stop repository for new-comers in Data-Science.

Contents

  1. Description
  2. Project structure
  3. Project roadmap
  4. Getting started
  5. Preview Notebooks
  6. Built with
  7. Contributing
  8. Authors
  9. License
  10. Acknowledgments

Description

What are the projects?

This repository has two projects -

  • Classification based project on Cancer prediction Cancer_prediction.ipynb
  • Regression based project on Stock price prediction L&T_Stock_Price_prediction.ipynb

How can this project help?

  • Cancer Prediction

    Machine learning is not new to cancer research. Artificial neural networks (ANNs) and decision trees (DTs) have been used in cancer detection and diagnosis for nearly 20 years.The fundamental goals of cancer prediction and prognosis are distinct from the goals of cancer detection and diagnosis.

  • Stock price Prediction

    Stock market prediction aims to determine the future movement of the stock value of a financial exchange. The accurate prediction of share price movement will lead to more profit investors can make.

The idea

  • Cancer Prediction

    The idea is to predict whether a cell is cancerous or non-cancerous based on different features of cell using different Machine learning algorithms or Deep learning techniques

  • Stock Prediction

    The idea is to predict the future stock pricing based on different dependencies of a stock using different Machine learning algorithms or Deep learning techniques

Project structure

.
├── Classification
│   ├── Cancer_prediction.ipynb                   Jupyter notebook for Cancer prediction
│   ├── Datasets                                  Dataset for Cancer prediction
│   │   ├── cancer_data.csv
│   │   └── dataset.txt
│   └── classification.txt                        Basic information about Classification
├── Regression
│   ├── Datasets                                  Dataset for L&T stock price prediction
│   │   ├── LT.csv
│   │   └── dataset.txt
│   ├── L&T_Stock_Price_prediction.ipynb          Jupyter notebook for Stock price prediction
│   └── regression.txt                            Basic information about Regression
├── LICENSE
├── code_of_conduct.md
├── contributing.md
└── readme.md

Project roadmap

The project currently does the following things-

  • Data cleaning
  • Data preprocessing
  • Already implemented a very few machine learning algorithms or deep learning techniques

Following things can be implemented -

  • Data augmentation or manipulation
  • Better data visualization
  • Implementation of different Machine learning algorithms or deep learning techniques to achieve better prediction results

Getting started

Prerequisites

  • Very basic understanding of git and github:

    1. What are repositories (local - remote - upstream), issues, pull requests
    2. How to clone a repository, how to fork a repository, how to set upstreams
    3. Adding, committing, pulling, pushing changes to remote repositories
  • For EDA and Visualisation

    1. Basic syntax and working of python.(This is a must)
    2. Basic knowledge of pandas library. Reading this blog might help.
    3. Basic knowledge of matplotlib library. Reading this blog might help.
    4. Basic knowledge of seaborn library. Reading this blog might help.
    5. Basic knowledge of scikit learn library. Reading this blog might help.
    6. Basic knowledge of tensorflow library. Reading this blog might help.

    However the code is well explained, so anyone knowing the basics of Python can get a idea of what's happenning and contribute to this.

Installing

A step by step series of examples that tell you how to get a development env running.

There are two ways of running the code.

  1. Running the code on web browser.(Google Colab) [Recommended]

    • Head on to Google colab
    • Then click on Upload Notebook Tab.
    • Upload the notebook that you got from this repo. Colab-1
    • Connect with the runtime. Colab-2
    • Upload your dataset. Colab-3
    • Then Click on Run All. Colab-4
    • Start Editing.
  2. You can also run the code locally in your computer by installing Anaconda.

Preview Notebooks

Notebook will be opened in Google Colab

Built with

Contributing

Please read contributing.md for details on our code of conduct, and the process for submitting pull requests to us.

Authors

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments