Skip to content

Latest commit

 

History

History
57 lines (42 loc) · 950 Bytes

README.md

File metadata and controls

57 lines (42 loc) · 950 Bytes

data_science_algorithms

A codebase to implement basic ML algorthims from scratch. Feel free to add your suggestions to the TODO lists.

TODO lists

Data processing

  • MinMax Scaler
  • Standard Scaler
  • PCA
  • Kernel PCA

ALgorithms

  • Linear regression
  • Logistic regression
  • Classification
  • Clustering
  • Feature Engineering
  • Random Forests
  • Decision Tree
  • Naive Bayes
  • Support Vector Machine
  • K-Nearest Neighbors
  • Gradient Boosting Machine​
  • Gaussian Process

Statistical techniques

  • Hypothesis testing
  • Confidence Interval
  • Regression Analysis
  • Dimensionality Reduction
  • ANOVA
  • f-test
  • chi-squre test
  • t-test

Data analysis libraries

  • pandas
  • numpy
  • scipy
  • scikit-learn
  • xgboost

Deep learning

  • Neural Networks
  • CNN
  • Tensorflow
  • Keras

Kaggle challenges