chirp-to-ROC

Demonstrate binary classification starting with a 2D chirp-boundary data set and ending with confusion-dots and ROC plots.

This repository provides chirp_roc_lib.py which has i) a 2D chirp-boundary data generator (above), and ii) a routine to compare the actual (y) and model-generated (y_hat) values by making a "Confusion-Dots plot" and the usual ROC plot (at right).

For fun, the jupyter notebook has several types of ML classifier models and uses them to make predictions based on a noisy training sample from the chirp. These predictions are evaluated using a test sample (large and similarly noisy) to show some of the characteristics of the different models.

The confusion-dots plot, above, is a visual analog of the usual confusion matrix and is related to the ROC plot, below.
Both of these are from a neural network trained on chirp data.

The models in the notebook are listed here with their Test and [Training] accuracies
(these will vary for different random training samples):

1- 87.5% [86.5%] The Known Chirp model. Just about the best that any model can do, indicates the Bayes limit.

2- 49.6% [52.7%] A Random model. Random guessing gives ~ 50% for the chirp here.

3- 65.6% [69.2%] A Really Simple model. Looks like more blue in top half, so try that...

4- 74.5% [75.0%] Logistic Regression. A simple linear boundary is ok; test-% and training-% are similar, an under-fitting model.

5- 72.6% [79.2%] Decision Tree model. The training data are over-fitted by a poorly-shaped (although cute) model...

6- 74.7% [80.0%] SVM with Polynomial Features (deg=7) Some overfitting but a reasonable model shape.

7- 75.1% [90.2%] SVM using a Kernel (poly, deg=15) This SVM has the flexibility to fit the training data beyond the Bayes limit.

8- 79.5% [96.2%] Neural Network (hidden=[40,9], no regularization) Finally a model is breaking the ~ 75% accuracy level; note the extreme training over-fitting!

9- 82.1% [90.2%] Neural Network (hidden=[40,9] with L2 regularization) L2 regularization has tamed the over-fitting and improved the test accuracy.

Models 4 - 7 are based on the book and repo Hands on Machine Learning by Aurelien Geron.

The Neural Network model used here comes from Andrew Ng's Deep Learning course, specifically from the Regularization excersize in Week 1 of Course 2. A model with 2 hidden layers and L2 regularization is implemented in the file reg_utils_dd.py here, which is modified from the course's reg_utils.py.

Images of the Trained Models Classifying the Test Data

- - -

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
chirp-to-ROC_demo.ipynb		chirp-to-ROC_demo.ipynb
chirp_roc_lib.py		chirp_roc_lib.py
reg_utils_dd.py		reg_utils_dd.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chirp-to-ROC

About

Releases

Packages

Languages

License

dan3dewey/chirp-to-ROC

Folders and files

Latest commit

History

Repository files navigation

chirp-to-ROC

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages