Citrus Leaf Classification with CNN

In this assignment, our objective is Classification of Citrus Leaves Data using a CNN classifier. Here, we are comparing performances of different optimizers and hyper-parameters on the basis of different metrics like Accuracy, Precision, Recall.

Table of Content

Introduction
Approach
- Dataset and preparation
- Model
  - A.) Architecture
  - B.) Training
    - I. Optimizers
    - II. Hyperparameters
Results
- Performance Metrics
- Learning Curves
Discussion Over Results
Key-Takeaways
Installation Guide
References
Contribution

Introduction

Image classification is amongst the fundamental tasks handled by CNN. The goal in classification is to assign a label to an image. The classification comprehends the image. In this assignment, the objective is to understand, design, and implement a CNN classifier. We must not just implement the CNN classifier but understand it as well.

Approach

Dataset and preparation

The original dataset contains 759 images of healthy and un-healthy citrus fruits and leaves. However, as of now the owners only export 594 images of citrus leaves with the following 4 labels: Black Spot, Canker, Greening, and Healthy. The exported images are in PNG format and have the dimension 256×256.

ImageDataGenerator was used to generate training, validation and testing data [60%,20%,20%] from the dataset. This allowed us to randomly augment the training data by zooming in and out (30%), rotating (±180°), height and width shifting (30%), and horizontal and vertical flipping. Finally, both training and validation images were re-scaled pixel-wise to the intensity in range [0,1].

Model

We made our custom model for this task.

A.) Architecture

Layers:

Convolution (16 filters, 3×3 kernel, ReLU)
MaxPooling (/2)
Convolution (32 filters, 3×3 kernel, ReLU)
MaxPooling (/2)
Convolution (64 filters, 3×3 kernel, ReLU)
MaxPooling (/2)
Convolution (64 filters, 3×3 kernel, ReLU)
MaxPooling (/2)
Convolution (64 filters, 3×3 kernel, ReLU)
MaxPooling (/2)
Flatten [4092 units]
Dense (512, ReLU)
Dense (4, Sigmoid)

Inspiration

The inspiration for our architecture is drawn from Alexnet. We have a simple dataset so it was important to control the number of parameters in order to control variance. This particular model has been reached after more than a hundred tries and iterative steps in order to improve it. All tries at adding regularization, changing activation functions, ordering of layers, number of layers and more, have resulted in very unfavorable accuracies. This is a sweet spot we had reached.

B.) Training

I. Optimizers

We have tried Stochastic Gradient Descent (SGD) withmomentum, Adam and RMSprop. SGD performed theworst and RMSprop performed the best.

1.} Stochastic Gradient Descent, with momentum

2.} Root Mean Square Propogation

3.} Adaptive Moment Estimation

II. Hyperparameters

Number of Epochs
Batch Size

We have kept the number of epochs as 35 as to neither underfit nor overfit the model, and the ratio of batch sizes of training and validation data (32 and 8) close to the ratio of their share in the dataset (3).

Results

Performance metrics

	Loss	Accuracy Prop	Precision	Recall
SGD	0.9554	0.4832	0.6321	0.3408
RMS Prop	0.5202	0.7849	0.8049	0.7374
Adam	0.4764	0.8156	0.4140	0.9944

Learning curves

	SGD	RMS Prop	Adam
Loss Curves
Accuracy Curves
Precision Curves
Recall Curves

Discussion of Results

We can see that the maximum accuracy reached by the two best optimizers is about 80%, which is not bad considering the small size of dataset. From our observation, a sufficiently low learning rate along with a large number of epochs result in the most effective validation and testing accuracy. Learning curves served as a guide to determine the proper learning rate. In case of RMSprop, we can see that the learning curves are quite 'jumpy' or fluctuating, however, overall, with increase in number of epochs, the validation loss decreases and validation accuracy increases, so we know that the model is learning.

A possible solution was to further decrease the learning rate and use an optimizer that utilizes momentum. However, when that was tried, we found that it results in relatively poorer loss and accuracy and hence the model doesn't train as well as desired. On the other hand, the decrease in learning rate even by a factor of 5 resulted in the weights to get stuck at some local minimum. Therefore, even with the volatility in RMSprop, the testing metrics are satisfactory, hence it is a good choice and learning can be considered satisfactory.

Adam is the also a very good option and yields balanced metrics. It has accuracy close to RMSprop yet significantly higher precision and recall than the latter. Overall, we can say that Adam is the best choice of optimizer in this case.

Key Takeaways

In this work, we have implemented classification of leaf diseases using a custom made Convolutional Neural Network Neural Style Transfer, using different optimizers for training. Their results have been discussed and compared. Here as shown in the table, Adam and RMSprop have performed almost the similar where Adam seems to be the best choice as all the metrics have good and balanced values. Our final testing accuracy is ~80%.

Platform

Google Colab

Installation Guide

Clone this repository using

$ git clone https://github.com/Engineer1999/CSP520-Computer-Vision.git
$ cd CSP520-Computer-Vision

Install the dependencies using

$ pip install -r requirements.txt

To run locally, launch jupyter notebook using $ jupyter notebook or upload the .ipynb file on Google Colab.

References

V. Fung, “An overview of resnet and its variants,” 17-Jul-2017. [Online]. Available: Link.
Vgg16 - convolutional network for classification and detection,” 24-Feb-2021. [Online]. Available: Link
Keras Conv2D: Working with CNN 2D Convolutions in Keras”. [Online]. Available: Link.

Contribution

Team : The Salvator Brothers
Members : Manav Vagrecha, Kirtan Kalaria

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
images		images
README.md		README.md
report.pdf		report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citrus Leaf Classification with CNN

Table of Content

Introduction

Approach

Dataset and preparation

Model

A.) Architecture

Inspiration

B.) Training

I. Optimizers

1.} Stochastic Gradient Descent, with momentum

2.} Root Mean Square Propogation

3.} Adaptive Moment Estimation

II. Hyperparameters

Results

Performance metrics

Learning curves

Discussion of Results

Key Takeaways

Platform

Installation Guide

References

Contribution

About

Releases

Packages

Languages

klsavaj/Classification-of-Citrus-Leaves-using-CNN-classifier

Folders and files

Latest commit

History

Repository files navigation

Citrus Leaf Classification with CNN

Table of Content

Introduction

Approach

Dataset and preparation

Model

A.) Architecture

Inspiration

B.) Training

I. Optimizers

1.} Stochastic Gradient Descent, with momentum

2.} Root Mean Square Propogation

3.} Adaptive Moment Estimation

II. Hyperparameters

Results

Performance metrics

Learning curves

Discussion of Results

Key Takeaways

Platform

Installation Guide

References

Contribution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages