Skip to content

This work applies two models to study the "image segmentation". One is the VGG+FCN8 and the other is UNet.

Notifications You must be signed in to change notification settings

WJiangH/Image_Segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 

Repository files navigation

Image_Segmentation (VGG+FCN8 and UNet)

  1. VGG16 + FCN8 in the folder VFF_FCN8.
    This work uses VGG16 from scratch as a encoder and FCN8 as decoder to build a image segmentation model. The diagram of the network shown as,

fcn-8

The dataset uses dataset1 prepared by divamgupta.

The dataset contains folders for images and annotations. The images contain the video frames while the annotations contain the pixel-wise label maps. Each label map has the shape (height, width , 1) with each point in this space denoting the corresponding pixel's class. Classes are in the range [0, 11] (i.e. 12 classes) and the pixel labels correspond to these classes
data_processing.py contains functions of pre-processing dataset
image_processing.py includes functions of image visulization.

The VGG16 weights can be downloaded from VGG16_weight

Here is the result after fitting the model with 170 epochs receiving 0.85 accuracy(IOU and Dice were calculated to assess the performance):

  1. UNet in the folder UNet. The model will use the dataset Oxford Pets - IIT dataset, which is included in TensorFlow Datasets already. The UNet diagram is shown as,

unet

The validation accuracy after 20 epochs reaches 0.89, and one of the resulting examples is shown as,

About

This work applies two models to study the "image segmentation". One is the VGG+FCN8 and the other is UNet.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published