An End-to-End Object Detection with Vision Transformation Link

Official implementation of "Place of Attention Matters!" in pytorch : Article is Here .

This work has been inspired by vision transformer and Detr.

GOOD NEWS!!
The pretrained model for 56 epoch is in Drive

Model Result

Preprocess

1-download coco dataset("annotations", "train2017") and unzip and put in dataset folder
2-python3 preprocess.py
3-it will make dataset_file_out.txt

Train or Fine-tune

python3 train.py --train_file_path ./dataset/dataset_file_out.txt --model_path ./m.pth --pretrained ./x_56.pth

Inference

python3 inference.py --img_path ./dataset/train2017/000000580197.jpg --model_path ./x_56.pth --out_path ./out.jpg

Model Architecture

Place of Attention Matters!

An End-to-End Object Detection with Vision Transformation!

In the object detection task, the purpose is to find the class of object and a bounding box around it. Most works have focused on just finding the class of object without considering bounding box features properly. We present a new method that focuses on relationships between patches of the image as a feature for bounding box detector. Also, we combine convolutional neural network as a local feature detector and Transformer network as a long-distance feature detector. We were also inspired by the method that has been used in Transformer as a relationship between patches in the image. Our implementation can perform in real-time and improve the accuracy of previous works.

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
core		core
images		images
object_detection		object_detection
.gitignore		.gitignore
README.md		README.md
article.pdf		article.pdf
inference.py		inference.py
inference_dataset_map.py		inference_dataset_map.py
inference_openvino.py		inference_openvino.py
inference_quantize.py		inference_quantize.py
inference_tensorrt.py		inference_tensorrt.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
x_440.pth		x_440.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An End-to-End Object Detection with Vision Transformation Link

Model Result

Model Architecture

Place of Attention Matters!

Sample output

LOSS

About

Releases 2

Packages

Languages

saeed5959/object-detection-transformer

Folders and files

Latest commit

History

Repository files navigation

An End-to-End Object Detection with Vision Transformation Link

Model Result

Model Architecture

Place of Attention Matters!

Sample output

LOSS

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages