Computer Vision, Deep Learning, Object-Detection, SOTA1 Comment

State of the Art: Object detection (2/2)

22 May 202225 May 2023 Foundations of DL

This article is a continuation of the work done in the previous article. It will summarize other methods used by the best COCO ranking architectures such as YOLOR, CBNet and R-CNN series, CenterNet, DetectoRS and EfficientDet.

Computer Vision, Deep Learning, Object-Detection, SOTALeave a comment

State of the Art: Object detection (1/2)

1 Mar 20226 Aug 2022 Foundations of DL

The aim of this article is to give a state of the art of object detection evaluated on COCO and classified by architecture type. Then, the transformers will be explained starting from the NLP domain to their adaptation to the computer vision domain with the Swin Transformers and the Focal Transformers. The methods presented in the SwinV2-G paper to adapt the Swin Transformer to a 3 billion parameters model will also be explained.

AI, Computer Vision, Deep Learning, MOT, Object-Detection, PythonLeave a comment

MOT, TF model customization and distributed training

6 Jan 20216 Aug 2022 Foundations of DL

Python project, TensorFlow.

First, this article will describe how to convert a simple object detector to Multi-Object Tracking (MOT) capable of keeping identities to follow subjects along a sequence. Second, it will show how to customize and retrain a model from TensorFlow Object Detection API. Contrary to the previous article, we will parse the VOC2012 dataset with modern methods instead of implementing our own parser from scratch. We will also distribute the training on multiple GPUs.

AI, Computer Vision, Deep Learning, Object-Detection, PythonLeave a comment

SSD300 implementation

7 Nov 20206 Aug 2022 Foundations of DL

Python project, TensorFlow.

This article describes how to implement a Deep Learning algorithm for object detection, following the Single Shot Detector architecture. It explains the implementation of the VGG16 backbone network, the SSD cone, the default box principle and the convolutions used to predict the box classes and to regress the offsets for their location. Finally, how to convert the .xml annotations to data used by such a network for training.

	Cruz on Image Segmentation: FCN-8 modu…
	Dorian on State of the Art: Object detec…
	C++ Application Deve… on Neural Network from scratch: P…
	apiquet on Neural Network from scratch: P…
	Erron on Neural Network from scratch: P…
	apiquet on Transfer Learning & Unsupe…
	apiquet on Transfer Learning & Unsupe…
	apiquet on Transfer Learning & Unsupe…
	apiquet on Transfer Learning & Unsupe…
	PTV on Transfer Learning & Unsupe…
	website on Transfer Learning & Unsupe…
	w88 on Transfer Learning & Unsupe…
	What Is An Alpha Mal… on Transfer Learning & Unsupe…

AI Code Wizards

Category: Object-Detection

State of the Art: Object detection (2/2)

State of the Art: Object detection (1/2)

MOT, TF model customization and distributed training

SSD300 implementation