SwinIR: image super-resolution, denoising and JPEG compression artifact reduction

This article describes SwinIR, a state-of-the-art architecture for super-resolution and image denoising. It follows the articles describing transformers and Swin transformers that can be found here. It also discusses Shallow and Deep feature extraction with Residual Swin Transformer Block, HQ image reconstruction and per-pixel / perceptual / Charbonnier losses.

Continue reading “SwinIR: image super-resolution, denoising and JPEG compression artifact reduction”

State of the Art: Object detection (1/2)

The aim of this article is to give a state of the art of object detection evaluated on COCO and classified by architecture type. Then, the transformers will be explained starting from the NLP domain to their adaptation to the computer vision domain with the Swin Transformers and the Focal Transformers. The methods presented in the SwinV2-G paper to adapt the Swin Transformer to a 3 billion parameters model will also be explained.

Continue reading “State of the Art: Object detection (1/2)”