Multi-Scale Context Aggregation By Dilated Convolutions

The semantic segmentation task in computer vision involves partitioning an image into a set of multiple non-overlapping and semantically interpretable regions. This entails assigning pixel-wise class labels to the entire image, making it a dense prediction task. Owing to the massive improvements in image classification performance achieved by CNNs over the recent years, there have been several works which successfully repurpose these popular image classification CNN architectures for dense prediction tasks. This paper questions this approach, and instead investigates if modules specifically designed for a dense prediction task would improve the segmentation performance even further. Unlike image classification networks which aggregate multi-scale contextual information through successive downsampling operations to obtain a global prediction, a dense prediction task like semantic segmentation requires “multi-scale contextual reasoning in combination with full-resolution output”. % However, increasing the receptive field of the convolution operator comes at the cost of more parameters. The authors therefore propose using the dilated convolution operator to address this. To this end, this paper makes threefold contributions: (a) a generalized form of the convolution operator to account for dilation, (b) a multi-scale context aggregation module that relies on dilated convolution, and (c) a simiplified front-end module which gets rid of “vestigial components” carried over from image classification networks. ...

September 21, 2020 · 3 min · Kumar Abhishek

Augmenting Data by Learning Spatial and Appearance Transformations

Here are some slides I made to present this CVPR 2019 paper in our reading group:

June 20, 2019 · 1 min · Kumar Abhishek

Deep NNs for Segmentation

November 28, 2018 · 0 min · Kumar Abhishek

Learning Active Learning from Data

Here are some slides I made to present this NIPS 2017 paper in our reading group:

November 18, 2018 · 1 min · Kumar Abhishek

CNNs for Brain Tumor Segmentation in MRI Scans (BraTS)

Drawing inspiration from the popular VGG networks, the paper proposes using a deep convolutional neural network architecture with small convolutional kernels for segmentation of gliomas in MRI images. The authors discuss the relative advantages of using small kernels, and also explore the use of intensity normalization as a pre-processing step, which was unconventional in CNN-based segmentation methods. The proposed algorithm obtained the first position for the complete, the core, and the enhancing regions in Dice Similarity Coefficient metric in the Brain Tumor Segmentation Challenge 2013 database (BraTS 2013). ...

November 14, 2018 · 3 min · Kumar Abhishek