Category Image Segmentation

Image Segmentation

140. Spatial Pyramid Pooling

Spatial Pyramid Pooling helps the network output the same shape regardless of any aspect ratio and input size. Instead of Pooling with a fixed filter size, it divides the input with different levels of ratio, so the output would not…

Kyosuke
June 30, 2022

Computer Vision, Image Segmentation

135. UNet

Unet may be one of the most basic researches on segmentation tasks. It consists of 3 parts: Encoding Phase (Apply Convolutions to classify object) -> Bridge -> Decoding Phase(Restore information so that the output would be 388×388). During the final…

Kyosuke
June 25, 2022

Computer Vision, Image Segmentation

134. HRNet

HRNet was a research done by Microsoft which lead to higher performance compared with state-of-the-art architectures. Traditional segmentation models utilize skip connections in order to recover spatial information from previous layers. The problem with this method is that it can’t…

Kyosuke
June 24, 2022

Computer Vision, Image Segmentation

133. FCN Upscaling

In order to classify images more precisely, the traditional way is to apply convolution and pooling to lower the dimension of the input so that the model can understand more complex features. This is ok for classification tasks because you…

Kyosuke
June 23, 2022

Computer Vision, Image Segmentation

132. Attention UNet

Attention Unet highlights only relevant activations during training. This can not only perform better when the target you want to detect is relatively tiny compared to the size of the picture, but it can also reduce unnecessary computations. The overall…

Kyosuke
June 22, 2022

Computer Vision, Image Segmentation

130. Loss Options for Semantic Segmentation

If I were to restart a semantic segmentation model project again, I would choose the loss function by considering the following. Is there any imbalance in your data? ⇒（NO）⇒ Binary Cross Entropy ⇓ (YES) ⇓ Is the area you want…

Kyosuke
June 16, 2022

Computer Vision, EdgeAI, Image Segmentation, Pytorch

127. TensorRT Engine Average IoU

I did an experiment to compare how model performance change when converted to TensorRT engine. These are the steps I took. 1. Train Several Binary Segmentation Model 2. Randomly Choose 30 images from the dataset 3. Run inference with the…

Kyosuke
June 13, 2022

Computer Vision, Image Segmentation

126. ArgMax Function

Argmax compares pixels in the same position across channels, and acquires the index of the highest channel. This can be useful for semantic segmentation. Semantic segmentation models outputs the same width and height as the input image and creates a…

Kyosuke
June 12, 2022

Image Segmentation, Pytorch

124. Preprocessing for Deepstream

I found out why my TensorRT engine model was not working as expected. I messed up with configuring the preprocessing step for Deepstream. When you use Deepstream to run inference, there is a property called net-scale-factor and offsets which you…

Kyosuke
June 10, 2022

Image Segmentation, Pytorch

123. TensorRT Engine Performance Comparison

I’ve compared the performance between the model before and after converting to TensorRT engine. The output from the TensorRT engine is flattened to a 1d array, so I had to convert that back to a 2d array to display the…

Kyosuke
June 9, 2022