SPANET: SPATIAL ADAPTIVE CONVOLUTION BASED CONTENT-AWARE NETWORK FOR AERIAL IMAGE SEMANTIC SEGMENTATION

SPANet: Spatial Adaptive Convolution Based Content-Aware Network for Aerial Image Semantic Segmentation

SPANet: Spatial Adaptive Convolution Based Content-Aware Network for Aerial Image Semantic Segmentation

Blog Article

Semantic segmentation of remote sensing images encounters four significant difficulties: 1) complex backgrounds, 2) large-scale differences, custom congratulations banner 3) numerous small objects, and 4) extreme foreground–background imbalance.However, the existing generic semantic segmentation models mainly focus on the modeling context information and rarely focus on these four issues.This article presents an enhanced remote sensing image semantic segmentation framework to solve these problems through the hierarchical atrous pyramid (HASP) module and spatial-adaptive convolution-based FPN decoder framework.On the one hand, HASP solved the problem of complex backgrounds and large-scale differences by further enlarging the receptive field of the network through the cascade of atrous convolution with various rates.

On the other hand, spatial adaptive convolution is embedded in FPN decoder framework step by step to solve the problems of numerous small objects and extreme foreground–background imbalance.Besides, a boundary-based loss function is constructed to help the mel axolotl network optimize the relevant segmentation results.Extensive experiments over iSAID and ISPRS Vaihingen datasets reflect the superiority of the presented structure to conventional the state-of-the-art semantic segmentation approaches.

Report this page