Understanding imbalanced semantic segmentation through neural collapse
Proceedings of the IEEE/CVF conference on computer vision and …, 2023•openaccess.thecvf.com
A recent study has shown a phenomenon called neural collapse in that the within-class
means of features and the classifier weight vectors converge to the vertices of a simplex
equiangular tight frame at the terminal phase of training for classification. In this paper, we
explore the corresponding structures of the last-layer feature centers and classifiers in
semantic segmentation. Based on our empirical and theoretical analysis, we point out that
semantic segmentation naturally brings contextual correlation and imbalanced distribution …
means of features and the classifier weight vectors converge to the vertices of a simplex
equiangular tight frame at the terminal phase of training for classification. In this paper, we
explore the corresponding structures of the last-layer feature centers and classifiers in
semantic segmentation. Based on our empirical and theoretical analysis, we point out that
semantic segmentation naturally brings contextual correlation and imbalanced distribution …
Abstract
A recent study has shown a phenomenon called neural collapse in that the within-class means of features and the classifier weight vectors converge to the vertices of a simplex equiangular tight frame at the terminal phase of training for classification. In this paper, we explore the corresponding structures of the last-layer feature centers and classifiers in semantic segmentation. Based on our empirical and theoretical analysis, we point out that semantic segmentation naturally brings contextual correlation and imbalanced distribution among classes, which breaks the equiangular and maximally separated structure of neural collapse for both feature centers and classifiers. However, such a symmetric structure is beneficial to discrimination for the minor classes. To preserve these advantages, we introduce a regularizer on feature centers to encourage the network to learn features closer to the appealing structure in imbalanced semantic segmentation. Experimental results show that our method can bring significant improvements on both 2D and 3D semantic segmentation benchmarks. Moreover, our method ranks first and sets a new record (+ 6.8% mIoU) on the ScanNet200 test leaderboard.
openaccess.thecvf.com
Showing the best result for this search. See all results