DC Field | Value | Language |
---|---|---|
dc.contributor.author | LEE, SEUNGYONG | - |
dc.contributor.author | PARK, SEONG JIN | - |
dc.contributor.author | HONG, KI SANG | - |
dc.date.accessioned | 2018-05-10T08:29:29Z | - |
dc.date.available | 2018-05-10T08:29:29Z | - |
dc.date.created | 2018-02-22 | - |
dc.date.issued | 2017-10-22 | - |
dc.identifier.uri | https://oasis.postech.ac.kr/handle/2014.oak/41856 | - |
dc.description.abstract | In multi-class indoor semantic segmentation using RGB-D data, it has been shown that incorporating depth feature into RGB feature is helpful to improve segmentation accuracy. However, previous studies have not fully exploited the potentials of multi-modal feature fusion, e.g., simply concatenating RGB and depth features or averaging RGB and depth score maps. To learn the optimal fusion of multimodal features, this paper presents a novel network that extends the core idea of residual learning to RGB-D semantic segmentation. Our network effectively captures multilevel RGB-D CNN features by including multi-modal feature fusion blocks and multi-level feature refinement blocks. Feature fusion blocks learn residual RGB and depth features and their combinations to fully exploit the complementary characteristics of RGB and depth data. Feature refinement blocks learn the combination of fused features from multiple levels to enable high-resolution prediction. Our network can efficiently train discriminative multi-level features from each modality end-to-end by taking full advantage of skip-connections. Our comprehensive experiments demonstrate that the proposed architecture achieves the state-of-the-art accuracy on two challenging RGB-D indoor datasets, NYUDv2 and SUN RGB-D. | - |
dc.publisher | ICCV | - |
dc.relation.isPartOf | The IEEE International Conference on Computer Vision (ICCV) | - |
dc.relation.isPartOf | The IEEE International Conference on Computer Vision (ICCV) | - |
dc.title | RDFNet RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation | - |
dc.type | Conference | - |
dc.type.rims | CONF | - |
dc.identifier.bibliographicCitation | The IEEE International Conference on Computer Vision (ICCV), pp.4980 - 4989 | - |
dc.citation.conferenceDate | 2017-10-22 | - |
dc.citation.conferencePlace | IT | - |
dc.citation.endPage | 4989 | - |
dc.citation.startPage | 4980 | - |
dc.citation.title | The IEEE International Conference on Computer Vision (ICCV) | - |
dc.contributor.affiliatedAuthor | LEE, SEUNGYONG | - |
dc.contributor.affiliatedAuthor | PARK, SEONG JIN | - |
dc.contributor.affiliatedAuthor | HONG, KI SANG | - |
dc.description.journalClass | 1 | - |
dc.description.journalClass | 1 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
library@postech.ac.kr Tel: 054-279-2548
Copyrights © by 2017 Pohang University of Science ad Technology All right reserved.