DiCo-NeRF: Difference of Cosine Similarity for Neural Rendering of Fisheye Driving Scenes

  • Jiho Choi*
  • , Gyutae Hwang
  • , Sang Jun Lee
  • *Corresponding author for this work

Research output: Contribution to conferenceConference paperpeer-review

Abstract

Neural radiance fields have emerged in the field of autonomous driving, which contributes to improve perception of the complex 3D environment through the reconstruction of geometry and appearance. Moving objects and sky for outdoor environment is challenging to optimize the NeRF model. Previous work addresses these challenges through preprocessing such as masking; however, the masking process requires additional ground-truth data and a segmentation network. We propose DiCo-NeRF, an approach for driving scenes by leveraging cosine similarity map differences of vision-language aligned model. DiCo-NeRF investigates the correlation between rendered patches and predefined text and adjusts the loss of challenging patches, such as moving objects and the sky. Our neural radiance field utilizes embedding vectors from a pre-trained CLIP to obtain the cosine similarity maps. We introduce SimLoss, a loss function aimed at regulating the color field of NeRF based on the quantified distribution differences between ground-truth and rendered similarity maps. Unlike previous NeRF models that used driving datasets, our approach does not require additional input, such as sensor data, to the model. Experimental results demonstrate that the incorporation of language semantic cues improves the performance of the novel view synthesis task, particularly in complex driving environments. We conducted experiments that included fisheye driving scenes from the KITTI-360 and real-world datasets. Our code is available at https://github.com/ziiho08/DiCoNeRF.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
PublisherIEEE Computer Society
Pages7850-7858
Number of pages9
ISBN (Electronic)9798350365474
DOIs
StatePublished - 2024
Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024 - Seattle, United States
Duration: 2024.06.162024.06.22

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN (Print)2160-7508
ISSN (Electronic)2160-7516

Conference

Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Country/TerritoryUnited States
CitySeattle
Period24.06.1624.06.22

Keywords

  • Autonomous driving
  • Fisheye camera
  • Neural Radiance Fields
  • Vision-Language models

Quacquarelli Symonds(QS) Subject Topics

  • Computer Science & Information Systems
  • Engineering - Electrical & Electronic
  • Engineering - Petroleum
  • Data Science

Fingerprint

Dive into the research topics of 'DiCo-NeRF: Difference of Cosine Similarity for Neural Rendering of Fisheye Driving Scenes'. Together they form a unique fingerprint.

Cite this