Abstract
Remote photoplethysmography (rPPG) is a camera-based technique that enables non-invasive monitoring of physiological signals such as heart rate (HR) and respiration rate (RR). In light of this advantage, many researchers have suggested deep learning-based methods to measure physiological signals from video data. The 3D convolutional neural network (3D CNN) has been widely applied to capture spatio-temporal features of subtle rPPG changes from facial video. However, the limited receptive fields of the convolution operation leave room for improvement in obtaining global facial features that are crucial for accurate rPPG estimation. Recently, vision transformer trained with self-supervised learning have emerged as powerful tools for extracting high-level features than CNN and supervised ViT. In this study, we propose DINO-rPPG, a method that utilizes a pre-trained DINO to obtain features relevant to the face without additional training. The DINO representation is extracted by a DINO-based semantic extractor (DSE), which effectively captures the high-level semantic features of the face region. The spatio-temporal feature is important for estimating accurate rPPG, therefore, we enhance the spatial DINO representation by incorporating it with features from a spatio-temporal extractor (STE). We conducted experiments using the V4V dataset for estimating HR values, and the results demonstrated that DINO representation guidance is effective for rPPG estimation.
| Original language | English |
|---|---|
| Pages (from-to) | 91-101 |
| Number of pages | 11 |
| Journal | CEUR Workshop Proceedings |
| Volume | 3750 |
| State | Published - 2024 |
| Event | 3rd Vision-Based Remote Physiological Signal Sensing Challenge and Workshop, RePSS 2024 - Jeju, Korea, Republic of Duration: 2024.08.5 → … |
Keywords
- Heart rate estimation
- Remote photoplethysmography
- Self-supervised vision transformer
Quacquarelli Symonds(QS) Subject Topics
- Computer Science & Information Systems
Fingerprint
Dive into the research topics of 'DINO-rPPG: Remote photoplethysmography Measurement using Facial Representation from DINO Guidance'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver