Abstract
Blind image quality assessment (BIQA) has always been a challenging problem due to the absence of reference images. In this paper, we propose a novel dual-branch vision transformer for BIQA, which simultaneously considers both local distortions and global semantic information. It first extracts dual-scale features from the backbone network, and then each scale feature is fed into one of the transformer encoder branches as a local feature embedding to consider the scale-variant local distortions. Each transformer branch obtains the context of global image distortion as well as the local distortion by adopting content-aware embedding. Finally, the outputs of the dual-branch vision transformer are combined by using multiple feed-forward blocks to predict the image quality scores effectively. Experimental results demonstrate that the proposed BIQA method outperforms the conventional methods on the six public BIQA datasets.
| Original language | English |
|---|---|
| Article number | 103850 |
| Journal | Journal of Visual Communication and Image Representation |
| Volume | 94 |
| DOIs | |
| State | Published - 2023.06 |
Keywords
- Blind image quality assessment
- No-reference image quality assessment
- Perceptual image processing
- Vision transformer
Quacquarelli Symonds(QS) Subject Topics
- Computer Science & Information Systems
- Engineering - Electrical & Electronic
- Engineering - Petroleum
- Data Science
Fingerprint
Dive into the research topics of 'Dual-branch vision transformer for blind image quality assessment'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver