Adaptive Locally-Aligned Transformer for low-light video enhancement

Yiwen Cao,Yukun Su, Jingliang Deng, Yu Zhang,Qingyao Wu

COMPUTER VISION AND IMAGE UNDERSTANDING(2024)

引用 0|浏览1
暂无评分
摘要
Low-light enhancement is a crucial task that aims to enhance the under-exposed input in computer vision. While state -of -the -art static single -image enhancement methods have made remarkable progress, yet, few attempts are explored the spatial -temporal sequence problem in low-light video enhancement. In this paper, we propose a simple yet highly effective method, termed as Adaptive Locally-Aligned Transformer (ALAT) for low-light video enhancement based on visual transformers. ALAT consists of three parts: feature encoder, locally-aligned transformer block (LATB) and pyramid feature decoder. Specifically, the transformer block enables the network to model the long-range spatial and appearance dependencies in videos due to its selfattention parallel computing mechanism. However, different from some previous approaches directly using the vanilla transformer, we consider that locality is significant in low-level vision tasks since the misaligned contextual local features (i.e., edges, shapes) may affect the prediction quality. Therefore, the proposed LATB is designed to align the video pixel with its most relevant ones adaptively in the local region to preserve the regional content information. Furthermore, we publish a new real -world low-light video dataset, named ExpressWay, to fill the gaps in the lack of dynamic low-light video scenarios, which contains high-quality videos with moving objects in both dark- and bright-light conditions. We conduct experiments on five benchmarks under three comprehensive settings including synthesized, static and our proposed dynamic low-light video datasets. Extensive experimental results show that our ALAT can outperform the previous state -of -the -arts by a large margin of 0.20-1.10 dB. Our method can be also extended to other video enhancement applications. The project is available at https://github.com/y1wencao/LLVE-ALAT.
更多
查看译文
关键词
Low-light enhancement,Vision transformer,Adaptive align,Spatial-temporal sequence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要