Project Page | Paper | Data
Yujin Wang, Tianyi Xu, Fan Zhang, Tianfan Xue, Jinwei Gu
Image Signal Processors (ISPs) convert raw sensor signals into digital images, which significantly influence the image quality and the performance of downstream computer vision tasks. Designing ISP pipeline and tuning ISP parameters are two key steps for building an imaging and vision system. To find optimal ISP configurations, recent works use deep neural networks as a proxy to search for ISP parameters or ISP pipelines. However, these methods are primarily designed to maximize the image quality, which are sub-optimal in the performance of high-level computer vision tasks such as detection, recognition, and tracking. Moreover, after training, the learned ISP pipelines are mostly fixed at the inference time, whose performance degrades in dynamic scenes. To jointly optimize ISP structures and parameters, we propose AdaptiveISP, a task-driven and scene-adaptive ISP. One key observation is that for the majority of input images, only a few processing modules are needed to improve the performance of downstream recognition tasks, and only a few inputs require more processing. Based on this, AdaptiveISP utilizes deep reinforcement learning to automatically generate an optimal ISP pipeline and the associated ISP parameters to maximize the detection performance. Experimental results show that AdaptiveISP not only surpasses the prior state-of-the-art methods for object detection but also dynamically manages the trade-off between detection performance and computational cost, especially suitable for scenes with large dynamic range variations.
conda create -n adaptiveisp python=3.10
conda activate adaptiveisp
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia
git clone https://github.com/OpenImagingLab/AdaptiveISP.git
cd AdaptiveISP
pip install -r requirements.txt
- Download the LOD dataset from Baidu Drive.
- Unzip the LOD, and modify the dataroot yolov3/data/lod.yaml
To train the AdaptiveISP model:
- Modify the dataroot yolov3/data/lod.yaml
- Training without runtime_penalty
CUDA_VISIBLE_DEVICES=0 python train.py \ --batch_size=8 \ --data_name=lod \ --data_cfg=yolov3/data/lod.yaml \ --save_path=adaptive-isp
- Training with runtime_penalty
CUDA_VISIBLE_DEVICES=0 python train.py \ --batch_size=8 \ --add_noise=False \ --data_name=lod \ --data_cfg=yolov3/data/lod.yaml \ --save_path=adaptive-isp \ --runtime_penalty \ --runtime_penalty_lambda=5e-3
Test the AdaptiveISP model on LOD dataset:
-
Modify the dataroot yolov3/data/lod.yaml
-
Download the pretrained model and put in pretrained folder.
-
ckpt-lod-df-1.0: training with discount facotr(1.0)
-
ckpt-lod-df-0.98: training with discount facotr (0.98)
-
yolov3: pretrined model on COCO
-
-
Run
CUDA_VISIBLE_DEVICES=0 python yolov3/val_adaptiveisp.py \ --project=results \ --isp_weights=pretrained/ckpt-lod-df-1.0.pth \ --data_name=lod \ --data=yolov3/data/lod.yaml \ --batch-size=1 \ --steps=5 \ --name=adptiveisp \ --save_image \ --save_param
@article{wang2024adaptiveisp,
title={AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection},
author={Yujin Wang and Tianyi Xu and Fan Zhang and Tianfan Xue and Jinwei Gu},
booktitle={Conference on Neural Information Processing Systems},
year={2024}
}
Related research projects and implementations. We thank the original authors for their excellent work.