Skip to content

Latest commit

 

History

History
executable file
·
94 lines (72 loc) · 4.7 KB

README.md

File metadata and controls

executable file
·
94 lines (72 loc) · 4.7 KB

TF_EnhanceDPED project

Prerequisites

  • Python + scipy, numpy packages
  • TensorFlow (>=1.0.1) + CUDA CuDNN
  • Nvidia GPU

File tree

├── data
│   ├── dped -> /home/***/datasets/dped/
│   ├── __init__.py
│   ├── load_dataset.py
│   └── pretrain_models
├── demo
├── experiments
│   ├── config
│   └── logs
├── loss
│   ├── GAN_loss.py
│   ├── __init__.py
│   ├── other_loss.py
│   └── vgg_loss.py
├── metrics
│   ├── __init__.py
│   ├── psnr.py
│   └── ssim.py
├── net
│   ├── __init__.py
│   └── resnet.py
├── README.md
├── tools
│   ├── inference.py
│   └── train.py
└── utils
    ├── __init__.py
    ├── logger.py
    └── utils.py
  • Refactored file structure is easy to expand.
  • Net, data, loss, metrics are folders for network, data import, loss definition and metrics respectively.
  • Demo, utils folder is used to provide some visualization, helper function folder.
  • experiments/config will provide the parameter parse, you can modify the hyperparameter from this file.
  • The DPED_* folder is automatically generated for each experiment, which stores the checkpoint file and the visually enhanced patch image.
  • The experiments/logs/ folder saves the event files in the training and validation phases, which can be visualized by commands tensorboard --lodir logs.

First steps congfig

  • Download the pre-trained VGG-19 model and put it into pretrain_models/ folder.

    • It should use the weight provided by the author, when use the matconvnet offical weight it will get error.
  • Download DPED dataset (patches for CNN training) and extract it into dped/ folder.

    • This folder should contain three subolders: sony/, iphone/ and blackberry/
  • Modify the config configuration file in the experiments.

Tain

  • Use this command to trian the models python tools/train_baseline.py --<parameter>.

Test

Coming soon optimization

  • Training is to read a part of the data into the memory for training. Each training can re-import a part of the data and load it into the memory.
  • So each training load the data is very slow, next step will optimization.
  • Will use Moving average loss.

Some useful paper

  • Range Scaling Global U-Net for Perceptual Image Enhancement on Mobile Devices,Jie Huang, Pengfei Zhu, Mingrui Geng, Jiewen Ran, Xingguang Zhou, Chen Xing, Pengfei Wan, Xiangyang Ji.
  • TALEBI H, MILANFAR P. Learned perceptual image enhancement, 2018[C]. IEEE, 2018
  • CHOI J, KIM J, CHEON M, et al. Deep Learning-based Image Super-Resolution Considering Quantitative and Perceptual Quality[J]. 2018.
  • HUA W, XIA Y. Low-Light Image Enhancement Based on Joint Generative Adversarial Network and Image Quality Assessment, 2018[C]. IEEE, 2018.
  • End-to-End Blind Image Quality Assessment Using Deep Neural Networks .Kede Ma, Wentao Liu, Kai Zhang, Zhengfang Duanmu, Zhou Wang, and Wangmeng Zuo.IEEE Transactions on Image Processing (TIP), vol. 27, no. 3, pp. 1202-1213, Mar. 2018.
  • TALEBI H, MILANFAR P. NIMA: Neural Image Assessment[J]. IEEE Transactions on Image Processing, 2018,27(8):3998-4011.

Thanks