Command Palette
Search for a command to run...
Zhu Jun-Yan Park Taesung Isola Phillip Efros Alexei A.

摘要
图像到图像的转换是一类视觉与图形问题,其目标是利用成对的图像训练数据,学习从输入图像到输出图像的映射关系。然而,在许多任务中,成对的训练数据难以获取。本文提出一种在缺乏成对样本的情况下,将图像从源域 X 转换到目标域 Y 的方法。我们的目标是学习一个映射函数 G:X→Y,使得生成图像 G(X) 的分布与目标域 Y 的分布在对抗性损失的约束下无法区分。由于该映射本身高度欠约束,我们引入一个逆映射 F:Y→X,并通过循环一致性损失强制满足 F(G(X))≈X(反之亦然)。我们在多个不存在成对训练数据的任务上展示了定性结果,包括风格迁移、物体形态转换、季节转换、照片增强等。与多种先前方法的定量对比表明,本文提出的方法具有显著优势。
代码仓库
基准测试
| 基准 | 方法 | 指标 |
|---|---|---|
| image-to-image-translation-on-cityscapes | CycleGAN | Class IOU: 0.11 Per-class Accuracy: 17% Per-pixel Accuracy: 52% |
| image-to-image-translation-on-cityscapes-1 | CycleGAN | Class IOU: 0.16 Per-class Accuracy: 22% Per-pixel Accuracy: 58% |
| image-to-image-translation-on-horse2zebra | CycleGAN | Frechet Inception Distance: 89.7 Number of params: 28.2M |
| image-to-image-translation-on-photo2vangogh | CycleGAN | Frechet Inception Distance: 151.4 Number of params: 28.2M |
| image-to-image-translation-on-rafd | CycleGAN | Classification Error: 5.99% |
| image-to-image-translation-on-vangogh2photo | CycleGAN | Frechet Inception Distance: 163.4 Number of Params: 28.2M |
| image-to-image-translation-on-zebra2horse | CycleGAN | Frechet Inception Distance: 110.5 Number of params: 28.2M |
| multimodal-unsupervised-image-to-image | CycleGAN | CIS: 0.076 IS: 0.813 |
| multimodal-unsupervised-image-to-image-1 | CycleGAN | Diversity: 0.012 Quality: 40.8% |
| multimodal-unsupervised-image-to-image-2 | CycleGAN | Diversity: 0.010 Quality: 36.0% |
| multimodal-unsupervised-image-to-image-3 | cycGAN | PSNR: 17.38 |
| unsupervised-image-to-image-translation-on-1 | cycGAN | PSNR: 18.57 |