Linfeng Tan Li Niu* Jiangtong Li Liqing Zhang*

Abstract
Image harmonization is an essential step in image composition that adjuststhe appearance of composite foreground to address the inconsistency betweenforeground and background. Existing methods primarily operate in correlatedRGB color space, leading to entangled features and limited representationability. In contrast, decorrelated color space (e.g., Lab) has decorrelatedchannels that provide disentangled color and illumination statistics. In thispaper, we explore image harmonization in dual color spaces, which supplementsentangled RGB features with disentangled L, a, b features to alleviatethe workload in harmonization process. The network comprises a RGBharmonization backbone, an Lab encoding module, and an Lab control module.The backbone is a U-Net network translating composite image to harmonizedimage. Three encoders in Lab encoding module extract three control codesindependently from L, a, b channels, which are used to manipulate thedecoder features in harmonization backbone via Lab control module. Our codeand model are available at\href{https://github.com/bcmi/DucoNet-Image-Harmonization}{https://github.com/bcmi/DucoNet-Image-Harmonization}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-harmonization-on-hadobe5k-1024-times | DucoNet | MSE: 10.94 PSNR: 41.37 SSIM: 0.9886 fMSE: 80.69 |
| image-harmonization-on-iharmony4 | DucoNet | MSE: 18.47 PSNR: 39.17 fMSE: 212.53 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.