PaperHub
6.3
/10
Rejected4 位审稿人
最低3最高8标准差2.0
3
8
6
8
4.5
置信度
ICLR 2024

Decomposition Ascribed Synergistic Learning for Unified Image Restoration

OpenReviewPDF
提交: 2023-09-23更新: 2024-02-11

摘要

关键词
Image RestorationDecompositionOrthogonalitySignal formation

评审与讨论

审稿意见
3

The paper introduces a new method called Decomposition Ascribed Synergistic Learning (DASL) for restoring multiple image degradations using a single model. DASL decomposes the image into singular vectors and singular values. The decomposition allows to learn the relationship between different restoration tasks and to optimize the degraded singular vectors and singular values. The method  comprises two operators: a Singular Vector Operator (SVEO) and Singular Value Operator (SVAO). The method can be integrated into existing convolutional image restoration models.  The idea is exemplified on on several image restoration tasks (image deraining, dehazing, denoising, deblurring, and low-light enhancement).

优点

  • The paper addresses an important problem: how to handle multiple image degradations with a single model.
  • The proposed ideas can be incorporated into existing convolutional models with minimal additional overhead.

缺点

  • The paper is poorly organized and presented.
  • The theoretical basis of the paper is unclear.
  • The experimental results are limited. In a practical setup, we observe a combination of degradations such as blur, noise, and compression (all together within the same observation). It is not clear how the decomposition works in this scenario (consider the degradation pipeline in RealESRGAN).

问题

In its current form, the presentation is poor, and the technical soundness and rigor are questionable. Many important aspects are left unanswered or poorly explained. The writing needs to be significantly improved before the paper can be fairly evaluated.

Here are some specific comments:

  • The abstract mentions singular value decomposition (SVD) but does not explain what is the entity that its decomposed. The paper later reveals that SVD is used to represent the input image as a matrix.
  • The paper discusses Fourier analysis, but these concepts are not well-explained and their connection to what the authors are proposing is not clear.
  • Many sentences, paragraphs, and sections are difficult to understand. For example, the sentence "Basically, one straightforward way to implement our idea is to directly perform the decomposition of latent high-dimensional tensor, and conduct the optimization of the decomposed singular vectors and singular value, respectively." is unclear and does not explain what the paper is proposing to do. Section 3.1, which is supposed to be an overview, is also unclear.
  • The first sentence of Section 3.2 states that "The singular vector operator is proposed to optimize the degraded singular vectors of the latent representation." This sentence is unclear because it does not specify what element is being decomposed or what the latent representation is.
  • The experimental results do not address simultaneous degradations, such as blur + noise + compression. It is not clear how the model would perform in such cases. Additionally, many of the tested degradations are synthetically simulated.
  • Equation (1) does not define the variable W.
  • Equation (4) appears to be a reconstruction loss that enforces the decomposition of the clean image to be close to the decomposition of the degraded image. However, it is not clear if this is the case. Additionally, it is not clear how this equation would be different if Fourier decomposition were used instead of SVD.
  • The paper states that training is done for 1200 epochs. This is an unusually long training in epochs, and it is not clear why it is necessary.

Overall, in its current form, its hard to fairly judge the paper contributions.

评论

Thank you so much for your valuable comments. Actually, the well presentation of the paper has been widely appreciated by other three reviewers, and the attractive and compelling insight has also been broadly acknowledged. We suggest the dear reviewer to reconsider the contribution of this paper. Your kind suggestions have been carefully incorporated in the revised version. In the following, we first revisit the main contribution of our paper and then followed by point-by-point responses, which hope address your concerns well.

Recall the main contribution of the paper.

[1] The observed SVD-ascribed degradation analysis presents that the decomposed singular vectors and singular values of the image naturally undertake the different types of degradation information, ascribing various restoration tasks into two groups, i.e., singular vector dominated degradations, such as rain, noise, blur, and singular value dominated degradations, such as low-light, haze.

[2] The intention of the proposed DASL is to support the decomposed optimization of singular vectors and singular values of the degraded signal, through two developed operators, i.e., SVEO and SVAO. Note that both of them decently evade the explicit time-consuming SVD on signals. Therefore, the proposed DASL can achieve superior performance with even accelerated inference time, compared to baseline models (Tab. 5 of the paper).

[3] SVEO only optimize the singular vectors of the signal without touching singular values. We perform this by the fact that the orthogonal matrices multiplication only impacts singular vectors (Theorem 1 in sec. 3.2).

[4] SVAO only optimize the singular values of the signal which is decoupled from the optimization of singular vectors. We perform this by resort to the Fast fourier transform (FFT) as the transformed fourier coefficients undertake the same role as singular values in terms of signal formation principle, i.e., the combined coefficients for a set of basis (detailed in sec. 3.3). Note that FFT is substantially faster than SVD (516x), as shown in Tab. 1 of the paper.

Briefly, the proposed decomposed optimization in DASL built upon SVD-ascribed degradation analysis could greatly benefit the multiple degradation processing within a single model, where the potential degradation relationship within ascribed groups and the conflict between groups could be inherently utilized and alleviated.

About SVD entity and how SVD represents the image.

The proposed SVD-ascribed degradation analysis is performed in image space, namely, the decomposed singular vectors and singular values of the image naturally undertake the different types of degradation information. In fact, SVD operation provides us X=UΣVTX=U\Sigma V^T for signal decomposition. In the opposite, we can also represent the original signal XX with decomposed singular vectors UU, VV, and singular values Σ\Sigma, which are two sides of SVD.

About Fourier analysis and how related to DASL.

The fourier transform has been widely applied in image processing for a long term [1] and is also prevailing in existing low-level vision investigation [2], which is supposed to be common sense. Basically, fourier analysis transforms the signal to the frequency space through Discrete Fourier Transform (DFT), and reverses back to the spatial space through Inverse Discrete Fourier Transform (IDFT). The most relevant form to our method, i.e., the Inverse Discrete Fourier Transform (IDFT) is presented in Eq. (3) of the paper, in comparison with the signal formation principle of SVD in Eq. 2, where both of them can be regarded as a weighted sum on a set of basis. And the transformed fourier coefficients G(u,v)G(u, v) in Eq. (3) and decomposed singular values σi\sigma_{i} in Eq. 2 undertake the same role, i.e., the combined coefficients for the set of basis. Therefore, the unattainable singular value optimization can be translated to the accessible and efficient fourier coefficients optimization, which is the main idea behind the SVAO.

[1] RC Gonzales, Digital image processing, Addison-Wesley Longman Publishing Co., Inc., 1987.

[2] Fourmer: An Efficient Global Modeling Paradigm for Image Restoration, ICML, 2023.

评论

Clarify some misunderstandings.

“One Straightforward way" is unclear and what the DASL supposed to do?

As presented in Recall [2], the intention of the proposed DASL is to support the decomposed optimization of singular vectors and singular values of the degraded signal, based on the SVD-ascribed degradation analysis that the decomposed singular vectors and singular values naturally undertake the different types of degradation information. Therefore, directly performing the SVD on degraded signals (i.e., the latent features in baseline model), and conducting the optimization of the decomposed singular vectors and singular values could implement our idea definitely. However, the explicit SVD on latent high-dimensional features of the baseline model is extremely time-consuming which is intolerable. And the intention of the proposed DASL is to support the decomposed optimization of singular vectors and singular values of the degraded signal without performing the explicit time-consuming SVD. Consequently, DASL could achieve superior performance with even accelerated inference time, compared to baseline models.

Unclear about Section 3.1 for overview.

Sec. 3.1 presents how we integrate the DASL into the existing baseline models and brings some insights about what SVEO and SVAO concentrated on, which is supposed to be an overview. What the “latent representation” is in the first sentence of sec. 3.2. As presented in sec. 3.1, DASL is integrated into the existing baseline model. And the latent representation here is referred to the intermediate features of the integrated baseline model. Details about how we integrate the DASL into existing baseline model is presented in sec. 3.1, and other model details is provided in Appendix B.

What element is being decomposed in the above sentence?

As presented, the latent representation is supposed to be decomposed optimized by SVEO.

The unclear variable WW in Eq. (1).

As presented, WW represents the weight matrix in SVEO that to be orthogonal regularized.

Concern about generality of the proposed method and application on hybrid image restoration.

We note that DASL is proposed for unified image restoration, which handles multiple image degradations within a single model. Extensive experiments on five common image restoration problems demonstrate the effectiveness of the proposed method, as shown in Tab. 2 of the paper.

Noteworthy, the intrinsic problem of unified image restoration is completely different from the hybrid image restoration, where the unified image restoration aims to excavate and exploit the degradation relationship externally among training samples, while hybrid image restoration aims to excavate the degradation relationship internally as it concentrates all degradations in single image. Therefore, the proposed DASL is proficient at external degradation relationship utilization based on proposed SVD-ascribed degradation analysis, where the internal degradation relationship excavation is out of the purpose of this work, and can be released for future investigations.

Despite this, we present the comparison results of DASL integration and baseline in real-world under-display camera (UDC) image restoration dataset below. Typically, images captured under UDC system suffer from blurring due to the point spread function, and lower light transmission rate, which can be viewed as hybrid distortion.

MethodTOLEDPOLED
PSNRSSIMLPIPSPSNRSSIMLPIPS
MPRNet24.690.7070.3478.340.3650.798
DGUNet19.670.6270.3848.880.3910.810
DASL+MPRNet25.550.7330.3268.850.3920.788
DASL+DGUNet25.250.7270.3299.800.4100.783

It is worth noting that DASL is also capable of boosting the performance on real-world hybrid image restoration settings, experimentally, which validate the generality of the proposed method.

Delightfully, we also validate the generality of the proposed SVD-ascribed degradation analysis, where more degradation analysis including downsampling, compression, sharpness, underwater enhancement, and sandstorm enhancement have been presented in Appendix F, which ensures the stable foundation of the proposed method.

评论

Concern about the signal order in decomposition loss and what if Fourier decomposition used?

The decomposition regularization of Eq. (4) concentrates on whether the corrupted components, e.g., singular vectors and singular values, have been recovered according to the clean counterparts. Note that there exists the absolute operation in L1 loss, and the signal order of the reconstruction and clean is irrelevant. By the way, the update gradients are only backpropagate to the input degraded image of the model rather than clean image, and there is no reason that the clean signal will be pushed close to the degraded signal.

Note that the proposed DASL is based on SVD-ascribed degradation analysis, and the developed two operators, SVEO and SVAO are concentrated on decomposed optimization of singular vectors and singular values. The decomposition loss of Eq. (4) is to assist such decomposed optimization congruously, and the utilization of fourier decomposition will break the SVD framework of the DASL, which is unreasonable and harmful. We present the comparison experiments below, where Lfdec L_{fdec} denotes the fourier decomposition regularization.

FormRain100LBSD68GoProSOTSLOLAverage
LfdecL_{fdec}37.9331.4626.8525.6820.9628.58
LdecL_{dec}38.0231.5726.9125.8220.9628.66

It worth noting that despite the SVAO resorts to the Fourier transform with approximation for efficiency, the overall framework of DASL is still SVD based, considering the SVEO and the foundation of DASL, i.e., SVD-ascribed degradation analysis.

Concern about the training epochs.

The training of DASL takes 1200 epochs with around 4.8×1054.8\times10^{5} iterations. For comparison, AirNet [3] takes 1400 epochs for training, IDR [4] takes 1200 epochs for training, and MPRNet [5] takes 4×1054\times10^{5} iterations for training. It is worth noting that all comparison methods in Tab. 2 have been retrained with the same epochs as DASL on mixed dataset for fair comparison, and there are no benefits come from the training epochs.

[3] Li et al. All-in-one image restoration for unknown corruption, CVPR, 2022.

[4] Zhang et al. Ingredient-oriented Multi-Degradation Learning for Image Restoration, CVPR, 2023.

[5] Zamir et al. Multi-Stage Progressive Image Restoration, CVPR, 2021.

We hope our response address your concerns well, and we are expecting the dear reviewer to reconsider the contribution of this paper.

Thank you again for your time and valuable comments.

审稿意见
8

This paper mainly proposes Decomposition Ascribed Synergistic Learning (DASL) for multi-degradation removing task. This work conducts Singular Value Decomposition (SVD) to take different types of degradation information, such as rain, blur, noise, haze, and low-light. The proposed SVEO can optimize the degraded singular vectors of the latent representation, while another SVAO approximates inherently inaccessible singular values by moving to Inver Discrete Fourier Transform operation. With the proposed components, DASL outperformed state-of-the-art image restoration networks, in all-in-one manner.

优点

[S1] The motivation in the introduction is well-written and compelling. In particular, the visual results of SVD analysis are very intuitive. The observation of different properties depending on distortion types by employing SVD for solving multiple image restoration is interesting.

[S2] The correct proof for Theorems, which underlies the proposed SVEO and SVAO, is thoroughly provided. It makes this paper both theoretically and empirically sound.

[S3] The way to approximate the inaccessible singular values using IDFT is an interesting point. Thanks to moving to familiar Fourier domain, it becomes easier to optimize the networks in terms of both time complexity and network size.

[S4] The developed decomposition loss is technically sound, which is empirically proven to be effective.

[S5] Multi-degradation removal task is improved by the proposed DASL when compared to not only the baselines but also other SOTA methos. The performance gains are significant and can contribute to this domain.

缺点

[W1] In some cases, such as MPRNet on deraining and DGUNet on deblurring/low-light enhancement, there are some inferiorities than baselines, while average scores show obvious performance gain on multiple tasks. If evaluation results on other benchmarks of the above tasks are provided and support the effectiveness of DASL, it will make this paper more compelling.

问题

[Q1] Why did you define decomposition loss as Eq.(4)? I think the terms of this loss function can be divided into three terms like UrecUcle1\lVert U_{rec} - U_{cle} \rVert_1 + VrecVcle1\lVert V_{rec} - V_{cle} \rVert_1 + ΣrecΣcle1\lVert \Sigma_{rec} - \Sigma_{cle} \rVert_1. Is there any flaw in the above case, or did you conduct other experiments on decomposition loss variants?

[Q2] In Tab.2, the baselines chosen are CNN-based models. However, other networks, such as SwinIR and Restormer, are Transformer-based methods. Could DASL mechanism be applied to Transformer-based image restoration methods? Or is it limited to only CNN-based model?

[Q3] How could you get the results of other general image restoration methods presented in Tabs.2,3? Did you train all the models on multiple-degradation restoration settings? Or did you get them from other literatures?

评论

Thank you so much for your constructive comments and positive feedback, and we are really encouraged from that. Your suggestions have been carefully incorporated in the revised paper. In the following, we hope our point-by-point responses address your concerns well.

Performance on other benchmarks.

Thanks for pointing that out. Basically, considering the decomposed optimization of DASL for respective singular vector/value dominated degradations, the optimal intra-group optimization pattern will be favored in developed operators, i.e., SVEO and SVAO, for overall performance improvement, which may slightly decline the baseline in few cases. To further verify the effectiveness of DASL, we provide more comparison results of DASL integration and vanilla baselines below.

MethodRain100LBSD68GoProSOTSLOLAverageParams
NAFNet35.5631.0226.5325.2320.4927.7617.11M
MIRNetV233.8930.9726.3024.0321.5227.345.86M
DASL+NAFNet35.9231.2726.9225.5020.6328.0516.84M
DASL+MIRNetV233.9831.1626.5224.2921.6427.525.71M

The metric is reported as PSNR, where DASL brings consistent performance gain on all tasks with reduced model complexity, further validating the effectiveness of the proposed method.

Concern about the form of the decomposition loss.

We present the comparison experiments below, where LTdecL_{Tdec} denotes the above three terms formulation in [Q1], and LdecL_{dec} denotes the form of Eq. (4) in the paper.

FormRain100LBSD68GoProSOTSLOLAverage
LTdecL_{Tdec}32.3529.4625.0323.8819.7726.10
LdecL_{dec}38.0231.5726.9125.8220.9628.66

Experimentally, LdecL_{dec} is more appealing than LTdecL_{Tdec} for decomposition regularization. Particularly, considering that any signal can be regarded as a weighted sum on a set of basis from the SVD perspective, i.e., X=UΣVT=i=1kσiuiviT X=U\Sigma V^T=\sum_{i=1}^{k}\sigma_{i}u_{i}v_{i}^T. The SVD-ascribed degradation analysis proposed in DASL revealed that different types of degradation information primarily distribute in different portion of the signal, i.e., singular vectors (base components i=1kuiviT\cup_{i=1}^k\\{u_{i}v_{i}^T\\}) and singular values (combined coefficients i=1kσi\cup_{i=1}^k\\{\sigma_{i}\\}). Therefore, the regularization on base components in forms of LdecL_{dec} is sufficient, as we merely concentrate on whether the corrupted based components have been recovered regardless of how it is composed. While LTdecL_{Tdec} further delve into the base components is unnecessary which may harm the performance with undesirable strengthened regularization.

Could DASL be applied to transformer-based restoration methods.

We present comparison results of DASL integration and baseline in transformer-based methods below, including SwinIR and Restormer.

MethodRain100LBSD68GoProSOTSLOLAverageParams
SwinIR30.7830.5924.5221.5017.8125.040.91M
Restormer34.8131.4927.2224.0920.4127.6026.13M
DASL+SwinIR33.5330.8425.7224.1020.3626.910.88M
DASL+Restormer35.7931.6727.3525.9021.3928.4215.06M

It is worth noting that DASL is not only favorable for CNN-based models, but also applicable to transformer-based methods. The concerned architecture incompatibility problem is not come to be an obstacle. Note that we replace the projection layer at the end of the attention mechanism with developed decomposed operators for transformer-based methods. Thanks so much for pointing that out, which makes improvement to our paper.

How to get the results presented in Tabs.2, 3?

All comparison methods in Tab. 2 have been retrained on the same mixed dataset as DASL with their default settings for fair comparison. Results in Tab. 3 are derived from the models in Tab. 2 by simply testing on image denoising datasets across different noise ratio.

We hope our response address your concerns well, and we are delightful to receive feedbacks if there exist any questions.

Thank you so much again for your precious time and valuable comments.

评论

The authors sincerely addressed my concerns. The supplemented experiments have shown significance of the proposed method. In particular, DASL+[NAFNet, MIRNetV2] notably improved the performance on every task. Furthermore, the proposed DASL could significantly enhance Transformer-based networks, which shows larger gains than those of CNN-based MPRNet, DGUNet, and AirNet. The result that the proposed DASL is not limited to CNN-based models but extended to recent Transformer-based methods makes this paper more effective. At last, the proposed loss function was demonstrated more sound than other design. I believe that the tables reported in this rebuttal must be included in the revised paper, which will make this paper more contribute to the research field.

Therefore, I will maintain my first rating (8: accept, good paper), and thank for your magnificent research. Good luck.

审稿意见
6

This work proposes an interesting method for unified image restoration via SVD based on the observation that some degradations are related to SVD values and some are to SVD vectors. These observations led to the proposed DASL, with SVEO and SVAO. The proposed method was demonstrated in multiple image restoration tasks, yielding state-of-the-art performance.

优点

SVD based analysis for multiple degradations looks neat and novel. Great performance in multiple image restoration is promising.

缺点

SVD is known to be slow, so the practicality of the proposed method may not be strong. Computation time must be reported for fair comparisons in the benchmarks. According to the analysis, the dehaze does not seem to work well with singular values and vectors (see Fig 2 (a)). The degradation model using a linear SVD may be too simple to capture complex degradations that were not investigated in this work.

问题

Q1. There are a number of works related to unified image restoration this year, but there was no discussion in terms of recent works as follows. Please discuss and clarify to ensure the novelty and superiority of the proposed method over them (I will not put any work from ICCV 2023 since it was after the submission, but comparing with CVPR 2023 works seems reasonable):

  • Z Yang et al., Visual Recognition-Driven Image Restoration for Multiple Degradation with Intrinsic Semantics Recovery, CVPR 2023
  • J Zhang et al., Ingredient-oriented Multi-Degradation Learning for Image Restoration, CVPR 2023
  • D Park et al., All-in-one Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations, CVPR 2023 It will be great if the proposed method somehow outperform some of these, which will make this work stronger.

Q2. In fact, SVD has been used in multiple image restoration works a long time ago, so relating the proposed method with them will be beneficial (and properly acknowledging them seems important).

Q3. Please discuss the computation complexity and its practicality in terms of computation over other prior works. While employing SVD was an interesting idea, its practical computation issue may make this work obsolete.

Q4. It is unclear if most degradations will fall into one of the two proposed methods - see Fig. 2(a) : dehaze yielded similar error for both singular vector and singular value, which may imply that the proposed method may not capture other degradations well including haze.

评论

Relating the proposed method with recent unified image restoration methods.

Thanks very much for pointing that out which makes improvement to our paper.

VRD-IR [1] concentrated on low-for-high problem and proposed to align various degradations in semantic space for consequent recognition-driven enhancement, which is originally different from our method in spirit. Methodologically, they handle diverse degradations through forced normalization for potential conflicts, compared to our natural SVD-based degradation analysis.

IDR [2] and ADMS [3] concentrated on unified image restoration with diametrically opposite perspective. IDR proposed to correlate various degradations through underlying degradation ingredients, while ADMS advocated to separate the diverse degradations processing with specific attributed discriminative filters. Note that multiple tasks-specific models have to be trained in ADMS for filter attribution, which is excessively inconvenient.

We present the comparison results with IDR below, which is the most relevant to our method. Compared to DASL, IDR correlates various degradations from microcosmic perspective with underlying degradation ingredients, such as directionality and unnatural image layering. While DASL performs the correlation from macroscopic perspective through SVD-based degradation analysis, ascribing various degradations into two groups. Despite the parallel motivation, the SVD-based analysis may benefit more other low-level vision problems for its generality, and the experimental comparisons are presented below.

MethodRain100LBSD68GoProSOTSLOLAverageParams
IDR35.6331.6027.8725.2421.3428.3415.34M
DASL35.7931.6727.3525.9021.3928.4215.06M

The metric is reported as PSNR, where our DASL is integrated with Restormer baseline which is the same as IDR for fair comparison, and the superior performance verifying the effectiveness of the proposed method.

The above discussion and comparison results have been accordingly incorporated into the revised version.

Relating the proposed method with previous SVD-based image restoration methods.

Thanks so much for bringing our attention to previous SVD-based image restoration works.

Generally, SVD has been widely applied for a range of restoration tasks, such as image denoising, image compression, etc., attributing to the attractive rank properties [1] including truncated energy maximization and orthogonal subspaces projection. Particularly, the former takes the fact that SVD provides the optima low rank approximation of the signal in terms of dominant energy preservation, which greatly benefiting the signal compression. The latter exploits the fact that the separate order of SVD decomposed components are orthogonal, which inherently partition the signal into independent rank space, e.g., signal / noise space or range / null space for further manipulation, supporting the application of image denoising or even prevailing inverse problem solvers [2]. We acknowledge the great efforts the pioneer works made in cultivating the SVD potential for restoration problems.

Apart from above rank-based SVD methods, SVD-based degradation analysis proposed in DASL excavates another promising property that SVD possessed from the vector-value perspective, which is essentially different and has never been explored. Encouragingly, the above two SVD perspectives have the potential to collaborate well and the separate order properties are supposed to be incorporated into the DASL for sophisticated degradation relationship investigation in future works, as presented in sec. 4.4.

We hope our response address your concerns well, and we are delightful to receive feedbacks if there exist any questions.

Thank you again for your time and constructive comments.

[1] Yang et al. Visual Recognition-Driven Image Restoration for Multiple Degradation with Intrinsic Semantics Recovery, CVPR 2023

[2] Zhang et al. Ingredient-oriented Multi-Degradation Learning for Image Restoration, CVPR 2023

[3] Park et al. All-in-one Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations, CVPR 2023

[4] R. Sadek. SVD Based Image Processing Applications: State of The Art, Contributions and Research Challenges, IJACSA, 2012.

[5] Wang et al. Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model, ICLR, 2023.

评论

Thank you so much for your valuable comments, and we really appreciate that “SVD based analysis for multiple degradations looks neat and novel”. Your suggestions have been carefully incorporated in the revised paper. In the following, we hope our point-by-point responses address your concerns.

Computation complexity. The intension of the proposed DASL is to utilize the property of “SVD based degradations analysis” without explicitly performing the time-consuming SVD. We present the computation overhead of the proposed DASL below, which is also provided in Tab. 5 of the paper. Note that DASL achieves advanced performance compared to incorporated baselines with considerable inference acceleration, e.g.12.86% accelerated on MPRNet and 58.61% accelerated on AirNet.

MethodParams (M)FLOPs (B)Inference Time (s)
MPRNet15.74 / 15.155575.32 / 2905.140.241 / 0.210
DGUNet17.33 / 16.923463.66 / 3020.220.397 / 0.391
AirNet8.93 / 5.411205.09 / 767.890.459 / 0.190

(The metrics are reported as baseline / DASL integrated for comparison.)

Tips: Why faster and how to sidestep the time-consuming SVD?

DASL comprises two lightweight operators, i.e., SVEO and SVAO, to replace the basic calculation units of the baseline network for decomposed optimization of singular vectors and singular values, while both of them decently evade the time-consuming SVD. Conceptually, SVEO takes the fact that the simple orthogonal matrices multiplication can impact singular vectors of the signal without touching singular values (Theorem 1 in sec. 3.2). SVAO resorts to the Fast fourier transform (FFT) as the transformed fourier coefficients undertake the same role as singular values in terms of signal formation principle (detailed in sec. 3.3). Albeit efficient compared to SVD (516x faster as shown in Tab. 1 of the paper), the consequent overhead of FFT is also been considered, and we adopt the SVAO merely in bottleneck layers of the backbone considering the examined global statistical properties of singular values. Therefore, the decomposed optimization in DASL can be performed decently with efficiency.

Details about the integration layout of DASL is presented in sec. 3.1, and the unit reformulation strategy is provided in Appendix B.

Whether more degradations conform to the proposed SVD-based analysis, and the concern about dehazing.

We provide more degradation analysis in Appendix F to verify the generality of the proposed SVD-based analysis, including downsampling, compression, sharpness, underwater enhancement, and sandstorm enhancement. While the former three types are ascribed into singular vector dominated degradations and the latter two types are ascribed into singular value dominated degradations. Together with five deteriorations presented previously, SVD-based degradation analysis has been successfully applied to ten types of degradation!

Tips: A deeper understanding of the proposed SVD-based degradation analysis.

Experimentally, if we reexamine the two groups of degradation ascribed by SVD-based analysis, namely, rain, noise, blur, downsampling, compression, sharpness in singular vector dominated and hazy, low-light, underwater, sandstorm in singular value dominated, it can be concluded that the singular vectors responsible for the spatial content information, while the singular values represent the global statistical properties of the image. Note that degradations ascribed by such analysis, i.e., content corruption and global corruption, inevitably cover most of scenes.

Theoretically, the above conjecture can also be derived from the signal formation principle of SVD, where any signal can be regarded as a weighted sum on a set of basis, i.e., X=USigmaVT=sumi=1ksigmaiuiviTX=U\\Sigma V^T=\\sum_{i=1}^{k}\\sigma_{i}u_{i}v_{i}^T. From this perspective, the singular vectors i=1kuiviT\cup_{i=1}^{k}\\{u_iv_i^T\\} represent the base components of the signal for content composition, and singular values i=1kσi\cup_{i=1}^{k}\\{\sigma_i\\} represent the combined coefficients for statistical properties. We are confident that most degradations will conform to the proposed SVD-based analysis in virtue of the closed form of signal formation principle. The above discussion have been accordingly incorporated into the revised version.

Dehaze case: The MSE reconstruction error in Fig. 2 (a) may not suitable for the dehaze due to the sensitivity to global corruption, which may yield both large error for singular vectors and singular values regardless of visual effect. However, the disparate singular value distribution in Fig. 2 (b) and the lowest singular vector difference in Fig. 2 (c) indicate that the dehaze is definitely ascribed into singular value dominated degradation. The visual analysis of the dehazing in Fig. 1 and Appendix F are more intuitive and evident, validating that SVD-based degradation analysis capturing the dehaze well, and support the generality of the proposed method.

审稿意见
8

In this paper, authors propose a novel multiple degradation restoration methods by exploiting the relationship between different degradations. Specifically, authors observe that the decomposed singular vectors and singular values naturally undertake the different types of degradation information, which dividing various restoration tasks into two groups, i.e., singular vector dominated and singular value dominated. Based on above observation, the Decomposition Ascribed Synergistic Learning (DASL) is proposed to optimize aforementioned components and then integrate them into existing convolutional image restoration backbone to achieve image restorations. In the experimental part, the proposed method outperforms other baselines in both quantitative and qualitative comparisons, which verifies its effectiveness.

优点

There are several strengths here:

  1. The basic idea of the paper, i.e., leveraging the relationship between different degradations, is attractive and meaningful, which could give more insightful obversations from it.
  2. The observation of the paper is elegant and easy to follow, which may inspire other works.
  3. The paper is well-writing and origanized.

缺点

There are several concerns here:

  1. As shown in Table 6, adopting different components, i.e., SVAO, SVEO and L_dec respectively, will benefit different restoration tasks. For instance, SVEO will boost the performance on deblurring and SVAO will boost the low-light enhancement. I am wondering the reason about it. It would be better to give more discussions about it.
  2. I am wondering the reason why the proposed DASL could reduce the computations of baseline model in Table 5. In the common sense, integrating extra modules into existing network would introduce extra computations. It would be better to give more details information about it.
  3. Will the proposed methods sensitive to the hyper-parameters, e.g., \lambda_orth and \lambda_dec, which will influence the reproducibility of the model?
  4. Partial figures are needed to be improved. For instance, unexpected lines are shown in the bottom of Figure 2. It would be better to improve the figures in the paper.

问题

More detailed could be referred to Weakness.

伦理问题详情

This paper is not related to the ethics concerns.

评论

Thank you so much for your valuable comments and encouraging feedback, and we really appreciate that. Your suggestions have been carefully incorporated in the revised paper. In the following, we hope our point-by-point responses address your concerns well.

A Deeper understanding of SVEO and SVAO.

Recall that the SVD ascribed degradation analysis presents that the decomposed singular vectors and singular values naturally undertake the different types of degradation information. In this way, the intention of the proposed DASL is to support the decomposed optimization of singular vectors and singular values of the degraded signal, through developed operators, i.e., SVEO and SVAO. Particularly, SVEO merely optimizes singular vectors and is decoupled from the optimization of singular values, while SVAO operates in the opposite way. Without interference from the degradation-irrelevant portion of the signal, the decomposed operators, i.e., SVEO and SVAO, are capable of concentrating more on degradation processing, which interprets that SVEO is capable of boosting the performance on singular vector dominated degradations, including deblurring, and SVAO is capable of boosting the performance on singular value dominated degradations, including low-light image enhancement.

Tips: What the SVEO and SVAO truly focused on?

If we reexamine the singular vector dominated degradations, including rain, noise, blur, and singular value dominated degradations, including hazy, low-light, it can be conjectured that the singular vectors responsible for the spatial content information and texture details, and the singular values represent the global statistical properties of the image. The above conjecture can also be derived from the signal formation principle of SVD in theory, where singular vectors rerpesent the base components of the signal and singular values rerpesent the combined coefficients, as show in Eq. (2) of the paper. Therefore, SVEO is concentrated on the content corruption through the orthogonal regularized convolution for impacting singular vectors (Theorem 1 in sec. 3.2), while SVAO is concentrated on the global statistical corruption through approximated fourier coefficients for affecting singular values (homogeneous signal formation principle in sec. 3.3). From this point, the decomposed operators are developed precisely for respective singular vector/value dominated degradations.

Why DASL could reduce the computations of baseline model?

Thanks for your question. Actually, DASL replaces the basic calculation units of the baseline model with developed lightweight decomposed operators, instead of introducing extra modules into baseline model. Basically, we substitute half of the convolution layers in baseline model with SVEO, and only perform SVAO at the bottleneck layers of the backbone network, considering the global statistical properties of singular values and non-negligible overhead of Fast Fourier Transform (FFT). We present the integration layout of DASL in sec. 3.1, and the detailed unit reformulation strategy is provided in Appendix B.

Whether the proposed method sensitive to the hyper-parameters?

Faithfully, the proposed DASL is supposed to be sensitive to the involved two hyper-parameters, i.e., λorth\lambda_{orth} and λdec\lambda_{dec}, due to the large number of LorthL_{orth} and LdecL_{dec}. Concretely, LorthL_{orth} is related to tremendous network parameters for orthogonal regularization, and LdecL_{dec} is related to the decomposition space error, especially the huge discrepancy in singular values.

We present the ablation experiments for hyper-parameters λorth\lambda_{orth} and λdec\lambda_{dec} below. The metrics are reported on average of all tasks with MPRNet baseline.

λorth\lambda_{orth}λdec\lambda_{dec}PSNRSSIM
1e-30.127.930.884
1e-40.128.660.893
1e-50.128.240.889
1e-4128.560.892
1e-40.0128.320.890

Note that the default setting is marked in bold. We further present the ablation experiment of hyper-parameters with DGUNet baseline below.

λorth\lambda_{orth}λdec\lambda_{dec}PSNRSSIM
1e-30.128.060.885
1e-40.128.510.892
1e-50.128.280.888
1e-4128.460.890
1e-40.0128.350.889

Comfortingly, the hyper-parameters in DASL are agonistic to the baseline model, which exhibit the similar effect tendency as variation. Therefore, it is convenient to utilize the same hyper-parameter setting for various baseline models without cumbersome selection. And it is suggested that the DASL is best used with recommended hyper-parameters for promising performance.

Thanks so much for pointing this out.

The figure of the paper has been accordingly revised. We hope our response address your concerns well, and we are delightful to receive feedbacks if there exist any questions.

Thank you so much again for your time and constructive comments.

AC 元评审

Overall, there is a significant divergence in the final ratings for this work. Three reviewers have decided to accept it, while the other one reviewer explicitly stating rejection. After carefully reviewing the feedback from the reviewers and the rebuttals provided by the authors, while three reviewers acknowledge the contributions of the work and give positive ratings, the concerns raised by the other one reviewer also cannot be ignored. Reviewer 5Pyv is particularly concerned about the lack of methodological explanation in the work, and even after the authors' response, this reviewer indicates that their concerns are not adequately addressed, raising further questions about the effectiveness of the proposed method.

In summary, although the contributions of the work are acknowledged by some reviewers, the concerns raised by the remaining reviewers are substantial. The responses from the authors, unfortunately, seem to fall short of completely alleviating the reviewers' concerns, indicating room for improvement. Considering these factors, it appears that the work may not fully meet the standards for ICLR 2024, and I decide to reject it.

为何不给更高分

Please refer to the metareview.

为何不给更低分

N/A

最终决定

Reject