A customized low-rank prior model for structured cartoon–texture image decomposition

doi:10.1016/j.image.2021.116308

Signal Processing: Image Communication

Volume 96, August 2021, 116308

https://doi.org/10.1016/j.image.2021.116308 Get rights and content

Abstract

The mathematical characterization of the texture component plays an instrumental role in image decomposition. In this paper, we are concerned with a low-rank texture prior based cartoon–texture image decomposition model, which utilizes a total variation norm and a global nuclear norm to characterize the cartoon and texture components, respectively. It is promising that our decomposition model is not only extremely simple, but also works perfectly for globally well-patterned images in the sense that the model can recover cleaner texture (or details) than the other novel models. Moreover, such a model can be easily reformulated as a separable convex optimization problem, thereby enjoying a splitting nature so that we can employ a partially parallel splitting method (PPSM) to solve it efficiently. A series of numerical experiments on image restoration demonstrate that PPSM can recover slightly higher quality images than some existing algorithms in terms of taking less iterations or computing time in many cases.

Introduction

Image decomposition is one of the most important problems in the era of artificial intelligence, since it has widespread applications in image restoration, biomedical engineering, astronomical imaging, pattern recognition, and computer vision, e.g. [1], [2], [3], to name just a few. Generally, image decomposition refers to the task of extracting two meaningful components from a given image, where one is called cartoon component representing the piecewise-smooth part with the global structural and geometrical information, and the other one is texture component being a collection of locally-patterned oscillating information. Mathematically, for a given noise-free image $b \in R^{m \times n}$ (note that a two-dimensional image can also be stacked as a column vector, e.g., in lexicographic order), the goal of image decomposition is to find the cartoon component $u \in R^{m \times n}$ and the texture component $v \in R^{m \times n}$ such that $b = u + v .$ Essentially, problem (1) is an underdetermined linear system since the number of unknowns is larger than the number of equations, which puts forward theoretical challenges for the sake of the “ground truths” of cartoon $u$ and texture $v$ from infinitely many solutions of (1). Certainly, such a problem could be solvable under mild prerequisites when attaching favorable prior information, e.g., sparsity and compressibility, on both cartoon and texture parts (e.g., see [2]). In the literature, there are two types of popular approaches for image decomposition. The first type is PDE-based method (e.g. [4]), which utilizes well-known total variation (TV) norm to characterize the cartoon and exploits special functional norm to extract the texture component from an image. Another type is the wavelet-based method (e.g. [5], [6], [7], which employs transformation operators to transfer a real image into the wavelet domains such that the two components can be efficiently extracted by sparse approximations under some tight frame systems. However, these two types of approaches usually model image decomposition as a convex optimization problem, in which both cartoon and texture components are characterized by appropriate convex priors, e.g., TV and nuclear norms.

One of the most popular optimization models for image decomposition is the so-named G-norm based model originally proposed by Meyer [3], where a TV norm is used to induce the cartoon $u$ and a negative semi-norm, i.e., G-norm, in Sobolev space serves as promoting texture $v$ . Although the G-norm is theoretically elegant to characterize the texture nature of a noisy-free image, solving G-norm based optimization models is often not an easy task due to the complicated structure of the negative semi-norm. To circumvent the difficulty caused by the G-norm, Vese and Osher [8] accordingly introduced a surrogate instead of the G-norm. Actually, in many real-world applications, the observed images are usually degraded with noise or incomplete information (pixels). Correspondingly, the cartoon–texture based image restoration can be modeled as a generalization of (1), i.e., $b_{0} = Φ (u + v) + ε,$ where $b_{0}$ is an observed image and $Φ$ is a linear degradation operator, $ε$ is an additive white noise with known variance. Generally speaking, such a generic problem (2) is ill-posed due to the appearance of degradation matrix $Φ$ . Inspired by the work of [8], Ng et al. [9] tactfully introduced a structured optimization model to handle (2) so that the resulting problem can be easily solved by the employment of multiple-block splitting methods. However, as pointed out in [10], the G-norm is not a perfect regularizer to discriminate texture from noise, especially in the case of small magnitude (but well-patterned) texture. To handle the case where texture does not have a relatively large magnitude but is well-patterned, Schaeffer and Osher [10] judiciously proposed a low patch-rank (LPR) model, where the texture component is modeled as an alignment of patches. A perfect property of the patches is that they are almost linearly dependent, which means that the whole collection of texture patches should be low rank. Numerically, the LPR model can be efficiently solved by the split Bregman method [10] and the partial splitting augmented Lagrangian method [11]. Moreover, such a model has been shown to be superior to the other existing models in terms of extracting ideal texture from an image with well-patterned texture. However, the LPR model utilizes the nuclear norm to capture the low-rankness of the patch-vectors globally, i.e., the whole texture is optimized simultaneously at each iteration, which possibly appears a global feature. As a result, the LPR model seems not much ideal for image decomposition when images have various different texture patterns. To overcome the drawback of the LPR model, Ono et al. [12] cleverly proposed a block-wise low rank model to characterize texture that enjoys a globally dissimilar but locally well-patterned nature. The core idea of the model proposed in [12] is the utility of the so-named block nuclear norm (BNN) to characterize local blocks of the texture component. Computational results reported in [12] demonstrated that their model performs better than the LPR model when images have locally block-wise well-patterned texture. However, their model looks relatively complicated due to the presence of many block-wise nuclear norms characterizing sub-texture components. On the other hand, although the BNN model for gray images can be efficiently solved by the state-of-the-art alternating direction method of multipliers (ADMM), it seems a little difficult to implement on color images, which is also a future work in [12].

Actually, globally well-patterned textures often appear in many real-world images arising from the areas of petrography, lumber processing, tiles, wallpaper and printed circuit boards, e.g. see [13], [14], [15] and references therein. When applying the LPR model to these images, the underlying patch operator will yield one more equality constraint to guarantee that the proximity operator of nuclear norm can be efficiently utilized (see [11]). In this situation, these augmented Lagrangian-based methods, e.g., [11], perhaps take more time to decompose large-scale images since it requires more storage (or memory) for Lagrangian multipliers and auxiliary variables. On the other hand, applying the BNN model to globally well-patterned images, especially color images, will result in many superfluously expensive singular value decomposition (SVD) for block-wise nuclear norms. Therefore, a natural question is that can we consider a simple model to characterize the globally patterned texture structure. Meanwhile, such a model can be efficiently solved by an easily implementable algorithm.

Notice that the globally well-patterned texture component has the low rank property as shown in [10], in this paper, we propose a simpler but effective image decomposition model, which directly employs nuclear norm and TV norm to induce the original texture and the cartoon components of a given image, respectively. In what follows, we call the proposed model customized low-rank prior (CLRP) to distinguish the aforementioned models. Comparing with the models in [10], [12], the proposed CLRP model is simpler but without loss of its powerful ability to extract or recover high-quality cartoon and texture components from a degraded image, which is of benefit to imaging engineers. Another remarkable contribution of this paper is that we give a structured reformulation to the CLRP model so that it can be solved efficiently by the partially parallel splitting method (PPSM) [16], which is globally convergent and has a parallel eligibility for large-scale image decomposition problems. In the algorithmic framework, it is possible to make use of parallel computing devices, e.g., GPUs, for acceleration. A series of computational results demonstrate that our extremely simple model equipped with a customized PPSM works well on the cartoon–texture image decomposition, especially when images have globally well-patterned texture. Moreover, the CLRP model performs well on color image restoration problems. All the demo codes of our approach can be downloaded from the website: https://github.com/Zhiyuan-Zhang510zg/CLRP.

The remainder of this paper is organized as follows. In Section 2, we briefly introduce some notations that will be used throughout this paper. In Section 3, we first introduce the customized image decomposition model. Then, we reformulate the CLRP model as a three-block separable optimization problem by introducing auxiliary variables and show details of the employment of PPSM [16] to the underlying model. In Section 4, we will investigate the performance of the CLRP model on four scenarios with respect to different degradation operator $Φ$ ’s. A series of numerical results will be reported to support the promising ability of the CLRP model for image decomposition and restoration. Finally, we complete this paper with drawing some concluding remarks in Section 5.

Section snippets

Notations

In this section, we summarize some notations that will be used throughout this paper.

Let $R^{n}$ be an $n$ -dimensional Euclidean space. For a given $x \in R^{n}$ , we denote ${‖ x ‖}_{p}$ as the $ℓ_{p}$ norm of vector $x$ whose value is ${‖ x ‖}_{p} = {(\sum_{i = 1}^{n} {| x_{i} |}^{p})}^{\frac{1}{p}}$ for $1 \leq p < \infty$ , where $x_{i}$ is the $i$ th component of vector $x$ . In particular, we will use $‖ \cdot ‖$ to denote the standard $ℓ_{2}$ -norm for notational simplicity. Moreover, letting $M$ be a positive definite matrix (i.e., $M ≻ 0$ ), we define the $M$ -norm of $x$ by ${‖ x ‖}_{M} = \sqrt{x^{⊤} M x}$ . We denote the nuclear norm of

Model and algorithm

In this section, we first propose a customized low-rank prior model for decomposing structured images. Then, we reformulate our model as a separable convex problem so that we can gainfully employ the so-named PPSM proposed in [16] to obtain a solution of the resulting optimization model.

Experimental results

In this section, we will verify the effectiveness of the model (5) with different degradation operator $Φ$ . More specifically, we test four cases (i) clean image decomposition by setting $Φ = I$ with $I$ being an identity operator; (ii) image inpainting by taking $Φ = S$ with $S$ being a binary ‘mask’ (i.e., a down-sampling matrix), which corresponds to splitting images with missing information (i.e., pixels); (iii) image deblurring by specifying $Φ = B$ with $B$ being a known blurring matrix; (iv) image

Conclusion

We proposed a cartoon–texture based image decomposition model with low-rank texture prior. We used the full nuclear norm to characterize the texture component. An efficient algorithm named PPSM with guaranteed convergence was employed to solve the proposed model. The numerical results showed the effectiveness of the proposed model and the efficiency of the employed algorithm. We found that the proposed model can perform very well for pure cartoon–texture image decomposition when images are

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors would like to the four anonymous referees for their close readings and valuable comments, which helped us improve the quality of this paper. Also, they are grateful to Dr. Wenxing Zhang for useful advice and sharing codes with us. This work was supported in part by National Natural Science Foundation of China (No. 11771113) and Zhejiang Provincial Natural Science Foundation of China at Grant No. LY20A010018.

References (26)

RudinL. et al.
Nonlinear total variation based noise removal algorithms
Physica D
(1992)
BertalmioM. et al.
Simultaneous structure and texture image inpainting
IEEE Trans. Image Process.
(2003)
FadiliM.J. et al.
Image decomposition and separation using sparse representations: An overview
Proc. IEEE
(2010)
MeyerY.
Oscillating Patterns in Image Processing and Nonlinear Evolution Equations: The Fifteenth Dean Jacqueline B. Lewis Memorial Lectures, Vol. 22
(2001)
CaiJ.-F. et al.
Image restoration: Total variation, wavelet frames and beyond
J. Amer. Math. Soc.
(2012)
CaiJ.-F. et al.
Split Bregman methods and frame based image restoration
Multiscale Model. Simul.
(2010)
CaiJ.-F. et al.
Simultaneous cartoon and texture inpainting
Inverse Probl. Imaging
(2010)
VeseL.A. et al.
Modeling textures with total variation minimization and oscillating patterns in image processing
J. Sci. Comput.
(2003)
NgM.K. et al.
Coupled variational image decomposition and restoration model for blurred cartoon-plus-texture images with missing pixels
IEEE Trans. Image Process.
(2013)
SchaefferH. et al.
A low patch-rank interpretation of texture
SIAM J. Imaging Sci.
(2013)

HanD. et al.

A partial splitting augmented lagrangian method for low patch-rank image decomposition

J. Math. Imaging Vision

(2015)

OnoS. et al.

Cartoon-texture image decomposition using blockwise low-rank texture characterization

IEEE Trans. Image Process.

(2014)

MacKenzieW.

Atlas of Igneous Rocks and their Textures

(1982)

Cited by (24)

Image cartoon-texture decomposition by a generalized non-convex low-rank minimization method
2024, Journal of the Franklin Institute
Image cartoon-texture decomposition is an important problem in image processing. In recent years, by exploiting low-rank priors of images, low-rank minimization methods have been widely adopted for image cartoon-texture decomposition. Since matrix rank minimization is an NP-hard problem, the convex nuclear norm is often used as a substitute for the matrix’s rank to realize the low-rank minimization methods. In this paper, we utilize a generalized non-convex surrogate of the matrix rank function to develop a novel low-rank minimization model for image cartoon-texture decomposition. We design a proximal alternating algorithm to solve the non-convex model and further demonstrate the global convergence of the algorithm. Numerical experiments illustrate that the proposed method can show much better performances than the existing state-of-the-art methods for image cartoon-texture decomposition.
Cartoon-texture guided network for low-light image enhancement
2024, Digital Signal Processing: A Review Journal
Recovering normal-exposure images from low-light images is a challenging task. Recent works have built a great deal of deep learning methods to address this task. Nevertheless, most of them treat cartoon and texture components in the same way, resulting in a loss of details. Recent effort, i.e. unfolding total variation network (UTVNet), is proposed, which recovers normal-light image by roughly decomposing the image into a noise-free smoothing layer and a detail layer using total variation (TV) regularization, and then processes the two components in different ways. However, its enhanced image exhibits color distortion owing to the limited representation ability of the TV model. To address this limitation, we design a cartoon-texture guided network named CatNet for low-light image enhancement. CatNet uses a cartoon-guided normalizing flow to retain cartoon information and an elaborated frequency domain attention mechanism in U-Net denoted as FAU-Net to recover texture information. Concretely, the ground-truth image is decomposed into cartoon and texture components to guide the corresponding recovery modules training, respectively. We also design a hybrid loss in the spatial and frequency domains to train the CatNet. Compared to state-of-the-art methods, our method gets better results, obtaining richer colors and more details. The source code and datasets have been made publicly available at https://github.com/shibaoshun/CatNet.
Application of computer image processing technology in old artistic design restoration
2023, Heliyon
Art designs exhibit different principles, textures, color combinations, and creative skills for vivid thinking visualizations. Art exhibits are far from ages, periods, and creators finding their digital patterns in recent years for resurrection. Degraded periodic artworks are digitally handled for reviving their legacy using digital image processing. This article introduces Textural Restoration Technique (TRT) using Deep Feature Processing (DFP) to augment such innovations. The proposed technique analyses the tampered image for its textures, and available features are extracted. The textures are expected to be sequential based on gradient distribution; the missing gradients are identified from the available features near the region of interest (ROI). The ROI is marked by combining missing and available features from which textural edges are sketched. In this process, recurrent learning is employed for verifying the gradient substitutions for even textures. The texture patterns are classified using high and low accuracy features exhibited between two successive ROIs. First, the learning model is trained using gradient distribution accuracy pursued by the texture completion edge. The second training is pursued by the first distribution, achieving the maximum restoration. The filled features and their gradient positions are marked by moving the ROIs for distinguishing textures. The restoration ratio is computed with high accuracy based on the filled edges.
Automated detection of gear tooth flank surface integrity: A cascade detection approach using machine vision
2023, Measurement: Journal of the International Measurement Confederation
The surface integrity of gear tooth flanks significantly impacts the efficiency and reliability of gear transmission systems. This article proposes a cascaded detection approach using machine vision to comprehensively localize and identify thermal damages on tooth flanks following the grinding process. This method utilized an image enhancement to correct non-uniform illumination and a saliency detection based on the spectral residual algorithm to extract individual tooth flanks. Additionally, an image semantic segmentation model, GBSU-Net, was put forward to detect thermal damage regions on the tooth flank and quantify the severity of grinding burn with the area ratio. The experimental results demonstrated the efficacy of the proposed method on the gear surface image dataset, with the Dice coefficient and IoU metrics achieving 84.29 % and 73.95 %, respectively. The proposed method is applicable for real-time detection during gear machining processes because of its swift and accurate detection capability.
Cartoon-Texture decomposition with patch-wise decorrelation
2023, Journal of Visual Communication and Image Representation
Citation Excerpt :
Existing methods roughly fall into four categories. The first category takes the cartoon and the texture as functions in certain functional spaces, and regularizes them by norms of the corresponding functional spaces [7–18]; the second category regards the cartoon and the texture as vectors in the Euclidean space and regularizes them accordingly [19–23]; the third category considers patch-wise regularity from the matrix theory [24–27]; while some recent methods use unsupervised learning neural network [28,29]. In the second category, the most well-known method is the Morphological Component Analysis (MCA) [19].
Cartoon-Texture decomposition (CTD) is a fundamental task and has wide applications in image processing and computer vision. To enhance separation of the cartoon and texture, existing models explicitly introduce correlation terms to decorrelate the two components. However, existing correlations usually ignore the local geometric structure information, thus insufficient to decorrelate cartoon and texture. In this work, we propose the patch-wise cosine similarity to decorrelate the cartoon and texture. The proposed decorrelation term takes the local geometric information into account and is more effective in separating cartoon and texture. Combining our decorrelation term with the regularities for cartoon (Relative Total Variation (RTV)) and texture (div( $L^{1}$ )-norm), we propose a new CTD model. Extended experiments show that the proposed model outperforms existing methods in CTD, especially in preserving edges of the cartoon.
A doubly sparse and low-patch-rank prior model for image restoration
2022, Applied Mathematical Modelling
Citation Excerpt :
Comparing the ROF model (i.e., (1.2) with a TV regularization term) with LPRM, one common feature is that both models have a TV regularization term, and one noticeable difference is that LPRM possesses an extra low-patch-rank regularization term. Promisingly, a series of numerical results in [5,7,8] showed that LPRM works better than the pure TV minimization model for image deblurring and inpainting. Indeed, it is not difficult to observe from the literature that the success of LPRM for image restoration is due to the ability of the low-patch-rank term maximally exploiting the inherent low rankness of an image (see an illustration example in Fig. 1 that the texture part appears approximately low rank).
Image restoration is a core problem in computer vision and image processing. In this paper, we introduce a unified low-patch-rank minimization model, which possesses one nuclear norm regularization term promoting the low-patch-rankness, and two sparse regularization terms including the classical total variation (TV) norm and a general sparse term under certain transform such as discrete cosine transform. By setting balancing parameters, our unified model reduces to the classical TV-regularized low-patch-rank minimization model and yields a new non-TV-regularized low-patch-rank prior image restoration model. Due to the multi-block structure of the model, we introduce a three-block alternating minimization algorithm to find approximate solutions of the proposed models. A series of computational results on image inpainting and deblurring further show that our approaches are reliable to recover high-quality images from degraded ones.

View all citing articles on Scopus

View full text

A customized low-rank prior model for structured cartoon–texture image decomposition

Abstract

Introduction

Section snippets

Notations

Model and algorithm

Experimental results

Conclusion

Declaration of Competing Interest

Acknowledgments

Physica D

Simultaneous structure and texture image inpainting

IEEE Trans. Image Process.

Image decomposition and separation using sparse representations: An overview

Proc. IEEE

Oscillating Patterns in Image Processing and Nonlinear Evolution Equations: The Fifteenth Dean Jacqueline B. Lewis Memorial Lectures, Vol. 22

Image restoration: Total variation, wavelet frames and beyond

J. Amer. Math. Soc.

Split Bregman methods and frame based image restoration

Multiscale Model. Simul.

Simultaneous cartoon and texture inpainting

Inverse Probl. Imaging

Modeling textures with total variation minimization and oscillating patterns in image processing

J. Sci. Comput.

Coupled variational image decomposition and restoration model for blurred cartoon-plus-texture images with missing pixels

IEEE Trans. Image Process.

A low patch-rank interpretation of texture

SIAM J. Imaging Sci.

A partial splitting augmented lagrangian method for low patch-rank image decomposition

J. Math. Imaging Vision

Cartoon-texture image decomposition using blockwise low-rank texture characterization

IEEE Trans. Image Process.

Atlas of Igneous Rocks and their Textures