Abstract

As a branch of the field of machine learning, deep learning technology is abrupt in various computer vision tasks with its powerful functional learning functions. The deep learning method can extract the required features from the original data and dynamically adjust and update the parameters of the neural network through the backpropagation algorithm so as to achieve the purpose of automatically learning features. Compared with the method of extracting features manually, the recognition accuracy is improved, and it can be used for the segmentation of copperplate printing images. This article mainly introduces the research on the key algorithm of the copperplate printing image segmentation based on deep learning and intends to provide some ideas and directions for improving the copperplate printing image segmentation technology. This paper introduces the related principles, watershed algorithm, and guided filtering algorithm of copperplate printing image synthesis process and establishes an image segmentation model. As a result, a deep learning-based optimization algorithm mechanism for the segmentation of copper engraving printing images is proposed, and experimental steps such as main color extraction in the segmentation of copper engraving printing images, adaptive main color extraction based on fuzzy set 2, and main color extraction based on fuzzy set 2 are proposed. Experimental results show that the average processing time of each image segmentation model in this paper is 0.39 seconds, which is relatively short.

1. Introduction

Image segmentation is one of the key issues in computer vision, digital image processing, and other fields. Image segmentation refers to the segmentation of the target image with relatively consistent features according to the consistency of the image attributes and the target image, so that the features of the same subregion have certain similarities and differences. The characteristics of the subregions show obvious differences. Finally, the target to be decomposed can be separated from the background and output the target. Currently, the number of image segmentation methods has reached thousands.

In recent years, with the continuous improvement of computer processing performance and computing power, especially the huge development of artificial intelligence, deep learning has achieved great success in many fields such as speech recognition, natural language processing, and computer vision. Different from traditional pattern recognition methods, the technology based on deep neural network does not need to manually design features; that is, it can automatically extract features with good information expression layer by layer through training, eliminating the tedious steps of manually extracting features. Therefore, the recognition of copperplate printing image segmentation has been slowly applied to deep learning.

Chen et al. found that classification is one of the most popular topics in hyperspectral remote sensing. In the past two decades, experts have proposed many methods to deal with the classification of hyperspectral data, but most of them did not extract hierarchically. Chen et al. introduced the concept of deep learning to the classification of hyperspectral data for the first time, first by following the classification based on classical spectral information to verify the qualifications of stacked autoencoders; secondly, a new method of spatially dominant information classification was proposed. A novel deep learning framework is proposed to integrate these two functions, from which the highest classification accuracy can be obtained. The framework is a mixture of principal component analysis, deep learning architecture, and logistic regression. Specifically, as a deep learning architecture, stacked autoencoders are designed to obtain useful advanced features. This research is relatively novel, but lacks experimental data support [1]. Postalcolu uses a convolutional neural network for image recognition. The dataset is named Fruits-360. 70% of the images are selected as training data. The remaining images are used for testing. The image size is set to 100 × 100 × 3. The training uses momentum stochastic gradient descent (sgdm), adaptive moment estimation (Adam), and root mean square boost (rmsprop) techniques. The training threshold is determined to be 98%. When the accuracy rate reaches 98% or more, the training is stopped. Use the trained network to calculate the final verification accuracy. This research is relatively one-sided and not practical [2]. Guo et al. found that adding a spatial penalty term to the fuzzy c-means (FCM) model is an important method to reduce the impact of noise in the image segmentation process. Although these algorithms improve the robustness to noise to a certain extent, they still have some shortcomings. First of all, they are usually very sensitive to the parameters that should be adjusted according to the intensity of the noise; secondly, in the case of uneven noise, it is obviously unreasonable to use constant parameters for different image regions, which usually leads to undesirable segmentation results. In order to overcome these shortcomings, Guo et al. proposed an adaptive FCM based on noise detection for image segmentation in this research. The new algorithm uses two image filtering methods to denoise and maintain detailed information. The variance of the gray values in each community is measured to calculate the parameters used to balance the two parts. This research procedure is relatively complicated and is not suitable for popularization in practice [3].

The innovations of this paper are (1) proposing a deep learning-based optimization algorithm mechanism for the segmentation of copper engraving printing images and (2) proposing the main color extraction in the etching image segmentation, the adaptive main color extraction based on fuzzy set 2, and main color extraction based on fuzzy set 2.

2.1. Related Principles of Copperplate Printing Image Synthesis Process

The printed matter must be as consistent as possible with the provided copper engravings; that is, the reproduction looks the same as the original under all lighting conditions, rather than achieving the “same spectrum” effect reproduction [4]. Usually, the transparent film provided by the picture provider has a larger color gamut than the printing technology application. When converting the image information to the printed matter, a compromise must be taken. Perform color gamut mapping according to the color management system, ignore and compress image feature information, or add spot colors. The spot colors are not in the image color separation information, and separate color plates are generated [5].

In the entire printing process, the original is decomposed into different colors, and the level, color, and definition are adjusted according to the printing needs and then screened, and finally, the printing equipment is overprinted color by color on the substrate to complete the color synthesis to obtain the final printed product to achieve color restoration and reproduction [6]. In color image synthesis, color separation is based on subtractive color mixing. In multicolor printing, halftone dots are independent of each other, but they also overlap. On paper, both subtractive color mixing (overprinting of monochromatic halftone dots) and additive color mixing (the comprehensive effect of monochromatic independent halftone dots observed by the observer) will occur [7].

2.2. Watershed Algorithm
2.2.1. Overview

The watershed algorithm compresses the input image to a topographic map, and the gray value of the digital image corresponds to the altitude in the topographic map. The higher the gray value, the higher the corresponding altitude. Continuously changing pixels constitute elements such as mountains, basins, elevations, and topographical changes in topography. They are undulating and continuous, just like gradient changes in graphics [8].

The realization process of the watershed algorithm can be understood according to the process from the rise of the water level to the flooding of the mountain. First, suppose that the groundwater level gradually rises from the deepest underground, and the first geographic location to be touched is the depression of the basin at the lowest altitude. As the water level continues to rise, the water level in the depressions of the basin will gradually rise, slowly filling the entire basin [9]. At this time, a zero-height dam was built around the basin to prevent water from overflowing in the basin. Then, the water level continued to rise, the dam was submerged, and the water in the basin passed over the dam on the edge of the basin and merged with the water in the adjacent basin. Finally, the water level continued to rise until the highest mountain was submerged, and the water level stopped rising. Here, the block-shaped area enclosed by the dam is called a stagnant basin, and the edge (or dam) of the stagnant basin is called a watershed [10].

2.2.2. Algorithm Steps

The image is composed of pixels, and each pixel has a specific value. Let the pixel in the image be p, the largest and smallest pixel value points in the image are denoted by , and the basin is denoted by C [11].

The threshold function is expressed as :where I (p) is the image function.

The basin function is expressed aswhere N is the local minimum in the image area.

The path distance function L (x, y) is expressed aswhere I is the path between any two points x and y in the divided area [12, 13].

2.2.3. Algorithm Improvement

The unified processing of the entire image is likely to confuse elements such as different targets, front and back scenes, and produce opposite effects. In order to further enhance the target and make the small target in the huge image clearer, this paper uses the watershed calculation to obtain the target contour not for preliminary segmentation, but for unified enhancement of the image in the region to enhance the difference between different regions [14]. Regarding the target area G obtained by the watershed algorithm, it is regarded as a target. There are the following processing methods for the target G:where c is 255, θ is the contrast correction parameter, and the value of θ is different for different colors of the area. In order to make the correction force in the same target area the same and the correction direction to be uniform, this paper confirms the θ value according to the distribution probability of the gray mean value:

Among them, k is the gray level threshold [15, 16].

2.3. Guided Filtering Algorithm
2.3.1. Overview

The purpose of the segmentation algorithm used in this chapter is to segment the copper engravings as a whole, and the color distribution of the copper engravings is often continuous in a certain range. Therefore, image-guided filtering is selected to denoise the segmented image to facilitate the extraction of the copper engravings as a whole [17]. Image-guided filtering is an edge-preserving filter that can smooth the image while maintaining the boundary. This algorithm needs a guided image when filtering. The guided image can be another image of the same size, or the original image itself [18]. When the original image itself is used as the guide image, the guided filtering can filter the image while keeping the edges. When another image is used as the guided image, the realization is to constrain the original image based on the gradient in the other image filtering operation. Guided filtering is widely used in image denoising, image dehazing, and detail smoothing [19].

2.3.2. Algorithm Steps

The guided filtering technique assumes that the filtered output image is q and the guided image is I, and there is a certain local linear relationship between the two [20]. Under this assumption, the linear transformation is given as follows:

Among them, is a square neighborhood window centered on pixel k. This paper takes 3 × 3. From formula (6), it can be found that the output image only has edges where there is a gradient in the guiding image [21, 22]. Define the output image q, which is obtained by subtracting some unnecessary noise elements n from the image p to be filtered:

Minimizing the difference between q and p, while satisfying the constraints of formula (6), can get the result of guided filtering. The most critical problem in guided filtering is to calculate the optimal solution of [23]. In order to obtain the best effect of image optimization while maintaining the local linear model in formula (6), the difference between the image p to be filtered and the output image q of the guided filtering is expressed by the minimized cost function [24]. The following formula exists:

In order to prevent the value from being too large, ε is introduced as a regularization parameter [25]. According to linear regression analysis, the optimal solution of can be obtained:

Since guided filtering has an overlapping window averaging strategy, this method can well consider the correlation of local information. The overlapping window averaging strategy means that a certain pixel i of the output image is included in many overlapping windows. The pixel value of this point is related to the value of all the windows covered. The value of the center of different windows is different. Therefore, it is necessary to calculate the mean value of all q obtained [26]. Assuming that the pixel point is i, the filtered output can be expressed as

2.4. Image Segmentation Model

The image has many different characteristics such as texture, spatial structure, and color. According to these characteristics, the image is divided into different areas, so that each area has its own characteristics, and there are different characteristics between areas. Image segmentation allows us to conduct further research on different regions [27]. Image segmentation is a very basic and important step in image processing. It is very important to understand the subsequent recognition and analysis, and it is also a bridge for image processing in image segmentation. The result of image segmentation directly affects the processing conditions of subsequent machines. Therefore, image segmentation is one of the important research topics in the current image technology field. Image segmentation algorithms are also diverse and are widely used in practical applications such as computer vision, pattern recognition, aesthetic image processing, and industrial fields [28].

Define an image on the area . Based on the segmentation results of the U-net model based on coarse calibration and the level set method, this paper proposes the following image segmentation model, denoted as Model I:where is the probability density function of pixel x inside and outside the object and can be calculated according to the segmentation result of the U-net model based on coarse calibration [29, 30]. I (x) is the image intensity value, and the function is defined aswhere is a Gaussian kernel with standard deviation σ and it is defined as

H (u) and δ (u) are Heaviside function and Delta function; they are defined as follows:

The method part of this paper adopts the above method to study the key algorithm for the segmentation of copper engraving printing image based on deep learning. The specific process is shown in Table 1.

3. Experiment on the Key Algorithm of Copper Engraving Printing Image Segmentation Based on Deep Learning

3.1. Optimization Algorithm Mechanism of Copper Engraving Printing Image Segmentation Based on Deep Learning
3.1.1. Attention Mechanism

The attention mechanism comes from the study of human vision. It usually refers to the fact that under the obstacles of information dissemination, people selectively pay attention to a part of all information, while ignoring other visible information is processing information. Researchers introduce this method into the field of computer vision information processing, such as traditional local image feature extraction and image detection. In the deep learning technology, the attention mechanism is usually used as an independent neural network bundle to determine the strictly different weights of different inputs or different parts.

3.1.2. Attention Mechanism in Spatial Domain

The spatial transformer network (STN) model proposes a unit called spatial transformation, which converts spatial sector information into images, so that key information can be derived. The spatial conversion is actually the realization of the attention mechanism, because the trained spatial converter can find the area where the image information needs to be paid attention to. At the same time, it can have the rotation function and the zoom function, so that the important part of the image can be converted and extracted. And this unit can be directly added to the original network structure as a new level.

The space conversion unit can identify the basic information of the superior tag. The information table is a distinguishable table because the information on each target point is actually a combination of all the information on the origin. In theory, such a unit can be added at any level because the unit can process channel information and matrix information at the same time.

3.1.3. Channel Domain Attention Mechanism

The principle of the channel zone attention mechanism can be understood from the perspective of signal conversion. In the analysis of signal systems, any signal can be written as a linear combination of semitone waves. After time-frequency transformation, a continuous half ion wave signal can be used to replace the frequency signal value in the time domain.

3.2. Image Segmentation of Copperplate Printing
3.2.1. Main Color Extraction

Color is the subjective feeling produced by external light stimulation acting on human visual organs. Therefore, the adaptive nature of human visual system must be considered. Studies have shown that the performance of colors depends on local contrast rather than absolute contrast values. It can be seen that the color characteristics of an image not only depend on the image itself, but also on the observer’s visual system and observation experience. Therefore, designing a main color extraction method that combines the attributes of the image and the human visual perception plays an important role in improving the segmentation effect of the copperplate printing image.

The method of obtaining the main color of the image based on the adaptive color image segmentation of fuzzy set 2 is the multithreshold method based on histogram to obtain the correlation (spatial information) between pixels through fuzzy identity, that is, the local information of the pixels, and make full use of the advantages of fuzzy theory, effectively reserving more low-level information for high-level processing. Therefore, this is a fuzzy threshold segmentation method that considers both global and local information of the image.

3.2.2. Adaptive Primary Color Extraction Based on Fuzzy Set 2

In the process of image processing and recognition, the human visual characteristics must be fully considered. The imaging process of an image is a process of mapping from many to one, so the image itself has many uncertainties and imprecision, that is, blur. The change of the image from black to white is also fuzzy, which is difficult to distinguish for human visual perception. This kind of uncertainty and inaccuracy is mainly reflected in the uncertainty of the image gray level, the uncertainty of the geometric shape, the uncertainty knowledge, etc., which cannot be solved by the classic mathematical theory. Fuzzy theory studies this uncertainty and imprecision and provides effective new processing technology for intelligent information processing. In the research, people found that fuzzy theory has a good ability to describe the uncertainty of images. Therefore, fuzzy theory can be introduced as a model and method to effectively describe image characteristics and human visual characteristics, analyze human judgment and perception, and recognize images and other behaviors.

Fuzzy sets are just like describing nonfuzzy phenomena with classical set theory; they are descriptions of fuzzy phenomena. Corresponding to the domain of the classical set theory, in the fuzzy theory, the domain X is used to represent the value range of the research object. This paper proposes an adaptive multithreshold technology based on fuzzy set 2. Through the analysis of the histogram, the histogram peak value is automatically obtained, assuming that each peak is a pixel class, then the number of peaks is the number of pixel classes N, thereby determining the threshold number as N-1. Using the relationship between the number of categories and the number of thresholds, the shape of the fuzzy curve is controlled by adjusting the membership window width to achieve a fully adaptive multithreshold technology.

3.2.3. Main Color Extraction Based on Fuzzy Set 2

Most of the fuzzy closed value methods in the past did not consider the correlation between image pixels, and the local information of the pixels was lost in the process of processing. Therefore, this paper considers the correlation between image pixels by defining fuzzy identity. Combined with the abovementioned multithreshold technology, an adaptive color image segmentation technology based on fuzzy set 2 is used to extract the main color. First, obtain the fuzzy identity histogram of each color component of the image, and then apply the multithreshold technology based on fuzzy set 2 to obtain the closed value set of each component, then cluster the image according to the threshold set, and finally perform the necessary area merging to prevent oversegmentation phenomenon.

This part of the experiment proposes that the above steps are used in the research experiment of the key algorithm for the segmentation of copperplate print images based on deep learning. The specific process is shown in Table 2.

4. Results of Key Algorithms for Image Segmentation of Copperplate Printing Based on Deep Learning

4.1. Antijamming Performance Analysis of the Algorithm for Gaussian White Noise

Gaussian white noise is a common noise in images, which obeys Gaussian distribution. In the image, Gaussian noise appears at almost every point, but its noise depth is random. In this experiment, we first segment the image, then add Gaussian white noise of different intensities to the image, and then use the algorithm in this paper to obtain the segmentation results. The noise intensity is 0.01–0.10, and each noise intensity is separated by 0.01. In order to further illustrate the anti-interference of the algorithm in this paper against Gaussian white noise, the global consistency error between the original image segmentation result and the noise-added image segmentation result is calculated as shown in Table 3 and Figure 1.

It can be seen from Table 3 and Figure 2 that as the intensity of Gaussian white noise continues to increase, the highest global consistency error between the original image segmentation result and the noise image segmentation result is 0.0221, which is almost negligible, indicating that this algorithm is correct Gaussian white noise having strong anti-interference.

4.2. Algorithm Target Number Analysis

In the definition of the target size in this article, based on the analysis of the dataset, the ISAID dataset is sorted in ascending order according to the pixels occupied by the target. According to the size of the pixels occupied by the target, among the 655451 targets contained in the dataset, the target size from the smallest tens of thousands of pixels to the largest tens of millions of pixels, we take one million pixels (1000 pixels × 1000 pixels) as the unit of the pixel points occupied by the target area, and calculate the target number as shown in Table 4 and Figure 2.

In order to make the classification of the target size reasonable, the number of targets under each category is evenly distributed, this article classifies the pixels that the target occupies less than 3000 pixels × 3000 pixels as small targets; the number of pixels occupied is greater than 3000 pixels × 3000 pixels less than 7000 pixels × 7000 pixels are classified as medium targets; the rest are large targets.

4.3. Algorithm Processing Analysis
(1)A large number of numerical experiments have been carried out in this paper, and it is found that the selection of the parameter p of in model I has a great impact on the experimental results. The selection of different values will make the model’s evaluation indicators PA and IoU reach the maximum. The number of iterations is different. Therefore, this paper fixes other parameters and selects different values to repeat the experiment. The experimental results are recorded and compared to select the appropriate value. The test results of the experiment are shown in Table 5 and Figure 3.According to the above experimental results, we choose the value of parameter p to be 1.2, which can reduce the number of iterations of the model and achieve better performance of the model. In this case, the segmentation result of the copper engraving printing image is shown in Figure 4 (picture from Baidu Encyclopedia).(2)In the training process of this paper, the weight of the network model on the ImageNet dataset is used as the pretraining model, and the weights of the deep learning part of the model are initialized. This article uses the Adam optimizer algorithm and cross-entropy loss function for training, and the comparative experimental test results are shown in Table 6 and Figure 5.It can be seen from the table and chart that the image segmentation neural network proposed in this paper is tested on the ImageNet dataset, and the overall segmentation effect is relatively good, and the intersection of two categories is relatively high. The image segmentation neural network in this paper has a better segmentation effect on the ImageNet dataset, which proves the feasibility of the algorithm in this paper for the ImageNet dataset.(3)Quantitative evaluation of several algorithms, statistics, and analysis of the processing time of each algorithm for the image segmentation of copper engraving printing is carried out. The processing time is shown in Table 7 and Figure 6.

It can be seen from the experimental results that, in terms of segmentation speed, the watershed algorithm, guided filter algorithm, and image segmentation model have little difference in processing speed for each sample, indicating that these algorithms have good anti-interference capabilities. The average processing time of each image segmentation model is 0.39 seconds, and the processing speed is relatively fast.

5. Conclusions

Image segmentation methods based on neural network technology have attracted the most attention of researchers in the field of vision. Improved algorithms, new methods, and new tools are constantly emerging, especially the recent deep learning technology, which has become an emerging topic in the field of image segmentation. This article discusses the current mainstream image segmentation methods. Aiming at the research status of deep learning technology, this paper analyzes and summarizes the related algorithms of copperplate print image segmentation. This article is mainly inspired by traditional methods combined with deep learning methods and made a little improvement. Although the improvement of this article has been significantly improved, it is an attempt on a relatively simple network with relatively few convolutional layers. In the future research, it is necessary to conduct deeper research and prove the influence of the relatively complex network of the convolutional layer.

Data Availability

No data were used to support this study.

Conflicts of Interest

The authors declare that that they have no conflicts of interest regarding the publication of this paper.