Abstract

To analyze the effect of government reward-penalty policies (RPPs) on the decisions of a dual-channel closed-loop supply chain (CLSC), this paper endogenizes government decision variables to maximize social welfare and builds four decision-making models (without RPP, with carbon emission RPP, with recycling amount RPP, and with double RPP) by using a Stackelberg dynamic game between the government and supply chain members. The research results show that, (1) in the four models, there exist optimal prices and reward-penalty coefficients to maximize the supply chain members’ profits and social welfare. (2) Comparing with model , under most conditions, three government RPPs decrease the demand for new products and increase the demand for remanufactured products. Comparing the case without RPP, R’s profit decreases, and when the carbon emission cap is very big and the lowest recycling amount is very small, M’s profit increases. (3) In most cases, the three government RPPs can effectively control the total carbon emission and increase the social welfare, but they damage the benefits of retailers and consumers. With the increase of the carbon emission intensity of remanufactured products, the government can implement the double RPP, the carbon emission RPP, and the recycling amount RPP in turn.

1. Introduction

As the concepts of sustainable manufacturing and green supply chain have gained global acceptance, remanufacturing of used products has been recognized as a significant innovation of manufacturing and environmental protection [15]. Remanufacturing refers to a manufacturing process for specialized repair or upgrading in order to transform remanufactured products into “like-new.” [6] More and more enterprises engage in recycling and remanufacturing the used products through dual-channel consisting of a network direct channel and traditional retail channel, which result in dual-channel closed-loop supply chains (CLSCs) [7]. The CLSC management means that the supply chain members share their resources and recycle the used products to reduce environmental pollution and change of biodiversity over time [814]. Meanwhile, governments have put forward environmental policy to promote remanufacturing, including the remanufacturing subsidy policy [15, 16]. Government subsidy policy holds a very important place in the development of remanufacturing, which is considered to be an efficient policy. The Chinese government has started to subsidize remanufacturers for each remanufactured truck engine sold [17]. Government policy is not only an attractive remanufacturing mechanism from an environmental perspective but it also provides a substantial source of revenue [18]. Therefore, CLSC members are increasingly accepting remanufacturing and are willing to take responsibility for product recycling and remanufacturing. In a dual-channel CLSC, the decisions of manufacturers and retailers are influenced by the government subsidy policy, and the government plays a leading role. Therefore, it has become a critical issue to investigate the effect of the government subsidy policy on the decisions of a dual-channel CLSC [19].

On the contrary, for the sustainable development of the environment, governments will levy taxes on excess carbon emission to control carbon emission, which together with the government subsidy policy constitutes the government reward-penalty policy (RPP). The carbon emission tax policies include carbon tax regulation [20], cap-and-trade regulation [21], and carbon cap regulation [22]. In 2008, the first global trading platform for carbon emission began to run. After that, more and more carbon trading platforms appeared, including the UK Emissions Trading Group and National Trust of Australia [2325]. Since climate change is a defining issue of our time [26, 27], the climate action summit held on September 23, 2019, and low carbon emission reduction is an important goal of our country’s enterprise management. In this process, as a social subject, governments should deal with manufacturing companies to take strong regulatory measures and control carbon emission. And thus, governments carry out reward-penalty for enterprises of achieving and meeting performance targets. Chinese also promulgated “the law of People’s Republic of China on air pollution prevention and control,” “the law of People’s Republic of China on energy conservation,” and “the measures for the administration for certification of energy-saving products.” In 2012, Beijing, Shanghai, Chongqing, Guangdong, Hubei, and Shenzhen were approved to carry out pilot carbon emission trading.

In a dual-channel CLSC, game theory is often used to research the decisions of CLSC members. Especially, the dynamics game theory, as a traditional and classical theory, reports that the action between the participants is in order, i.e., the latter can observe the former’s behavior choice, and then the former makes the appropriate choice according to this [28]. By considering the influence of government policies, the members of the dual-channel CLSC must change the traditional decision-making mode. Since the government plays an important role in the dual-channel CLSC, it should be regarded as a participant in the dynamic game in the dual-channel CLSC dynamic game model. In the dynamic game model of government participation, the government’s goal is to maximize social welfare [29]. We attempt to address this decision problem for a dual-channel CLSC by endogenizing government decisions in a dynamic game model.

More specifically, we will analyze the impact of one government RPP (carbon emission RPP or recycling amount RPP) and double government RPP (carbon emission and recycling amount RPP) on the decision-making of a dual-channel CLSC consisting of a manufacturer and a retailer, respectively. Four models are examined: (1) the decentralized model without RPP (model ), (2) the decentralized model with carbon emission RPP (model ), (3) the decentralized model with recycling amount RPP (model ), and (4) the decentralized model with double RPP (model ). In the dual-channel CLSC, the government first gives the intervention policy parameters to maximize social welfare. Then, the manufacturer wholesales new products to the retailer and collects the used products from the marketplace. The manufacturer is responsible for remanufacturing and sells remanufactured products to consumers through the online channel. The retailer subsequently retails the new products in the end market. In this decentralized dynamic model, the government is modeled as the leader followed by the manufacturer and lastly by the retailer. This research attempts to address the following three questions:(1)How to derive the optimal pricing decision, demand, profit, carbon emission, consumer surplus, and social welfare for the dual-channel CLSC in , , , and models based on a Stackelberg dynamic game?(2)What is the impact of the government RPP on the performance of the dual-channel CLSC?(3)What are the applicable conditions for each government RPP and how to set government RPP parameters for maximizing social welfare?

The remainder of this paper is organized as follows. Section 2 reviews related literature. In Section 3, we describe the problem and model assumptions. Section 4 gives the optimal decisions of a dual-channel CLSC in , , , and models based on the Stackelberg dynamic game, and then model is taken as a benchmark model, and the optimal decisions in models , R, and are compared with those in model to illustrate the impacts of government RPP on the performance of the dual-channel CLSC, respectively. Numerical illustration is provided in Section 5. Section 6 concludes the paper.

2. Literature Review

In a dual-channel CLSC, an extensive game theoretic approach has been applied to the supply chain, which has been carried out from various perspectives, such as CLSC’s performance, pricing decisions, dual-channel competition, and the effect of government intervention policy. Based on the research questions in this paper, we focus our review on the applications of a game theory in supply chains, the decision-making of dual-channel CLSCs, and the effect of government policies on supply chains.

2.1. The Applications of a Game Theory in Supply Chains

A critical issue in the supply chain is the competition relationship of supply chain members, which has attracted considerable attention in academia and practice. Most of the existing studies are using a game theory approach to depict the competition among supply chain members, but limited research has been carried out for game members including governments [3032]. For instance, an optimization fuzzy game model of three-player payoff is developed in a green supply chain, and this model proposes a practical solution to increase the players’ confidence to choose green strategies [33]. Babu and Mohan [34] explained and predicted social and economic sustainability for a public health insurance supply chain using evolutionary game theory. Raj et al. [35] studied the coordination issues in a sustainable supply chain using two-stage Stackelberg game-theoretic approach where the supplier acts as a Stackelberg leader. Pricing, recycling, and green product decisions in a multiproduct competitive 3-echelon supply chain are investigated, and the supply chain involves one manufacturer and multiple suppliers and retailers, in which the latter two compete horizontally while keeping Nash equilibrium, but all the three compete vertically while maintaining the Stackelberg equilibrium [36]. Guo et al. [37] proposed a two-echelon reverse supply chain and dealt with the differential game model by introducing recycling publicity and proposed coordination strategies between recyclers and processors in the collection of waste electrical and electronic equipment. Taleizadeh and Sadeghi [38] considered two collecting reverse supply chains and applied three game theory structures to obtain the optimal channel rewards: Nash, Nash–Stackelberg-first supply chain, and Nash–Stackelberg-second supply chain. The above research indicates that game models consisting of supply chain members are well suited to investigate the competition in the supply chain, but it remains an unsettled issue of considering the effect of the government action on the CLSC. In practice, governments usually are more powerful than supply chain members, and therefore, the Stackelberg game can be more suitable to characterize the interaction between them [39].

2.2. The Decision-Making of Dual-Channel CLSCs

An increasing number of researchers have paid attention to the decision-making problem of dual-channel CLSCs and focused on the pricing decisions, channel operational management, remanufacturing strategy, and coordination mechanism. In a dual-channel CLSC, Xie et al. [40] combined the revenue-sharing contract in the forward channel with the channel investment cost-sharing contract and introduced the Stackelberg game to investigate the contract coordination mechanism. Giri and Dey [41] extended the model of Jafari et al. [42] with a backup supplier, considering the uncertainty in the collection of used products and the performance of the supply chain is improved. On the background of an online/offline dual channel, Xie et al. [43] developed a revenue-sharing mechanism by taking the relationship between the recycling rate and the recycling revenue-sharing ratio into consideration. Yi et al. [44] targeted at the optimum strategies of the collection decision for a retailer-oriented closed-loop supply chain with dual recycling channel in the construction machinery industry. Giri et al. [45] derived analytically the pricing and return product collection decisions for the supply chain under five different scenarios, viz., centralized, decentralized (Nash game), manufacturer-led, retailer-led, and third-party-led decentralized scenarios. Huang et al. [46] investigated the optimal strategies for a closed-loop supply chain with dual recycling channels, within which the manufacturer sells products via the retailer in the forward supply chain, while the retailer and the third party competitively collect used products in the reverse supply chain. It is worth noting that the government plays an important role in a dual-channel CLSC, and government policies will affect the supply chain members’ decisions [7]. This motivates us to investigate the game between government and supply chain members [47].

2.3. The Effect of Government Policies on Supply Chains

Many existing studies consider different government policies in a supply chain [48, 49]. For example, He et al. [7] investigated the channel structure and pricing decisions of a dual-channel closed-loop supply chain (CLSC), where a manufacturer can distribute new products through an independent retailer and sell remanufactured products via a third-party firm or platform (3P) in the presence of possible government subsidy. Wan and Hong [50] investigated the impacts of subsidy policies and transfer pricing policies on the closed-loop supply chain with dual collection channels. Improving greening activities in manufacturing firms plays a major role in decreasing hazardous environmental impacts of products and increasing social welfare (SW); Zand et al. [51] considered a dyadic online to offline (O2O) closed-loop supply chain (CLSC) composed of a manufacturer and a retailer for trading a single green product. Chen et al. [52] endogenized a government subsidy in a research joint venture and presented a three-player game in which a government determines the amount of subsidy for a supply chain consisting of a manufacturer and a retailer conducting a research joint venture into a sustainable product. Sinayi and Rasti-Barzoki [53] considered a two-tier model which consists of a supply chain and a government that the government in a higher-level role as a leader for the whole supply chain. At both levels, three dimensions of sustainability, namely, economic, social, and environmental dimensions, are defined, and each is considered in modeling. Cheng et al. [54] established a cooperation decision model for a mixed carbon policy for carbon trading-carbon tax in a two-stage S-M supply chain and investigated the influence of mixed carbon policy with constraint of reduction targets on supply chain price, productivity, profits, and carbon emission reduction rate. These aforementioned papers all highlighted that government policies play a significant role in dual-channel CLSCs. Therefore, our study will investigate the changes in the supply chain members’ optimal decisions relative to government policy parameters, and especially when using government policies to increase social welfare.

3. Problem Description and Model Assumptions

We consider a dual-channel CLSC consisting of a government (G), a manufacturer (M), and a retailer (R), and they play a Stackelberg dynamic game. In the decentralized dynamic model, G firstly gives the intervention policy parameters to maximize social welfare. Then, M wholesales new products at a unit wholesale price to R. M collects used products at a price from the marketplace, and it is responsible for remanufacturing at a cost . And then, M sells remanufactured products at a unit price to consumers through the online channel. R subsequently retails the new products at a unit price in the end market. In this three-player Stackelberg dynamic game, G is modeled as the leader followed by M and lastly by R. Decision-making goal of M and R is to maximize their profits, and G’s decision-making goal is to maximize social welfare. Our primary objective is to derive the effect of government RPPs on the decision-making of the dual-channel CLSC by using a dynamic game theory.

In this research, two common government policies (carbon emission RPP and recycling amount RPP) are considered. The operational structure diagram of the dual-channel CLSC under the government RPP is shown in Figure 1. Then, four models are examined: (1) the decentralized model without RPP (model ), (2) the decentralized model with carbon emission RPP (model ), (3) the decentralized model with recycling amount RPP (model ), and (4) the decentralized model with double RPP (model ). Given the above model settings, we firstly derive the optimal pricing, resulting demand, profit, carbon emission, consumer surplus, and social welfare in the four models , , , and . Then, the effect characteristic of the government RPPs is obtained based on the optimal results. The numerical experiments are carried out to compare and evaluate the performance of the carbon emission RPP, recycling amount RPP, and double RPP, thereby useful managerial insights are given for supply chain managers and governments.

This paper only considers a single product, i.e., a new product or a remanufactured product. A consumer can only have a new product or a remanufactured product. It is assumed that new and remanufactured products coexist in the same market [55, 56], and the remanufactured products are homogeneous. All recycled used products can be produced as remanufactured products, and only one unit of recycled used products can produce one unit of remanufactured products.

The market consists of two types of consumers: new product consumers and remanufactured product consumers. New product consumers do not have used products when they buy new products, while remanufactured product consumers have used products before they buy remanufactured products. New product consumers can buy new products directly, while remanufactured product consumers must sell their used products when they buy remanufactured products [8].

Based on the problem description, we employ the symbols and notations given in Table 1 throughout this paper.

To make the analysis tractable, we introduce the following assumptions in this research.

Assumption 1. Consumers are heterogeneous in their willingness-to-pay for a new product, which is uniformly distributed between 0 and 1. Consumers’ willingness-to-pay for a remanufactured product is a fraction a of , where .
The utilities that consumers receive from new and remanufactured products are and , respectively [3, 8, 57]. Following the utility maximization principle, if , consumers will purchase the new product, resulting in a new product demand function . If , consumers will purchase the remanufactured product, leading to a remanufactured product demand function [8, 58, 59].

Assumption 2. A remanufactured product is cheaper to produce, i.e., .
It is common to simplify the modeling analysis process by assuming the unit manufacturing cost of new products to be positive and that of remanufactured products to be zero [3, 8, 60, 61]. The marginal manufacturing cost of new products is assumed to be to ensure a positive demand for new products [8, 62].

Assumption 3. To ensure profitable remanufacturing, is assumed.
encourages M to recycle and remanufacture used products to enjoy a cost advantage and offers both new and remanufactured products [3, 8, 55].

Assumption 4. The unit carbon emission of a remanufactured product is less than that of a new product, and the carbon emission intensity of the remanufactured product satisfies . The total amount of carbon emission is linearly increasing in its production output.
The unit carbon emission generated in producing remanufactured products is a certain percentage of that generated in producing new products [61]. In model similar processes, the unit carbon emission generated in producing new products is 1, and then the unit carbon emission generated in producing remanufactured products is [61, 62]. Thus, the total carbon emission of the two types of products is . Following Cachon [63], Yenipazarli [29], Cao et al. [21], and Cao et al. [61], the total environmental cost is assumed to be linearly increasing in the total emission, i.e., , where is the environmental cost coefficient.

4. Equilibrium Analysis of Four Dynamic Models

In the following part, we will derive the optimal results for the four dynamic models W, C, R, and D and compare the optimal decisions in the models C, R, and D with those in the model W, respectively.

4.1. Model W: Decentralized Model without RPP

In model W, G does not participate in the game; only M and R participate in the game. M determines and to maximize his profit. Then, R determines to maximize his profit. M is the leader, and R is the follower, and they play a Stackelberg dynamic game. Hence, the model is formulated as

By using the reverse induction method, we have the optimal results for optimization model (1) as shown in the following Proposition 1.

Proposition 1. In model W, the optimal prices are given as , , and . The resulting optimal demands are and .

The optimal profits of M and R are given as

Total social welfare consists of four parts: supply chain members’ profits, consumer surplus, government expenditure, and environment damage cost [29]. In model W, G does not participate in the game, government expenditure is zero, and then we have

Consumer surplus consists of the consumer surplus of new and remanufactured products [29], i.e.,

The total carbon emission generated in producing new and remanufactured products [29, 61], in this model W, is given as

And thus, the environment damage cost is equal to .

In this model W, the optimal consumer surplus and the total social welfare are given as

The proof is given in Appendix A.

4.2. Model C: Decentralized Model with Carbon Emission RPP

In model C, G implements the carbon emission RPP, and it participates in the game. Firstly, G determines the unit carbon emission reward-penalty coefficient by maximizing the total social welfare. Under this carbon emission RPP, M determines and to maximize his profit. Finally, R determines to maximize his profit. G, M, and R play a Stackelberg dynamic game. In this Stackelberg dynamic game, G is modeled as the leader followed by M and lastly by R. The three-echelon Stackelberg dynamic game model consisting of G, M, and R is formulated as

By using the reverse induction method, we have the optimal results of optimization model (7) as shown in the following Proposition 2.

Proposition 2. In model C, the optimal reward-penalty coefficient and price are given as , , , and . The resulting optimal demands are and .

The optimal profits of M and R are given as

In this model C, the total carbon emission is given as

Therefore, the optimal consumer surplus and the total social welfare are given as

The proof is given in Appendix B.1.

The effects of carbon emission RPP on promoting remanufacturing are first embodied in the demands of the two-type products, as shown in Proposition 3.

Proposition 3. In model C, the effects of the carbon emission RPP on the prices and demands are given as follows:(1), , and .(2), if , , and .(3).

The proof is given in Appendix B.2.

Proposition 3 indicates that the retail price of new products in model is higher than that in model W, while the demand for new products in model is lower than that in model . The demand for new products in model decreases with the increase of unit carbon emission reward-penalty coefficient. Because the cost and the retail price of new products increase under the carbon emission RPP, thus the demand for new products decreases. We can also find that the carbon emission RPP increases the retail price of remanufactured products due to the increase cost. When the carbon emission intensity of the remanufactured products is very small, the demand for remanufactured products in model is higher than that in model . With the increase of the unit carbon emission reward-penalty coefficient, the demand for the remanufactured products increases. However, the total market demand in model is smaller than that in model . Therefore, compared to the case without RPP, when the carbon emission intensity of the remanufactured product is very small, the carbon emission RPP can promote the recycling and remanufacturing products. But, it will reduce the total market demand.

The effect of carbon emission RPP on the total carbon emission is given in Proposition 4.

Proposition 4. In model , the effects of carbon emission RPP on the total carbon emission are given as and .

The proof is given in Appendix B.3.

Proposition 4 indicates that, with the increase of the unit carbon emission reward-penalty coefficient, the total carbon emission decreases. Compared to the case without RPP, the total carbon emission decreases. Because the demand for new products decreases under the carbon emission RPP and only when the carbon emission intensity of the remanufactured products is very small, the demand for remanufactured products increases, but the total market demand decreases, and thus, the total carbon emission decreases comparing the case without RPP.

The effects of the carbon emission RPP on the profits of R and M are described in Proposition 5.

Proposition 5. In model , the effects of carbon emission RPP on profits are given as follows:(1), .(2)If , then , , where .

The proof is given in Appendix B.4.

Proposition 5 shows that, with the increase of the unit carbon emission reward-penalty coefficient, R’s profit decreases, and the carbon emission RPP decreases R’s profit comparing the case without RPP. Since the carbon emission RPP makes M increase the wholesale price of new products, while the demand for new products decreases comparing the case without RPP, thus R’s profit decreases. From Proposition 5, we also find that when the carbon emission cap is very big, M’s profit increases as the unit carbon emission reward-penalty coefficient increases, and it is higher than that in model . Because the demand for the remanufactured products increases under the carbon emission RPP, the positive effect of remanufactured products on M’s profit is greater than the negative effect of new products on M’s profit, and thus, M’s profit increases.

4.3. Model : Decentralized Model with Recycling Amount RPP

In model , G implements a recycling amount RPP, and it participates in the game. Firstly, G determines the reward-penalty coefficient by maximizing the total social welfare. Under this recycling amount RPP, M determines and to maximize his profit. Finally, R determines to maximize his profit. G, M, and R play a Stackelberg dynamic game. In this Stackelberg dynamic game, G is modeled as the leader followed by M and lastly by R. The three-echelon Stackelberg game consisting of G, M, and R is formulated as

By using the reverse induction method, we have the optimal results for optimization model (11) as shown in the following Proposition 6.

Proposition 6. In model , the optimal reward-penalty coefficient and price are given as , , , and . The resulting optimal demands are and .

The optimal profits of M and R are given as

In this model , the total carbon emission is given as

Therefore, the optimal consumer surplus and the total social welfare are given as

The proof is given in Appendix C.1

The effects of recycling amount RPP on promoting remanufacturing are first embodied in the demands of the two-type products, as shown in Proposition 7.

Proposition 7. In model , the effects of the recycling amount RPP on the prices and demands are given as follows:(1), , and .(2), , and .(3).

The proof is given in Appendix C.2.

Proposition 7 indicates that the retail prices of new and remanufactured products in model are lower than those in model . The demand for new products in model is lower than that in model , while the demand for remanufactured products in model is higher than that in model . With the increase of the unit recycling amount reward-penalty coefficient, the demand for new products decreases, but the demand for remanufactured products increases. Because the cost decreases and M decreases the retail price of the two-type products under the recycling amount RPP. And thus, the demand for remanufactured products increases in model , while the demand for new products decreases in model . The total market demand increases, and the recycling amount RPP is conducive to promoting recycling and remanufacturing of products.

The effect of recycling amount RPP on the total carbon emission is given in Proposition 8.

Proposition 8. In model , the effects of recycling amount RPP on the total carbon emission are given as if , then and .

The proof is given in Appendix C.3.

Proposition 8 indicates that when the carbon emission intensity of the remanufactured products is very small, the total carbon emission decreases as the unit recycling amount reward-penalty coefficient increases. Compared to the case without RPP, the total carbon emission decreases. Because the increase amount of remanufactured product demand under the recycling amount RPP is larger than the decrease amount of new product demand, thus, the total carbon emission decreases comparing the case without RPP.

The effects of the recycling amount RPP on the profits of R and M are described in Proposition 9.

Proposition 9. In model , the effects of the recycling amount RPP on profits are given as follows:(1), ;(2)If , , and , where .

The proof is given in Appendix C.4.

Proposition 9 shows that, with the increase of the unit recycling amount reward-penalty coefficient, R’s profit decreases. And the recycling amount RPP decreases R’s profit comparing the case without RPP. Because the retail price and demand of new products decrease under the recycling amount RPP, thus R’s profit decreases. From Proposition 9, we also find that when the lowest recycling amount is very small, M’s profit increases as the unit recycling amount reward-penalty coefficient increases, and it is higher than that in model . Although the retail prices of two-type products and the demand for new products decrease under the recycling amount RPP, the demand for the remanufactured product and the total market demand increase. The negative effect of new products on M’s profit is lower than the positive effect of remanufactured products on M’s profit, and thus, M’s profit increases.

4.4. Model : Decentralized Model with Double RPP

In model , G implements a double RPP consisting of a carbon emission RPP and a recycling amount RPP, and it participates in the game. Firstly, G determines the unit carbon emission reward-penalty coefficient and the unit recycling amount reward-penalty coefficient by maximizing the total social welfare. Under this double RPP, M determines and to maximize his profit. Finally, R determines to maximize his profit. G, M, and R play a Stackelberg dynamic game. In this Stackelberg dynamic game, G is modeled as the leader followed by M and lastly by R. The three-echelon Stackelberg dynamic game consisting of G, M, and R is formulated as

By using the reverse induction method, we have the optimal results for optimization model (15) as shown in Proposition 10.

Proposition 10. In model , the optimal reward-penalty coefficients and prices are given as , , , , and . The resulting optimal demands are and .

The optimal profits of M and R are given as

In this model , the total carbon emission is given as

Therefore, the optimal consumer surplus and the total social welfare are given as

The proof is given in Appendix D.1.

The effects of double RPP on promoting remanufacturing are first embodied in the demands of the two-type products, as shown in Proposition 11.

Proposition 11. In model , the effects of the double RPP on the prices and demands are given as follows:(1), , and ; if , then .(2); if , then and , and if , then .(3)If , then .

The proof is given in Appendix D.2.

Proposition 11 indicates that the double RPP decreases the demand for the new products, and with the increase of carbon emission and recycling amount reward-penalty coefficient, the demand for new products decreases. When the environment damage coefficient is very small, the retail price of new products in model is lower than that in model . The total carbon emission is strictly controlled under the double RPP, which causes a great negative impact on new products. The demand for remanufactured products in model increases as the recycling amount reward-penalty coefficient increases. When the carbon emission intensity of remanufactured products is very small, the retail price of remanufactured products in model is lower than that in model , while the demand for remanufactured products in model is higher than that in model . The demand for remanufactured products in model increases as the carbon emission reward-penalty coefficient increases, and the total market demand in model is higher by comparing the case without RPP. Since the double RPP promotes product recycling, the positive effect of the remanufactured product is larger than the negative effect of new products.

The effect of the double RPP on the total carbon emission is given in Proposition 12.

Proposition 12. In model , the effects of double RPP on the total carbon emission are given as ; if , then ; if , then .

The proof is given in Appendix D.3.

Proposition 12 indicates that the total carbon emission decreases with the increase of carbon emission reward-penalty coefficient. When the carbon emission intensity of the remanufactured product is very small, the total carbon emission decreases with the increase of recycling amount reward-penalty coefficient, and the total carbon emission in model is lower than that in model . Because the demand for remanufactured products increases and the demand for new products decreases under the double RPP, thus, the total carbon emission decreases.

The effects of the double RPP on the profits of R and M are described in Proposition 13.

Proposition 13. In model , the effects of the double RPP on profits are given as follows:(1), , and .(2)If , then , and if , then , where and .

The proof is given in Appendix D.4.

Proposition 13 indicates that, with the reward-penalty coefficients increase, R’s profit decreases, and it is lower comparing the case without RPP. Since the double RPP increases the wholesale price of new products and decreases the demand for new products, it damages R’s profit. We also find that when the carbon emission cap is very big, M’s profit increases as the unit carbon emission reward-penalty coefficient increases. When the lowest recycling amount is very small, M’s profit increases as the unit recycling amount reward-penalty coefficient increases. The demand for remanufactured products and the total market demand increase under the double RPP. When the carbon emission cap is very big and the lowest recycling amount is very small, M can obtain more rewards through product recycling and remanufacturing.

From the above propositions, we can find that the three government RPPs can promote product recycling and remanufacturing, i.e., decreasing the demand for new products and increasing the demand for remanufactured products. The three government RPPs decrease the total carbon emission comparing the case without RPP and protect the environment. The three government RPPs damage R’s profit but improve M’s profit comparing the case without RPP. The comparative effects of the three government RPPs on supply chain profit, consumer surplus, and social welfare will be given in the following numerical analysis.

5. Numerical Experiment

In order to more intuitively describe the impact of the three government RPPs on the optimal decision of supply chain members, the supply chain profit, consumer surplus, and total social welfare, the numerical simulation analysis will be given in this part. The relevant parameters are designed to illustrate the managerial insights, which are set as , , , , , and . The carbon emission intensity of remanufactured products is set to . Then, we first have the impact of λ on the optimal reward-penalty coefficients, which is described in Figure 2. The impacts of on price, demand, profit, carbon emission, consumer surplus, and social welfare in the four models are described in Figures 38.

It can be found from Figure 2 that, with the increase of , the reward-penalty coefficient increases, while the reward-penalty coefficients , , and decrease. When the carbon emission intensity of remanufactured products, , is very small, the reward-penalty coefficient is the highest, while when is very big, the reward-penalty coefficient is the highest. And the reward-penalty coefficient in model R is the lowest, but the reward-penalty coefficients and in model are not the lowest. In order to ensure the reward-penalty coefficients are positive, must be lower than 0.45. That is, because maximizes the total social welfare, when the carbon emission intensity of remanufactured products is very big, in order to control the total carbon emission, increases the reward-penalty coefficient in model . However, decreases the reward-penalty coefficient in model R.

From Figure 3, we can find that, with the increase of , the retail prices of new and remanufactured products increase in models and , and the retail price of remanufactured products increases in model . However, the retail price of new products firstly decreases then increases with the increase of in model . The retail price of new products in model is the highest, that is, the lowest in model when is very small. The retail prices of new products in models and are higher than those in model W. When is very small, the retail price of remanufactured products in model is the lowest, while is very bigger, the retail price of remanufactured products in model is the highest, that is, because increases the reward-penalty coefficient in model , which increases M’s cost. And then, M will increase the retail price of remanufactured products and the wholesale price of new products, and R increases the retail price of new products.

It is found from Figure 4 that, with the increase of , the demands of new products increase in models , , and , while the demands of remanufactured products decrease in models , , and . When is very small, the demand for new products in model is the lowest, but the demand for remanufactured products in model is the highest. However, when is very big, the demand for new products in model R is the highest, but the demand for remanufactured products in model is the lowest. That is because when the carbon emission intensity of remanufactured products, , increases, the three government RPPs will decrease the demand for remanufactured products but increase the demand for new products. When is very small, the double RPP introduces the lowest demand for new products and the highest demand for remanufactured products in model . However, when is very big, the difference of carbon emission between new and remanufactured products is very small, and G has no incentive to set a higher reward-penalty coefficient of recycling amount. And then, the recycling amount RPP causes that the demand for new products is the highest, but the demand for remanufactured products is the lowest in model .

From Figure 5, we can find that, with the increase of , the profits of M and R increase in model , the profits of M and R first decrease, and then increase in model , but in model , M’s profit first decreases then increases, and R’s profit first increases then decreases. When is very small, M’s profit in model is the highest, and it is the lowest in model , but R’s profit in model is the highest, and it is the lowest in model . When is very big, M’s profit in model is the highest, but R’s profit in model is the highest. In the four models, M’s profits are all higher than R’s profits. Because when is very big, the reward-penalty coefficient is negative, and the recycling amount RPP increases the prices and demands of new and remanufactured products, which can increase the profits of M and R. G limits on carbon emission and recycling amount, which will damage the price and demand of new products and promote the price and demand of remanufactured products. And thus, comparing the case without RPP, the three government RPPs increase M’s profit but decrease R’s profit.

From Figure 6, we can find that when , the total carbon emission in model is higher than that in model , but in other cases, the total carbon emission under the three government RPPs is lower than that in model . From Proposition 8, when , the total carbon emission in model R is lower than that in model . We also find that when is very small, the total carbon emission in model is the lowest, but when is very big, the total carbon emission in model is the lowest. Therefore, when the carbon emission intensity of remanufactured products, , meets certain conditions, the three government RPPs can effectively control the total carbon emission. When the difference of carbon emission between new and remanufactured products is very big, the double RPP is the most effective to control the total carbon emission; otherwise, the carbon emission RPP is the most effective to control the total carbon emission.

Figure 7 gives that the consumer surplus in model is the lowest. When is very small, the consumer surplus in model is the highest; otherwise, the consumer surplus in model is the highest. In most conditions, the three government RPPs damage the consumer interests and surplus. Especially, the carbon emission RPP can most effectively control the total carbon emission, but under this government RPP, the consumer surplus is the lowest. The retail prices of new and remanufactured products are higher under the carbon emission RPP than those under other government RPPs. However, when is very small, the retail prices of new and remanufactured products in models and are lower than those in model , and thus, the consumer surplus in models and is higher than that in model .

From Figure 8, we can find that, with the increase of , the total social welfare in model decreases, while the total social welfare in models , , and first decreases then increases. When is very small, the total social welfare in model is the highest, and can implement the double RPP, while when is very big, the total social welfare in model is the highest, and can implement the recycling amount RPP; under other conditions, the total social welfare in model is the highest, G can implement the carbon emission RPP, and they are all higher than those in model . That is to say, the three government RPPs can all increase the total social welfare. That is because, in most conditions, comparing the case without RPP, although the three government RPPs decrease the consumer surplus, they can increase the profits of M and supply chain and decrease the total carbon emission, and thus increase the total social welfare. When the difference of carbon emission between two products is very big, G only can control the total carbon emission through setting the lowest recycling amount.

In order to analyze the impact of RPPs on supply chain members, we choose model as the analysis object. In the following part, the parameter is set to be . From Figure 2, we have the optimal reward-penalty coefficients t and as to be , , , and . To illustrate the impacts of the reward-penalty coefficients and on profit, total carbon emission, and social welfare, the reward-penalty coefficients and are set to be and . The effects of the double RPP are illustrated in Figures 911.

From Figure 9, we can find that, with the increase of , M’s profit increases, but R’s profit decreases. Under most conditions, M’s profit is higher than R’s profit. Only when is very small and is very big, R’s profit is higher than M’s profit. Because the more the reward-penalty coefficient is, the greater the damage to the demand for new products and the more benefit to the demand for remanufactured products. And thus, the double RPP is a benefit for M, but it is damage for R.

It can be found that, from Figure 10, with the increase of , the total carbon emission decreases, but it increases with increasing. Since when the reward-penalty coefficient is very high, the demand for new products decreases, but the demand for remanufactured products increases, and thus, the total carbon emission decreases. Combining Figure 6, we obtain that the total carbon emission in model is always lower than that in model . That is to say, the double RPP is effective to control the total carbon emission.

From Figure 11, it can be found that the total social welfare is a convex function of two reward-penalty coefficients, and then, there exists a unique optimal reward-penalty coefficient combination to maximize the total social welfare. Considering the supply chain member’s profits, consumer surplus, government expenditure, and environment damage cost, G must set appropriate reward-penalty coefficients to balance the interests of the four aspects to maximize the total social welfare.

6. Conclusions

This paper studies the role of government policies in a dual-channel CLSC and constructs four dynamic game models (i.e., model , model , model , and model ) by considering the game among M, R, and G. We first derive the optimal pricing decision, consumer demand, profit, carbon emission, consumer surplus, and social welfare for the dual-channel CLSC in models , , , and . Second, we take model as the benchmark model and compare the optimal results in the later three models with the optimal values of model to illustrate the impacts of the government RPPs on the performance of the dual-channel CLSC. Third, through numerical simulation analysis, the optimal results in the four models are compared, and model is taken as an example to analyze how to set the reward-penalty coefficients of the government RPPs by maximizing social welfare.

The results show that, (1) in the four models, there exist optimal prices and reward-penalty coefficients to maximize the supply chain members’ profits and social welfare. (2) Comparing with model , under most conditions, three government RPPs decrease the demand for new products and increase the demand for remanufactured products. When the difference of carbon emission between new and remanufactured products is very small, three government RPPs can decrease the total carbon emission comparing the case without RPP. Comparing the case without RPP, R’s profit decreases, and when the carbon emission cap is very big and the lowest recycling amount is very small, M’s profit increases. (3) In most cases, the three government RPPs can effectively control the total carbon emission and increase the social welfare, but they damage the benefits of retailers and consumers. When the difference of carbon emission between new and remanufactured products is very small, G can implement the double RPP, while when the difference of carbon emission between new and remanufactured products is very big, G can implement the recycling amount RPP, and otherwise, G can implement the carbon emission RPP.

In this paper, we do not consider the recycling channels, recycling, and remanufactured costs of used products. These factors will influence the decisions of supply chain members in a dual-channel CLSC and affect governments for setting their RPP’s parameters. This is worthy of further research. In addition, the three government RPPs damage the benefits of the retailer and consumers, and how to design a more appropriate government RPP for governments to balance various interests will also become a problem for further study. When governments implement RPPs with the limitation of capital budget, how to determine the optimal reward-penalty coefficients is also worth further investigation.

Appendix

A. Proof of Proposition 1 in Model

In model W, M and R play a manufacturer-dominated Stackelberg game. Since is a strictly concave function about . Since and is a strictly concave function about and , and there exist unique optimal solutions to model (1). Then, the optimal solutions in model can be obtained by the reverse induction, which are given as , , and . Hence, we have the optimal demands as follows:

Substituting , , , , and into profit functions in model (1) gives the optimal profits of M and R, carbon emission, consumer surplus, and social welfare.

B. Proof of Propositions in Model

B.1. Proof of Proposition 2 in Model

Similar to the proof of Proposition 1, given reward-penalty coefficient of carbon emission, we have the prices and demands as follows:

Hence, we have the profits of M and R, the total carbon emission, consumer surplus as follows: , , , and

Substituting the above results into the total social welfare maximizing problem, we can derive the optimal carbon emission reward-penalty coefficient as follows:

Substituting into , , , , , , , , , and gives the optimal price, demand, profit, carbon emission, consumer surplus, and social welfare.

B.2. Proof of Proposition 3 in Model

Comparing and analyzing the optimal results in Propositions 1 and 2, we have the following:(1), , and .(2), if , we can easily verify that , and .(3).

B.3. Proof of Proposition 4 in Model

Comparing and analyzing the total carbon emission in models and , we have ; since and , we have .

Similar to the above proof process, we can easily verify that .

B.4. Proof of Proposition 5 in Model

(1)Comparing and analyzing the optimal profits of R between models and , we can easily verify that(2)Comparing and analyzing the optimal profits of M between models and , we have

Hence, we can verify that if , then , , where .

C. Proof of Propositions in Model

C.1. Proof of Proposition 6 in Model

Similar to the proof of Proposition 1, given reward-penalty coefficient of recycling amount, we have the prices and demands as follows: , , , , and .

Hence, we have the profits of M and R, the total carbon emission, and consumer surplus as follows:

Substituting the above results into the social welfare maximizing problem, we can derive the optimal carbon emission reward-penalty coefficient as follows:

Substituting into , , , , , , , , , and gives the optimal price, demand, profit, carbon emission, consumer surplus, and social welfare.

C.2. Proof of Proposition 7 in Model

Comparing and analyzing the optimal results in the Propositions 1 and 6, we can easily verify that(1)(2)(3)

C.3. Proof of Proposition 8 in Model

Comparing and analyzing the total carbon emission in models and , we have and .

If , we can easily verify that and .

C.4. Proof of Proposition 9 in Model

(1)Comparing and analyzing the optimal R’s profits between models and , we can easily verify that(2)Comparing and analyzing the optimal M’s profits between models and , we have

If , it can be verified that and , where .

D. Proof of Propositions in Model

D.1. Proof of Proposition 10 in the Model

Similar to the proof of Proposition 1, given reward-penalty coefficient of carbon emission and reward-penalty coefficient of recycling amount, we have the prices and demands as follows:

Hence, we have the profits of M and R, the total carbon emission, and consumer surplus as follows:

Substituting the above results into the social welfare maximizing problem, we can derive the optimal carbon emission and recycling amount reward-penalty coefficients as follows:

Substituting and into , , , , , , , , , and gives the optimal price, demand, profit, carbon emission, consumer surplus, and social welfare.

D.2. Proof of Proposition 11 in Model

Comparing and analyzing the optimal results in Propositions 1 and 10, we can easily verify that(1)If , then , , , and .(2), if , then and , and if , then .(3)If , then .

D.3. Proof of Proposition 12 in Model

Comparing and analyzing the total carbon emission in models and , we have since and , .

It is easily verified that if , then ; if , then .

D.4. Proof of Proposition 13 in Model

(1)Comparing and analyzing the optimal R’s profits between models and , we can easily verify that(2)Comparing and analyzing the optimal M’s profits between models and , we have , where if , , where .

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (Grant no. 71771055) and Key Projects of Natural Science of Anhui (Grant no. KJ2019A0519).