Introduction

The major upheaval caused by the coronavirus crisis has shown how modern societies depend on electricity for healthcare, telecommunications, entertainment, shopping, work, and so on. In short, electricity is at the heart of modern economies and human life and provides an increasing share of energy services. The electrification of transportation and heat, the dependence on digitally connected devices, the growing services sector, and the increased use of technology have triggered increases in demand for electricity. Increasing electricity demand was a primary source of global CO2 emissions from the energy sector (impact of fossil fuels) that reached a record level in 2018 (IEA 2019). All these issues indicate that countries must develop long-term energy strategies that consider sustainability indicators.

Most countries tend to increase their use of sustainable energy to cope with environmental issues. In this regard, the transition from fossil fuels to renewable energy is an essential task for many countries. As a developing country, Turkey is mostly dependent on imports for electricity generation, in which natural gas has the highest share (30.34%), followed by hard coal (22.37%), hydro (19.66%), lignite (14.79%), wind (6.54%), solar PV (2.56%), geothermal (2.44%), others (1.19%), and liquid fuels (0.11%) (TEIAS 2020). In other words, a considerable share of total electricity in Turkey is generated by fossil fuels. In addition, strong economic and population growth and increasing income levels have triggered Turkey's greenhouse gas (GHG) emission growth to peak levels among the Organization for Economic Co-operation and Development (OECD) countries. However, Turkey set a mitigation target for 2030 under the UN Framework Convention on Climate Change. Nevertheless, energy is the primary source of GHG emissions in Turkey (OECD 2019).

Any decision regarding energy plays a vital role in future planning; thus, various aspects, such as economic, environmental, socioeconomic, and technical attributes, must be considered. Energy decision makers (DMs) need to take a hard, evidence-based view of current conditions and the results of their choices. Therefore, DMs need practical guiding tools to make the most appropriate decisions, particularly if many alternatives and conflicting criteria affect the choice of these alternatives. In this regard, multiple attribute decision making (MADM) is often adopted to examine specific alternatives and select the optimal choice based on multiple criteria (attributes). The evaluation of electricity generation options and the selection of the optimal option is a typical, critical, and highly influential MADM problem.

In MADM, the attribute weights play a vital role in the decision-making process. In other words, the result of the decision-making process is prominently affected by the attribute weights. There have been numerous approaches utilized for determining attribute weights. These methods can be grouped as subjective, objective, and integrated or hybrid based on the considered preferences and utilized data (Fu and Wang 2015; Yang et al. 2017). Subjective methods solely utilize the subjective evaluations of DMs to obtain the attribute weights. The most commonly used methods include the analytic hierarchy process (AHP) (Saaty 1986), the best–worst method (BWM) (Rezaei 2015), the simple multiattribute rating technique (SMART) (Edwards and Barron 1994), direct rating (Bottomley and Doyle 2001), point allocation (Doyle et al. 1997), linear programming techniques for multidimensional analysis of preferences (LINMAP) (Srinivasan and Shocker 1973), and the Delphi method (Hwang and Yoon 1981). Objective methods are utilized to prevent human-made instabilities and obtain more realistic results. The objective methods use mathematical models and data without considering the preferences of DMs. The most common objective approaches include the criteria importance through intercriteria correlation (CRITIC) (Diakoulaki et al. 1995), entropy (Xu 2004), standard deviation (SD) (Deng et al. 2000), mean weight (MW), and maximizing deviation method (Yingming 1997). Hybrid or integrated approaches combine the preferences of DMs with a decision matrix to obtain the criteria weights. Thus, they take advantage of both method types. Different forms of hybrid methods exist in the literature (Fan et al. 2002; Ma et al. 1999; Pei 2013; Rao et al. 2011; Wang and Parkan 2006).

In this study, the most common subjective approach (AHP) and a recently introduced and trending subjective approach (BWM) are considered as weighting approaches. In addition, four of the most common objective approaches (CRITIC, SD, MW, and entropy) are considered. These methods have been selected to reveal the differences among subjective methods, the differences among objective methods, and the differences among subjective and objective methods in general. Additionally, through evaluations of the weighting methods, the aim is to reveal the impacts of subjectivity and eliminate the uncertainty. Scholten et al. (2015) posited that uncertainty in attribute weights could result from inaccurate quantitative evaluations, personal prejudices, or the use of imprecise weights. In this context, uncertainties related to personal biases caused by subjective weighting methods (AHP and BWM) are minimized by including the objective weighting methods in the proposed methodology. Mean weights are also included in the study to reduce uncertainties that may be caused by attribute weights. In addition, uncertainties caused by inaccurate quantitative estimates are diminished with the inclusion of expert knowledge and experience in the proposed approach. Thus, potential uncertainties are minimized by considering various weighting sets. Notably, sensitivity analysis, which is defined as the analysis of the impact of uncertainty by Saltelli et al. (2000), is not conducted because various weight sets are already analyzed through several weighting methods.

The AHP is one of the most extensively implemented multiple criteria decision-making (MCDM) approaches because of its ease of application and success. The BWM is a new technique compared to the AHP. Nevertheless, it has attracted considerable interest thanks to its efficiency in reducing the number of pairwise comparisons and its success in preserving consistency between assessments. The benefits of the BWM over the AHP can be described as follows. First, the comparisons in the BWM take less time than the AHP because the AHP uses the whole matrix of comparisons. Second, the BWM eliminates redundant comparisons; thus, its performance in preserving the consistency of pairwise comparisons is better than that of the AHP. Last, the complexity of comparisons is reduced in the BWM because it utilizes a 1–9 scale rather than the 1/9–9 scale used in the AHP; this simplifies the evaluation process (Mi et al. 2019). The CRITIC method takes both the contrast intensity and contradictory character of the assessment criteria into account (Diakoulaki et al. 1995). The CRITIC, entropy, and SD methods consider the decision matrix to be the only source of information on the relative importance of indicators. In the MW approach, equal weights are given to the attributes. All these methods have been used for finding attribute weights either solely or as integrated with other MCDM (MADM) methods in various areas (Table 1).

Table 1 Literature review on hybrid weighting and MADM (including fuzzy) methods

To rank the alternatives, seven of the MADM methods, namely the elimination and choice expressing the reality (ELECTRE) method, the weighted sum method (WSM), the weighted product method (WPM), the organization, rangement et synthese de donnes relationnelles (ORESTE) method, the technique for order performance by similarity to the ideal solution (TOPSIS), the preference ranking organization method for the enrichment of evaluations (PROMETHEE), and the multiple criteria optimization compromise solution (VIKOR), are utilized in this study. The reasons for selecting these methods can be explained as follows. First, these are among the most widely used MADM techniques (Table 1). Second, they belong to different MADM family groups and follow different procedures. Third, each method is likely to produce a different result. Therefore, it is more reasonable to use multiple techniques and evaluate the results compared to the use of one method. In this context, Ozernoy (1992) claimed that there was no perfect MADM method to produce intended results for all problems. Ishizaka and Siraj (2018) proposed the use of multiple MADM methods to improve the accuracy of the results for a specific problem. To minimize the uncertainties resulting from MADM methods (Haddad and Sanders 2018; Mosadeghi et al. 2013), multiple MADM methods are included in the proposed methodology.

One of the main objectives of this study is to assess and compare the results of subjective and objective weighting methods and MADM methods. It also aims to reveal the most effective integrated methods in the sustainable energy context. Another main objective of this study is to rank the six leading electricity generation technologies, namely natural gas, coal, hydro, onshore wind, geothermal, and solar photovoltaic (PV), for Turkey through numerous integrated MADM methods. In this context, four main attribute groups, namely economic, environmental, socioeconomic, and technical, are determined in the scope of the sustainable development concept. Twelve subattributes are set under the main criteria based on the literature and expert knowledge. Then, a unique and comprehensive methodology involving various weighting and MADM methods, Spearman's correlation coefficient analyses, and the Borda, Copeland, and grade average methods, is implemented to reveal the optimal electricity generation option for Turkey.

The main contributions of this study can be summarized as follows:

  1. 1.

    This study is the first to compare the BWM, AHP, entropy, SD, MW, and CRITIC methods.

  2. 2.

    To minimize the subjectivity of the AHP and BWM and the solely data-based evaluations of the objective weightings of entropy, SD, MW, and CRITIC, all of them are utilized to determine attribute weights.

  3. 3.

    Because using one MADM method may not guarantee an accurate selection, seven MADM methods are applied to rank the electricity generation options.

  4. 4.

    A comprehensive analysis is performed using six weighting and seven MADM methods, which constitute 42 models in total. Additionally, some hybrid models, such as entropy-based ORESTE, are used in the evaluation of electricity generation technologies for the first time.

  5. 5.

    The electricity generation options of Turkey are evaluated by considering sustainability indicators and using current data to the extent possible.

  6. 6.

    Unlike other studies, the rankings of integrated methods, involving various weighting methods and numerous MADM methods, are combined to determine the optimal electricity generation technology.

  7. 7.

    A unique and comprehensive methodology involving various weighting and MADM methods, Spearman's correlation coefficient analyses, and the Borda, Copeland, and grade average methods is presented.

The rest of this study is organized as follows. In Sect. 2, the energy profile of Turkey is presented. Section 3 presents a description of each method and the implementation details of the proposed methodology in the energy case. Section 4 includes the results and a discussion regarding each method and analysis. Section 5 concludes the study and provides insights into possible future studies.

Energy and electricity generation profile of Turkey

Due to population and economic growth, the demand for energy and natural resources in Turkey is increasing. With an annual growth rate of 5.5% in electricity demand since 2002, Turkey has set a record for the fastest growth among OECD members. Additionally, energy demand is expected to increase by 50% in the next ten years. Despite the growing energy demand, Turkey is dependent on imports. As preventive and precautious actions, it is aimed to improve energy efficiency, to add nuclear to the energy mix, to increase the share of domestic and renewable energy, and to diversify imported oil and gas supply sources and resources (MFA 2020).

The evaluation and prioritization of electricity generation technologies that consider sustainability indicators are vital. To reveal the current situation, the annual development of installed capacity by the primary energy resources (options) of Turkey is shown in Fig. 1. The general view varies by year. To compare options accurately, 2014 is taken as a reference year because data for solar PV are available starting in 2014. Based on year-by-year changes, it can be seen that solar PV use increased by 124.9% in 2018 compared to 2014. In addition, geothermal energy increased by 2.2%, followed by renewable sources (including waste) by 1.7%, onshore wind by 0.9%, hard coal by 0.5%, hydro by 0.2%, natural gas by 0.1%, and lignite by 0.1%. However, others decreased by 0.3% and liquid fuels by 0.4%. Notably, Turkey has an installed capacity of 90,400 MW as of July 2019 (MFA 2020).

Fig. 1
figure 1

Annual development of Turkey's installed capacity by primary energy resources

The annual generation (GWh) of each electricity generation option and their shares are shown in Fig. 2. It can be seen that natural gas has the highest share, followed by hard coal, hydro, lignite, wind, solar PV, geothermal, others, and liquid fuels.

Fig. 2
figure 2

Annual generation and share of each electricity generation option in Turkey in 2018

Turkey is among the largest growing renewable markets in the world. Turkey is the sixth-largest electricity market in Europe, and it has the highest market growth rate in Europe. Both demand and supply are expected to double by 2030. Turkey’s 2023 goal includes increasing the share of renewables to 30%, maximizing the use of hydropower to reach 34 GW, increasing wind and solar installed capacities to 20 GW and 10 GW, respectively, and decreasing the share of natural gas to below 30% of electricity generation. The targets for renewable installed capacity (MW) are 34,000, 20,000, 10,000, 1500, and 1000 for hydro, wind, solar, geothermal, and biomass, respectively (Presidency of The Republic of Turkey 2017). These facts demonstrate why Turkey was selected for the analyses in the study.

Materials and Methods

In this study, four methods for determining attribute weights are considered. The procedures for these methods are defined in the following subsections. The methods utilize the decision matrix given as follows:

$$ A = \left[ {a_{ij} } \right]_{m \times n} = \left[ {\begin{array}{*{20}c} {a_{11} } & {a_{12} } & \ldots & {a_{1n} } \\ {a_{21} } & {a_{22} } & \ldots & {a_{2n} } \\ \vdots & \vdots & \ddots & \vdots \\ {a_{m1} } & {a_{m2} } & \ldots & {a_{mn} } \\ \end{array} } \right] $$
(1)

where \(a_{ij}\) denotes the value of alternative i concerning criterion j.

Weighting methods

Best–Worst method (BWM)

The BWM, which was introduced by Rezaei (2015), is a comparison-based approach for determining attribute weights. The outcome of the BWM depends heavily on the assessment of DMs. Like other such methods, it has benefits and drawbacks. The advantages of the BWM include less time required for comparisons and the delivery of more consistent and reliable outcomes compared to the AHP (Mi et al. 2019). Additionally, the BWM only uses integer numbers for pairwise comparisons. However, similar to other subjective methods, it relies on the subjective evaluations of DMs, which can be biased and lead to deceptive outcomes.

The BWM consists of the following steps (Rezaei 2015):

  1. i.

    A set of decision criteria is formed. While reaching a decision, the DM identifies n criteria \(\left\{ {c_{1} ,c_{2} , \ldots ,c_{n} } \right\}\).

  2. ii.

    The best (most important) and the worst (least significant) criteria are determined. The DM categorizes the best and the worst criteria overall. There is no need for comparisons during this step.

  3. iii.

    The preference of the best criterion over all the other criteria is specified using a number from 1 to 9. The best-to-others vector is formed as follows:

    $$ A_{B} = \left( {a_{B1} ,a_{B2} , \ldots ,a_{Bn} } \right) $$

    where \(a_{Bj}\) represents the preference of the best criterion B over criterion j and \(a_{BB} = 1\).

  4. iv.

    Pairwise comparisons between the worst criterion and the other criteria are formed. The others-to-worst vector is formed as follows:

    $$ A_{W} = \left( {a_{1W} ,a_{2W} , \ldots ,a_{nW} } \right)^{T} $$

    where \(a_{jW}\) denotes the preference of criterion j over the worst criterion W and \(a_{WW} = 1.\)

  5. v.

    The optimal weights \(\left( {w_{1}^{*} ,w_{2}^{*} , \ldots ,w_{n}^{*} } \right)\) are defined such that the maximum absolute differences \(\left| {\frac{{w_{B} }}{{w_{j} }} - a_{Bj} } \right|\) and \(\left| {\frac{{w_{j} }}{{w_{W} }} - a_{jW} } \right|\) are minimized for all j. The following min–max model is formed accordingly:

    $$ \begin{gathered} \min \mathop {\max }\limits_{j} \left\{ {{ }\left| {\frac{{w_{B} }}{{w_{j} }} - a_{Bj} } \right|,\left| {\frac{{w_{j} }}{{w_{W} }} - a_{jW} } \right|} \right\} \hfill \\ {\text{subject to}}:\, \hfill \\ \mathop \sum \limits_{j} w_{j} = 1 \hfill \\ w_{j} \ge 0, \quad {\text{for }}\;{\text{all }} j \hfill \\ \end{gathered} $$
    (2)

    The model given above can be transformed to the following model:

    $$ \begin{gathered} \min \xi \hfill \\ {\text{subject to}}: \hfill \\ \left| {\frac{{w_{B} }}{{w_{j} }} - a_{Bj} } \right| \le \xi ,\quad {\text{for}}\;{\text{all }} j \hfill \\ \left| {\frac{{w_{j} }}{{w_{W} }} - a_{jW} } \right| \le \xi , \quad {\text{for}}\;{\text{all }} j \hfill \\ \mathop \sum \limits_{j} w_{j} = 1 \hfill \\ w_{j} \ge 0,\quad {\text{for}}\;{\text{all }} j \hfill \\ \end{gathered} $$
    (3)

    After solving this model, the optimal weights and \(\xi^{*}\) are found.

  6. vi.

    The consistency of the comparison \(\left( {w_{1}^{*} ,w_{2}^{*} , \ldots ,w_{n}^{*} } \right)\) matrix is checked to ensure overall consistency. A consistency ratio is computed via the following function:

    $$ CR = \frac{{{ }\xi^{*} }}{CI} $$
    (4)

    where CR denotes the consistency ratio and CI represents the consistency index. The consistency index values are given in XX. The lower the CI, the more consistent the comparisons are.

Analytic hierarchy process (AHP)

The AHP, which was introduced by Saaty (1977), determines the weights of criteria through pairwise comparisons in Table 2. The AHP can also help the DM with relevant information to choose the optimal alternative by ranking a set of alternatives. If there are many alternatives, the AHP method may not be appropriate, as it requires a high number of pairwise comparisons. In such a case, the AHP is mostly utilized for weighting the attributes. The advantages and disadvantages of AHP are summarized in Table 3 (Ishizaka and Labib 2011).

Table 2 Consistency index (CI) values
Table 3 The advantages and disadvantages of AHP

The AHP decomposes a complex MADM problem into a hierarchy and evaluates the relative importance of decision attributes. The steps of the AHP can be explained as follows. First, the hierarchy is formed. The top level of the hierarchy represents the overall goal. The middle level(s) consists of decision attributes (and subcriteria, if any). The bottom level consists of decision alternatives. Second, pairwise comparisons of the criteria are formed based on the 1–9 scale given in Table 4. The relative importance of the attributes at each level is determined.

Table 4 The scale of numbers and definitions (Saaty 1987)

To determine the attribute weights, the AHP method forms a pairwise comparison matrix A as follows:

$$ {\text{A}} = \left[ {\begin{array}{*{20}c} {a_{11} } & {a_{12} } & \cdots & {a_{1n} } \\ {a_{21} } & {a_{22} } & \cdots & {a_{2n} } \\ \vdots & \vdots & \ddots & \vdots \\ {a_{n1} } & {a_{n2} } & \cdots & {a_{nn} } \\ \end{array} } \right] $$
(5)

where the criteria set is \(C = \left\{ {C_{j} | j = 1,2, \ldots ,n} \right\}\). In the evaluation matrix A(n × n), each element aij (i, j = 1, 2, …, n) denotes the relative preference of ith criterion to the jth criterion (Wang and Yang 2007). Then, mathematical operations take place to normalize and determine the relative weights for each matrix. The relative weights are denoted by the right eigenvector (w) matching to the largest eigenvalue (λmax) as follows:

$$ Aw = \lambda_{\max } w $$
(6)

If the pairwise comparisons are wholly consistent, matrix A has a rank of one and λmax = n. In such a case, weights can be found by normalizing any of the columns or rows of A. Last, consistency of the judgments is checked to verify the results. The accuracy of the AHP results depends mainly on the consistency of the pairwise comparison evaluations. The relation between the data of A: aij * ajk = aik defines the consistency. The CI is obtained through Eq. 7:

$$ CI = \frac{{\lambda_{\max } - n}}{n - 1} $$

The final CR is calculated by dividing CI by a random index (RI), as shown in Eq. 8:

$$ CR = \frac{CI}{{RI}} $$
(8)

where 0.1 is the upper limit of acceptable CR. If the final CR exceeds this limit, the assessment procedure must be performed again to achieve an acceptable consistency value.

Entropy

The entropy concept, which is a measure of uncertainty in information expressed in terms of probability theory, was introduced by Shannon (1948). Shannon's entropy approach interprets the relative intensities of the criterion importance depending on the discrimination among data to evaluate the relative weights (Monghasemi et al. 2015). As an objective method, the entropy method provides reliable results in a case in which the results of subjective methods can be misleading due to prejudiced or inadequate decisions by DMs. This may be considered one of the principal advantages of this objective method. However, data dependence may be a disadvantage. The steps of the entropy method can be described as follows:

In step 1, the decision matrix is normalized for benefit and cost attributes:

$$ r_{ij} = \frac{{a_{ij} }}{{\mathop \sum \nolimits_{i = 1}^{m} a_{ij} }}\quad {\text{for}}\; i = 1,2, \ldots ,m $$
(9)
$$ r_{ij} = \frac{{1/a_{ij} }}{{\mathop \sum \nolimits_{i = 1}^{m} \left( {1/a_{ij} } \right)}}\quad {\text{for}}\;i = 1,2, \ldots ,m $$
(10)

In step 2, entropy values are computed using the following equation:

$$ e_{j} = - \left( {\ln m} \right)^{ - 1} \mathop \sum \limits_{i = 1}^{m} r_{ij} \ln r_{ij} \quad {\text{for }} j = 1,2, \ldots ,n $$
(11)

In step 3, the weights of each criterion are obtained using the following equation:

$$ w_{j} = \frac{{1 - e_{j} }}{{(n - \mathop \sum \nolimits_{j = 1}^{n} e_{j} )}}\quad {\text{for }}\;j = 1,2, \ldots ,n $$
(12)

A low entropy value means that the degree of disorder in the system is low and the weight is high (Mohsen and Fereshteh 2017).

Criteria importance through intercriteria correlation (CRITIC)

In this method, the weights are determined based on the contrast intensity and conflict evaluation of the decision problem. Additionally, human intervention is not required for the evaluation process. The steps of the method can be described as follows:

  1. i.

    The decision matrix is normalized using the following equation:

    $$ a_{ij}^{ + } = \frac{{a_{ij} - a_{j}^{worst} }}{{a_{j}^{best} - a_{j}^{worst} }} $$
    (13)

    where \(a_{ij}^{ + }\) denotes the normalized value of the ith design on the jth response.

  2. ii.

    The following multiplicative aggregation formula is used to determine the amount of information contained in the jth response:

    $$ C_{j} = \sigma_{j} \mathop \sum \limits_{k = 1}^{n} \left( {1 - r_{jk} } \right) $$
    (14)

    where \(\sigma_{j}\) denotes the standard deviation of the jth response and \(r_{jk}\) represents the correlation coefficient between two different responses.

  3. iii.

    The objective weights (wj) are determined by using the following equation:

    $$ w_{j} = \frac{{C_{j} }}{{\mathop \sum \nolimits_{k = 1}^{m} C_{k} }} $$
    (15)

As a result, this method assigns a high value of weights to those responses with high standard deviation and low correlation with other responses.

Standard deviation (SD)

This approach determines the weights of attributes based on their standard deviations through the following equation:

$$ w_{j} = \frac{{\sigma_{j} }}{{\mathop \sum \nolimits_{k = 1}^{m} \sigma_{k} }},\quad j = 1,2, \ldots ,m $$
(16)

Mean weight (MW)

MW is a straightforward weighting approach that considers all attributes equally important through the following function:

$$ w_{j} = \frac{1}{m},\quad j = 1,2, \ldots ,m $$
(17)

The MADM methods

Seven MADM approaches are utilized to rank the alternatives and reveal a comparative analysis. Each approach has its procedures, ranking and selecting capabilities, drawbacks, and advantages (Dessler 2006). The algorithms and features of these MADM approaches are summarized in Table 5 but are not detailed in this study, as they are well known and frequently used in the literature.

Table 5 The MADM techniques and their algorithms, advantages, drawbacks, and references

ELECTRE, introduced by Benayoun et al. (1966a), consists of two main procedures. A multicriteria aggregation procedure that allows for the creation of one or more outranking relationship(s) aims at comprehensively comparing each pair of actions. An exploitation procedure leads to yield results according to the nature of the problem, including choosing, ranking, or sorting (Figueira et al. 2016). Among the ELECTRE family methods, ELECTRE III is preferred in this study, as it has been successfully implemented for numerous problems in various fields. The WSM, presented by Fishburn (1967), is a fundamental approach. The alternatives are ranked based on their WSM scores. The WSM score of each alternative is equal to the sum of the products. The weighted product method (WPM), introduced by Bridgman (1922), can be considered a different version of the WSM, as it was presented to overcome some of its drawbacks. ORESTE, introduced by Roubens (1982), is highly discriminatory in terms of conflicting alternatives and clearly shows incomparability (Pastijn and Leysen 1989). PROMETHEE, presented by Brans et al. (1986), is based on the pairwise comparison of alternatives along with each attribute and considers the internal relationships of each evaluation fact. Among all PROMETHEE versions, PROMETHEE II is used in this study due to its success in various problems. In the PROMETHEE II approach, the net flow is obtained as final values, and the full ranking of alternatives is provided. TOPSIS, introduced by Hwang and Yoon (1981), assumes that the utility of each attribute tends to increase or decrease monotonically (Evangelos 2000). Thus, defining the positive ideal and negative ideal solutions is possible. VIKOR, presented by Opricovic (1998), can solve MADM problems with contradictory criteria (perhaps in different units) grounded on assumptions that compromising is satisfactory for the resolution of conflict, the DM wants a solution that is the closest to the ideal, and the alternatives are evaluated based on all attributes.

Spearman's rank correlation coefficient

Spearman's rank correlation coefficient is utilized to determine the relationship between two sets of ranks produced by different methods. In this study, it is used to evaluate the results of weighting and MADM methods. A high coefficient value indicates a strong correlation between the two methods. The coefficient (rs) is calculated through the following function:

$$ r_{s} = 1 - 6\frac{{\sum d_{i}^{2} }}{{n\left( {n^{2} - 1} \right)}} $$
(18)

where di is the difference between the ranks of the two methods and n is the number of options.

Application of the proposed method to the selection of electricity generation option

The implementation of the proposed methodology is summarized in Fig. 3.

Fig. 3
figure 3

The framework of the proposed methodology

First, the decision problem is defined. The problem selected is to rank electricity generation options (natural gas, coal, hydro, onshore wind, geothermal, and solar PV) in Turkey based on sustainable indicators under the categories of economic, environmental, socioeconomic, and technical attributes (Table 6).

Table 6 Main attributes and regarding subattributes

These attributes are determined through expert knowledge and a literature review, as summarized in Table 7. These attributes are selected because they are commonly considered in the evaluation of such problems in the literature, and they reflect the economic, environmental, socioeconomic, and technical aspects that provide a complete assessment in terms of sustainability. The descriptions and objectives of these attributes are presented in the table. The "min" and "max" objectives indicate that the attributes are cost and benefit, respectively.

Table 7 The selected attributes and descriptions

The data for sustainability indicators in Turkey are collected from official reports, websites, articles, and periodical reports from reputable institutions. The units of attributes and the data sources for each indicator are given in Table 8. For some attributes, it is not possible to find Turkey-specific data. Therefore, average values are considered for such cases.

Table 8 Units of the attributes and data sources

Then, the performance matrix is formed based on the collected data, as presented in Table 9. The range of data in each column is different. Additionally, the unit of each attribute is different (Table 8). To ensure consistency in the calculations, the data are normalized during the weighting process and MADM methods. Each method may require different normalization methods.

Table 9 Raw data of sustainability attributes for each electricity generation technology

Then, each weighting method procedure is implemented. For subjective methods, the data are not required since they depend on subjective evaluations from the expert. In addition, the objective weighting methods process the performance matrix. Different attribute weight sets are obtained from each weighting method. The correlation between each method is evaluated through Spearman's correlation coefficient. Thus, the outcome of each method is analyzed, and their similarities/dissimilarities are revealed. By using the outputs of each weighting method as input, the MADM methods are implemented separately. Thus, the rankings of electricity generation options are obtained by 42 different models. Then, these results are analyzed through Spearman's rank correlation coefficients and variances. Finally, to reach an ultimate ranking, the Borda, Copeland, and grade average methods are implemented. In this way, an overall ranking is revealed, and the best option is recommended on a consensus basis. Overall, it can be stated that the proposed methodology is a unique hybrid decision-making approach.

Results and discussion

Based on the data in Table 9, each objective weighting method (CRITIC, entropy, SD, and MW) and subjective weighting approach (AHP and BWM) are implemented to obtain the relative importance (weight) of each attribute. The weights of all attributes provided by all methods are shown in Table 10. As seen in Table 10, the results of the AHP and BWM are similar, as they are based on the same expert knowledge. The most critical attribute is LCOE, followed by economic support and efficiency, according to the AHP and BWM. However, their results are not identical. The results of entropy indicate that water use is the most critical attribute, followed by accident-related fatalities and land use. For the SD and CRITIC method, the capacity factor is the most significant attribute, followed by the electricity mix share. All attributes are equally important in the MW method.

Table 10 Attribute weights obtained from subjective and objective methods

The calculated Spearman's correlation coefficients are given in Table 11. Notably, MW is excluded because all weights are equal. The results indicate that there is a significant correlation between the AHP and BWM. Additionally, the correlation between the CRITIC and SD can be regarded as moderate. Otherwise, the remaining correlations are weak.

Table 11 Spearman's correlation coefficients for weighting methods

The MADM methods (ELECTRE, ORESTE, PROMETHEE, TOPSIS, VIKOR, WPM, and WSM) are applied based on the data (Table 9) and the result of each weighting method. The ranking results are given in Tables 12, 13, 14, 15, 16, 17.

Table 12 Ranking results of the AHP-based MADM methods
Table 13 Ranking results of the BWM-based MADM methods
Table 14 Ranking results of the entropy-based MADM methods
Table 15 Ranking results of the SD-based MADM methods
Table 16 Ranking results of the CRITIC-based MADM methods
Table 17 Ranking results of the MW-based MADM methods

As seen in Table 12, all AHP-based MADM methods suggest that hydro is the best alternative. The rankings of AHP-ORESTE and AHP-PROMETHEE are identical. Additionally, AHP-TOPSIS and AHP-WSM provide the same rankings.

Table 13 reveals the rankings provided by BWM-based MADM methods. All methods suggest hydro as the optimal alternative. The rankings for BWM-TOPSIS and BWM-WSM are the same.

The rankings provided by entropy-based MADM methods are given in Table 14. Solar PV is recommended as the optimal option in general. However, Entropy-WPM suggests hydro and Entropy-VIKOR recommends wind as the ideal alternatives.

As seen in Table 15, SD-based MADM methods suggest hydro as the best option except for SD-based ELECTRE, whose best option is solar PV.

CRITIC-based MADM methods also suggest hydro as the most reasonable alternative except for CRITIC-based ELECTRE, which recommends geothermal as the best option (Table 16).

For equally weighted attributes, MW-based MADM methods suggest hydro as the best option except for MW-ORESTE, which suggest wind as the optimal option, as given in Table 17.

Overall, hydro is suggested as the most reasonable electricity generation option by most of the weighting–MADM method pairs. However, other options can also be the best, as seen from the results. These results reveal that each MADM method provides different rankings for the same case (weights) and verify the importance of using multiple MADM methods for evaluations. As stated in the literature, using multiple MADM methods is more reasonable than using a single method. The final decision should be made after evaluating various MADM models.

The correlation between the methods is expressed through Spearman's correlation coefficients in Table 18. The significant correlations are marked. Spearman's rank correlation coefficient “1” indicates a perfect association of the models. In other words, their rankings are the same. For instance, there is a very strong correlation between the AHP-based TOPSIS and the WSM. The higher the correlation coefficient is, the stronger the correlation between the models is.

Table 18 Spearman's correlation coefficients for the MADM methods

However, there is no consistency in these correlations. For instance, although the correlation between ORESTE and PROMETHEE is strong for the AHP, BWM, entropy, SD, and MW weights, the correlation is moderate for CRITIC weights. The reasons for these discrepancies among rankings of different MADM methods can be generally explained as follows.

First, these methods process weights in different ways in their calculations. Second, algorithms vary in their approach to choosing the optimal option. Third, some procedures require additional parameters that affect the selection of the option. Last, many procedures try to scale the objectives that affect the already chosen weights (Zanakis et al. 1998). The possible reasons or explanations can also be explained explicitly at the method level. In addition to the features of the methods in Table 5, the following points may be explanatory in this regard:

  • TOPSIS identifies a solution with the shortest distance (Euclidean) from the PIS and the farthest distance from the NIS, whereas it does not take into account the relative importance of these distances (Hwang and Yoon 1981; Yoon 1987).

  • Even though both the VIKOR and the TOPSIS methods are distance-based, a compromise solution in VIKOR is determined based on mutual concessions (Opricovic and Tzeng 2007).

  • The WPM overvalues extremes, leading to misleading results, as it substantially favors/disfavors the final assessment of any alternative that is far from the average.

  • ELECTRE and VIKOR are based on similar principles in terms of consideration of a specific global measure (concordance and group utility), and the opposition of the other attribute is not too strong (nondiscordance) (Opricovic and Tzeng 2007).

  • In PROMETHEE, differences in attribute values are not considered entirely. The degree of exceedance preference threshold is ignored. Uncertainty is handled with constant thresholds.

  • In ELECTRE III, differences in attribute values are not considered wholly. Regardless of how significant the value is, the criterion is better than another criterion. Uncertainty is handled with constant or proportional thresholds.

  • The ranking of the options differs between PROMETHEE and ELECTRE III; the procedure of PROMETHEE is additive, whereas ELECTRE III uses distillation that is grounded on the qualification of each option (Salminen et al. 1998).

  • ORESTE stands out because of its conflict analysis test, which enables separate indifference, incomparability, and preference situations (van Huylenbroeck 1995).

Apart from the characteristics of the MADM methods that cause different rankings, the impact of the weighting method is also apparent. In Tables 12, 13, 14, 15, 16, 17, the impact of the weighting methods on the rankings can be observed evidently. The ranking provided by the same MADM method may vary due to the weighting method. To better observe and further analyze this situation, such as evaluating the robustness of the MADM methods, Figs. 4, 5, 6, 7, 8, 9 and 10 are provided.

Fig. 4
figure 4

Ranking results of ELECTRE

Fig. 5
figure 5

Ranking results of ORESTE

Fig. 6
figure 6

Ranking results of PROMETHEE

Fig. 7
figure 7

Ranking results of TOPSIS

Fig. 8
figure 8

Ranking results of VIKOR

Fig. 9
figure 9

Ranking results of WPM

Fig. 10
figure 10

Ranking results of WSM

In Fig. 4, the ranking results of different weighting approach-based ELECTRE methods are illustrated. It can be inferred from the figure that the ranking of ELECTRE is susceptible to the weighting method—weights. Therefore, the right choice of the weighting method for ELECTRE is vital. For instance, hydro is ranked first by three of the methods, but it is ranked third by the remaining three methods. Similarly, the solar PV option is ranked first, third, fourth, and sixth based on different weights. These outcomes reveal the impact of the weights on the ELECTRE ranking.

In Fig. 5, the general overview of the results of ORESTE models is different from Fig. 4. For instance, hydro is ranked first in the four models, and the ranking average of hydro is lower compared to Fig. 4.

In Fig. 6, the rankings of PROMETHEE seem more stable than those of ELECTRE and ORESTE. The overall best option, hydro, is ranked first in five PROMETHEE models.

In Fig. 7, the impact of weights on the TOPSIS results can be observed. However, the impact may be regarded as limited compared to the previous models. For instance, the coal option is ranked fifth by all TOPSIS models regardless of the weighting approach.

In Fig. 8, the rankings of VIKOR also seem vulnerable to the method chosen for weighting. Nevertheless, hydro is ranked first by five VIKOR models out of six.

In Fig. 9, the rankings of the WPM can be observed as the most robust among all MADM methods used in this study. For instance, the rankings of hydro, wind, and coal options are the same for all WPM models, regardless of the weighting method.

In Fig. 10, the rankings of the WSM illustrated that hydro is ranked first by five of the WSM models.

The evaluations based on Figs. 410 are mostly grounded on observations. For further tangible analysis and evaluations, all outcomes are assessed based on variances. In this context, the values are calculated as given in Table 19. The variance measure mainly designates the variations from the mean and, consequently, the precision and effectiveness of the techniques. The minimum values of this measure are always desirable.

Table 19 Variance values of the models

The results presented in Table 19 indicate that the WPM is the least susceptible to the weighting method. The WPM provides the most consistent rankings regardless of the weighting method compared to the other MADM methods for the case considered. However, the rankings of ELECTRE are vulnerable to the weighting method. Therefore, the selection of the weighting method for ELECTRE is crucial.

After evaluating all the weighting and MADM methods, further calculations and evaluations need to be made to determine an optimal option. In this context, the rankings of all 42 models are demonstrated together in Fig. 11. Since there are 42 different rankings, they should be integrated to attain a final ranking to reveal the optimal electricity generation option. Thus, three of the approaches are utilized.

Fig. 11
figure 11

Rankings of the alternatives by 42 different models

Integrated ranking approaches

To determine a final ranking for various rankings produced by numerous models, three approaches are used.

Borda method

The Borda method was presented by Borda (1784). A pairwise comparison matrix is formed to calculate the Borda scores of the options. To score an option, for all other options that rank below the selected option, a value of one is assigned; otherwise, a zero value is assigned to the corresponding matrix value (Table 20). For this study, the results of 42 models are considered and compared. Once the pairwise comparison matrix is completed, the row sum for an option indicates the number of times that the option is preferred to the others. The Borda scores of each option are presented in Table 21.

Table 20 Calculation results for the Borda and Copeland methods for final ranking
Table 21 Comparison of the results of different ranking strategies for integrated ranking

Copeland method

The Copeland method is an extended version of the Borda method (Dortaj et al. 2020). In addition to the calculations for the Borda method, the column sum for each alternative is calculated. The Copeland value of an option is equal to the difference between the row sum and column sum of that option (Tables 20, 21).

The grade average method

The arithmetic average of rankings obtained from different methods is calculated for each option, and the final ranking is determined accordingly. As expected, the lower-ranking average an option has, the better option it is. The calculated averages for each option are presented in Table 21.

Based on Borda, Copeland, and grade average values, it can be inferred that hydro is the best electricity generation option for Turkey, followed by wind (onshore), solar PV, geothermal, natural gas, and coal. As several weighting models provided different weights and numerous models that resulted in various rankings, a sensitivity analysis was not needed. The cases in which different weights are allocated to different attributes already exist.

To compare the results of this study with the previous results, several studies are considered. It may be reasonable to consider the studies conducted for Turkey initially. Yilan et al. (2020) examined the electricity generation options in Turkey based on sustainability indicators through two MADM methods, namely the WSM and multiattribute utility theory. As a result, hydro was found to be the best option, similar to this study. Atilgan and Azapagic (2016) evaluated the electricity generation technologies for Turkey in terms of sustainability through one MADM method: a multiattribute value theory. The results of the study indicated that hydro was the most sustainable option, followed by geothermal and wind. Özkale et al. (2017) assessed renewable energy options based on PROMETHEE and found that hydro was the best alternative. Büyüközkan and Güleryüz (2017) implemented MADM methods, namely TOPSIS, ANP (analytic network process), and DEMATEL (decision-making trial and evaluation laboratory), and the best option was found to be geothermal. In addition, it may be helpful to compare the results of studies conducted for other countries to reveal the impact of data on the outcomes of MADM methods. In a comparative analysis study of the MADM methods, Lee and Chang (2018) used Shannon's entropy, TOPSIS, WSM, ELECTRE, and VIKOR methods to evaluate renewable energy options in Taiwan. They found that hydro was the best option for Taiwan, followed by solar, wind, biomass and geothermal. Ali et al. (2020) used a subjective weighting method (BWM) and objective method (integrated determination of objective criteria weights (IDOCRIW)) and an evaluation based on distance from average solution (EDAS) to rank power generation technologies in Bangladesh. Additionally, a comparative analysis was conducted through three MADM methods, namely combinative distance-based assessment (CODAS), weighted aggregated sum product assessment (WASPAS), and multiobjective optimization based on simple ratio analysis (MOOSRA). The results indicated that gas was the best option, followed by oil, coal, solar, hydro, and wind.

It can be seen that the ranking of electricity generation technologies varies in these studies, although the best option can be the same. The reasons for this outcome can be explained as follows. First, the weighting methods significantly affect the ranking results of the MADM methods. The preference of subjective or objective methods changes the result to some extent. Additionally, the choice of different attributes affects ranking. In this context, the fact that the data forming the decision matrix are ​​different changes the ranking. Last, the selection of MADM methods is crucial because the procedures of MADM methods vary, which produces different outcomes. Therefore, the use of hybrid methods might increase the accuracy of the outcomes.

As seen in these studies, some used only subjective or objective methods as the weighting method. If two method groups were preferred, mostly one method from each of these groups was used. Additionally, the number of MADM methods used differed from study to study; however, the number was generally limited. It can be said that no study has used all the methods in this study in an integrated approach. Thus, the present study differs from previous studies in terms of integrating numerous approaches to determine the optimal electricity generation technology for the first time and presenting a unique and comprehensive methodology involving various weighting and MADM methods, Spearman's correlation coefficient analyses, and the Borda, Copeland, and grade average methods. In this way, the disadvantages of each method are minimized, and the advantages of each are utilized through aggregation; thus, the accuracy of the assessment is increased.

In summary, the following advantages of the proposed method might explain the reasons for choosing these methods and proposing the hybrid approach. First, the advantages of both subjective and objective weighting methods were integrated. Second, the MADM methods from different groups were used to utilize the advantages of each. Third, the results of the subjective and objective methods were compared and explicitly analyzed. Additionally, the results of each MADM method were revealed and compared. Thus, the results of 42 different models were presented and compared. Notably, the result of each model can be the main topic of a separate study. Fourth, the performance and behavior of each MADM method were analyzed. Fifth, the results of each hybrid model were integrated through different aggregation methods to allow for comparisons. Ultimately, a final consensus of the 42 models was proposed as the best option.

Overall, by utilizing various methods and combining their advantages, the proposed methodology provides a more effective and accurate evaluation and analysis from a broader perspective. However, there are some limitations to this study. First, the availability of the updated country (Turkey)-specific data might increase the usefulness and accuracy of the results. Second, the drawback of each weighting method group may be considered a limitation. Being entirely dependent on the subjective evaluations of experts and relying on the statistical evaluations of the data in the performance matrix, regardless of the knowledge and experience of the decision maker, can be considered the disadvantages of subjective and objective methods, respectively. In this context, the inclusion of “combinative weighting” methods could further expand the scope of the study. Additionally, the number of MADM methods could be expanded for a broader comparative analysis. Last, the fuzzy set theory might be useful to handle uncertainty.

Conclusion

In this study, six different weighting and seven MADM methods were utilized for raking electricity generation options in Turkey based on sustainability attributes. The output of each weighting method was used as the input for each MADM method. Thus, rankings of the options were obtained through 42 different models. To determine the optimal technology, the rankings were integrated through the Borda, Copeland, and grade average methods. According to the evaluations and results, some of the findings of this study can be summarized as follows:

  1. 1.

    Hydro is the best energy generation option, followed by onshore wind, solar PV, geothermal, natural gas, and coal, based on the evaluation of 42 different models. The rankings in previous studies regarding Turkey are different. However, hydro, solar, and wind are generally recommended as the best options.

  2. 2.

    As mentioned previously, it is more reasonable to use multiple MADM methods. If one of the MADM methods is used, based on the integrated results, it can be inferred that any of the hybrid AHP-WPM, BWM-WPM, Entropy-WPM, SD-ORESTE, CRITIC-WPM, and MW-ELECTRE III models can be implemented for the selection of electricity generation option problems.

  3. 3.

    The rankings of ELECTRE are susceptible to the weighting method. Therefore, the selection of the weighting method for ELECTRE is crucial.

  4. 4.

    Subjective methods, such as the AHP and BWM, provided similar weights. However, objective methods provided different weights. Therefore, choosing any of them is a prominent task.

  5. 5.

    Subjective and objective weighting methods provided different weights for attributes. If possible, using them together will prevent subjectivity and solely data-based evaluations.

  6. 6.

    The rankings of the WPM are more stable than the rankings of the other six MADM methods. In other words, the impact of the weighting method on the WPM outcome is less.

  7. 7.

    It can be concluded that the share of renewable energy options should be increased based on economic, environmental, socioeconomic, and technical attributes.

In addition to these highlighted results, the present study contributes to the literature by presenting a unique integrated approach that utilizes various methods, including subjective and objective weighting, MADM, and aggregation. Given that each weighting and MADM method has its advantages and disadvantages, the proposed integrated approach reduces the disadvantages and makes most of the advantages, thus providing more effective and accurate evaluations. For further research, in considering the significance of each case, more analyses and calculations considering more comprehensive attributes can be conducted. Additionally, the number of MADM methods can be expanded. Machine learning methods can also be integrated into the model for future predictions.