Optimal components selection based on fuzzy-intra coupling density for component-based software systems under build-or-buy scheme

Kalantari, Samira; Motameni, Homayun; Akbari, Ebrahim; Rabbani, Mohsen

doi:10.1007/s40747-021-00449-z

Optimal components selection based on fuzzy-intra coupling density for component-based software systems under build-or-buy scheme

Original Article
Open access
Published: 28 August 2021

Volume 7, pages 3111–3134, (2021)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

Optimal components selection based on fuzzy-intra coupling density for component-based software systems under build-or-buy scheme

Download PDF

Samira Kalantari¹,
Homayun Motameni ORCID: orcid.org/0000-0003-1309-6569¹,
Ebrahim Akbari¹ &
…
Mohsen Rabbani²

1491 Accesses
3 Citations
Explore all metrics

Abstract

Component-Based Software Engineering (CBSE) is an approach to building and developing software systems based on software components. In component-based software systems, there are various software components, including Commercial off the Shelf (COTS) and in-house components. Software developers can build their desired software component as in-house or COTS. The problem of deciding optimally between COTS and in-house components is one of the major challenges of software developers, which is known as the component selection problem. This can be resolved by evaluating the criteria for optimality in component selection and then solving the component selection problem by optimization techniques. In this paper, an attempt was made to optimize the component selection problem through the multi-objective optimization by maximizing the Fuzzy-Intra Coupling Density (Fuzzy-ICD) and functionality as objective functions, and also taking into account budget, delivery time, reliability, and Fuzzy-ICD as constraints of multi-objective problems. Fuzzy ICD is a more accurate criterion to calculate the relationship between Cohesion and Coupling of components, which is obtained through the fuzzy computing of each of them, based on the Meyers classification. Thus, after a two-criterion optimization model formulation, this optimization problem was solved by fuzzy multi objectives approach. Finally, the proposed method was evaluated by performing the case study of financial-accounting system. Comparison of the results showed that the proposed method could select optimal components with maximum functionality and Fuzzy-ICD and fewer rates of time and Budget (0.29, 0.43, 1.1 s, and 88$ were the improved rates of functionality, Fuzzy-ICD, time, and budget, respectively).

Software component evaluation and selection using TOPSIS and fuzzy interactive approach under multiple applications development

Article 29 August 2018

Shilpi Verma, Mukesh Kumar Mehlawat & Divya Mahajan

Optimal Component Selection Based on Cohesion and Coupling for Component-Based Software System

Software Component Selection in CBSE Considering Cost, Reliability, and Delivery Delay Using PSO-integrated MVO and ALO

Introduction

With the progressive development of software in recent years, software development has become more complex; to overcome this complexity, a lot of cost and time is needed. An acceptable and reasonable solution is reusability of software systems. Component-Based Software Engineering (CBSE) is an approach to building and development of software systems based on existing software components [1,2,3]. These software systems, which have the maximum use of reusable materials, are called Component-Based Software Systems (CBSS) [2]. CBSSs cause the efficiency of software development in terms of demanding lower cost, reducing the time to market, improving maintainability, increasing reliability, and improving other quality parameters [2, 3].

With the development of CBSE, various software components have been presented by many software developer organizations. These components are called Commercial off the shelf (COTS). The components of COTS help software developers to select a software component from among a set of alternative software components available in the market [4,5,6]. Thus, software developers can either build their desired software components as in-house or buy them as COTS (i.e., the build-or-buy strategy) [2].

These two different strategies have advantages and disadvantages that software developers should be aware of. Components of COTS are often built by independent teams of developers in different languages, and applicable to different platforms with less complexity; then, they are available as standard components in the market. The advantages of this type of components are the existence of different versions of a COTS product, design diversity, diversity of data, and the diversity of executive environment which are available in the market by different manufacturers. On the other hand, these products have some disadvantages, including issues with security, consistency, integrity and interoperability, procurement and licensing, etc. However, the customization of the components of in-house development makes it compatible with the system, reliability, and support. Unlike the components of COTS, components of in-house prolong the time of supplying the software product to the market [2]. Thus, always there is a discussion on whether to build-or-buy software components.

To overcome this problem, software researchers evaluate in-house and COTS software components based on some criteria to finally decide what components they should build and what components they should buy. These criteria include [7]:

Financial perspective (COTS cost, maintenance cost, upgrading cost, etc.)
Technical perspective (reliability, safety, performance, requirements, quality, etc.)
Business perspective (COTS vendor recognition, COTS vendor properties, Historical records, etc.)
Legal perspective (type of contract, license agreement, escrow, etc.).

Thus, the problem of selecting the optimal component has been recognized as an optimization problem.

In addition to the above mentioned criteria, it should be noted that the efficiency of the component-based software system significantly depends on the system architecture; coupling and cohesion have a major role in reducing the complexity associated with the design and determining the quality of a software system in terms of reliability, maintainability, and accessibility [2, 8]. Cohesion as an intra modules property refers to the amount of communication that the components within a module have with each other (operating power of a module). On the other hand, coupling as an inter-modules property refers to the amount of the communication of a module with other modules (dependency between two or more modules) [9, 10]. Thus, for an optimal software component selection, the interactions of components within the modules (cohesion) need to be maximum, while the interactions between the modules (coupling) need to be minimum. Software designs with high cohesion and low coupling will create independent modules that offer some advantages, including easier development, reduced complexity, facilitated maintenance and modifications, reduced error rate, increased reusability, parallel development, and simple implementation [10, 11]. Intra Coupling Density (ICD) is a measure used to describe the relationship between coupling and cohesion of modules [2, 12, 13].

Various optimization problems have been applied by different researchers to the selection of the optimum component in a component-based software system based on the build-or-buy strategy, which will be discussed in the next section.

In this paper, the multi-objective optimization problem is addressed to carry out the optimal component selection through the build-or-buy strategy for CBSS. Fuzzy ICD and functionality are considered as two-objective functions in this problem. A multi-objective optimization problem was formulated in this study by maximizing the Fuzzy-Intra Coupling Density (Fuzzy-ICD) and functionality, and also taking into account budget, delivery time, reliability, and Fuzzy-ICD as constraints of multi objectives problems. Fuzzy ICD is a more accurate criterion for calculating the relationship between cohesion and coupling of components (which is obtained through the fuzzy computing of each of them based on the Mayers classification). Since, fuzzy approach used as an effective tool for quickly obtaining a good compromised solution in these scenario [8], after the two-objective optimization model is formulated for optimum software component selection to build or buy in CBSS, the formulated optimization problem will be solved by fuzzy multi-objective approach.

In the following, the main contributions of this paper are summarized:

1.
Considering the application of the fuzzy measurement of coupling and cohesion to the problem of component selection: the efficiency of component-based software system depends greatly on the system architecture; coupling and cohesion have a major role in software nonfunctional requirements and reducing software complexity. Therefore, it is necessary to calculate them accurately.
2.
Applying Fuzzy-ICD to one of the objective functions in multi-objective component selection optimization: accurate calculation of ICD as a criterion for calculating the relationship between coherence and connection of parts in the software plays a major role in developing a qualitative evaluation criterion in software.
3.
Formulation of multi-objective optimization applicable to optimal software component selection: the multi-objective optimization problem is formulated by maximizing the Fuzzy-ICD and functionality and also taking into account the factors such as budget, delivery time, reliability, and Fuzzy-ICD as constraints. The formulated bi-objective optimization model for optimal software components selection in the build-or-buy strategy in CBSS will be solved by a fuzzy multi-objective approach.
4.
Evaluating the proposed formulation of multi-objective component selection optimization by applying it to case study of financial-accounting system used by authors in [2].

The rest of the paper is organized as follows: in “Related Work”, first, the existing literature regarding the software component selection in CBSS is reviewed, and then existing measurement methods of coupling and cohesion are investigated. In “Fuzzy Method for Calculation of Coupling and Cohesion”, the proposed method of fuzzy computing of cohesion and coupling is described. Then, the “Selecting the Optimal Software Components with Multi-objective Optimization Approach” reports the process of optimal choice of components in the form of a series of hypotheses and problem formulation and discusses the optimization problem solution. Next, the case study used in this article is introduced in “Case Study”. In section “The Result”, the proposed method is evaluated, and finally, in seventh section, the conclusion of the article is stated.

Related work

In section “Selection methods optimized software components”, the other studies related to the optimal choice of software components are reviewed; then, in “Calculation Methods of Coupling and Cohesion”, the studies conducted in relation to measurement are discussed.

Optimization methods of software components selection

In a general classification, the methods of selecting the optimal components in component-based software systems include the methods based on the Weighted scoring method, the methods based on Analytical Hierarchical Process (AHP), the methods based on artificial intelligence, and the methods based on optimization [14, 15].

Figure 1 represents a classification of component selection methods proposed in literature by researchers working in this field.

In these methods, the representations related to the available characteristics for the optimal selection of components are presented in the form of feature vector, XML Scheme, Requirement document templates, and Graph format.

Weighted scoring method

The Weighted scoring method (WSM) is one of the oldest methods in the selection of components and software packages. In cases where the issue of Multi-Criteria Decision Making (MCDM) exists for n number of candidate components and m criteria, manual weighting method will be applied [14]. The method proposed by Collier et al. in [16] uses weighting method to select the optimal software components.

This method, although, is simple and implementable, but if the customer needs change at the last minute, the score for each component changes according to the evaluation criteria, and it should be updated before the final calculation. As the process of weighting is manual, it is considered duplication and process is complicated [17].

Analytical hierarchical process

The Analytical hierarchical process (AHP) is a technique applicable to selection-related problems with several criteria. A lot of studies have been carried out by software developers to choose the optimal components based on AHP [4, 7, 18, 19]. Mitta et al. [19] considered reusability as an important criterion in selecting the components. They used the technique of AHP and ranked the criteria for reusability to select and evaluate the components of COTS. In another study, Garg et al. [4] applied the ranking based on fuzzy distance to the AHP technique to select COTS components on the Database Management System (DBMS). Despite the widespread application of AHP to both quantitative and qualitative parameters, its disadvantages are its lack of flexibility to changes of optimization criteria, uncertain ranking, and the lack of time optimization that is due to pairwise comparisons in the AHP method [14].

Artificial intelligence-based methods

The methods such as neural network, Decision Tree, fuzzy classifier, deductive method, and collective intelligence are a number of methods working based on artificial intelligence. In this category, a technique for training AI classifiers described to assist in the selection of software components for development projects. Researchers believe that when using AI, we are able to represent dependencies between attributes, overcoming some of the limitations of existing aggregation-based approaches to CS [1]. Maxville et al. [20] used neural networks and Decision Tree for optimal selection of components. They prepared the ideal profile data of the needed components in both training and testing categories and in the form of XML. In the neural network, they used the back propagation algorithm in the weka. The Decision Tree creates a tree by a combination of data collection and their classification. After that, the data group was pruned to create a control decision tree. In their research, they concluded that the Decision Tree (C4.5) provided better results than the neural network.

In addition, Jadhav et al. [21] offered a deductive method based on a combination of Rule Base Reasoning (RBR) and Case Base Reasoning (CBR). The RBR and CBR methods are two fundamental techniques of Knowledge Base System (KBS). In RBR, the ideal needs of the user are collected in the form of feature value, and to assess the decision-making criteria, simple if–then-else rules are used. In CBR, the ideal needs of the user are compared in the form of candidate software packages. The candidate packets are saved as "cases" in the case-based system. The collection of system results in the selection of replacement components is ranked based on similarity. This similarity identifies the software component that responds to the system ideal needs. Hybrid Knowledge Base System (HKBS), compared to the AHP and WSM methods, is more efficient because it offers the computational efficiency, ease of problem-solving, knowledge reuse, compatibility, and evaluation of the results.

The fuzzy method is used by decision makers to evaluate alternative components easily and directly using language requirements [22]. The authors in [23] used the fuzzy method to avoid ambiguity that human decision-making processes may suffer from when using AHP and WSM.

In [24], an algorithm was proposed based on the collective intelligence of ants and the left footprint of pheromone to select optimal software components. To choose the optimal software components, they considered both positive and negative feedbacks in the characteristic evaluation of the components. In their model, the positive feedback increases the amount of pheromone, whereas the negative feedback evaporates the pheromone. Finally, after enough repetition, the component with the highest pheromone is selected as the optimal component.

Optimization-based methods

The structure of the CS (Component selection) problem is more similar to the multi-part problem due to the involvement of different (and sometimes contradictory) criteria. Optimization-based methods have shown higher efficiency in solving these problems. The CS problem is transformed to an optimization problem that essentially looks for maximum/minimum values in one or more fitness functions. Optimization-based methods can be include single and multi-objective optimization, mathematical optimization and evolutionary algorithm.

Generally, optimization-based methods are divided into two categories: single-objective optimization and multi-objective optimization methods. In the former, an objective function with some constraints is formulated to choose the optimal components. This function can be one of the parameters of cost, reliability, delivery time, quality, etc. Optimization problem is finding the answer or answers from among a set of possible options (with respect the constraints of the problem) with the aim of optimizing the criterion or criteria of the problem. The multi-objective optimization problem is a branch of MCDM problem. On the other hand, the multi-objective optimization problem is originated from real-world situations where a decision maker faces a set of objectives with multiple contradictory criteria. In these types of problems, unlike the single-objective optimization, different solutions can be taken into consideration [2].

In one of the first studies related to the optimal choice of components with the use of single-objective function, Berman et al. [25] formulated the optimization problem. Their objective function was maximizing the reliability. And the constraint of their optimization was cost criterion that needed to be limited to a certain threshold value.

Cortellessa et al. [26] used the cost minimization as an objective function with reliability and delivery time constraints in the optimal choice of components in the build-or-buy frameworks in software architecture.

Kwong et al. [13] considered the maximum of relationships within the modules and minimum of relationships between modules as parameter ICD and used functionality as the objective function of the problem. Since multi-objective optimization is complex and difficult to solve, they adopted the weighting method for each of the objective functions (as expressed in Eq. (1)) to be able to convert multi-objective problem to single-objective problem and to solve it simply. Thus, with separate solution of the objective functions of functionality and ICD in minimizing and maximizing states, the values F_min, F_max, E_min, and E_max are calculated respectively. According to the obtained values and weighting of functionality and ICD, the objective function of the optimization problem is defined as below:

$$ \max w_{f} \frac{{F - F_{{\min }} }}{{F_{{\max }} - F_{{\min }} }} + w_{l} \frac{{E - E_{{\min }} }}{{E_{{\max }} - E_{{\min }} }} $$

(1)

Since the optimization is an NP-Complete problem [13], the authors in [13] used genetic algorithm to select the optimal components.

Gupta et al. [27] used the multi-objective optimization with the aim of increasing the quality, reducing the cost, increasing the reliability, and reducing the size and the time of delivery, and they discussed the constraints of time and consistency in the formulation of their proposed system. Then, they used the fuzzy approach to solve the multi-objective optimization problem, continued according to the principle of maximizing of Bellman-zadeh [28], and formulated the problem using the fuzzy membership functions suggested for solving the problem of multi-objective optimization.

Jung et al. [29] used two model of formulation in the optimal choice of components. In the first one, they built their issue with the objective function of quality and budget constraint. In the optimization process, they used the weighted value for the objective function of quality. They formulated the second model like the first one with only one difference: they added the compatibility between the components in their optimal choice to the constraints of the problem. In a similar study, Shen et al. [30] selected the optimal components by the objective function of quality weighted on budget constraint with the difference that they analyzed the budget constraint with the use of the fuzzy system.

According to Indumati et al. [8], a component-based software system uses a top-down approach. Based on this approach, at the first step, the operational needs are identified, then at the second step, the number and nature of software modules are determined. Finally, at the third step, the selection of the optimal components for each module is formulated. They considered the maximization of the ICD and reliability as the objective function, and the threshold on the ICD, reliability, cost, and time as the constraints. The authors mentioned above, despite their similar peer articles, solved the optimization problem as a nonlinear optimization problem.

Likewise, in [31], the authors examined the optimal choice of components, in a fault-tolerance modular software system. They considered simultaneously the maximization of the system reliability and minimization of the cost as an objective function. They used this method only to choose between the components of COTS. In another experiment, they considered the cost as the objective function, and reliability as the constraint. Finally, they concluded that by using multi-objective function with the aim of minimizing the cost and maximizing the reliability, and by using the technique of goal programming, favorable optimization results can be achieved.

Jha et al. [32] used the parameters of ICD, functionality, budget, and quality as the objective function to choose the optimal component in a component-based software system. Thus, they considered the objective functions of ICD, quality, and functionality as maximization and the objective function of budget as minimization in solving the optimization problem. Then, they used fuzzy multi-objective approach to solve their optimization problem. In another study [2], the optimal choice of components in modular software system with fuzzy two-criterion optimization model and under the build-or-buy scheme was formulated. This model of optimization attempts to increase the objective functions of ICD and functionality and, at the same time, takes into account the constraints of budget, reliability, and delivery time.

Also according to [50, 51], multi-objective evolutionary algorithms (MOEA) are artificial intelligence optimization problems that decompose multi-objective optimization problems into a set of simple optimization sub-problems and solve them in a common manner. This method plays a key role in tradeoffs between diversity and convergence in MOEA.

Statistical analysis result of related work

Although much research has been done on WSM, AHP, AI, and optimization-based methods, literature consists of some other methods such as semantic-based methods and cluster-based methods that are presented in the selection of components problem. The Ontology-based method proposed by Yesad and Boufaida in [33] is an example of methods based on different semantic and theories. Furthermore, in [34], Vescan et al. used fuzzy clustering algorithms.

Research shows that the use of the feature vector has been a popular approach among the researchers working in this field. The increase in the use of the feature vector is due to the greater use of feature vectors in Objective Optimization-based methods. However, it can be said that multi-objective optimization is still the best option for solving the optimal selection of components. As a result, the current paper focuses on multi-objective optimization. As can be observed in Fig. 2, the representation based on the feature vector with 69.7% has been the most used item among the papers reviewed by the authors in [1].

It is important to know what feature(s) of a component is the most important feature in optimizing components selection. To achieve this goal, researchers used Hundred Doller (100$) test in [35, 36]. First, they selected the most important features from a list. Then they chose their priority using the 100$ test. The results showed that cost was the most important feature for component selection.

By analyzing the results of approximately 40 criteria that could be examined in CS in [1], the statistical results of Table 1 were obtained. In these statistical results, nine types of the most popular practical criteria in CS have been evaluated.

Table 1 Statistical results for main criteria in related research

Optimal components selection based on fuzzy-intra coupling density for component-based software systems under build-or-buy scheme

Abstract

Similar content being viewed by others

Software component evaluation and selection using TOPSIS and fuzzy interactive approach under multiple applications development

Optimal Component Selection Based on Cohesion and Coupling for Component-Based Software System

Software Component Selection in CBSE Considering Cost, Reliability, and Delivery Delay Using PSO-integrated MVO and ALO

Introduction

Related work

Optimization methods of software components selection

Weighted scoring method

Analytical hierarchical process

Artificial intelligence-based methods

Optimization-based methods

Statistical analysis result of related work

Calculation methods of coupling and cohesion

Fuzzy method for calculation of coupling and cohesion

Determining the amount of coupling

Determining the cohesion

Selecting the optimal software components with multi-objective optimization approach

The assumptions of the optimization problem

Formulation of optimization problem based on fuzzy computing of ICD

Fuzzy-Intra Coupling Density (Fuzzy-ICD)

Functional performance

Threshold on ICD constraint

Building decision versus buying decision

Budget constraint

Delivery time constraint

The reliability of in-house components

Threshold on the reliability constraint

Solving the optimization problem using fuzzy multi-objective approach

Case study

The experimental results

Simulation conditions

Test No 1: the optimal choice of components with the objective functions of ICD and functionality

Test No 2 (the proposed method): the optimal choice of components with the objective functions of Fuzzy-ICD and functionality

Comparing the proposed method with some other methods

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation