Next Article in Journal
Exact Likelihood Inference for an Exponential Parameter under Generalized Adaptive Progressive Hybrid Censoring
Previous Article in Journal
Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Clustering Stock Performance Considering Investor Preferences Using a Fuzzy Inference System

by
Siti Nazifah Zainol Abidin
1,2,
Saiful Hafizah Jaaman
1,*,
Munira Ismail
1 and
Ahmad Syafadhli Abu Bakar
3
1
Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi 43600, Selangor, Malaysia
2
Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Melaka, Jasin Campus, Merlimau 77300, Melaka, Malaysia
3
Mathematics Division, Centre for Foundation Studies in Science, University of Malaya, Kuala 50603, Lumpur, Malaysia
*
Author to whom correspondence should be addressed.
Symmetry 2020, 12(7), 1148; https://doi.org/10.3390/sym12071148
Submission received: 19 May 2020 / Revised: 6 July 2020 / Accepted: 7 July 2020 / Published: 9 July 2020

Abstract

:
The fact that many stocks are traded in the marketplace makes the selection process of choosing the right stocks for investment crucial and challenging. In the literature on stock selection, cluster analysis-based methods have usually been used to group and to determine the best stock for investment. Many established cluster analysis-based methods often cluster stocks under consideration using the average of the variables, where stocks with similar scores are concluded as having the same performances. Nevertheless, the performance results obtained do not reflect the actual performance of the stocks. Depending only on the average score of each variable is inefficient, as market situations usually involve uncertain extreme values. Moreover, when grouping stock performance, the established clustering methods assume that investors’ selection preferences are single and unclear, when actually, in reality, investors’ selection preferences vary; some investors are pessimistic, while others may be more optimistic. Due to this issue, this paper presents a novel fuzzy clustering method using a fuzzy inference system to flexibly assess the consistent evaluations given to stock performance that differentiate between pessimistic and optimistic investors that are symmetrical in nature. All variables considered in this study were defined in terms of linguistic inputs, where the consensus among them was aggregated using rule bases. These rule bases provide assistance in obtaining the linguistic output, which is the actual performance of the stock. Next, each stock under consideration was ranked using the proposed novel stock selection strategy based on investors’ confidence levels and preferences. The proposed method was then applied to a case study of 30 Syariah stocks listed on the Malaysian stock exchange, where the results obtained were empirically validated with established cluster analysis-based methods.

1. Introduction

Stocks traded on the financial market are often observed as unpredictable and unstable. This is due to the uncertain fluctuations of the daily prices of stocks, which leads to hesitance in the process of selecting the right stocks to invest in [1,2,3]. Dubious investors’ selection preferences due to the hefty identification of the well-balanced interaction between risks and returns also contribute toward indefinite stock selection [1]. In making investment decisions regarding stock selection, investors usually aim to select stocks when both the risks and the returns are consistent, such that the prior is low and the latter is high [1,4]. Stocks providing high returns and low risks are ranked as profitable investments by investors [3,4].
In the literature of modern portfolio theory, the initial stock selection approach developed for investment focuses on clustering stocks based on risk tolerance and return expectations [3,5,6,7,8]. When employing this approach, risk is measured using variance, while asset return data are assumed to be normally distributed. However, this is misleading, as the risk measures obtained are not consistent with investors’ preferences and the assumption made on the asset return data is invalid for a real market scenario, which are actually left or right-skewed [2,9]. Thus, methods based on cluster analysis are presented as alternative stock selection approaches that aim at clustering multiple best stocks simultaneously, otherwise known as the optimal portfolio [1,10]. The established clustering analysis-based methods, such as k-means and hierarchical, generally cluster stocks based on the averages of variables involved, where stocks with similar average scores are presumed as having the same performances [4]. Among the advantages of using these methods in stock clustering are the reduced time taken to develop the optimal portfolio given many diverse stocks, the important investment-related information made available to investors, and the minimal risk level obtained on the optimal portfolio [7,8,11].
In the past two decades, clustering equities (i.e., stock and mutual funds) have received global attention from researchers utilizing numerous clustering techniques to study their local market situations [4,6,10,11,12,13]. A survey conducted by Chen and Huang [4] on Taiwan’s mutual funds used the k-means clustering method. In the evaluation, the funds were measured based on the return rates, standard deviation, turnover rate, and Treynor index, where the results obtained were clustered into four clustering performance groups—namely, inferior, stable, good, and aggressive. Among the four clustering performance groups, the researchers considered good and aggressive performances as the best group of funds for investment. Similarly, Kiliçman and Sivalingam [3] used 38 funds to examine the performance of the Malaysian stock market using the return rate, Treynor index, and variance. Employing the same method used by Chen and Huang [4], Mirnoori and Shariati [7] studied 39 funds and formed three clusters designated as inferior, good, and aggressive. Although these clustering methods have distinct procedures for obtaining the best stock or optimal portfolio, both achieve equivalent clustering results for investment [4]. Nevertheless, the performance results obtained by known established methods do not reflect the actual performance of stocks. Actual stock performance depends on its market situations, which are volatile in nature, by taking the outcome using average scores of each variable, although this is deemed to be insufficient when there exist extreme values of returns and risks [14,15,16]. In addition, the average score tends to be influenced by outliers [17,18].
In the literature, k-means clustering is the most applied clustering method [4,10]. A study on the Indian stock market by Nanda et al. [10] carried out analysis based on three distinct clustering methods, namely, k-means, self-organizing maps (SOMs), and fuzzy c-means clustering, that were applied to the Markowitz model to generate an efficient portfolio. The results show that k-means clustering is the best clustering method, as it produced 12 clusters. Instead of using the k-means clustering method, Tekin and Gümüş [6] used the hierarchical clustering method to evaluate the stock performance in Borsa Istanbul. The variables considered were the price per earnings ratio, the market value per book value ratio, the dividend yield, the return on assets, the return on equity, the change in sales and equity, the return on average, the returns, and the risks, producing 12 clusters. Meanwhile, Alqaryouti et al. [11] studied the Abu Dhabi stocks using the k-means and k-medoids clustering methods, testing 61 stocks and forming four distinct clusters.
K-means clustering is favored among researchers; however, the clustering results of this method depend on the initial value assigned, and this value needs to be predetermined, which is a disadvantage of this method [17,18]. This leads to inconsistency in forming clusters, depending on the situation. Another limitation is that the method is sensitive to outliers [17,18]. Other research [12,19] of cluster stocks was based on the individual stock performance, by comparing the values of the variables considered for each stock one by one. The main reason for evaluating stock performance individually is to consider their effects on fundraising by non-professional investors [19]. In [12], the authors evaluated the performance of 10 sectorial stock indices individually, using the Sharpe, Treynor, Jensen alpha, adjusted Sharpe, adjusted Jensen, and Sortino indices. Unfortunately, it is time consuming to analyze stocks one at a time, and thus, this concept is unsuitable for large data sets.
Apart from the variables that are related to stocks, investors’ preferences toward risk-taking have also been considered as one of the most important factors for evaluating stock performance. Shams and Rezvani [20] investigated investors’ risk aversion and risk taking by ranking the performance of investment companies using three loss aversion indices and comparing the results against the Treynor index. The result shows that the loss aversion behavior of investment companies is influenced by the outcome of previous performances. In [21], multiple hybrid methods were developed by combining SOM and k-means cluster stocks, the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) to rank stocks, and the genetic algorithm to establish different classes of investors with respect to their risk-taking levels.
The established clustering methods in the literature consider several options to ensure that the stocks preferred by investors are good and the best to invest in. Among investors’ preferences studied are stocks that have high return rates, low standard deviation and high Treynor index values [3,4], low standard deviations, moderate return rates, moderate turnover ratios, and moderate Sortino index values [7], or high return rates, high Sharpe index values, high appraisal ratios, high Sortino index values, low standard deviations, and downside risks [8]. However, investors’ preferences considered by these methods are ambiguous, since such evaluation focuses only on one unknown investor’s selection preference when clustering stock performance. These evaluations are inconsistent with investors’ genuine preferences, which can be either pessimistic or optimistic in nature; thus, clustering methods are unable to track the true performance of stocks [2,22]. The inefficiencies of the established clustering methods justify the motivation for this study.
As indicated above, the established clustering methods neglect to take into consideration the importance of diverse investors’ preferences when selecting stocks, and thus are inefficient in accurately clustering stocks. This study extends the works of [4,6] by proposing a novel fuzzy clustering method that has the capability to distinctively express investors’ vague preferences which establish clustering methods cannot capture. Furthermore, investors’ diverse preferences, which can be distinguished in various forms, such as pessimistic or optimistic, enhance and complement fuzzy inference systems for developing specific stock selection strategies for different types of investors. A fuzzy inference system was utilized in this study, as it possesses great capability for considering vague decision-makers’ preferences, as well as the uncertainty in the decision-making environment [23,24]. In this proposed method, the variables considered were defined as the linguistic inputs, while stock performance was defined as the linguistic output. All defined linguistic inputs were then aggregated using fuzzy rule bases that were developed in accordance with established investors’ preferences on stock clustering. In this case, rule base development is important to achieve rational interaction between the variables and the performances of stocks that are defined linguistically based on investors’ preferences [23,24]. As for the output, the proposed method produces two distinct views of investors’ preferences, which are pessimistic and optimistic, with each view consisting of multiple levels of investors’ preferences differentiated based on confidence levels and the frequency of stock performance. The novel differentiation process in this study is the first of its kind to be developed with the objective of assisting investors in selecting the best stock to invest, given their preferences. For efficiency purposes, the results obtained from the analysis of this proposed method were compared against the established clustering methods.
The rest of the paper is structured as follows. Section 2 explains the development of the novel proposed fuzzy clustering method. Section 3 presents a case study on clustering 30 Syariah-compliant stocks in Malaysia for the year 2011, and then validation of the results is provided in Section 4. The discussion and conclusion are presented in Section 5 and Section 6, respectively.

2. Research Formulation

As mentioned in the introduction section, previous works on clustering were unable to handle outliers, providing inconsistent numbers of clusters and neglecting investor’s preferences. Taking into consideration the limitations of previous studies, in this study, a novel fuzzy clustering method that is capable of distinctively expressing vague investors’ preferences is presented. Furthermore, investors’ diverse preferences, which can be distinguished in various forms, such as pessimistic or optimistic, enhance and complement fuzzy inference systems for developing specific stock selection strategies for different types of investors.
The development of the novel proposed fuzzy clustering method using a fuzzy inference system involved four steps. The first step was data collection and the identification of the inputs and outputs, as well as normalization. In this step, the inputs were variables related to the stocks—namely, return rates, standard deviations and Treynor index values—while stock performance served as the output. All inputs and outputs were then normalized to ensure that the data were in generic forms. In the second step, the results obtained from the normalization process were transformed into triangular fuzzy numbers—this process is known as fuzzification. All normalized inputs and outputs defined were in the form of linguistics terms described as triangular fuzzy numbers. For step 3, processes such as fuzzy rule base, fuzzy inference system, and defuzzification were performed. Fuzzy rule bases were developed based on the results of [3,4,7,8] and characterized by IF THEN rules. These rule bases were then aggregated in the fuzzy inference system, and the products were converted into crisp values that represent stock performance. This conversion process is known as defuzzification. The defuzzification process covers the limitation of outliers and inconsistent numbers of clusters. In step 4, the results obtained from defuzzification were projected according to confidence levels, where the confidence levels represent the actual levels of investors’ preferences. This step covered the limitation of neglecting investors’ preferences. For the purpose of distinguishing each stock based on its performance and on investors’ confidence levels, this study presents a unique stock selection strategy, whereby the best stocks are ranked based on investors’ preference priority. A flowchart on the development of the novel proposed fuzzy clustering method is given in Figure 1. The steps involved in the development of the novel proposed fuzzy clustering method are described below.

2.1. Step 1: Data Collection, Input and Output Identification, and Normalization

This study started with data collection, where information related to the input and output variables was identified. The inputs were return rates, standard deviations, and Treynor index values, while the output was the stock performance. Based on [4,12,25,26], a definition for each input variable is given, as follows.
Definition 1.
Return rates, R t : The return rate, R t , is the return gained from investment. A high value of R t indicates a high profit gain, and thus positive stock performance is a good sign for investors. As Chen and Huang [4] demonstrated, R t is defined based on the concept of net asset value (NAV), where the definition of R t is given as follows:
R t = N A V t N A V t 1 N A V t 1 × 100 %
where R t is the return rate for stock t, N A V t is the net asset value for the current transaction, and N A V t 1 is the net asset value of the previous transaction.
Definition 2.
Standard Deviation, S t : Standard deviation, S t , measures the volatility of returns denoted as the investment risk level [4,25]. The standard deviation, S t , can be calculated using Equation (2), shown as follows:
S t = t = 1 n ( R t i R ¯ t ) 2 n 1
where R t i is the rate return of stock t on the ith day, and R ¯ t is the average return rate for n period of time.
Definition 3.
Treynor Index, T t : The Treynor index, T t , is a measure of the excess return earned per unit of systematic risk [4]. The Treynor index was chosen in this study as it examines the stock portfolio against the market as a whole and is highly sensitive to market risk [12,26]. A high value of T t denotes a high return per market risk [4]. The Treynor index is given by Equation (3), as follows:
T t = R ¯ t R ¯ r f β .
where β is the systematic risk or the market risk, and R ¯ r f is the daily average risk-free rate for a week.
As mentioned earlier, step 1 of the novel proposed fuzzy clustering method involved the normalization process. In this case, all of the values obtained from the inputs were normalized using the following definition.
Definition 4.
Normalization, ϖ i : Let ϖ i be the normalization of input variables [9] with i = R t , S t   and   T t , and ϖ i is given as
ϖ i = ϖ i M i n ( ϖ i , j ) M a x j ( ϖ i , j ) M i n ( ϖ i , j )
where M i n i ( ϖ i , j ) and M a x i ( ϖ i , j ) is the minimum and maximum i with j = 1 , 2 , 3 n respectively.
With respect to all inputs defined above, all variables were normalized using Equation (4), as shown in Equations (5)–(7):
R t R t
S t S t
T t T t
where R t , S t and T t are the values of normalization for the return rates, standard deviations, and Treynor index values, respectively.

2.2. Step 2: Fuzzification

The results obtained from the normalization process in step 1 were then transformed into linguistic triangular fuzzy numbers shown by Equations (8)–(10).
R t R t = ( a R t , b R t , c R t ; 1 )
S t S t = ( a S t , b S t , c S t ; 1 )
T t T t = ( a T t , b T t , c T t ; 1 )
where R t , S t , and T t are the linguistic triangular fuzzy numbers for the return rates, standard deviations and Treynor index values, respectively. Figure 2 shows the generic linguistic triangular fuzzy numbers for R t .
Where a and c are the minimum and maximum values, respectively, while b is the modal value of the triangular fuzzy numbers [27,28].

2.3. Step 3: Fuzzy Rule Base, Fuzzy Inference System, and Defuzzification

In this step, rule bases in the form of linguistic terms were developed based on established stock performance decisions [23,24,29,30,31]. All inputs in step 2 were aggregated using the rule bases developed in this step and the output obtained represent the performance of the stocks. It is worth noting that the output produced underwent defuzzification where the linguistic triangular fuzzy numbers were transformed into crisp values. The interaction between the inputs, fuzzy rule bases, and outputs are generically given as follows:
  • IF R t is ….AND S t is ….AND T t is …. THEN …. Performance.

2.4. Step 4: Stock Performance, Investor Selection Preferences and Stock Selection Strategy

The stock performance obtained from the defuzzification process in step 3 was expressed as a single value; this value represents investors’ evaluation of the stocks. The evaluation was then projected onto the height of the linguistics triangular fuzzy numbers, where the two confidence levels were obtained. The confidence level represents the selection preferences of two types of investors, namely, pessimistic and optimistic investors. To distinguish the stocks in accordance to investors’ preference priority, a stock selection strategy was developed, and is displayed using Equation (11):
Investors   preferences   priority ,   P = ( k = 1 4 w k b k ) c ¯ l
where w k is the weight of the performance, b k is the number of performances obtained for the stock, k is the stock performance, and c ¯ l is the average of the confidence levels for stock performance.

3. Clustering Malaysia’s Stock Market

According to [32], in studies to develop a model, selecting perfect data for model illustration is not a key factor; it is sufficient to use a sample of real data to illustrate the realistic scenario of investment. As a small economy, Malaysia is vulnerable to global and regional developments, and in Malaysia a stock market crash led to economic downturn. The global downturn of the 2007 financial crisis hit Malaysia hard, and the market took about 3 years to recover to its pre-crisis level. In this study, data of the year 2011 were considered when the stock market started to recover [33] to observe investors’ preferences towards stocks selection that were not affected by the crisis. Generally, the proposed fuzzy clustering method can be applied to any data set, as carried out by [3,34] by employing a small sample and a shorter time frame for evaluating stock performance. Thus, consistent with previous studies where the objective was to present a novel clustering method, the capability of this novel method was demonstrated by employing real data of 30 stocks listed on Bursa Malaysia. These 30 stocks are listed under the Syariah category of consumer products and services sector, a dominant sector observed by the masses [35,36]. The details of the processes involved in the application of the proposed model are given below.

3.1. Step 1: Data Collection, Input/Output Identification and Normalization

In this study, data of 30 Syariah compliant companies listed under the consumer products and services sector from 3rd January 2011 to 30th December 2011 were collected. The data of the 30 stocks considered comprised stock prices, return rates, R t , standard deviations, S t , and Treynor index values, T t , as the input variables, while stock performance served as the output for this investigation. The companies’ stock prices were derived from DataStream, while the Syariah and Kuala Lumpur Composite Index (KLCI) indices are from Bursa Malaysia. Table 1 shows part of the variables evaluated for Aeon Co (M) Bhd, one of the 30 stocks examined.
Using the values from Equation (4), each variable was normalized, as displayed in Table 2.

3.2. Step 2: Fuzzification

The normalization results obtained in step 1 were transformed into linguistic triangular fuzzy numbers, where they were defined in linguistic terms as very high, high, moderate, low, and very low for the input variables, while for outputs, the linguistic terms were inferior, stable, good, and aggressive. Table 3 and Table 4 describe the linguistic terms and their respective triangular fuzzy numbers for the inputs and outputs, respectively.

3.3. Step 3: Fuzzy Rule Base, Fuzzy Inference System, and Defuzzification

In this step, fuzzy rule bases were developed to aggregate the inputs considered. These rule bases were obtained based on the results achieved from past known works [3,4,7,8]. In total, there were 125 rule bases developed, utilizing the five linguistics terms for the input variables and the four linguistic terms for the output variables defined in step 2. Some of the rules generated for Aeon Co (M) Bhd are given below.
  • IF R t is very lowAND S t is lowAND T t is highTHENinferior stock performance.
  • IF R t is highAND S t is highAND T t is very highTHENaggressive stocks performance.
  • IF R t is highAND S t is lowAND T t is very highTHENgood stocks performance.
  • IF R t is moderateAND S t is moderateAND T t is highTHENstable stocks performance.

3.4. Step 4: Stock Performance, Investor Selection Preferences, and Stock Selection Strategy

The results of the stock performance acquired in step 3 are presented in the form of a single value. Table 5 gives the results for the stock performance and the confidence levels with respect to the pessimistic and optimistic investors for Aeon Co (M) Bhd.
To determine the overall performance of a specific stock given by different types of investors, the performance frequency for the stocks was evaluated. Table 6 shows part of the performance evaluation for 10 stocks.
Based on Table 6, the performance that emerges most frequently indicates the actual performance of the stock. In this case, Aeon Co (M) is considered to have good performance from the point of view of pessimistic investors, while optimistic investors classify the stock performance as stable. Table 7 provides the results of the stock performance for all 30 stocks evaluated, as well as the confidence level average based on both pessimistic and optimistic investors.
The same technique was applied to evaluate market performance, and consistent results between the market and stock performances were obtained. Such evaluation is important to ensure that stocks and markets are coherent. Table 8 and Table 9 display the market performance evaluation for Syariah and KLCI.

Stock Selection Strategy

There are various ways to select stock in the market. However, this study introduces a novel stock selection strategy based on investors’ confidence levels and preferences of stocks. The strategy aims to provide investors with a unique stock priority evaluation, so that high priority stocks from the available pool of stocks will be selected first for investment. This is computed by assigning each performance a value, such that 0.1 refers to inferior performance, 0.2 stable, 0.3 good and 0.4 aggressive. Thus, the proposed strategy based on the novel stock priority evaluation for Aeon Co (M) Bhd was calculated using Equation (11).
P = ( ( 5 × 0.1 ) + ( 13 × 0.2 ) + ( 34 × 0.3 ) + ( 0 × 0.4 ) ) × 0.305 = 4.0565
Table 10 shows the stock priority evaluation and the ranking for the 30 stocks analyzed in this study.
The distinct selection made based on different investors’ preferences can be observed in Table 10. As shown, pessimistic investors select the stock of SHH Resources Holding Bhd, while optimistic investors choose the Milux Corporation Bhd stock to invest in.

4. Validation of the Results

For validation purposes, a comparative analysis between the novel proposed fuzzy clustering, k-means clustering, and hierarchical clustering methods is presented. The validation focused on obtaining the correlation between the actual stock performance rankings, the novel proposed fuzzy clustering method, the k-means method, and the hierarchical clustering method. Table 11 summarizes the stock performance evaluations for the k-means, hierarchical, and novel proposed fuzzy clustering methods.
Table 11 provides the ranking order decided by the different methods, namely, k-means, hierarchical clustering, and the novel proposed fuzzy clustering. This ranking order was then validated against the actual ranking performance using Spearman’s rank coefficient of correlation [37]. Table 12 describes the rankings and Spearman’s rank coefficient of correlation scores for the k-means, hierarchical clustering, and novel proposed fuzzy clustering methods.
Based on the result shown in Table 12, the novel proposed fuzzy clustering method outperforms the other established methods, providing higher correlation values for both pessimistic investors, at 70.28%, and optimistic investors, at 70.90%.

5. Discussion

As projected in Section 4, this study successfully extended the established clustering methods [4,6] by developing a novel fuzzy clustering method using a fuzzy inference system. The novel fuzzy clustering method is capable of determining stock performance based on investor preferences, as well as ranking stock based on priority. As shown in Table 5, four performance evaluations, namely, inferior, stable, good, and aggressive, were formed. As exhibited in this study, inferior performance consists of stocks that are unstable and in poor condition, yielding high risk and low return gains. Investment in this performance classification is deemed to be unworthy. The classification of stable performance for stocks consists of stocks that are still considered to be high risk and to have low return gains, but the performance is slightly better than that of inferior performance. Moreover, stable performance also consists of stocks that have moderate return rates and risk levels, but are unable to provide profit for shorter investment periods. Good performance stocks are the best stocks for investment, since the risk is low and the return rate is high, indicating that investors’ chances of losing are low and that they are able to secure high returns in investments. Finally, the aggressive performance classification consists of stocks that provide higher returns but with higher risk. This stock classification is for investors who are not intimidated by high investment risk to gain high profit returns. Stocks with a lack of investor preference are classified as inferior and stable, rather than good or aggressive. Therefore, it is suggested that investors invest in stocks classified as having good and aggressive performance.
Unlike established past works that only considered a single investors’ selection preferences, which cannot be justified, the proposed novel fuzzy clustering method is able to distinguish investors’ preferences based on stock performance. Real investors’ selection preferences can be either pessimistic or optimistic, as shown by the results obtained using the novel fuzzy clustering method. The types of investors are represented based on confidence levels, with a low value indicating pessimistic investors, while a high value denotes optimistic investors, as shown in Table 5 and Table 7. Even though investors express different preferences toward performance evaluation, the results of overall stock performance show that some stocks are given the same performance evaluation by both optimistic and pessimistic investors. This implies that investors’ preferences are an important element to consider when selecting appropriate stocks.
The stock performance presented using the novel fuzzy clustering method is in the form of a numerical value. This numerical value shows the strength of the stock performance, which can be used to rank stocks based on priority, and which established works were unable to do. Typically, most established works used more than one method to sort and rank stocks based on priority, as was done by [20,21]. Providentially, this study used only one method to determine stock performance, to cluster stock performance, to rank stocks based on priority, as well as to determine and rank priority stocks based on investor preferences.
In step 4, stocks based on priority were applied to search for the best stocks to invest in. At this stage, the stocks were rank based on the priority of the stock performance, as shown in Table 10. Such ranking was done for both pessimistic and optimistic investors. The performance of the proposed method was validated against established works, and was actual rank using Spearman’s rank coefficient of correlation. The results show that the proposed novel fuzzy clustering method is superior compared to the k-means and hierarchical clustering methods.

6. Conclusions

This paper presents a novel fuzzy clustering method for stock selection based on investors’ selection preferences. The novel proposed method provides precise and unambiguous investors’ selection preferences compared to established methods with regard to different types of investors, such as pessimistic and optimistic investors’ views. Moreover, unlike established methods, the novel investors’ selection strategy developed in this study ensures that high-priority stocks are chosen as the best stock and selected first for investment by employing the proposed stock priority method. The efficiency of the proposed method was illustrated and validated by clustering 30 Syariah stocks listed in Bursa Malaysia. This study successfully applied the novel fuzzy clustering method to the problem of stock selection based on investors’ preferences so as to assist investors in their investment choices. The results obtained from the validation justify this novel fuzzy clustering method, as it provided higher efficiency by achieving consistent ranking correlations against the actual results, unlike the established clustering methods. Although the novel clustering method obtained consistent results in terms of the actual stock performances, the ranking correlation values were not adequately sufficient, and thus, better computations will be needed to increase the level of accuracy. In addition, the novel fuzzy clustering method considers only the uncertain component of pessimistic and optimistic investors’ behaviors; for better results, the reliability, hesitancy, and bipolarity components may need to be embedded into stock selection. In the future, the authors aim to explore stock selection procedures in relation to investors’ reliability, hesitancy, and bipolarity.

Author Contributions

Individual contributions of authors are as follows: Conceptualization, S.N.Z.A., S.H.J., M.I. and A.S.A.B.; methodology, S.N.Z.A., S.H.J. and A.S.A.B.; software, A.S.A.B.; validation, S.N.Z.A., S.H.J., M.I. and A.S.A.B.; formal analysis, S.N.Z.A. and A.S.A.B.; data curation, S.N.Z.A.; writing—original draft preparation, S.N.Z.A.; writing—review and editing, S.H.J., A.S.A.B. and S.N.Z.A.; supervision, S.H.J. and M.I. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by research grant provided by Universiti Kebangsaan Malaysia code GUP-2019-038 And The APC was funded by grant GUP-2019-038.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kumari, S.K.; Kumar, P.; Priya, J.; Surya, S.; Bhurjee, A.K. Mean-value at risk portfolio selection problem using clustering technique: A case study. AIP Conf. Proc. 2019, 2112, 020178-1–020178-7. [Google Scholar]
  2. Mohamed, Z.; Mohamad, D.; Samat, O. A fuzzy approach to portfolio selection. Sains Malays. 2009, 38, 895–899. [Google Scholar]
  3. Kiliçman, A.; Sivalingam, J. Portfolio optimization of equity mutual funds: Malaysian case study. Adv. Fuzzy Syst. 2010, 2010, 879453. [Google Scholar]
  4. Chen, L.H.; Huang, L. Portfolio Optimization of Equity Mutual Funds with Fuzzy Return Rates and Risks. Expert Syst. Appl. 2009, 36, 3720–3727. [Google Scholar] [CrossRef]
  5. Markowitz, H. Portfolio selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
  6. Tekin, B.; Gümüş, F.B. The classification of stocks with basic financial indicators: An application of cluster analysis on the BIST 100 index. Int. J. Acad. Res. Bus. Soc. Sci. 2017, 7, 104–131. [Google Scholar]
  7. Mirnoori, S.M.; Shariati, A. Fuzzy portfolio optimization using Chen and Huang model: Evidence from Iranian mutual funds. Afr. J. Bus. Manag. 2012, 6, 6608–6616. [Google Scholar]
  8. Gabriel, F.S.; Post, K.C.D.S.R.; Rogers, P. Clustering real estate investment trusts: Brazil versus United States. J. Manag. Res. 2015, 7, 166–190. [Google Scholar] [CrossRef] [Green Version]
  9. Rom, B.M.; Ferguson, K.W. Post-modern portfolio theory comes of age. J. Investig. 1994, 3, 11–17. [Google Scholar] [CrossRef]
  10. Nanda, S.R.; Mahanty, B.; Tiwari, M.K. Clustering Indian stock market data for portfolio management. Expert Syst. Appl. 2010, 37, 8793–8798. [Google Scholar] [CrossRef]
  11. Alqaryouti, O.; Farouk, T.; Siyam, N. Clustering stock markets for balanced portfolio construction. In Proceeding of the Springer Nature Switzerland AG 2019, International Conference on Advanced Intelligent Systems and Informatics; Springer: Cham, Switzerland, 2019; pp. 577–587. [Google Scholar]
  12. Robiyanto, R. Performance evaluation of stock price indexes in the Indonesia stock exchange. Int. Res. J. Bus. Stud. 2018, 10, 173–182. [Google Scholar] [CrossRef] [Green Version]
  13. León, D.; Aragón, A.; Sandoval, J.; Hernández, G.; Arévalo, A.; Niño, J. Clustering algorithms for Risk-Adjusted Portfolio Construction. In Proceedings of the International Conference on Computer Sciences, Zurich, Switzerland, 12–14 June 2017; pp. 1334–1343. [Google Scholar]
  14. Falk, M.; Guillou, A. Peaks-over-threshold stability of multivariate generalized Pareto distributions. J. Multivar. Anal. 2008, 99, 715–734. [Google Scholar] [CrossRef]
  15. Chan, N.H.; Zhang, R.M. Quantile inference for near-integrated autoregressive time series under infinite variance and strong dependence. Stoch. Process. Their Appl. 2009, 119, 4124–4148. [Google Scholar] [CrossRef] [Green Version]
  16. Stelzer, R. Multivariate Markov-switching ARMA processes with regularly varying noise. J. Multivar. Anal. 2008, 99, 1177–1190. [Google Scholar] [CrossRef] [Green Version]
  17. Govender, P.; Sivakumar, V. Application of k-means and hierarchical clustering techniques for analysis of air pollution: A review (1980–2019). Atmos. Pollut. Res. 2020, 11, 40–56. [Google Scholar] [CrossRef]
  18. Mehta, N.; Kathuria, M.; Singh, M. Comparison of Conventional & Fuzzy Clustering Techniques: A Survey. Int. J. Adv. Res. Comput. Commun. Eng. 2013, 2, 2248–2253. [Google Scholar]
  19. Tehrani, R.; Ahmadinia, H.; Hasbaei, A. Analyzing performance of investment companies listed in the Tehran stock exchange by selecting ratios and measures. Afr. J. Bus. Manag. 2011, 5, 7428–7439. [Google Scholar]
  20. Shams, S.; Rezvani, F. Performance measurement of investment companies with loss aversion in Tehran stock exchange. Risk Gov. Control Financ. Mark. Inst. 2015, 5, 81–87. [Google Scholar] [CrossRef] [Green Version]
  21. Goudarzi, S.; Jafari, M.J.; Afsar, A. A hybrid model for portfolio optimization based on stock clustering and different investment strategies. Int. J. Econ. Financ. Issues 2017, 7, 602–608. [Google Scholar]
  22. Li, J.; Xu, J. A novel portfolio selection model in a hybrid uncertain environment. Omega 2009, 37, 439–449. [Google Scholar] [CrossRef]
  23. Tsatiris, M.; Kitikidou, K. Sunflower for biodiesel production: A mamdani-type fuzzy inference system using the fuzzy logic toolbox graphical user interface (GUI) tools. J. Sci. Eng. Res. 2018, 5, 162–169. [Google Scholar]
  24. Izquierdo, S.; Izquierdo, L.R. Mamdani fuzzy systems for modeling and simulation: A critical assessment. J. Artif. Soc. Soc. Simul. 2017, 21, 1–15. [Google Scholar]
  25. Ma, L.; Tang, Y. Portfolio manager ownership and mutual fund risk taking. Manag. Sci. 2019, 65, 5518–5534. [Google Scholar] [CrossRef] [Green Version]
  26. Kuhle, J.L.; Lin, E.C. An evaluation of risk and return performance measure alternatives: Evidence from real estate mutual funds. Rev. Bus. Financ. Stud. 2018, 9, 1–11. [Google Scholar]
  27. Princy, S.; Dhenakaran, S.S. Comparison of triangular and trapezoidal fuzzy membership function. J. Comput. Sci. Eng. 2016, 2, 46–51. [Google Scholar]
  28. Shyamal, A.K.; Pal, M. Triangular fuzzy matrices. Iran. J. Fuzzy Syst. 2007, 4, 75–87. [Google Scholar]
  29. Kalia, H.; Dehuri, S.; Ghosh, A.; Cho, S.B. On the mining of fuzzy association rule using multi-objective genetic algorithms. Int. J. Data Min. Model. Manag. 2016, 8, 1–31. [Google Scholar] [CrossRef] [Green Version]
  30. Zadeh, L.A. Fuzzy sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef] [Green Version]
  31. Liagkouras, K.; Metaxiotis, K. Multi-period mean–variance fuzzy portfolio optimization model with transaction costs. Eng. Appl. Artif. Intell. 2018, 67, 260–269. [Google Scholar] [CrossRef]
  32. Ohlson, J.A. Financial ratios and the probabilistic prediction of bankruptcy. J. Account. Res. 1980, 18, 109–131. [Google Scholar] [CrossRef] [Green Version]
  33. Bank Negara Malaysia. Economic Developments in 2010-Bank Negara Malaysia. Available online: www.bnm.gov.my>files>publication>2010 (accessed on 15 December 2019).
  34. Yaakob, A.M.; Serguieva, A.; Gegov, A. FN-TOPSIS: Fuzzy networks for ranking traded equities. IEEE Trans. Fuzzy Syst. 2016, 25, 315–332. [Google Scholar] [CrossRef] [Green Version]
  35. Mansor, F.; Bhatti, M.I. The Islamic mutual fund performance: New evidence on market timing and stock selectivity. In Proceedings of the IACSIT International Conference on Economics and Finance Research IPEDR, Singapore, 26–28 February 2011; Volume 4, pp. 477–484. [Google Scholar]
  36. Bakar, M.A.A.A.; Ali, N. Performance measurement analysis: Shariah-compliant vs. non shariah-compliant securities. Manag. Account. Rev. MAR 2014, 13, 75–108. [Google Scholar]
  37. Serguieva, A. Computational Intelligence in Asset Risk Analysis; Brunel University: London, UK, 2004. [Google Scholar]
Figure 1. Flowchart of the novel proposed fuzzy clustering method.
Figure 1. Flowchart of the novel proposed fuzzy clustering method.
Symmetry 12 01148 g001
Figure 2. Triangular fuzzy number.
Figure 2. Triangular fuzzy number.
Symmetry 12 01148 g002
Table 1. The evaluation variables for Aeon Co (M) Bhd.
Table 1. The evaluation variables for Aeon Co (M) Bhd.
DateStock Price R t S t T t
18/3/111.45−0.740.95−15.51
25/3/111.490.650.8912.75
1/4/111.47−0.291.00−7.07
8/4/111.500.370.623.59
15/4/111.550.722.1510.73
22/4/111.570.293.112.14
29/4/111.49−1.052.76−25.27
6/5/111.500.180.60−0.09
13/5/111.550.671.779.99
20/5/111.590.540.447.28
27/5/111.610.250.441.44
3/6/111.610.000.35−3.66
10/6/111.701.040.9417.46
17/6/111.962.913.3355.71
24/6/111.78−1.832.56−40.99
Table 2. Normalization values of the variables for Aeon Co (M) Bhd.
Table 2. Normalization values of the variables for Aeon Co (M) Bhd.
Date R t S t T t
18/3/110.230.280.26
25/3/110.520.270.56
1/4/110.320.300.35
8/4/110.460.190.46
15/4/110.540.650.53
22/4/110.450.930.45
29/4/110.160.830.16
6/5/110.420.180.42
13/5/110.530.530.53
20/5/110.500.130.50
27/5/110.440.130.44
3/6/110.390.110.39
10/6/110.600.280.60
17/6/111.001.001.00
24/6/110.000.770.00
Table 3. Linguistic terms and triangular fuzzy numbers for the inputs.
Table 3. Linguistic terms and triangular fuzzy numbers for the inputs.
Linguistic TermsTriangular Fuzzy Number
Very low[0, 0, 0.2511]
Low[0, 0.2511, 0.5]
Moderate[0.2511, 0.5, 0.7511]
High[0.5, 0.7511, 1]
Very high[0.7511, 1, 1]
Table 4. Linguistic terms and triangular fuzzy numbers for the output variables.
Table 4. Linguistic terms and triangular fuzzy numbers for the output variables.
PerformanceTriangular Fuzzy Number
Inferior[0, 0, 0.3333]
Stable[0, 0.3333, 0.6667]
Good[0.3333, 0.6667, 1]
Aggressive[0.6667, 1, 1]
Table 5. Results of the stock performance and investors’ confidence levels for Aeon Co (M) Bhd.
Table 5. Results of the stock performance and investors’ confidence levels for Aeon Co (M) Bhd.
PerformancePessimisticConfidence LevelOptimisticConfidence Level
0.332Inferior0.004Stable0.996
0.426Good0.279Stable0.721
0.422Good0.267Stable0.733
0.428Good0.285Stable0.715
0.418Good0.255Stable0.745
0.242Inferior0.274Stable0.726
0.119Stable0.357Inferior0.643
0.442Good0.327Stable0.673
0.39Good0.171Stable0.829
0.495Good0.486Stable0.514
0.495Good0.486Stable0.514
0.516Stable0.451Good0.549
0.473Good0.420Stable0.580
0.892Good0.324Aggressive0.676
0.109Stable0.327Inferior0.673
Table 6. The performance evaluation for 10 stocks.
Table 6. The performance evaluation for 10 stocks.
Stocks/Investor PreferencesPessimisticOptimistic
ISGAISGA
Aeon Co (M) Bhd51334034081
Asia Brands Bhd412351131200
CAM Resources Bhd118330232180
CWG Holdings Bhd230173320290
Classic Century Bhd120292125260
AHB Holdings Bhd17926024271
Asia File Corporation Bhd516310127231
Euro Holdings Bhd6937024550
Emico Holdings Bhd4156015001
Eurospan Holdings Bhd019321131200
Note: I, inferior; S, stable, G, good; A, aggressive.
Table 7. Performance evaluation results for the 30 stocks and the confidence levels of the investors.
Table 7. Performance evaluation results for the 30 stocks and the confidence levels of the investors.
Stocks/Investor PreferencesPessimisticOptimistic
PerformanceCLPerformanceCL
Asia Brands BhdGood0.281Stable0.756
Aeon Co (M) BhdGood0.305Stable0.740
CAM Resources BhdGood0.300Stable0.780
CWG Holdings BhdStable0.163Good0.880
Classic Century BhdGood0.305Good0.799
AHB Holdings BhdGood0.503Stable0.813
Asia File Corporation BhdGood0.503Stable0.764
Euro Holdings BhdGood0.309Stable0.744
Emico Holdings BhdInferior0.062Stable0.918
Eurospan Holdings BhdGood0.366Stable0.655
Federal International HoldingsInferior0.078Stable0.892
Poh Huat Resources Holdings BhdGood0.261Stable0.779
Paragon Union BhdGood0.309Stable0.769
Ni Hsin Resources BhdGood0.366Stable0.737
Khind Holdings BhdGood0.298Stable0.762
SWS Capital BhdGood0.320Stable0.709
Sand Nisko Capital BhdGood0.254Stable0.817
SHH Resources Holdings BhdGood0.439Stable0.576
Latitude Tree Holdings BhdGood0.389Stable0.775
Milux Corporation BhdStable0.185Good0.859
Homeritz Corporation BhdStable0.446Stable0.748
Yoong Onn Corporation BhdGood0.306Stable0.737
Lee Swee Kiat Group BhdGood0.327Stable0.775
Tafi Industries BhdStable0.426Good0.669
Parkson Holdings BhdGood0.324Stable0.711
Lii Hen Resources BhdGood0.597Good0.811
Syf Resources BhdGood0.282Stable0.811
Jaycorp BhdGood0.255Stable0.753
Kamdar Group (M) BhdGood0.260Stable0.795
Cheetah Holdings BhdGood0.433Stable0.726
Note: CL, confidence level.
Table 8. The market performance evaluation.
Table 8. The market performance evaluation.
Market/Investor PreferencesPessimisticOptimistic
ISGAISGA
Syariah Index315331125251
KLCI Index412351131200
Note: I, inferior, S, stable, G good and A indicates aggressive.
Table 9. The performance of the market index.
Table 9. The performance of the market index.
Market IndicesPessimisticOptimistic
PerformanceConfidence LevelPerformanceConfidence Level
Syariah IndexGood0.432Stable/Good0.718/0.811
KLCI IndexGood0.457Stable0.771
Table 10. Stock ranking-based performance weightage.
Table 10. Stock ranking-based performance weightage.
Stocks/Investor PreferencesPessimisticOptimistic
PerformanceWeightageRankPerformanceWeightageRank
Asia Brands BhdGood13.78Stable12.311
Aeon Co (M) BhdGood13.312.5Stable11.122.5
CAM Resources BhdGood13.69.5Stable1214
CWG Holdings BhdStable12.525Good134
Classic Century BhdGood13.69.5Good12.95
AHB Holdings BhdGood11.324Stable11.122.5
Asia File Corporation BhdGood1316.5Stable12.86
Euro Holdings BhdGood13.511Stable10.725.5
Emico Holdings BhdInferior6.930Stable10.527.5
Eurospan Holdings BhdGood13.86.5Stable12.311
Federal International HoldingsInferior829Stable10.328
Poh Huat Resources Holdings BhdGood12.621Stable11.815.5
Paragon Union BhdGood13.312.5Stable11.717
Ni Hsin Resources BhdGood12.819Stable12.77.5
Khind Holdings BhdGood1316.5Stable10.527.5
SWS Capital BhdGood14.32Stable11.221
Sand Nisko Capital BhdGood12.322Stable10.824
SHH Resources Holdings BhdGood14.51Stable1123
Latitude Tree Holdings BhdGood13.114.5Stable12.311
Milux Corporation BhdStable1226Good13.81
Homeritz Corporation BhdStable11.828Stable12.77.5
Yoong Onn Corporation BhdGood14.23Stable11.419.5
Lee Swee Kiat Group BhdGood11.723Stable12.49
Tafi Industries BhdStable11.427Good13.22
Parkson Holdings BhdGood14.14Stable11.419.5
Lii Hen Resources BhdGood13.114.5Good13.13
Syf Resources BhdGood12.819Stable10.725.5
Jaycorp BhdGood13.86.5Stable11.618
Kamdar Group (M) BhdGood12.819Stable11.815.5
Cheetah Holdings BhdGood145Stable12.213
Table 11. Comparison results of the k-means, hierarchical, and novel proposed fuzzy clustering methods.
Table 11. Comparison results of the k-means, hierarchical, and novel proposed fuzzy clustering methods.
Stocks/PerformanceK-MeansHierarchicalNovel Proposed Fuzzy Clustering Method
Pessimistic WeightageOptimisticWeightage
Asia Brands BhdGoodGoodGood3.8497Stable9.2988
Aeon Co (M) BhdGoodGoodGood4.0565Stable8.214
CAM Resources BhdGoodGoodGood4.08Stable9.36
Classic Century BhdStableGoodGood5.8072Good10.3071
AHB Holdings BhdGoodGoodGood3.4465Stable9.0243
Asia File Corporation BhdStableStableGood6.89Stable9.7792
Euro Holdings BhdGoodGoodGood4.1715Stable7.9608
Eurospan Holdings BhdStableStableGood5.0508Stable8.0565
Paragon Union BhdGoodStableGood4.1097Stable8.9973
SWS Capital BhdGoodGoodGood4.576Stable7.9408
Sand Nisko Capital BhdGoodStableGood3.1242Stable8.8236
Latitude Tree Holdings BhdGoodGoodGood5.0959Stable9.5325
Yoong Onn Corporation BhdGoodStableGood4.3452Stable8.4018
Tafi Industries BhdStableGoodStable4.8564Good8.8308
Parkson Holdings BhdStableStableGood4.5684Stable8.1054
Lii Hen Resources BhdGoodGoodGood7.8207Good10.6241
Jaycorp BhdGoodGoodGood3.519Stable8.7348
Kamdar Group (M) BhdGoodGoodGood3.328Stable9.381
Table 12. Stock ranking and Spearman’s rank coefficient of correlation scores for all clustering methods under consideration.
Table 12. Stock ranking and Spearman’s rank coefficient of correlation scores for all clustering methods under consideration.
StocksActualK-MeansHierarchicalA Novel Proposed Method
PessimisticOptimistic
Asia Brands Bhd1476.5147
Aeon Co (M) Bhd1376.51314
CAM Resources Bhd676.5126
Classic Century Bhd2166.532
AHB Holdings Bhd1076.5168
Asia File Corporation Bhd31615.523
Euro Holdings Bhd1876.51017
Eurospan Holdings Bhd111615.5516
Paragon Union Bhd7715.5119
SWS Capital Bhd1576.5718
Sand Nisko Capital Bhd16715.51811
Latitude Tree Holdings Bhd476.544
Yoong Onn Corporation Bhd9715.5913
Tafi Industries Bhd5166.5610
Parkson Holdings Bhd81615.5815
Lii Hen Resources Bhd176.511
Jaycorp Bhd1776.51512
Kamdar Group (M) Bhd1276.5175
Correlation −14.55%10.99%70.28%70.90%

Share and Cite

MDPI and ACS Style

Zainol Abidin, S.N.; Jaaman, S.H.; Ismail, M.; Abu Bakar, A.S. Clustering Stock Performance Considering Investor Preferences Using a Fuzzy Inference System. Symmetry 2020, 12, 1148. https://doi.org/10.3390/sym12071148

AMA Style

Zainol Abidin SN, Jaaman SH, Ismail M, Abu Bakar AS. Clustering Stock Performance Considering Investor Preferences Using a Fuzzy Inference System. Symmetry. 2020; 12(7):1148. https://doi.org/10.3390/sym12071148

Chicago/Turabian Style

Zainol Abidin, Siti Nazifah, Saiful Hafizah Jaaman, Munira Ismail, and Ahmad Syafadhli Abu Bakar. 2020. "Clustering Stock Performance Considering Investor Preferences Using a Fuzzy Inference System" Symmetry 12, no. 7: 1148. https://doi.org/10.3390/sym12071148

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop