Skip to main content
Log in

Classifying Two Populations by Bayesian Method and Applications

  • Published:
Communications in Mathematics and Statistics Aims and scope Submit manuscript

Abstract

This article proposes some related issues to classification problem by Bayesian method for two populations. They are relationships between Bayes error (BE) and other measures and the results for determining the BE. In addition, we propose three methods to find the prior probabilities that can make to reduce BE. The calculation of these methods can be performed conveniently and efficiently by the MATLAB procedures. The new approaches are tested by the numerical examples including synthetic and benchmark data and applied in medicine and economics. These examples also show the advantages of the proposed methods in comparison with existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Berger, J.O.: Statistical Decision Theory and Bayesian Analysis. Springer, Berlin (1985)

    Book  MATH  Google Scholar 

  2. Devijver, P.A., Kittler, J.: Pattern Recognition: A Statistical Approach. Prentice Hall, New York (1982)

    MATH  Google Scholar 

  3. Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters. Cybern 3(3), 32–57 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  4. Ghosh, A.K., Chaudhuri, P., Sengupta, D.: Classification using kernel density estimates. Technometrics 48(1), 377–392 (2012)

    Google Scholar 

  5. Inman, H.F., Bradley, E.L.: The overlapping coefficient as a measure of agreement between probability distributions and point estimation of the overlap of two normal densities. Commun. Stat. Theory Methods 18(10), 3851–3874 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  6. James, I.: Estimation of the mixing proportion in a mixture of two normal distributions from simple, rapid measurements. Biometrics 5, 265–275 (1978)

    Article  MATH  Google Scholar 

  7. Jasra, A., Holmes, C., Stephens, D.: Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling. Stat. Sci. 12, 50–67 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  8. Kraft, C.H.: Some conditions for consistency and uniform consistency of statistical procedures. University of California (1955)

  9. Martinez, W.L., Martinez, A.R.: Computational Statistics Handbook with MATLAB. CRC Press, Boca Raton (2007)

    MATH  Google Scholar 

  10. Matusita, K.: On the notion of affinity of several distributions and some of its applications. Ann. Inst. Stat. Math. 19(1), 181–192 (1967)

    Article  MathSciNet  MATH  Google Scholar 

  11. McLachlan, G.J., Basford, K.E.: Mixture Models. Inference and Applications to Clustering. Dekker, New York (1988)

    MATH  Google Scholar 

  12. Miller, G., Inkret, W., Little, T., Martz, H., Schillaci, M.: Bayesian prior probability distributions for internal dosimetry. Radiat. Protect. Dosim. 94(4), 347–352 (2001)

    Article  Google Scholar 

  13. Nguyentrang, T., Vovan, T.: A new approach for determining the prior probabilities in the classification problem by Bayesian method. Adv. Data Anal. Classif. 11(3), 629–643 (2017)

    Article  MathSciNet  Google Scholar 

  14. Nielsen, F.: Generalized Bhattacharyya and Chernoff upper bounds on Bayes error using quasi-arithmetic means. Pattern Recognit. Lett. 42, 25–34 (2014)

    Article  Google Scholar 

  15. Pal, N.R., Bezdek, J.C.: On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Syst. 3(3), 370–379 (1995)

    Article  Google Scholar 

  16. Pham-Gia, T., Turkkan, N., Bekker, A.: Bounds for the bayes error in classification: a bayesian approach using discriminant analysis. Stat. Methods Appl. 16(1), 7–26 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  17. Pham-Gia, T., Turkkan, N., Vovan, T.: Statistical discrimination analysis using the maximum function. Commun. Stat. Simul. Comput. 37(2), 320–336 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  18. Scott, D.R.: Multivariate Density Estimation: Theory Practice and Visualization. Wiley, New York (1992)

    Book  MATH  Google Scholar 

  19. Silverman, B.W.: Density Estimation for Statistics and Data Analysis. CRC Press, Boca Raton (1986)

    Book  MATH  Google Scholar 

  20. Toussaint, G.: Some inequalities between distance measures for feature evaluation. Comput 21, 389–394 (1972)

    MathSciNet  MATH  Google Scholar 

  21. Vovan, T.: \(L^1\)-distance and classification problem by Bayesian method. J. Appl. Stat. 44(3), 385–401 (2017)

    Article  MathSciNet  Google Scholar 

  22. Vovan, T., Nguyentrang, T.: Fuzzy clustering of probability density functions. J. Appl. Stat. 44(4), 583–601 (2017)

    Article  MathSciNet  MATH  Google Scholar 

  23. VoVan, T., Pham-Gia, T.: Clustering probability distributions. J. Appl. Stat. 37(11), 1891–1910 (2010)

    Article  MathSciNet  Google Scholar 

  24. Webb, A.R.: Statistical Pattern Recognition, 2nd edn. Wiley, Hoboken (2002)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tai Vovan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vovan, T., Tranphuoc, L. & Chengoc, H. Classifying Two Populations by Bayesian Method and Applications. Commun. Math. Stat. 7, 141–161 (2019). https://doi.org/10.1007/s40304-018-0139-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s40304-018-0139-8

Keywords

Mathematics Subject Classification

Navigation