Development of CART model for prediction of tuberculosis treatment loss to follow up in the state of São Paulo, Brazil: A case–control study
Introduction
Tuberculosis (TB) is the leading cause of infectious disease-related death, surpassing the immunodeficiency virus (HIV) [1]. TB control is based on early and appropriate diagnosis and treatment, interrupting the transmission chain [2]. The National Tuberculosis Control Plan (PNCT) is a strategy implemented by the Brazilian government. The Directly Observed Treatment Short-Course (DOTS), is a part of this strategy and has assisted in the country's reach to an 85% cure rate and has reduced the loss to follow up rate in TB treatments by 5% [2]. However, loss to follow up treatment and irregular medication use contribute to the persistence of morbidity and mortality, increasing drug resistance cases [3], [4].
TB is related to poverty and different social causes. Therefore, patients are difficult to monitor. Factors such as initial improvement, lack of patient follow-up and monitoring are all associated with poor treatment adherence [5]. After the first few months of using the correct medications during treatments, patients present improvements in their overall physical well-being and most of their symptoms are gone, which then leads to patients interrupting the treatment themselves, even though they are not cured. Social factors, treatment methodology and provided health services are factors associated with treatment loss to follow up [6], [7], [8].
In the last 10 years, computational techniques for predicting the outcomes of TB treatment have been explored. Neural networks have been applied to predict the TB treatment loss to follow up in some cities of Espírito Santo in Brazil [9]. A logistic regression model was used to select features [10] and predict loss to follow up TB treatment [5]. Another study compared the application of different machine learning methods to classify treatment outcomes, among which the C4.5 decision tree showed better performance [11].
With a computational model that is able to identify patients that are prone to not follow-up treatment, teams would then be able to focus their efforts in reducing the rate of relapse, dissemination and loss to follow-up treatment. Although there are studies about the subject [9], [10], [5], [11], no study has developed an effective human-readable model that can be applied by health professionals in TB treatment control. Decision trees can be represented as a set of rules that would assist healthcare professionals, the ones with and the ones without access to an information system, to control TB. Thus, this study aims to develop a CART predictive model to predict TB treatment loss to follow up cases.
Section snippets
Materials and methods
For the development of this case-control study, 208,620 TB treatment cases were used, all between the years of 2006 and 2016 and in the state of São Paulo. The sample had sensitive and drug-resistant tuberculosis cases. The state of São Paulo is one of 27 federal states of Brazil and is located in the southeast region of the country. It has 645 cities and an area of 248,219,491 km2 [12]. It has the greatest population in the country, with around 45.9 million inhabitants [12].
The inclusion
Results
The final population of the study contained 91,823 controls and 12,023 cases (Fig. Figure 1 ). Table Table 2 shows the distribution for the population demographic data.
Most of the patients are male adults between the ages of 20 and 39 (24.67%). Less than 2% of the population represents the youth and elders.
The sampled population is mainly composed of white (39.84%) and brown (26.69%) ethnic groups. Only 0.52% of them were from an indigenous ethnic group. Most patients had basic education (less
Discussion
The decision tree created can be used as a guideline among health professionals to detect treatment loss to follow up. The accuracy of the model is similar to other models already presented in the literature [11]. For health professionals, the decision tree is a much more intuitive and understandable approach than a statistical model, since it can be visualized and explored by anyone, as well as be applied without a computational system. To exemplify, the decision tree for this study can be
Conclusion
It was possible to build the predictive model of treatment loss to follow up of tuberculosis through CART. During model development, new features for TB treatment loss to follow up predictors were identified. Although the fact that the ideal predictive ability was not achieved, it seems reasonable to propose the use of CART models. We aim in future studies to improve the model accuracy and implement the model in an informational system for tuberculosis treatment and management.
Summary Points
What was already known on the topic:
- •
Treatment loss to follow up and irregular medication use contribute to increased bacillus drug resistance and has a negative impact on disease control.
- •
Other studies apply different models to predict the TB treatment outcome.
- •
There is evidence that, among different models, decision trees present better performance.
What this study added to our knowledge:
- •
The CART model was able to perform equally or better than other models in the literature, additionally
Acknowledgements
We would like to thank the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001 and São Paulo Research Foundation (FAPESP) (Grant Nos. 2018/23963-2 and 2018/00307-2).
References (15)
Global Tuberculosis Report 2019, Geneva
(2019)Plano Nacional pelo Fim da Tuberculose como Problema de Saúde Pública
(2017)- et al.
Perfil de casos de tuberculose notificados e fatores associados ao abandono, Belo Horizonte, MG
Rev. Saúde Pública
(2007) Abandono do tratamento da tuberculose na Atenção Primária à Saúde: uma análise segundo o enfoque familiar do cuidado
(2008)- et al.
Modelo preditivo ao abandono do tratamento da tuberculose
Saúde em Debate
(2014) - et al.
Abandono de tratamento e recidiva da tuberculose: Aspectos de episódios prévios, Campinas, SP, Brasil, 1993–1994
Rev. Saude Publica
(2000) - et al.
Noncompliance with tuberculosis treatment by patients at a tuberculosis and AIDS reference hospital in midwestern Brazil
Braz. J. Infect. Dis.
(2002)
Cited by (4)
Clinical pathways and hierarchical clustering for tuberculosis treatment outcome prediction
2023, Procedia Computer SciencePreliminary results to predict tuberculosis outcomes applying traditional and automated machine learning models
2023, Procedia Computer Science