Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database

Souza, Julio; Pimenta, Diana; Caballero, Ismael; Freitas, Alberto

doi:10.1007/s11219-020-09504-3

Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database

Published: 12 June 2020

Volume 28, pages 1043–1061, (2020)
Cite this article

Software Quality Journal Aims and scope Submit manuscript

386 Accesses
2 Citations
Explore all metrics

Abstract

Some countries have adopted the diagnosis-related groups (DRG) system to pay hospitals according to the number and complexity of patients they treat. Translating diseases and procedures into medical codes based on international standards such as ICD-9-CM or ICD-10-CM/PCS is at the core of the DRG systems. However, certain types of coding errors undermine this system, namely, upcoding, in which data is manipulated by deliberately using medical codes that increase patient’s complexity, resulting in higher reimbursements. In this sense, ensuring data credibility in the context of upcoding is critical for an effectively functioning DRG system. We developed a method to measure data credibility in the context of upcoding through a case study using data on pneumonia-related hospitalizations from six public hospitals in Portugal. Frequencies of codes representing pneumonia-related diagnosis and comorbidities were compared between hospitals and support vector machine models to predict DRGs were employed to verify whether codes with discrepant frequencies were related to upcoding. Data were considered not credible if codes with discrepant frequencies were responsible for increasing DRG complexity. Six pneumonia-related diagnoses and fifteen comorbidities presented a higher-than-expected frequency in at least one hospital and a link between increased DRG complexity, and these targeted codes was found. However, overall credibility was very high for nearly all conditions, except for renal disease, which presented the highest percentage of potential upcoding. The main contribution of this paper is a generic and reproducible method that can be employed to monitor data credibility in the context of upcoding in DRG databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Toward the Measure of Credibility of Hospital Administrative Datasets in the Context of DRG Classification

Underestimated prevalence of heart failure in hospital inpatients: a comparison of ICD codes and discharge letter information

Article Open access 17 April 2018

Mathias Kaspar, Georg Fette, … Stefan Störk

Identifying Candidates for Medical Coding Audits: Demonstration of a Data Driven Approach to Improve Medicare Severity Diagnosis-Related Group Coding Compliance

References

Administração Central do Sistema de Saúde (2019). Benchmarking hospitais - grupos e instituições. http://benchmarking.acss.min-saude.pt/BH_Enquadramento/GrupoInstituicoes, .
Administração Central do Sistema de Saúde (2014). Agrupador de GDH All Patient Refined DRG. http://www2.acss.min-saude.pt/Portals/0/CN22.pdf. .
Aelvoet, W., Terryn, N., Windey, F., Redivo, M., van Sprundel, M., & Faes, C. (2009). Miscoding: A threat to the hospital care system. How to detect it? Revue d’epidemiologie et de sante publique, 57(3), 169–177.
Article Google Scholar
Aiello, F. A., & Roddy, S. P. (2017). Inpatient coding and the diagnosis-related groups. Journal of Vascular Surgery, 66(5), 1621–1623.
Article Google Scholar
Alonso, V., Santos, J. V., Pinto, M., Ferreira, J., Lema, I., Lopes, F., & Freitas, A. (2019). Health records as the basis of clinical coding: Is the quality adequate? A qualitative study of medical coders' perceptions. Health Information Management Journal. https://doi.org/10.1177/1833358319826351.
Averill, R. F., McCullough, E. C., Goldfield, N. I., Hughes, J. S., Bonazelli, J., Bentley, L. (2013). 3M APR-DRG classification system methodology overview, version 31. 3M health information systems. https://www.hcup-us.ahrq.gov/db/nation/nis/grp031_aprdrg_meth_ovrview.pdf. .
Barros, P. P., & Braun, G. (2017). Upcoding in a national health service: The evidence from Portugal. Health Economics, 26(5), 600–618.
Article Google Scholar
Carter, G. M., Newhouse, J. P., & Relles, D. A. (1990). How much change in the case mix in-dex is DRG creep? Journal of Health Economics, 9(4), 411–428.
Article Google Scholar
Carter, G. M., Newhouse, J. P., & Relles, D. A. (1991). Has DRG creep crept up? Decomposing the case mix index change between 1987 and 1988. Santa Monica, California: RAND Corporation.
Google Scholar
Centers for Medicare and Medicaid Services. (2014). International classification of diseases. Clinical Modification: Ninth Revision https://www.cms.gov/Medicare/Coding/ICD9ProviderDiagnosticCodes/codes.html. .
Google Scholar
Centers for Medicare and Medicaid Services. (2019). International classification of diseases. Clinical Modification: Tenth Revision https://www.cms.gov/Medicare/Coding/ICD10/index.html. .
Google Scholar
Chong, W. F., Ding, Y. Y., & Heng, B. H. (2011). A comparison of comorbidities obtained from hospital administrative data and medical charts in older patients with pneumonia. BMC Health Services Research, 11, 105.
Article Google Scholar
Chu, A., Ahn, H., Halwan, B., Kalmin, B., Artifon, E. L., Barkun, A., Lagoudakis, M. G., & Kumar, A. (2008). A decision support system to facilitate management of patients with acute gastrointestinal bleeding. Artificial Intelligence in Medicine, 42(3), 247–259.
Article Google Scholar
Dafny, L. S. (2005). How do hospitals respond to price changes? American Economic Review, 95(5), 1525–1547.
Article Google Scholar
Di Giacomo, M., Piacenza, M., Siciliani, L., & Turati, G. (2017). Do public hospitals respond to changes in DRG price regulation? The case of birth deliveries in the Italian NHS. Health Economics, 26, 23–37.
Article Google Scholar
Feder, S. L. (2018). Data quality in electronic health records research: Quality domains and assessment methods. Western Journal of Nursing Research, 40(5), 753–766.
Article Google Scholar
Freitas A., Lema I., da Costa-Pereira A. (2016) Comorbidity coding trends in hospital administrative databases. In: Rocha Á., Correia a., Adeli H., Reis L., Mendonça Teixeira M. (eds), New Advances in Information Systems and Technologies. Advances in intelligent systems and computing, vol 445. Springer, Cham.
Goodpasture, H., Nguyen-Dang, C., Lee, T. H., Ghazarian, P. G., & Fulton, M. A. (2004). Miscoding as a cause of elevated simple pneumonia mortality. The Joint Commission Journal on Quality and Safety, 30(6), 335–341.
Article Google Scholar
Hebert, P. L., McBean, A. M., & Kane, R. L. (2005). Explaining trends in hospitalizations for pneumonia and influenza in the elderly. Medical Care Research and Review, 62(5), 560–582.
Article Google Scholar
Hsia, D. C. (1990). Accuracy of Medicare reimbursement for cardiac arrest. Journal of the American Medical Association, 264(1), 59–62.
Article Google Scholar
Hsia, D. C., Ahern, C. A., Ritchie, B. P., Moscoe, L. M., & Krushat, W. M. (1992). Medicare reimbursement accuracy under the prospective payment system, 1985 to 1988. Journal of the American Medical Association, 268(7), 896–899.
Article Google Scholar
ISO/IEC 25012 (2006). ISO/IEC 25012: Software product quality – Data quality model. https://iso25000.com/index.php/en/iso-25000-standards/iso-25012.
Januleviciute, J., Askildsen, J. E., Kaarboe, O., Siciliani, L., & Sutton, M. (2016). How do hospitals respond to price changes? Evidence from Norway. Health Economics, 25(5), 620–636.
Article Google Scholar
Jarman, B., Gault, S., Alves, B., Hider, A., Dolan, S., Cook, A., Hurwitz, B., & Iezzoni, L. I. (1999). Explaining differences in English hospital death rates using routinely collected data. British Medical Journal, 318(7197), 1515–1520.
Article Google Scholar
Lau, E. C., Mowat, F. S., Kelsh, M. A., Legg, J. C., Engel-Nitz, N. M., Watson, H. N., Collins, H. L., Nordyke, R. J., & Whyte, J. L. (2011). Use of electronic medical records (EMR) for oncology outcomes research: Assessing the comparability of EMR information to patient registry and health claims data. Clinical Epidemiology, 3, 259–272.
Google Scholar
Lungen, M., & Lauterbach, K. W. (2000). Upcoding: A risk for the use of diagnosis-related groups. Deutsche Medizinische Wochenschrift, 125(28-29), 852–856.
Article Google Scholar
Luo, W., & Gallagher, M. (2010). Unsupervised DRG upcoding detection in healthcare databases. In 2010 IEEE International Conference on Data Mining Workshops, Sydney, NSW (pp. 600–605).
Chapter Google Scholar
Mathauer, I., & Wittenbecher, F. (2013). Hospital payment systems based on diagnosis-related groups: Experiences in low- and middle-income countries. Bulletin of the World Health Organization, 91(10), 746–756.
Article Google Scholar
Ministério da Saúde. (2017). Portaria n.o 207/2017 - Diário da República n.o 132/2017, série i de 2017-07-11. http://www.acss.min-saude.pt/wp-content/uploads/2016/12/Portaria_207_2017-1.pdf. .
Pimenta D., Souza J., Caballero I., Freitas A. (2019) Toward the measure of credibility of hospital administrative datasets in the context of DRG classification. In: Piattini M., Rupino da Cunha P., García Rodríguez de Guzmán I., Pérez-Castillo R. (eds) Quality of Information and Communications Technology. QUATIC 2019. Communications in Computer and Information Science, vol 1010. Springer, Cham.
Platt, J. (1998). Fast training of support vector machines using sequential minimal optimization. https://pdfs.semanticscholar.org/d1fa/8485ad749d51e7470d801bc1931706597601.pdf. Accessed 22 October 2019.
Pongpirul, K., & Robinson, C. (2013). Hospital manipulations in the DRG system: A systematic scoping review. Asian Biomedicine, 7, 301–310.
Google Scholar
Psaty, B. M., Boineau, R., Kuller, L. H., & Luepker, R. V. (1999). The potential costs of upcoding for heart failure in the United States. The American Journal of Cardiology, 84(108–9), A9.
Google Scholar
Quan, H., Sundararajan, V., Halfon, P., Fong, A., Burnand, B., Luthi, J.-C., Saunders, L. D., Beck, C. A., Feasby, T. E., & Ghali, W. A. (2005). Coding algorithms for defining comorbidities in icd-9-cm and icd-10 administrative data. Medical Care, 43(11), 1130–1139.
Article Google Scholar
Rea, S., Bailey, K. R., Pathak, J., Haug, P. J. (2013). Bias in recording of body mass index data in the electronic health record. AMIA Joint Summits on Translational Science Proceedings. AMIA Summit on Translational Science, 2013:214-218.
Reid, B., Allen, C., & McIntosh, J. (2005). Investigation of leukaemia and lymphoma ar-drgs at a Sydney teaching hospital. Health Information Management, 34(2), 34–39.
Article Google Scholar
Reid, B., Palmer, G., & Aisbett, C. (2000). Under-coding in Australia limits the performance of drg groupers. Health Information Management, 29(3), 113–117.
Article Google Scholar
Scott, I., Youlden, D., & Coory, M. (2004). Are diagnosis specific outcome indicators based on administrative data useful in assessing quality of hospital care? BMJ Quality & Safety, 13(1), 32–39.
Article Google Scholar
Silverman, E., & Skinner, J. (2004). Medicare upcoding and hospital ownership. Journal of Health Economics, 23(2), 369–389.
Article Google Scholar
Singh, A., Thakur, N., & Sharma, A. (2016). A review of supervised machine learning algorithms. In 2016 3rd international conference on computing for sustainable global development (INDIACom), New Delhi (pp. 1310–1315).
Google Scholar
Sjoding, M. W., Iwashyna, T. J., Dimick, J. B., & Cooke, C. R. (2015). Gaming hospital-level pneumonia 30-day mortality and readmission measures by legitimate changes to diagnostic coding. Critical Care Medicine, 43(5), 989–995.
Article Google Scholar
Souza, J., Santos, J. V., Lopes, F., Viana, J., & Freitas, A. (2018). Miscoding alerts within hospital datasets: An unsupervised machine learning approach. In A. Rocha, H. Adeli, L. P. Reis, & S. Costanzo (Eds.), Trends and advances in information systems and technologies, Advances in intelligent systems and computing, vol (Vol. 746, pp. 1198–1207). Cham p: Springer.
Chapter Google Scholar
Spangler, W. E., May, J. H., Strum, D. P., & Vargas, L. G. (2002). A data mining approach to characterizing medical code usage patterns. Journal of Medical Systems, 26(3), 255–275.
Article Google Scholar
Strong, D. M., Lee, Y. W., Wang, R. Y., Strong, D., Lee, Y. W., & Wang, R. (1997). 10 potholes in the road to information quality. IEEE Computer, 30, 38–46.
Article Google Scholar
Vapnik, V. (1995). The nature of statistical learning theory. New York, NY: Springer-Verlag.
Book MATH Google Scholar
Verplancke, T., Van Looy, S., Benoit, D., Vansteelandt, S., Depuydt, P., De Turck, F., & Decruyenaere, J. (2008). Support vector machine versus logistic regression modeling for prediction of hospital mortality in critically ill patients with haematological malignancies. BMC Medical Informatics and Decision Making, 8, 56.
Article Google Scholar
Weiskopf, N. G., & Weng, C. (2013). Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research. Journal of the American Medical Informatics Association, 20(1), 144–151.
Article Google Scholar
Yates, D., Moore, D., & McCabe, G. (1999). The practice of statistics. New York: Freeman.
Google Scholar

Download references

Acknowledgments

The authors would like to thank the Central Authority for Health Services, I.P. (ACSS) for providing access to the data. We would also like to thank to project GEMA: Generation and Evaluation of Models for Data Quality (Ref.: SBPLY/17/180501/000293) and the Master Program in Medical Informatics of the Faculty of Medicine and Faculty of Sciences of the University of Porto for financial support. Finally, we thank the project ECLIPSE (RTI2018–094283-B-C31), co-funded by the Spanish Ministry of Science, Innovation and Universities and Fundo Europeu de Desenvolvimento Regional (FEDER) funds.

Author information

Authors and Affiliations

MEDCIDS – Department of Community Medicine, Information and Health Decision Sciences, Faculty of Medicine, University of Porto, Alameda Prof. Hernâni Monteiro, 4200-319, Porto, Portugal
Julio Souza, Diana Pimenta & Alberto Freitas
CINTESIS – Center for Health Technology and Services Research, R. Dr. Plácido da Costa, 4200-450, Porto, Portugal
Julio Souza & Alberto Freitas
Information Systems and Technologies Institute (ITSI), University of Castilla-La Mancha c\Moledores s/n, 13071, Ciudad Real, Spain
Ismael Caballero

Authors

Julio Souza
View author publications
You can also search for this author in PubMed Google Scholar
Diana Pimenta
View author publications
You can also search for this author in PubMed Google Scholar
Ismael Caballero
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Freitas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julio Souza.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOCX 13 kb)

ESM 2

(DOCX 15 kb)

ESM 3

(DOCX 15.5 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Souza, J., Pimenta, D., Caballero, I. et al. Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database. Software Qual J 28, 1043–1061 (2020). https://doi.org/10.1007/s11219-020-09504-3

Download citation

Published: 12 June 2020
Issue Date: September 2020
DOI: https://doi.org/10.1007/s11219-020-09504-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database

Abstract

Access this article

Similar content being viewed by others

Toward the Measure of Credibility of Hospital Administrative Datasets in the Context of DRG Classification

Underestimated prevalence of heart failure in hospital inpatients: a comparison of ICD codes and discharge letter information

Identifying Candidates for Medical Coding Audits: Demonstration of a Data Driven Approach to Improve Medicare Severity Diagnosis-Related Group Coding Compliance

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Electronic supplementary material

ESM 1

ESM 2

ESM 3

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database

Abstract

Access this article

Similar content being viewed by others

Toward the Measure of Credibility of Hospital Administrative Datasets in the Context of DRG Classification

Underestimated prevalence of heart failure in hospital inpatients: a comparison of ICD codes and discharge letter information

Identifying Candidates for Medical Coding Audits: Demonstration of a Data Driven Approach to Improve Medicare Severity Diagnosis-Related Group Coding Compliance

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Electronic supplementary material

ESM 1

ESM 2

ESM 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation