Skip to main content

Advertisement

Log in

Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database

  • Published:
Software Quality Journal Aims and scope Submit manuscript

Abstract

Some countries have adopted the diagnosis-related groups (DRG) system to pay hospitals according to the number and complexity of patients they treat. Translating diseases and procedures into medical codes based on international standards such as ICD-9-CM or ICD-10-CM/PCS is at the core of the DRG systems. However, certain types of coding errors undermine this system, namely, upcoding, in which data is manipulated by deliberately using medical codes that increase patient’s complexity, resulting in higher reimbursements. In this sense, ensuring data credibility in the context of upcoding is critical for an effectively functioning DRG system. We developed a method to measure data credibility in the context of upcoding through a case study using data on pneumonia-related hospitalizations from six public hospitals in Portugal. Frequencies of codes representing pneumonia-related diagnosis and comorbidities were compared between hospitals and support vector machine models to predict DRGs were employed to verify whether codes with discrepant frequencies were related to upcoding. Data were considered not credible if codes with discrepant frequencies were responsible for increasing DRG complexity. Six pneumonia-related diagnoses and fifteen comorbidities presented a higher-than-expected frequency in at least one hospital and a link between increased DRG complexity, and these targeted codes was found. However, overall credibility was very high for nearly all conditions, except for renal disease, which presented the highest percentage of potential upcoding. The main contribution of this paper is a generic and reproducible method that can be employed to monitor data credibility in the context of upcoding in DRG databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Administração Central do Sistema de Saúde (2019). Benchmarking hospitais - grupos e instituições. http://benchmarking.acss.min-saude.pt/BH_Enquadramento/GrupoInstituicoes, .

  • Administração Central do Sistema de Saúde (2014). Agrupador de GDH All Patient Refined DRG. http://www2.acss.min-saude.pt/Portals/0/CN22.pdf. .

  • Aelvoet, W., Terryn, N., Windey, F., Redivo, M., van Sprundel, M., & Faes, C. (2009). Miscoding: A threat to the hospital care system. How to detect it? Revue d’epidemiologie et de sante publique, 57(3), 169–177.

    Article  Google Scholar 

  • Aiello, F. A., & Roddy, S. P. (2017). Inpatient coding and the diagnosis-related groups. Journal of Vascular Surgery, 66(5), 1621–1623.

    Article  Google Scholar 

  • Alonso, V., Santos, J. V., Pinto, M., Ferreira, J., Lema, I., Lopes, F., & Freitas, A. (2019). Health records as the basis of clinical coding: Is the quality adequate? A qualitative study of medical coders' perceptions. Health Information Management Journal. https://doi.org/10.1177/1833358319826351.

  • Averill, R. F., McCullough, E. C., Goldfield, N. I., Hughes, J. S., Bonazelli, J., Bentley, L. (2013). 3M APR-DRG classification system methodology overview, version 31. 3M health information systems. https://www.hcup-us.ahrq.gov/db/nation/nis/grp031_aprdrg_meth_ovrview.pdf. .

  • Barros, P. P., & Braun, G. (2017). Upcoding in a national health service: The evidence from Portugal. Health Economics, 26(5), 600–618.

    Article  Google Scholar 

  • Carter, G. M., Newhouse, J. P., & Relles, D. A. (1990). How much change in the case mix in-dex is DRG creep? Journal of Health Economics, 9(4), 411–428.

    Article  Google Scholar 

  • Carter, G. M., Newhouse, J. P., & Relles, D. A. (1991). Has DRG creep crept up? Decomposing the case mix index change between 1987 and 1988. Santa Monica, California: RAND Corporation.

    Google Scholar 

  • Centers for Medicare and Medicaid Services. (2014). International classification of diseases. Clinical Modification: Ninth Revision https://www.cms.gov/Medicare/Coding/ICD9ProviderDiagnosticCodes/codes.html. .

    Google Scholar 

  • Centers for Medicare and Medicaid Services. (2019). International classification of diseases. Clinical Modification: Tenth Revision https://www.cms.gov/Medicare/Coding/ICD10/index.html. .

    Google Scholar 

  • Chong, W. F., Ding, Y. Y., & Heng, B. H. (2011). A comparison of comorbidities obtained from hospital administrative data and medical charts in older patients with pneumonia. BMC Health Services Research, 11, 105.

    Article  Google Scholar 

  • Chu, A., Ahn, H., Halwan, B., Kalmin, B., Artifon, E. L., Barkun, A., Lagoudakis, M. G., & Kumar, A. (2008). A decision support system to facilitate management of patients with acute gastrointestinal bleeding. Artificial Intelligence in Medicine, 42(3), 247–259.

    Article  Google Scholar 

  • Dafny, L. S. (2005). How do hospitals respond to price changes? American Economic Review, 95(5), 1525–1547.

    Article  Google Scholar 

  • Di Giacomo, M., Piacenza, M., Siciliani, L., & Turati, G. (2017). Do public hospitals respond to changes in DRG price regulation? The case of birth deliveries in the Italian NHS. Health Economics, 26, 23–37.

    Article  Google Scholar 

  • Feder, S. L. (2018). Data quality in electronic health records research: Quality domains and assessment methods. Western Journal of Nursing Research, 40(5), 753–766.

    Article  Google Scholar 

  • Freitas A., Lema I., da Costa-Pereira A. (2016) Comorbidity coding trends in hospital administrative databases. In: Rocha Á., Correia a., Adeli H., Reis L., Mendonça Teixeira M. (eds), New Advances in Information Systems and Technologies. Advances in intelligent systems and computing, vol 445. Springer, Cham.

  • Goodpasture, H., Nguyen-Dang, C., Lee, T. H., Ghazarian, P. G., & Fulton, M. A. (2004). Miscoding as a cause of elevated simple pneumonia mortality. The Joint Commission Journal on Quality and Safety, 30(6), 335–341.

    Article  Google Scholar 

  • Hebert, P. L., McBean, A. M., & Kane, R. L. (2005). Explaining trends in hospitalizations for pneumonia and influenza in the elderly. Medical Care Research and Review, 62(5), 560–582.

    Article  Google Scholar 

  • Hsia, D. C. (1990). Accuracy of Medicare reimbursement for cardiac arrest. Journal of the American Medical Association, 264(1), 59–62.

    Article  Google Scholar 

  • Hsia, D. C., Ahern, C. A., Ritchie, B. P., Moscoe, L. M., & Krushat, W. M. (1992). Medicare reimbursement accuracy under the prospective payment system, 1985 to 1988. Journal of the American Medical Association, 268(7), 896–899.

    Article  Google Scholar 

  • ISO/IEC 25012 (2006). ISO/IEC 25012: Software product quality – Data quality model. https://iso25000.com/index.php/en/iso-25000-standards/iso-25012.

  • Januleviciute, J., Askildsen, J. E., Kaarboe, O., Siciliani, L., & Sutton, M. (2016). How do hospitals respond to price changes? Evidence from Norway. Health Economics, 25(5), 620–636.

    Article  Google Scholar 

  • Jarman, B., Gault, S., Alves, B., Hider, A., Dolan, S., Cook, A., Hurwitz, B., & Iezzoni, L. I. (1999). Explaining differences in English hospital death rates using routinely collected data. British Medical Journal, 318(7197), 1515–1520.

    Article  Google Scholar 

  • Lau, E. C., Mowat, F. S., Kelsh, M. A., Legg, J. C., Engel-Nitz, N. M., Watson, H. N., Collins, H. L., Nordyke, R. J., & Whyte, J. L. (2011). Use of electronic medical records (EMR) for oncology outcomes research: Assessing the comparability of EMR information to patient registry and health claims data. Clinical Epidemiology, 3, 259–272.

    Google Scholar 

  • Lungen, M., & Lauterbach, K. W. (2000). Upcoding: A risk for the use of diagnosis-related groups. Deutsche Medizinische Wochenschrift, 125(28-29), 852–856.

    Article  Google Scholar 

  • Luo, W., & Gallagher, M. (2010). Unsupervised DRG upcoding detection in healthcare databases. In 2010 IEEE International Conference on Data Mining Workshops, Sydney, NSW (pp. 600–605).

    Chapter  Google Scholar 

  • Mathauer, I., & Wittenbecher, F. (2013). Hospital payment systems based on diagnosis-related groups: Experiences in low- and middle-income countries. Bulletin of the World Health Organization, 91(10), 746–756.

    Article  Google Scholar 

  • Ministério da Saúde. (2017). Portaria n.o 207/2017 - Diário da República n.o 132/2017, série i de 2017-07-11. http://www.acss.min-saude.pt/wp-content/uploads/2016/12/Portaria_207_2017-1.pdf. .

  • Pimenta D., Souza J., Caballero I., Freitas A. (2019) Toward the measure of credibility of hospital administrative datasets in the context of DRG classification. In: Piattini M., Rupino da Cunha P., García Rodríguez de Guzmán I., Pérez-Castillo R. (eds) Quality of Information and Communications Technology. QUATIC 2019. Communications in Computer and Information Science, vol 1010. Springer, Cham.

  • Platt, J. (1998). Fast training of support vector machines using sequential minimal optimization. https://pdfs.semanticscholar.org/d1fa/8485ad749d51e7470d801bc1931706597601.pdf. Accessed 22 October 2019.

  • Pongpirul, K., & Robinson, C. (2013). Hospital manipulations in the DRG system: A systematic scoping review. Asian Biomedicine, 7, 301–310.

    Google Scholar 

  • Psaty, B. M., Boineau, R., Kuller, L. H., & Luepker, R. V. (1999). The potential costs of upcoding for heart failure in the United States. The American Journal of Cardiology, 84(108–9), A9.

    Google Scholar 

  • Quan, H., Sundararajan, V., Halfon, P., Fong, A., Burnand, B., Luthi, J.-C., Saunders, L. D., Beck, C. A., Feasby, T. E., & Ghali, W. A. (2005). Coding algorithms for defining comorbidities in icd-9-cm and icd-10 administrative data. Medical Care, 43(11), 1130–1139.

    Article  Google Scholar 

  • Rea, S., Bailey, K. R., Pathak, J., Haug, P. J. (2013). Bias in recording of body mass index data in the electronic health record. AMIA Joint Summits on Translational Science Proceedings. AMIA Summit on Translational Science, 2013:214-218.

  • Reid, B., Allen, C., & McIntosh, J. (2005). Investigation of leukaemia and lymphoma ar-drgs at a Sydney teaching hospital. Health Information Management, 34(2), 34–39.

    Article  Google Scholar 

  • Reid, B., Palmer, G., & Aisbett, C. (2000). Under-coding in Australia limits the performance of drg groupers. Health Information Management, 29(3), 113–117.

    Article  Google Scholar 

  • Scott, I., Youlden, D., & Coory, M. (2004). Are diagnosis specific outcome indicators based on administrative data useful in assessing quality of hospital care? BMJ Quality & Safety, 13(1), 32–39.

    Article  Google Scholar 

  • Silverman, E., & Skinner, J. (2004). Medicare upcoding and hospital ownership. Journal of Health Economics, 23(2), 369–389.

    Article  Google Scholar 

  • Singh, A., Thakur, N., & Sharma, A. (2016). A review of supervised machine learning algorithms. In 2016 3rd international conference on computing for sustainable global development (INDIACom), New Delhi (pp. 1310–1315).

    Google Scholar 

  • Sjoding, M. W., Iwashyna, T. J., Dimick, J. B., & Cooke, C. R. (2015). Gaming hospital-level pneumonia 30-day mortality and readmission measures by legitimate changes to diagnostic coding. Critical Care Medicine, 43(5), 989–995.

    Article  Google Scholar 

  • Souza, J., Santos, J. V., Lopes, F., Viana, J., & Freitas, A. (2018). Miscoding alerts within hospital datasets: An unsupervised machine learning approach. In A. Rocha, H. Adeli, L. P. Reis, & S. Costanzo (Eds.), Trends and advances in information systems and technologies, Advances in intelligent systems and computing, vol (Vol. 746, pp. 1198–1207). Cham p: Springer.

    Chapter  Google Scholar 

  • Spangler, W. E., May, J. H., Strum, D. P., & Vargas, L. G. (2002). A data mining approach to characterizing medical code usage patterns. Journal of Medical Systems, 26(3), 255–275.

    Article  Google Scholar 

  • Strong, D. M., Lee, Y. W., Wang, R. Y., Strong, D., Lee, Y. W., & Wang, R. (1997). 10 potholes in the road to information quality. IEEE Computer, 30, 38–46.

    Article  Google Scholar 

  • Vapnik, V. (1995). The nature of statistical learning theory. New York, NY: Springer-Verlag.

    Book  MATH  Google Scholar 

  • Verplancke, T., Van Looy, S., Benoit, D., Vansteelandt, S., Depuydt, P., De Turck, F., & Decruyenaere, J. (2008). Support vector machine versus logistic regression modeling for prediction of hospital mortality in critically ill patients with haematological malignancies. BMC Medical Informatics and Decision Making, 8, 56.

    Article  Google Scholar 

  • Weiskopf, N. G., & Weng, C. (2013). Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research. Journal of the American Medical Informatics Association, 20(1), 144–151.

    Article  Google Scholar 

  • Yates, D., Moore, D., & McCabe, G. (1999). The practice of statistics. New York: Freeman.

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank the Central Authority for Health Services, I.P. (ACSS) for providing access to the data. We would also like to thank to project GEMA: Generation and Evaluation of Models for Data Quality (Ref.: SBPLY/17/180501/000293) and the Master Program in Medical Informatics of the Faculty of Medicine and Faculty of Sciences of the University of Porto for financial support. Finally, we thank the project ECLIPSE (RTI2018–094283-B-C31), co-funded by the Spanish Ministry of Science, Innovation and Universities and Fundo Europeu de Desenvolvimento Regional (FEDER) funds.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julio Souza.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOCX 13 kb)

ESM 2

(DOCX 15 kb)

ESM 3

(DOCX 15.5 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Souza, J., Pimenta, D., Caballero, I. et al. Measuring data credibility and medical coding: a case study using a nationwide Portuguese inpatient database. Software Qual J 28, 1043–1061 (2020). https://doi.org/10.1007/s11219-020-09504-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11219-020-09504-3

Keywords

Navigation