RSS-Feed abonnieren

DOI: 10.1055/s-0042-1757880
Developing Automated Computer Algorithms to Phenotype Periodontal Disease Diagnoses in Electronic Dental Records

Abstract
Objective Our objective was to phenotype periodontal disease (PD) diagnoses from three different sections (diagnosis codes, clinical notes, and periodontal charting) of the electronic dental records (EDR) by developing two automated computer algorithms.
Methods We conducted a retrospective study using EDR data of patients (n = 27,138) who received care at Temple University Maurice H. Kornberg School of Dentistry from January 1, 2017 to August 31, 2021. We determined the completeness of patient demographics, periodontal charting, and PD diagnoses information in the EDR. Next, we developed two automated computer algorithms to automatically diagnose patients' PD statuses from clinical notes and periodontal charting data. Last, we phenotyped PD diagnoses using automated computer algorithms and reported the improved completeness of diagnosis.
Results The completeness of PD diagnosis from the EDR was as follows: periodontal diagnosis codes 36% (n = 9,834), diagnoses in clinical notes 18% (n = 4,867), and charting information 80% (n = 21,710). After phenotyping, the completeness of PD diagnoses improved to 100%. Eleven percent of patients had healthy periodontium, 43% were with gingivitis, 3% with stage I, 36% with stage II, and 7% with stage III/IV periodontitis.
Conclusions We successfully developed, tested, and deployed two automated algorithms on big EDR datasets to improve the completeness of PD diagnoses. After phenotyping, EDR provided 100% completeness of PD diagnoses of 27,138 unique patients for research purposes. This approach is recommended for use in other large databases for the evaluation of their EDR data quality and for phenotyping PD diagnoses and other relevant variables.
Keywords
periodontal disease - data quality - automated algorithms - electronic dental record - phenotypeProtection of Human Subjects and Animals in Research
This study was reviewed and approved by our institutional review board (IRB: 28321) granted authorization. In this retrospective study, we used deidentified patient datasets from the patients' EDR records; therefore, informed consents were not required to obtain.
Publikationsverlauf
Eingereicht: 26. Februar 2022
Angenommen: 30. August 2022
Artikel online veröffentlicht:
22. November 2022
© 2022. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany
-
References
- 1 Bruland P, Doods J, Storck M, Dugas M. What information does your EHR contain? Automatic generation of a Clinical Metadata Warehouse (CMDW) to support identification and data access within distributed clinical research networks. Stud Health Technol Inform 2017; 245: 313-317
- 2 Song M, Liu K, Abromitis R, Schleyer TL. Reusing electronic patient data for dental clinical research: a review of current status. J Dent 2013; 41 (12) 1148-1163
- 3 Coorevits P, Sundgren M, Klein GO. et al. Electronic health records: new opportunities for clinical research. J Intern Med 2013; 274 (06) 547-560
- 4 Siddiqui Z, Wang Y, Patel J, Thyvalikakath T. Differences in medication usage of dental patients by age, gender, race/ethnicity and insurance status. Technol Heal Care 2021; 29 (06) 1099-1108
- 5 Patel J, Siddiqui Z, Krishnan A, Thyvalikakath TP. Leveraging electronic dental record data to classify patients based on their smoking intensity. Methods Inf Med 2018; 57 (5-06): 253-260
- 6 Thyvalikakath TP, Duncan WD, Siddiqui Z. et al; National Dental PBRN Collaborative Group. Leveraging electronic dental record data for clinical research in the National Dental PBRN Practices. Appl Clin Inform 2020; 11 (02) 305-314
- 7 Watson JI, Patel JS, Ramya MB. et al. Longevity of crown margin repairs using glass ionomer cement: a retrospective study. Oper Dent 2021; 46 (03) 263-270
- 8 Thyvalikakath TP, Padman R, Vyawahare K, Darade P, Paranjape R. Utilizing dental electronic health records data to predict risk for periodontal disease. Stud Health Technol Inform 2015; 216: 1081
- 9 Krois J, Ekert T, Meinhold L. et al. Deep learning for the radiographic detection of periodontal bone loss. Sci Rep 2019; 9 (01) 8495
- 10 Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc 2013; 20 (01) 144-151
- 11 Martin S, Wagner J, Lupulescu-Mann N. et al. Comparison of EHR-based diagnosis documentation locations to a gold standard for risk stratification in patients with multiple chronic conditions. Appl Clin Inform 2017; 8 (03) 794-809
- 12 Patel J, Zai A, Kumar K. et al. Retrospective study of deriving periodontal disease diagnosis from periodontal findings. J Dent Res 2020. Available at: https://iadr.abstractarchives.com/abstract/20iags-3323278/retrospective-study-of-deriving-periodontal-disease-diagnosis-from-periodontal-findings
- 13 Patel J, Zai A, Kumar K. et al. Utilizing electronic dental record data to monitor periodontal disease progression. mobilize Comput Biomed Knowl; July 18–19 2019; Bethesda, Maryland; Abstract 18. Accessed September 22, 2022.
- 14 Eke PI, Thornton-Evans GO, Wei L, Borgnakke WS, Dye BA, Genco RJ. Periodontitis in US adults: National Health and Nutrition Examination Survey 2009-2014. J Am Dent Assoc 2018; 149 (07) 576-588.e6
- 15 Ramseier CA, Anerud A, Dulac M. et al. Natural history of periodontitis: disease progression and tooth loss over 40 years. J Clin Periodontol 2017; 44 (12) 1182-1191
- 16 Eke PI, Dye BA, Wei L. et al. Update on prevalence of periodontitis in adults in the United States: NHANES 2009 to 2012. J Periodontol 2015; 86 (05) 611-622
- 17 Albandar JM. Epidemiology and risk factors of periodontal diseases. Dent Clin North Am 2005; 49 (03) 517-532 , v–vi
- 18 Trombelli L, Farina R, Silva CO, Tatakis DN. Plaque-induced gingivitis: case definition and diagnostic considerations. J Clin Periodontol 2018; 45 (Suppl. 20) S44-S67
- 19 Weiskopf NG, Bakken S, Hripcsak G, Weng C. A data quality assessment guideline for electronic health record data reuse. EGEMS (Wash DC) 2017; 5 (01) 14
- 20 Patel JS. Utilizing Electronic Dental Record Data to Track Periodontal Disease Change. 2020 . Available at: https://scholarworks.iupui.edu/bitstream/handle/1805/23677/Patel_iupui_0104D_10455.pdf?sequence=1
- 21 Chan KS, Fowles JB, Weiner JP. Review: electronic health records and the reliability and validity of quality measures: a review of the literature. Med Care Res Rev 2010; 67 (05) 503-527
- 22 Patel J, Mowery D, Krishnan A, Thyvalikakath T. Assessing information congruence of documented cardiovascular disease between electronic dental and medical records. AMIA Annu Symp Proc 2018; 2018: 1442-1450
- 23 Mullins J, Yansane A, Kumar SV. et al. Assessing the completeness of periodontal disease documentation in the EHR: a first step in measuring the quality of care. BMC Oral Health 2021; 21 (01) 282
- 24 Tonetti MS, Greenwell H, Kornman KS. Staging and grading of periodontitis: Framework and proposal of a new classification and case definition. J Periodontol 2018; 89 (Suppl. 01) S159-S172
- 25 Pathak J, Bailey KR, Beebe CE. et al. Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium. J Am Med Inform Assoc 2013; 20 (e2): e341-e348
- 26 Koleck TA, Dreisbach C, Bourne PE, Bakken S. Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review. J Am Med Inform Assoc 2019; 26 (04) 364-379
- 27 Khalifa A, Meystre S. Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes. J Biomed Inform 2015; 58 (Suppl): S128-S132
- 28 Liu J, Li C, Xu J, Wu H. A patient-oriented clinical decision support system for CRC risk assessment and preventative care. BMC Med Inform Decis Mak 2018; 18 (Suppl. 05) 118
- 29 GitHub—chrisleng/ehost: Annotation Tool: The extensible Human Oracle Suite of Tools (eHOST). Accessed September 9, 2022 at: https://github.com/chrisleng/ehost
- 30 Ravanelli M, Parcollet T, Bengio Y. The Pytorch-kaldi Speech Recognition Toolkit. ICASSP—IEEE International Conference on Acoustics, Speech and Signal Processing; 2019
- 31 Loper E, Bird S. NLTK: The Natural Language Toolkit. Proceedings of the ACL-02 Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Philadelphia, PA: Association for Computational Linguistics. . doi:10.48550/arxiv.cs/0205028
- 32 Hammami L, Paglialonga A, Pruneri G. et al. Automated classification of cancer morphology from Italian pathology reports using natural language processing techniques: a rule-based approach. J Biomed Inform 2021; 116: 103712
- 33 Geng W, Qin X, Wang Z, Kong Q, Tang Z, Jiang L. Model-based reasoning methods for diagnosis in integrative medicine based on electronic medical records and natural language processing. medRxiv 2020; :2020.07.12.20151746.
- 34 Wu L, Dodoo NA, Wen TJ, Ke L. Understanding Twitter conversations about artificial intelligence in advertising based on natural language processing. Int J Advert 2021; 41 (04) 685-702
- 35 Yang F, Wang X, Ma H, Li J. Transformers-sklearn: a toolkit for medical language understanding with transformer-based models. BMC Med Inform Decis Mak 2021; 21 (2, Suppl 2): 90
- 36 Lalkhen AG, McCluskey A. Clinical tests: sensitivity and specificity. Contin Educ Anaesth Crit Care Pain 2008; 8 (06) 221-223