CC BY-NC-ND 4.0 · Yearb Med Inform
DOI: 10.1055/s-0039-1677903
Working Group Contribution
IMIA and Georg Thieme Verlag KG

Artificial Intelligence in Clinical Decision Support: Challenges for Evaluating AI and Practical Implications

A Position Paper from the IMIA Technology Assessment & Quality Development in Health Informatics Working Group and the EFMI Working Group for Assessment of Health Information Systems
Farah Magrabi
1  Macquarie University, Australian Institute of Health Innovation, Sydney, Australia
,
Elske Ammenwerth
2  UMIT, University for Health Sciences, Medical Informatics and Technology, Institute of Medical Informatics, Hall in Tyrol, Austria
,
Jytte Brender McNair
3  Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
,
Nicolet F. De Keizer
4  Amsterdam UMC, University of Amsterdam, Department of Medical Informatics, Amsterdam Public Health research institute, The Netherlands
,
Hannele Hyppönen
5  National Institute for Health and Welfare, Information Department, Helsinki, Finland
,
Pirkko Nykänen
6  Tampere University, Faculty for Information Technology and Communication Sciences, Tampere, Finland
,
Michael Rigby
7  Keele University, School of Social Science and Public Policy, Keele, United Kingdom
,
Philip J. Scott
8  University of Portsmouth, Centre for Healthcare Modelling and Informatics, Portsmouth, United Kingdom
,
Tuulikki Vehko
5  National Institute for Health and Welfare, Information Department, Helsinki, Finland
,
Zoie Shui-Yee Wong
9  St. Luke’s International University, Tokyo, Japan
,
Andrew Georgiou
1  Macquarie University, Australian Institute of Health Innovation, Sydney, Australia
› Author Affiliations
Further Information

Publication History

Publication Date:
25 April 2019 (eFirst)

  

Summary

Objectives: This paper draws attention to: i) key considerations for evaluating artificial intelligence (AI) enabled clinical decision support; and ii) challenges and practical implications of AI design, development, selection, use, and ongoing surveillance.

Method: A narrative review of existing research and evaluation approaches along with expert perspectives drawn from the International Medical Informatics Association (IMIA) Working Group on Technology Assessment and Quality Development in Health Informatics and the European Federation for Medical Informatics (EFMI) Working Group for Assessment of Health Information Systems.

Results: There is a rich history and tradition of evaluating AI in healthcare. While evaluators can learn from past efforts, and build on best practice evaluation frameworks and methodologies, questions remain about how to evaluate the safety and effectiveness of AI that dynamically harness vast amounts of genomic, biomarker, phenotype, electronic record, and care delivery data from across health systems. This paper first provides a historical perspective about the evaluation of AI in healthcare. It then examines key challenges of evaluating AI-enabled clinical decision support during design, development, selection, use, and ongoing surveillance. Practical aspects of evaluating AI in healthcare, including approaches to evaluation and indicators to monitor AI are also discussed.

Conclusion: Commitment to rigorous initial and ongoing evaluation will be critical to ensuring the safe and effective integration of AI in complex sociotechnical settings. Specific enhancements that are required for the new generation of AI-enabled clinical decision support will emerge through practical application.