Keywords
Artificial intelligence - consumer informatics - patient - participatory health -
machine learning - social media - consumer
1 Introduction
For this 28th edition of the Yearbook of Medical Informatics, the topic of “Artificial
Intelligence (AI) in Health: New Opportunities, Challenges, and Practical Implications”
is thought-provoking, especially when it comes to ’re-imagining’ the future for patients
and consumers as AI technologies are increasingly introduced into our daily lives.
Over the past years, our ability to store large repositories of data has surpassed
the ability to effectively and efficiently develop actionable knowledge from these
sources.
For example, cloud artificial intelligence and machine learning platform services,
known collectively as AI Platforms as a Service (PaaS), are placed at the peak of
the Gartner hype curve in 2018 [1]. Due to the opportunity of having access to large quantities of data that were previously
unavailable, and the technical challenge of not being able to deliver real-time actionable
knowledge from these sources, many researchers have worked on enhancing machine learning
algorithms to extract meaning from these sources. As a result, many of the AI applications
we are witnessing today are reliant on having access to large repositories of data.
Success of these data-driven approaches varies across disciplines and depends on the
quality and quantity of the data available, the specificity of the task, the appropriate
choice of algorithms, the rigour in the execution, as well as the domain expertise
available to guide the analysis and interpretation.
In health, recent AI developments that are showing promising results are data-driven
approaches, specifically for clinician-facing applications such as image analysis
and interpretation in radiology [2], [3]. In the world of patients and consumers, recent AI applications have also taken
a data-driven approach. Social media platforms, including online health communities,
have become popular sources for individuals to connect and exchange support. Due to
the relative ease to access publicly-available data, many researchers have tapped
into mining social media as a way to explore how AI can be applied to get a better
understanding of patient and consumer experiences. As a result, a majority of recent
studies reporting AI approaches for patients and consumers focuses on secondary analyses
of social media data [4]. While social media could be a good source to understand how individuals cope with
and manage their conditions, they also present high risks due to widespread dissemination
of poor-quality or incorrect information. As a result, researchers have proposed various
data-driven approaches to analyse patients’ online behaviours and address the problems
they experience online, such as detecting disclosure of personal health information
on Twitter [5] and determining which online health forum threads require moderator assistance [6].
However, these data-driven approaches, regardless of whether they are focused on clinicians,
consumers, or patients, represent a narrow focus of AI [7]. In this paper, we examine how AI approaches are currently used for patients and
consumers, present papers that are representative in the year 2018, and highlight
untapped opportunities for research in AI for patients and consumers.
2 Methodology
2.1 Search Strategy
We used PubMed to conduct our search, capturing papers relevant to consumer health
informatics and artificial intelligence published in the year 2018. The search strategy
was based on the PICO framework, P-Population/Problem, I-Intervention, C-Comparison,
O-Outcome, where ’Problem’ refers to the various digital environments consumers and
patients participate in, e.g., social media, online health communities; “Intervention” comprises of various AI
methods and technologies; and “Outcome” outlines the impact or resulting effects of
participatory health, i.e., patient-centred initiatives to empower individuals in their health decisions and behaviours
[4]. A “comparison” intervention was not included as it is not relevant in this review.
We started with a search query adopted from previous work [2], [4]. Step by step, we refined the query to include keywords related to digital/social
media (41 keywords), artificial intelligence (52 keywords), and participatory health
(24 keywords). MeSH terms and the syntax “[All Fields]” were used wherever possible
to ensure our search strategy was comprehensive. The final search query is listed
below:
((2018[DP] NOT pubstatusaheadofprint) NOT Bibliography[pt] NOT Comment[pt] NOT Editorial[pt]
NOT Letter[pt] NOT News[pt] NOT Case Reports[pt] NOT Published Erratum[pt] NOT Historical
Article[pt] NOT legislation[pt] NOT (“review”[pt] OR “review literature as topic”[MeSH]
OR “literature review”[All Fields]))
AND (“social media”[All Fields] OR “facebook”[All Fields] OR “twitter”[All Fields]
OR “youtube” [All Fields] OR “instagram” [All Fields] OR “pinterest”[All Fields] OR
“google trends”[All Fields] OR “snapchat”[All Fields] OR “whatsapp” [All Fields] OR
“posts”[All Fields] OR “blog”[All Fields] OR “microblog”[All Fields] OR “wiki”[All
Fields] OR “health communities”[All Fields] OR “social network site”[All Fields] OR
“social web” [All Fields] OR “online social network”[All Fields] OR “social environment”[All
Fields] OR “social process”[All Fields] OR “social com- petition” [All Fields] OR
“social norm”[All Fields] OR “social feedback”[All Fields] OR “social influence”[All
Fields] OR “social com- parison”[All Fields] OR “social network”[All Fields] OR “discussion
group”[All Fields] OR “support group”[All Fields] OR “social support”[All Fields]
OR “community net- work”[All Fields] OR “online community”[All Fields] OR “second
life”[All Fields] OR “virtual worlds”[All Fields] OR “virtual real- ity”[All Fields]
OR “web 2.0”[All Fields] OR “web 3.0”[All Fields] OR “medicine 2.0”[All Fields] OR
“health 2.0”[All Fields] OR “digital health”[All Fields] OR “platform”[All Fields]
OR “nontraditional data sources”[All Fields] OR “novel data streams”[All Fields])
AND (“data science”[All Fields] OR “artificial intelligence”[All Fields] OR “learning
systems”[All Fields] OR “big data”[All Fields] OR “machine learning”[All Fields] OR
“deep learning”[All Fields] OR “reinforcement learning”[All Fields] OR “supervised
learning”[All Fields] OR “unsupervised learning”[All Fields] OR “active learning”[All
Fields] OR “neural net- work”[All Fields] OR “convolutional neural network”[All Fields]
OR “recurrent neural network”[All Fields] OR “natural language processing”[All Fields]
OR “text mining”[All Fields] OR “support vector machine”[All Fields] OR “support vector
network”[All Fields] OR “support vector classifier”[All Fields] OR “naive bayes”[All
Fields] OR “bayesian network”[All Fields] OR “bayesian learning”[All Fields] OR “boosting”[All
Fields] OR “machine intelligence”[All Fields] OR “computational intelligence”[All
Fields] OR “decision tree”[All Fields] OR “ensemble trees”[All Fields] OR “random
forest”[All Fields]] OR “clustering”[All Fields] OR “classification”[All Fields] OR
“validation”[All Fields] OR “first-order logic”[All Fields] OR “fuzzy model”[All Fields]
OR “cellular automaton”[All Fields] OR “markov model”[All Fields] OR “swarm intelligence”[All
Fields] OR “knowledge reasoning”[All Fields] OR “computational inference”[All Fields]
OR “model stacking”[All Fields] OR “intelligent agent”[All Fields] OR “multiagent
system”[All Fields] OR “conversational agent”[All Fields] OR “case-based reasoning”[All
Fields] OR “rule-based system”[All Fields] OR “knowledge-based reasoning”[All Fields]
OR “knowledge representation”[All Fields] OR “qualitative reasoning”[All Fields] OR
“decision-theoretic planning”[All Fields] OR “computer reasoning”[All Fields] OR “prediction”[All
Fields] OR “genetic algorithms”[All Fields] OR “evolutionary algorithms”[All Fields]
OR “evolutionary computing”[All Fields])
AND (“digital behavior”[All Fields] OR “compliance”[All Fields] OR “observance”[All
Fields] OR “pharmaco epidemiology”[All Fields] OR “digital epidemiology”[All Fields]
OR “infoveillance”[All Fields] OR “participatory health”[All Fields] OR “participatory
medicine” [All Fields] OR “patient engagement”[All Fields] OR “participatory medicine”[All
Fields] OR “patient empowerment” [All Fields] OR “shared decision making” [All Fields]
OR “patient-practitioner relationship” [All Fields] OR “consumer health” [All Fields]
OR “consumer empowerment” [All Fields] OR “citizen health” [All Fields] OR “citizen
engagement” [All Fields] OR “citizen empowerment” [All Fields] OR “participative medicine”
[All Fields] OR “personalized medicine” [All Fields] OR “precision medicine” [All
Fields] OR “predictive medicine” [All Fields] OR “preventive medicine” [All Fields]
OR “disease management” [All Fields])
2.2 Bibliometrics Analyses
To understand the state of the literature, we applied various bibliometrics tools
onto the original set of articles returned from the search query. The “Bibliometrix”
package from R [8] was used on the retrieved articles to report frequency of keywords used by authors,
and a topic dendrogram to cluster keywords and examine whether these clusters follow
a hierarchical structure.
3 Results
3.1 State of the Literature
Ninety-nine articles were returned from the search query. [Figure 1] reports the frequency of the 50 most common keywords used by authors in these retrieved
articles. The five most frequent keywords are (from most frequent to least frequent):
precision medicine, social media, big data, infodemiology, and machine learning, illustrating
the current focus on data-driven approaches to AI for patients and consumers.
Fig. 1 Frequency of authors' keywords in the 99 retrieved articles
[Figure 2] illustrates a dendrogram applied to the 99 retrieved articles to explore whether
keywords used by authors tended to cluster in groups, and whether these clusters follow
a hierarchical structure. Three small clusters emerged, illustrating the keywords
that were commonly used together by authors when describing AI approaches for patients
and consumers. For example: “machine learning” and “precision” in cluster 1; “infode-
miology”, “mortality”, and “outcome” in cluster 2; and “forensic DNA phenotyping”
and “DNA methylation” in cluster 3.
Fig. 2 Topic dendrogram of author’s keywords throughout the 99 retrieved articles.
We did not observe a clear hierarchical structure amongst these clusters but “digital
health” emerged as a unique term associated at the root of the tree. Furthermore,
there is a list of keywords near the centre of the dendrogram that do not seem discriminant
enough to form a cluster (e.g., natural language processing, big data, depression, twitter, internet, text mining,
etc.). The inability to form clusters amongst these keywords may suggest that although
there is research activity in each of these individual areas, different authors used
different approaches and there is not yet a body of work that combines these concepts,
methods, or techniques when authors describe their work in the literature.
3.2 Best Paper Selection
The 99 retrieved articles were then screened by section editors, which resulted in
14 articles considered for best paper selection. Elements that were considered in
the screening decision include: 1) level of relevance regarding the 2019 yearbook
topic “Artificial Intelligence in Health: New Opportunities, Challenges, and Practical
Implications”; 2) whether the AI application was focused only on patients and consumers;
3) nature of the healthcare problem addressed; and 4) level of innovative approach.
The selected 14 articles were then presented to a panel of international experts for
full paper review and scoring according to the IMIA Yearbook best paper selection
process. The three papers that received the highest scores were then discussed in
a consensus meeting, and it was agreed upon that they were representative papers on
artificial intelligence in health for patients and consumers for the year 2018.
The final three best papers selected after peer review process are listed in [Table 1].
Table 1
Best paper selection of articles for the IMIA Yearbook of Medical Informatics 2019
in the section ‘Education and Consumer Health Informatics’. The articles are listed
in alphabetical order of the first author’s surname.
Section
Education and Consumer Health Informatics
|
▪ Abdellaoui R, Foulquie P, Texier N, Faviez C, Burgun A, Schuck S. Detection of Cases
of Noncompliance to Drug Treatment in Patient Forum Posts: Topic Model Approach. J
Med Internet Res 2018;20(3):e85.
|
▪ Jones J, Pradhan M, Hosseini M, Kulanthaivel A, Hosseini M. Novel Approach to Cluster
Patient-Generated Data Into Actionable Topics: Case Study of a Web-Based Breast Cancer
Forum. JMIR Med Inform 2018;6(4):e45.
|
▪ Park A, Conway M, Chen AT. Examining Thematic Similarity, Difference, and Membership
in Three Online Mental Health Communities from Reddit: A Text Mining and Visualization
Approach. Comput Human Behav 2018 Jan;78:98-112.
|
These papers shared a common methodology of using data-driven algorithms (such as
text mining, topic modelling, or Latent Dirichlet allocation modelling), combined
with insight-led approaches (e.g. visualisation, qualitative analysis, or manual review),
to uncover patient and consumer experiences of health and illnesses in online communities.
For example, Abdellaoui et al, [9] outlined a methodology to detect medication non-compliance behaviours amongst people
on antidepressant and antipsychotic medications by modelling the way dosage variation
and treatment interruption behaviours were discussed online. Jones et al., [10] demonstrated it was possible to uncover the hidden, less obvious aspects of breast
cancer management and recovery in online discussion forums, including aspects that
are not easily ascertainable in patient clinics (e.g., side effects while in remission, financial challenges experienced by cancer survivors
over time). Similarly, Park et al., [11] identified subtle differences in the types of concerns expressed online by individuals
experiencing different mental health conditions (e.g., people with depression often discussed events associated with changes in mood whereas
discussion topics amongst people with anxiety or post-traumatic stress disorder clustered
around treatment- and medication-related issues).
4 Conclusions
Despite the attention and expectations given to artificial intelligence (AI), we did
not find eligible articles published in 2018 that reported AI applications designed
specifically for patients or consumers, nor literature that elicited patient and consumer
input on AI. Currently, the most common use of AI for patients and consumers lies
in secondary analysis of social media data (e.g., online discussion forums). In particular, the three 2018 best papers share a common
methodology of using data-driven algorithms, combined with insight-led approaches,
to uncover patient and consumer experiences in online communities.
Currently, there is a lack of direction and evidence on how AI would actually benefit patients and consumers. Perhaps instead of focusing on data and algorithms,
researchers should engage with patients and consumers early in the AI research agenda to ensure we are indeed asking the right questions, and
that important use cases and critical contexts are identified together with patients and consumers. Without a clear understanding on why patients and consumers
need AI in the first place, how AI could support individuals with their healthcare
needs, and what are the capabilities and limitations of AI, it is difficult to imagine
the kinds of AI applications that would have meaningful and sustainable impact on
individuals’ daily lives.
Artificial Intelligence in 2018 may not yet be at the state that meets the expectations
of patients and consumers. However, this presents a number of untapped opportunities
for research. While many have already made way in using data-driven and machine learning
approaches in health, perhaps the challenge of AI for patients and consumers lies
in how people will interact with the technology (i.e. human-computer interaction)
[4]. For patients and consumers to truly benefit from AI, the design of the technology
may need to be embedded deeply in their environment or perhaps even invisibly in their
daily routine [4]. For example, with the rise of voice-only or voice-first interfaces, one could explore
whether conversational agents have a role to support patients and consumers with their
daily tasks [12]. In addition, real-life decision support for patients and consumers remains an open
opportunity provided the right problem, use case, and interaction mode are identified.
To conclude, we leave readers with the words of Heht [13]: “The public’s view of artificial intelligence might not be accurate, but that doesn’t
mean researchers can ignore it”.