Performance of AI Approaches for COVID-19 Diagnosis Using Chest CT Scans: The Impact of Architecture and Dataset

Astha Jaiswal; Philipp Fervers; Fanyang Meng; Huimao Zhang; Dorottya Móré; Athanasios Giannakis; Jasmin Wailzer; Andreas Michael Bucher; David Maintz; Jonathan Kottlors; Rahil Shahzad; Thorsten Persigehl

doi:10.1055/a-2577-3928

Subscribe to RSS

Please copy the URL and add it into your RSS Feed Reader.

https://www.thieme-connect.de/rss/thieme/en/10.1055-s-00000066.xml

Share / Bookmark

Facebook Linkedin Weibo

Download PDF

CC BY 4.0 · Rofo
DOI: 10.1055/a-2577-3928

Chest

Performance of AI Approaches for COVID-19 Diagnosis Using Chest CT Scans: The Impact of Architecture and Dataset

Leistungsfähigkeit von KI-Methoden zur COVID-19-Diagnose mittels Thorax-CT: Der Einfluss von KI-Architektur und Datensätzen

Astha Jaiswal

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

,

Philipp Fervers

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

,

Fanyang Meng

²Department of Radiology, The First Hospital of Jilin University, Changchun, China (Ringgold ID: RIN117971)

,

Huimao Zhang

²Department of Radiology, The First Hospital of Jilin University, Changchun, China (Ringgold ID: RIN117971)

,

Dorottya Móré

³Department of Diagnostic and Interventional Radiology, University Hospital Heidelberg, University of Heidelberg, Heidelberg, Germany

,

Athanasios Giannakis

³Department of Diagnostic and Interventional Radiology, University Hospital Heidelberg, University of Heidelberg, Heidelberg, Germany

,

Jasmin Wailzer

⁴Institute for Diagnostic and Interventional Radiology, Frankfurt University Hospital, Frankfurt, Germany

,

Andreas Michael Bucher

⁴Institute for Diagnostic and Interventional Radiology, Frankfurt University Hospital, Frankfurt, Germany

,

David Maintz

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

,

Jonathan Kottlors

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

,

Rahil Shahzad

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

⁵Philips Healthcare, Innovative Technologies, Aachen, Germany

,

Thorsten Persigehl

¹Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany

› Author Affiliations

Supported by: Sino-German Center for Research Promotion (SGC), a project entitled CT-based Deep Learning Algorithm in Diagnosis and evaluation of COVID-19: An International Multi-center Study C-0007
Supported by: Jilin Provincial Key Laboratory of Medical imaging & big data 20200601003JC
Supported by: Radiology and Technology Innovation Center of Jilin Province 20190902016TC
Supported by: China International Medical Foundation, Imaging Research, SKY Z-2014-07-2003-03
Supported by: RACOON (NUM), „NUM 2.0“ FKZ: 01KX2121

› Further Information

Also available at

Abstract
Full Text
References
Figures
Supplementary Material

PDF Download Permissions and Reprints

Abstract
Zusammenfassung
Abbreviations
Introduction
Materials and methods

Dataset

Automatic COVID-19 diagnosis

Lung segmentation

COVID-19 classification

1. COVID-19 detection neural network (COVNet)

2. 3D deep convolutional neural network to detect COVID-19 (DeCoVnet)

3. Attention-based deep 3D multiple instance learning (AD3D-MIL)

Training

Statistics

Results

Lung segmentation

COVID-19 classification

Discussion
Conclusion
References

Abstract

Purpose

AI is emerging as a promising tool for diagnosing COVID-19 based on chest CT scans. The aim of this study was the comparison of AI models for COVID-19 diagnosis. Therefore, we: (1) trained three distinct AI models for classifying COVID-19 and non-COVID-19 pneumonia (nCP) using a large, clinically relevant CT dataset, (2) evaluated the models’ performance using an independent test set, and (3) compared the models both algorithmically and experimentally.

Materials and Methods

In this multicenter multi-vendor study, we collected n=1591 chest CT scans of COVID-19 (n=762) and nCP (n=829) patients from China and Germany. In Germany, the data was collected from three RACOON sites. We trained and validated three COVID-19 AI models with different architectures: COVNet based on 2D-CNN, DeCoVnet based on 3D-CNN, and AD3D-MIL based on 3D-CNN with attention module. 991 CT scans were used for training the AI models using 5-fold cross-validation. 600 CT scans from 6 different centers were used for independent testing. The models’ performance was evaluated using accuracy (Acc), sensitivity (Se), and specificity (Sp).

Results

The average validation accuracy of the COVNet, DeCoVnet, and AD3D-MIL models over the 5 folds was 80.9%, 82.0%, and 84.3%, respectively. On the independent test set with n=600 CT scans, COVNet yielded Acc=76.6%, Se=67.8%, Sp=85.7%; DeCoVnet provided Acc=75.1%, Se=61.2%, Sp=89.7%; and AD3D-MIL achieved Acc=73.9%, Se=57.7%, Sp=90.8%.

Conclusion

The classification performance of the evaluated AI models is highly dependent on the training data rather than the architecture itself. Our results demonstrate a high specificity and moderate sensitivity. The AI classification models should not be used unsupervised but could potentially assist radiologists in COVID-19 and nCP identification.

Key Points

This study compares AI approaches for diagnosing COVID-19 in chest CT scans, which is essential for further optimizing the delivery of healthcare and for pandemic preparedness.
Our experiments using a multicenter, multi-vendor, diverse dataset show that the training data is the key factor in determining the diagnostic performance.
The AI models should not be used unsupervised but as a tool to assist radiologists.

Citation Format

Jaiswal A, Fervers P, Meng F et al. Performance of AI Approaches for COVID-19 Diagnosis Using Chest CT Scans: The Impact of Architecture and Dataset. Rofo 2025; DOI 10.1055/a-2577-3928

#

Zusammenfassung

Ziel

Aktuell existieren verschiedenste Künstliche Intelligenz (KI)-Modelle zur Detektion und Klassifikation von Pneumonien in Thorax-CTs, aber unabhängige Vergleiche fehlen meist. In dieser Studie haben wir (1) drei verschiedene KI-Modelle zur Klassifizierung von COVID-19- und Nicht-COVID-19-Pneumonien (nCP) anhand eines klinisch relevanten CT-Datensatzes trainiert, (2) die Leistung der Modelle anhand eines unabhängigen Testsatzes bewertet und (3) die Modelle sowohl algorithmisch als auch experimentell verglichen.

Materialien und Methoden

In dieser multizentrischen, retrospektiven Studie haben wir insgesamt 1591 Thorax-CTs von COVID-19- (n=762) und nCP (n=829)-Patienten aus China und Deutschland zusammengestellt; in Deutschland wurden die CT-Daten von 3 RACOON-Standorten eingeschlossen. Es wurden 3 open-source KI-Modelle mit unterschiedlichen Architekturen trainiert und validiert: COVNet basierend auf 2D-CNN, DeCoVnet basierend auf 3D-CNN, und AD3D-MIL basierend auf 3D-CNN mit Attention-Modul. Die Performance der Modelle wurde anhand von Genauigkeit (Acc), Sensitivität (Se) und Spezifität (Sp) bewertet.

Ergebnisse

Die durchschnittliche Validierungsgenauigkeit der Modelle COVNet, DeCoVnet und AD3D-MIL über die 5-Fach-Validierung im Training mit n=991 CTs betrug 80,9%, 82,0% bzw. 84,3%. Auf dem unabhängigen Testsatz mit n=600 CTs lieferte COVNet: Acc=76,6%, Se=67,8%, Sp=85,7%; DeCoVnet: Acc=75,1%, Se=61,2%, Sp=89,7%; und AD3D-MIL: Acc=73,9%, Se=57,7%, Sp=90,8%.

Schlussfolgerung

Die Klassifizierungsleistung der evaluierten KI-Modelle hängt in hohem Maße von den Trainingsdaten und weniger von der Architektur selbst ab. Unsere Ergebnisse zeigen eine hohe Spezifität und eine moderate Sensitivität bei der Differenzierung von COVID-19- und Nicht-COVID-19-Pneumonien. Die KI-Klassifikationsmodelle sollten aber nicht unkritisch verwendet werden, könnten aber Radiologen unterstützen.

Kernaussagen

Vorliegende Studie vergleicht KI-Ansätze zur bildbasierten Diagnose von COVID-19 in Thorax-CTs, was relevant für die weitere Optimierung der Gesundheitsversorgung und die Vorbereitung auf etwaig kommende Pandemien ist.
Unser multizentrischer, herstellerübergreifender Datensatz zeigt, dass die Trainingsdaten der entscheidende Faktor für die diagnostische Leistungsfähigkeit sind.
KI-Modelle sollten nicht autonom eingesetzt werden, sondern als unterstützendes Werkzeug in die radiologische Befundung integriert werden, um die diagnostische Entscheidungsfindung zu ergänzen – nicht zu ersetzen.

#

Keywords

Artificial Intelligence - Pandemic Preparedness - CT Scan

Abbreviations

AI: Artificial intelligence

CapsNet: Capsule network

CNN: Convolutional neural network

COVID-19: Coronavirus disease 2019

CT: Computed tomography

HU: Hounsfield unit

MLP: Multi-layer perceptron

nCP: Non-COVID-19 pneumonia

ResNet: Residual network

RT-PCR: Reverse transcription polymerase chain reaction test

CI: Confidence interval

ROC: Receiver operating characteristic curve

AUC: Area under the curve

#

Introduction

The coronavirus disease 2019 (COVID-19) pandemic is a poignant reminder of how rapidly a global health crisis can emerge and pose a significant challenge to the world's healthcare systems. The potential for other pathogens to cause future pandemics, with the lungs as a primary target, underscores the critical need for comprehensive pandemic preparedness measures. As with COVID-19, fast detection and prompt patient isolation will be crucial for curbing the spread of future pandemics. Chest computed tomography (CT) is a method of choice for COVID-19 and other viral and bacterial pneumonias [1]. Due to a lack of alternative diagnostic methods in the early pandemic, chest CT was frequently performed to diagnose the disease [2]. The SARS-CoV-2 reverse polymerase chain reaction (RT-PCR) and antibody test has since become widely accessible and is considered the most reliable method for diagnosing COVID-19 [3]. Nevertheless, chest CT still has a potential role in the diagnosis of COVID-19 pneumonia and for determining disease stage [1]. The Fleischner Society recommends chest CT as a diagnostic tool, if RT-PCR resources are limited and could delay isolation or crucial treatment [4]. Furthermore, a patient with a suspected false-negative RT-PCR test and at least moderate clinical features qualifies for a chest CT scan [4]. Although the diagnosis of COVID-19 is currently the domain of laboratory testing, chest CT is frequently performed to provide detailed information about the severity and extent of lung involvement [5].

With the aim of supporting radiologists, numerous AI approaches have been developed for the automatic detection of COVID-19 based on CT scans. These algorithms are often based on convolution neural networks (CNN) with two dimensions (2D) [6] [7] [8] [9] [10] [11] [12] [13] or three dimensions (3D) [14] [15] [16] [17] [18]. 2D approaches learn features from individual slices in a volumetric CT scan [6] [7] [8] [9] [10] [11] [12] [13]. Slice-level results are often aggregated to obtain patient-level predictions. 3D approaches, on the other hand, utilize 3D volume for feature extraction and directly generate patient-level predictions [14] [15] [16] [17] [18]. Different approaches require patient-level [9] [10] [14] [15] [18], slice-level [8], or pixel-level [19] labels for training. Along with conventional CNNs, machine-driven design [20] [21] has also been explored. Hybrid strategy, which employs traditional machine learning along with deep learning [22] has also been proposed. A summary of various algorithms proposed in the literature is shown in [Table 1].

Table 1 Summary of AI algorithms proposed in the literature for COVID-19 diagnosis.
COVID-19 diagnosis algorithm	2D CNN	3D CNN	Caps-Net	Pixel-level label	Slice-level label	Patient-level label	Machine-driven design	Hybrid
Xiong et al. (2020) [6], Rahimzadeh et al. (2021) [8], Wang et al. (2021) [12], Wang et al. (2021) [13]	X				X
Song et al. (2021) [7], Jin et al. (2020) [9], Li et al. (2020) [10]	X					X
Qian et al. (2020) [11]	X				X	X
Wang et al. (2020) [14], Han et al. (2020) [15], Lee et al. (2021) [16], Javaheri et al. (2021) [17], Wang et al. (2020) [18]		X				X
Zhang et al. (2020) [19]		X		X
Wu et al. (2021) [23]	X			X		X
Amyar et al. (2020) [24], Wang et al. (2021) [25], Gao et al. (2021) [26]	X			X
Afshar et al. (2022) [27]			X	X
Qi et al. (2022) [28]			X		X
Gunraj et al. (2020) [20], Gunraj et al. (2022) [21]	X						X
Qi et al. (2021) [22], Mei et al. (2020) [29]	X					X		X
Hou et al. (2021) [30]	X	X			X	X

Many algorithms proposed in the literature lack an external validation dataset (e.g., [8] [10] [11]), which is important to assess the generalization of the AI models. An independent comparison of multiple approaches based on a common dataset can play a guiding role for both radiologists and AI developers. To address these issues, in this study, we collected a large and diverse set of CT scans from China and Germany. Chest CT data from three RACOON sites (Cologne, Frankfurt, Heidelberg) have been utilized in this study. RACOON is a nationwide RAdiological COOperative Network of 36 university hospitals in Germany. RACOON is supported by the National University Medicine Network (NUM) founded by the German Federal Ministry of Education and Research (BMBF). The unique RACOON infrastructure supports large-scale AI studies and could play a key role in Germany’s pandemic preparedness program.

Using datasets from China and Germany in this study, we aimed to assess and compare the performance of three distinct AI approaches (based on 2D and 3D CNNs) for distinguishing between COVID-19 pneumonia and non-COVID-19 pneumonia (nCP). Our overall goal was the assessment of three publically available AI tools for COVID-19 diagnosis and to determine whether these AI tools could potentially be used to support radiologists in clinical decision making.

#

Materials and methods

Dataset

For this retrospective IRB-approved study, we collected a multicenter, multi-vendor chest CT dataset consisting of n=1591 chest CT scans of COVID-19 (n=762) and nCP (n=829) patients from China and Germany ([Fig. 1]). For the COVID-19 class, the inclusion criteria were pulmonary infiltration, and a positive RT-PCR test within 48 h before the CT examination. For the nCP class, the inclusion criteria were: (1) inflammatory infiltrations on CT scans before the outbreak of COVID-19 (from February 2016 to December 2019), (2) an additional negative RT-PCR test after the outbreak of COVID-19 (January 2020). The exclusion criteria were imaging features consistent with lung tumors, tuberculosis, and traumatic and postoperative scarred lesions.

Our dataset is balanced across the two disease classes. The COVID-19 class includes cases with different disease stages (early: 0–3 days, progressive: 4–7 days, peak: 8–14 days, and absorption: ≥15 days) assessed based on CT morphology and the gap between CT scanning and symptom onset [31]. Based on laboratory etiological confirmation, the nCP class includes pneumonia caused by viral and bacterial pathogens. The number of scans and distribution over the two classes and the subcategories for training and test dataset are presented in [Table 2]. Detailed patient demographics across different centers are given in [Table 3]. The entire Chinese dataset, which contains only full-dose CT scans, was imaged using seven different CT device manufacturers. The dataset from Germany, which contains 53.7% low-dose CT scans and 46.3% full-dose CT scans, was imaged using two CT device manufacturers ([Table 3]). Further information about vendors, protocols, etc. can be found in the recent article [32].

Table 2 Imaging stages of COVID-19 and etiological classification of nCP. Number of samples in training and test datasets as well as the stages/etiology distribution are presented.
Disease	Stages/etiology	Training dataset (n)	Test dataset (n)	Distribution (%)
Note: N/A (not available) refers to the nCP patients, who did not test for etiology, or the cases in which the results of the etiological tests were negative.
COVID-19	Early	53	101	20.2
	Progressive	292	82	49.1
	Peak	48	68	15.2
	Absorption	69	49	15.5
nCP	Bacterial	197	150	41.8
	Viral	135	150	34.4
	N/A	197	0	23.8

Table 3 Demographic values, scanner companies, and slice thickness of the entire dataset according to country, center, and class. Neusoft Healthcare (NH), Minfound Healthcare (MH), and United imaging Healthcare (UIH).
Country	Center	COVID-19			nCP			CT vendors (n)
			Sex	Age		Sex	Age
		Scans (n)	Male, female (n)	Mean ± Std, (range)	Scans (n)	Male, female (n)	Mean ± Std, (range)	GE	Philips	Siemens	TOSHIBA	NH	MH	UIH
Note: COVID-19 – coronavirus disease 2019; nCP – non-COVID-19 pneumonia, number; ages are reported as means ± standard deviation
China	Jilin	69	35, 26	41.8 ± 12.9 (16, 62)	299	167, 130	57.1 ± 16.0 (18, 95)	31	142	86	5	99	5
	Wuhan	446	196, 249	57.8 ± 15.1 (15, 98)	292	148, 143	57.4 ± 18.1 (4, 89)	557	3	178
	Ningbo	97	36, 61	50.4 ± 14.3 (17, 86)	88	69, 19	69.0 ± 13.6 (32, 90)		2	152			21	10
Germany	Cologne	50	26, 24	59.7 ± 14.1 (29, 88)	50	28, 22	59.6 ± 18.9 (18, 89)		100
	Frankfurt	50	42, 8	58.5 ± 13.9 (36, 85)	50	32, 18	59.9 ± 13.3 (21, 90)		9	91
	Heidelberg	50	34, 16	56.9 ± 15.6 (20, 85)	50	34, 16	58.9 ± 15.4 (19, 86)		76	24
Total		762	369, 384	54.2 ± 14.3 (15, 98)	829	478, 348	60.3 ± 15.9 (4, 95)	588	332	531	5	99	26	10

We used n=991 CT scans (n=462 COVID-19 and n=529 nCP CT scans) from three different centers in China to train the AI models using a five-fold cross-validation approach. In each fold, non-overlapping 80% of the CT scans were used for training and 20% of the scans were used for validation. CT scans from all six centers from China (n=300; centers: Jilin, Wuhan, Ningbo) and Germany (n=300, external dataset; centers: Cologne, Frankfurt, Heidelberg) were used for independent testing to get an indication of the generalization of the trained models. The test set contained a total of n=600 CT scans with balanced COVID-19 and nCP classes, which the algorithms did not see during training or validation. Therefore, this test set is called the independent test set. The internal test set came from the same sites as the training dataset. The external test set came from different sites. An in-detail description of the training and validation process was recently published [32].

#

Automatic COVID-19 diagnosis

For this study, we selected AI algorithms based on criterion such as: (1) different network architectures and training strategies, (2) availability of code and documentation, and (3) ability to train with only patient-level labels. For more details about our literature search and model selection, see supplement 1. We trained and validated the selected models for diagnosing COVID-19 using chest CT scans: COVNet [10] based on 2D-CNN, DeCoVnet [14] based on 3D-CNN, and AD3D-MIL [15] based on 3D-CNN with an attention module. All three approaches consisted of two steps ([Fig. 2]). The first step segments the lung area to avoid the effect of irrelevant regions. The second step classifies the lung-masked CT scans as COVID-19 or nCP.

#

Lung segmentation

For lung segmentation, we utilized three different models: Seg-Net [33], U-Net [9] [34], and U-Net(R-231) [34] [35]. The models were trained on different datasets and employ slightly different preprocessing. The Seg-Net [33] model was trained and tested using 44,500 CT slices. Preprocessing involved resampling (1×1 mm), rescaling (512×512), windowing [-1000, 500 HU], and normalization (0–1). The 2D U-Net [9] [34] model was trained and tested using 16,223 CT slices. Preprocessing included resampling (1×1mm), windowing [-1200, 700 HU], and normalization (0–1). The U-Net(R-231) [35] model is based on U-Net [34] with batch normalization and was trained on a diverse dataset of 62,224 CT slices. Preprocessing included body cropping, rescaling (256×256), windowing [-1024, 600 HU], and normalization (0–1).

#

COVID-19 classification

The literature review (see supplement 1) yielded the following three algorithms for assessment in this study:

1. COVID-19 detection neural network (COVNet)

COVNet is based on ResNet-50 [36], which is a 50-layer residual network. COVNet takes a lung-masked CT scan as input and provides a patient-level prediction as output. As shown in [Fig. 3](a), ResNet-50 captures slice-level features using 2D convolutions (2D CNN). CT-level features are obtained by max. pooling. Preprocessing steps include resampling (224×224 in-plane), down sampling (by a factor of 5 in the Z-direction), intensity clipping (-1250, 250 HU), and normalization (0, 1). During training, data augmentation was used by applying random rotation, flipping, and adding Gaussian noise to the input data. The weights of ResNet-50 were initialized using weights optimized on the ImageNet database [37].

Fig. 3 Schematic overview of (a) COVNet, (b) DeCoVnet, and (c) AD3D-MIL. a The ResNet-50 backbone of COVNet generates features from each slice, which are flattened, and combined to generate a global feature vector using max pooling. b DeCoVnet consists of stem, ResBlocks, and progressive classifier stages. Different colors represent different operations. It includes 3D convolution layers (Conv3D), batch normalization (BN), max pooling (Max Pool), dropout, and fully connected layers (FC). c AD3D-MIL utilizes ResBlocks from DeCoVnet for deep instance generation. Transformer function is implemented using two fully connected layers.

#

2. 3D deep convolutional neural network to detect COVID-19 (DeCoVnet)

In contrast to COVNet, DeCoVnet is based on 3D ResNet which performs 3D convolutions (3D CNN) to learn features. As shown in [Fig. 3](b), DeCoVnet consists of the network stem, residual blocks (ResBlocks), and progressive classifier stages. The four ResBlocks include shortcut connections and pass 3D feature maps. The classifier progressively extracts important features using 3D max pooling and directly yields CT-level class probabilities. Input to DeCoVnet is a lung-masked CT scan. Preprocessing included resampling (224×336), intensity clipping (-1200, 600 HU), and normalization (0, 1). During training, data augmentation was performed that included random affine transformations and color jittering. Weights of the model were initialized using the Kaiming initialization method [38].

#

3. Attention-based deep 3D multiple instance learning (AD3D-MIL)

AD3D-MIL addresses the problem of detecting COVID-19 as multiple instance learning (MIL), where instances are automatically generated using convolution layers of DeCoVnet [14]. Once instances are obtained, attention-based pooling is applied to concentrate on important instances by weighting them. Next, a two-layer fully connected neural network provides final class probabilities. Compared to the above two methods, AD3D-MIL is based on 3D ResNet with attention pooling ([Fig. 3](c)). AD3D-MIL takes lung-masked CT scans as input. Preprocessing was performed by using resampling (256×256), intensity clipping (-1024, 600 HU), and normalization (0, 1). Data augmentation included color jittering and random affine transformation. For training, the model was initialized using random weights following Kaiming initialization [38].

#
#

Training

For diagnosing COVID-19, we trained, validated, and tested the three models (COVNet, DeCoVnet, and AD3D-MIL) using exactly the same set of images. The models were trained using a five-fold cross-validation approach. Based on the highest validation accuracy (Acc), the best model was selected from each fold for inference. Pretrained models were used for lung segmentation. We used the Seg-Net [33] obtained lung masks for the COVNet model, U-Net [9] lung masks for DeCoVnet, and U-Net(R-231) [35] lung masks for AD3D-MIL in accordance with the original [14] [15] or previous publications [32]. Hyperparameters are presented in supplementary Table 2. During inference on the independent test set, the predictions from the five best models were ensembled using majority voting.

#

Statistics

Statistical analysis was performed in Python using SciPy (Stats) [39] and Scikit-learn (Metrics, Calibration) [40] packages and in R software using pROC [41] package. Figures were plotted using the Matplotlib (Pyplot) package [42]. Statistical hypothesis testing of the non-parametric dichotomous performance data was calculated from pairwise 2×2 contingency tables using McNemar’s test. A bootstrapping approach was applied to calculate the confidence interval. DeLong’s test was used to compare AUCs. Statistical significance was defined as p < .05.

#
#