Archives of Academic Emergency Medicine. 2021; 9(1): e15
https://doi.org/10.22037/aaem.v9i1.1060

OR I G I N A L RE S E A RC H

Determining the Need for Computed Tomography Scan
Following Blunt Chest Trauma through Machine Learning
Approaches
Mohsen Shahverdi Kondori1, Hamed Malek1∗

1. Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran.

Received: December 2020; Accepted: December 2020; Published online: 24 January 2021

Abstract: Introduction: The use of computed tomography (CT) scan is essential for making diagnoses for trauma pa-
tients in emergency medicine. Numerous studies have been conducted on guiding medical examinations in
light of advances in machine learning, leading to more accurate and rapid diagnoses. The present study aims to
propose a machine learning-based method to help emergency physicians prevent performance of unnecessary
CT scans for chest trauma patients. Methods: A dataset of 1000 samples collected in nearly two years was used.
Classification methods used for modeling included the support vector machine (SVM), logistic regression, Naïve
Bayes, decision tree, multilayer perceptron (four hidden layers), random forest, and K nearest neighbor (KNN).
The present work employs the decision tree approach (the most interpretable machine learning approach) as
the final method. Results: The accuracy of 7 machine learning algorithms was investigated. The decision tree
algorithm was of higher accuracy than other algorithms. The optimal tree depth of 7 was chosen using the train-
ing data. The accuracy, sensitivity and specificity of the final model was calculated to be 99.91% (95%CI: 99.10%
– 100%), 100% (95%CI: 99.89% – 100%), and 99.33% (95%CI: 99.10% – 99.56%), respectively. Conclusion: Con-
sidering its high sensitivity, the proposed model seems to be sufficiently reliable for determining the need for
performing a CT scan.

Keywords: Radiography; Tomography, X-Ray Computed; Clinical Decision Rules; Decision Trees; Machine Learning

Cite this article as: Shahverdi Kondori M, Malek H. Determining the Need for Computed Tomography Scan Following Blunt Chest Trauma

through Machine Learning Approaches. Arch Acad Emerg Med. 2021; 9(1): e15.

1. Introduction

A number of studies have been published, which preferred

to use chest computed tomography (CT) scan rather than

chest X ray (CXR) in evaluation of traumatic thoracic injuries

(1, 2). It may be impossible to completely evaluate patients

and provide rapid medical services when they go to emer-

gency departments due to limitations in time, human re-

sources, and equipment, particularly during natural disas-

ters with a high number of visits. In such situations, the use

of clinical decision rules may be very effective in accelerat-

ing the decision-making process and can determine the pri-

ority of caring for patients and accelerate the discharge of

those who do not need further care (3). Evidence-based in-

∗Corresponding Author: Hamed Malek; Shahid Beheshti University, Shahid
Shahriari Square, Daneshjou Boulevard, Shahid Chamran Highway, Tehran,
Iran. Email: h_malek@sbu.ac.ir, Phone/Fax: +98 (21) 29904106, ORCID:
https://orcid.org/0000-0003-4314-6539

dications for CT scan in blunt thoracic trauma have not been

extensively reviewed. In an attempt in this regard, Safari et

al. showed that cases with normal CXR may skip chest CT

scan (4). Accordingly, the present study proposes a model

to predict whether a chest CT scan is necessary, using ma-

chine learning and artificial intelligence tools and a dataset

collected from patients who underwent chest CT scans.

2. Methods

2.1. Dataset

The dataset consisted of the data of 1000 trauma patients

who referred to Shohadaye Tajrish and Imam Hossein Hospi-

tals, which are two large trauma research centers in Tehran,

from January 2017 to July 2018. All of the patients underwent

initial examinations and CT scans. The data were collected

from adult patients at ages above 18 years and included pa-

tients’ personal information (i.e., age and gender), incident

details, trauma mechanism (either high or low energy), vi-

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


M. Shahverdi Kondori et al. 2

tal signs (i.e., heart rate, respiratory rate, blood pressure, and

oxygen saturation), level of consciousness, clinical examina-

tion and history taking findings (including dyspnea, respira-

tory sounds, reduced cardiac sounds, chest wall deformity,

distracting pain, generalized tenderness, chest wall tender-

ness, chest wall abrasion, crepitation, jugular venous pres-

sure ( JVP), and chest wall pain), CT scan findings, X-ray im-

ages, and sonography results. Table 1 provides the data.

2.2. Preprocessing

In the first preprocessing stage, a number of dataset fields

were found to be irrelevant and were excluded, including

gender and transportation to the hospital, as shown in col-

umn 3 of table 1. It should be noted that the exclusion of ir-

relevant data increases the model’s accuracy. Glasgow coma

scale (GCS) field, which indicates the level of consciousness

was also excluded as it divided patients into conscious and

unconscious patients. The items excluded in this stage are

shown in column 3 of table 1.

In the second stage, preprocessing methods used in machine

learning algorithms were applied to the data. Then, irrele-

vant or non-effective items obtained in the second prepro-

cessing stage based on model training results, including O2

saturation and hemoglobin level, were excluded from the

dataset. Then, X-ray and sonography data were excluded,

since they had a high correlation with the target value, as

shown in column 4 of table 1. This is further explained be-

low.

Before performing the learning process with the remaining

items, some categorical data, including chest CT scan find-

ings and type of high energy trauma, were quantized using

the one-hot vector method (5), as presented in column 2 of

table 2. The one-hot vector transforms categorical data into

binary values, allowing for building a better model through

machine learning methods. In the third stage, the remaining

data were reviewed by an expert, excluding the medically ir-

relevant fields and CT scan-requiring fields from the dataset,

as shown in column 5 of table 1. Then, a number of items that

were deemed to have the same implications by the expert

were integrated, as provided in column 7 of table 1. Thus,

only column 7 remained for the learning process. Then, ma-

chine learning algorithms were trained using the remaining

data.

In dealing with trauma patients, some signs necessitate CT

scans, regardless of other conditions. For example, a chest

wall deformity requires the medical team to perform a CT

scan. The GCS level is another sign that leads physicians to

prescribe CT scans – if a patient is not conscious, a CT scan

must be performed. Thus, these items were also excluded

from the dataset for model training.

2.3. Machine learning algorithms

Machine learning is one of the most commonly employed

artificial intelligence classes. It adjusts and discovers prac-

tices and algorithms by which computers and systems can

learn. Classification account for a set of machine learning

algorithms. The main objective of classification algorithms

is to classify data into distinct groups that can detect new

data. Classification methods include the support vector ma-

chine (SVM), logistic regression, Naïve Bayes (6), decision

tree, multilayer perceptron (four hidden layers)(7), random

forest, and K nearest neighbor (KNN) (8). Such methods have

advantages and disadvantages, and the best method to ad-

dress the problems should be chosen based on the specific

problem and its requirements. The decision tree approach

was chosen in the present study as it provides more explana-

tion for the results, which was importance in this study.

2.4. Decision tree

The decision tree approach is a decision support tool that

uses trees for modeling. Decision trees are typically em-

ployed in different operations, such as decision analysis, to

find the best strategy to classify data. A condition is inves-

tigated in each node of a decision tree. The algorithm fol-

lows one of the two branches of a node based on whether the

condition is met. This continues until a leaf is reached. Fi-

nally, decisions are made based on the number of each class

of samples in a given leaf. Particularly, after investigating the

entire conditions on the input data in the proposed problem,

the algorithm will produce a positive outcome if the num-

ber of training samples that suggest performing a CT scan is

higher than those that do not suggest performing a CT scan;

otherwise, it will produce a negative outcome. Each move

from a node to another adds a unit to the tree depth. The tree

depth is a parameter that should be either identified during

the learning process, or chosen based on the optimal depth

determined using optimal depth identification methods.

2.5. Data Segmentation

In a machine learning algorithm, data are segmented into

training, validation, and test data. It should be noted that

classification should be performed randomly. Accordingly,

60%, 20%, and 20% of the data were selected as training, val-

idation, and test data, involving 600, 200, and 200 samples,

respectively. The SVM, logistic regression, Naïve Bayes, deci-

sion tree, multilayer perceptron (four hidden layers), random

forest, and KNN algorithms were applied to the data. Then,

the models were evaluated using the validation data. The val-

idation results showed a higher accuracy for the decision tree

algorithm. Thus, the decision tree model was adopted. Then,

70% of the data were used to find the optimal depth, while the

remaining 30% were employed as the test data.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


3 Archives of Academic Emergency Medicine. 2021; 9(1): e15

2.6. Evaluation criteria

An evaluation criterion should be used to compare machine

learning algorithms and detect their efficiency. The present

study employed accuracy as the evaluation criterion. Then,

sensitivity was used to determine the optimal tree. In gen-

eral, sensitivity is of great importance in analyzing medical

data, since it represents how accurate a model is in diagno-

sis.

3. Results

The above-mentioned machine learning algorithms were in-

vestigated by the criterion of accuracy. Table 2 provides the

training and validation accuracy of different machine learn-

ing algorithms. As can be seen, the decision tree algorithm

had a higher accuracy compared to other algorithms. It

should be noted that the results were obtained after exclud-

ing the X-ray and sonography data. In fact, the idea is to pro-

pose a model that can be employed even without X-ray and

sonography equipment.

Table 3 shows the accuracy of the proposed decision tree in

different depths of the model. As can be seen, a decision tree

depth of 7 was chosen. After choosing the decision tree, the

tree’s depth should be measured as a parameter. The optimal

tree depth was selected using the training data based on Ta-

ble 4.

Finally, a rule (algorithm) was obtained to be proposed to

emergency physicians based on the obtained model, appli-

cation of X-ray and sonography data, and incorporation of

the data that were excluded from the procedure by the ex-

pert. Figure 1 presents the final model.

4. Discussion

In this work an interpretable machine learning model was

introduced to help emergency physicians to prevent perfor-

mance of unnecessary CT scans for chest trauma patients.

Due to the simplicity of the model, it is a very good choice

for patient classification in order to prevent the crowding

problems in critical conditions such as natural disasters like

earthquakes, floods, and volcanoes. This model has good ac-

curacy and high generalizability due to being usable in the

presence or absence of sonography and x-ray results. The

model, which is the final and pruned model of the decision

tree, can be easily implemented in the rule diagram.

Shapley value (9) is an analytical method in game theory,

which is used in machine learning in order to increase the

interpretability of models (10). The Shapley value explores

the hypothesis space by considering the presence or absence

of each parameter. Finally, the contribution of each param-

eter to the accuracy of the model is obtained as a result. In

order to evaluate the sensitivity of the model to each param-

eter, we analyzed each of the parameters used in the model to

find out the impact of each parameter on model output mag-

nitude. In Figure 4, X-axis represents the effect of each pa-

rameter on the accuracy of the model and Y-axis represents

the order of importance of the model parameters. As can be

seen, GCS categories, age, and loss of pulmonary sound have

the most impact on the results of the model in detecting the

correct classes.

It was demonstrated that the proposed model’s sensitivity is

high in identifying cases for which CT scan should be per-

formed, and its specificity is acceptable. The model is proved

to be effective with high reliability in reducing the number of

patients that need CT scans.

5. Conclusion

Trauma poses a challenge in emergency departments regard-

ing providing early care for patients. Proper hospital equip-

ment is required to perform CT scans on trauma patients and

its cost is high. The present study proposed a decision tree-

based model to determine whether a CT scan is necessary

early on. Considering its high sensitivity, the proposed model

seems to be sufficiently reliable in determining the need for

performing a CT scan.

6. Declarations

6.1. Acknowledgment

Not applicable.

6.2. Author contributions

All authors passed the criteria for authorship contribution

based on recommendations of the International Committee

of Medical Journal Editors.

6.3. Funding

None.

6.4. Conflict of interest

The authors have no conflict of interest to declare

References

1. Sangster GP, González-Beicos A, Carbo AI, Heldmann

MG, Ibrahim H, Carrascosa P, et al. Blunt traumatic in-

juries of the lung parenchyma, pleura, thoracic wall,

and intrathoracic airways: multidetector computer

tomography imaging findings. Emergency Radiology.

2007;14(5):297-310.

2. Traub M, Stevenson M, McEvoy S, Briggs G, Lo SK, Leib-

man S, et al. The use of chest computed tomography ver-

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


M. Shahverdi Kondori et al. 4

sus chest X-ray in patients with major blunt trauma. In-

jury. 2007;38(1):43-7.

3. Shafaf N, Malek H. Applications of Machine Learning

Approaches in Emergency Medicine; a Review Article.

Archives of Academic Emergency Medicine. 2019;7(1).

4. Safari S, Farbod M, Hatamabadi H, Yousefifard M,

Mokhtari N. Clinical predictors of abnormal chest

CT scan findings following blunt chest trauma: A

cross-sectional study. Chinese Journal of Traumatology.

2020;23(1):51-5.

5. Digital Design and Computer Architecture - 2nd Edition.

6. Khanna D, Sharma A, editors. Kernel-Based Naive Bayes

Classifier for Medical Predictions2018 2018. Singapore:

Springer.

7. Hastie T, Tibshirani R, Friedman J. The Elements of Sta-

tistical Learning: Data Mining, Inference, and Prediction,

Second Edition. 2nd edition ed. New York, NY: Springer;

2016 2016/01/01/. 767 p.

8. Jiang L, Cai Z, Wang D, Jiang S, editors. Survey of Improv-

ing K-Nearest-Neighbor for Classification. Fourth Inter-

national Conference on Fuzzy Systems and Knowledge

Discovery (FSKD 2007); 2007 2007/08//.

9. Molnar C. Interpretable Machine Learning: Lulu.com;

2020 2020/02/28/. 320 p.

10. Maleki S, Tran-Thanh L, Hines G, Rahwan T, Rogers A.

Bounding the Estimation Error of Sampling-based Shap-

ley Value Approximation. arXiv:13064265 [cs]. 2014.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


5 Archives of Academic Emergency Medicine. 2021; 9(1): e15

Table 1: The patient data of the dataset

Field Name Data type Deleted in
step one

Deleted in
step two

Deleted by ex-
pert comment

Change@ Merge Used#

Chest X-Ray binary × p
Chest X-Ray Binary ×
Age Numerical ×
Gender Binary ×
Fast Sonography Binary × p
Drug history Binary ×
Chest Wall Deformity Binary × p
Transfer to hospital Categorical ×
Chest Wall Tenderness Binary

p
Systolic blood pressure Numerical × •
Distracting Pain Binary

p
Diastolic BP Numerical × •
Loss of Cardiac Sound Binary

p
Glasgow coma scale (GCS) Numerical ×
Chest Wall Abrasion Binary

p
Respiratory rate Numerical × •
Generalized Tenderness Binary
O2 Saturation Numerical ×
Chest Wall Pain Binary

p
High energy trauma Binary ×
Medical History Binary

p
High energy trauma Categorical × ×
Heart rate Binary × •
Dyspnea Binary

p
Chest CT Scan Binary

p
Tachypnea Binary

p
Hemoglobin level Numerical ×
Pulmonary Sound* Binary

p
Chest CT Scan finding Categorical ×
Crepitation Binary

p
Trauma mechanism Categorical ×
JVP enlargement Binary

p
Age categories Binary

p
Unstable hemodynamics Binary × • p
GCS categories Binary × p
@: Change to one hot; #: used for final model, *: Loss of pulmonary sound;
BP: blood pressure; JVP: jugular vein pressure; GCS: Glasgow coma scale; CT: computed tomography.

Table 2: The accuracy of different models in training and validation phases

Model list Accuracy (95% CI)
Training Validation

Support Vector Machine 84.6 (79.87 - 89.33) 80.5 (74.62 – 86.38)
K Nearest Neighbor (k = 5) 81.33 (76.38 - 86.28) 76.5 (71.23 - 81.77)
Logistic Regression 82 (74.33 – 89.66) 77 (69.22 – 84.78)
Random Forest 85.33 (80.01 - 90.65) 80.5 (74.53 - 86.47)
Naive Bayes 80.5 (73.44 - 87.56) 74 (66.18 - 81.82)
Multilayer Perceptron (3 hidden layers) 85 (81.67 – 88.33) 82 (77.97 – 86.03)
Decision Tree (Depth = 7) 87 (84.12 - 89.88) 85 (81.58 – 88.42)
Data are presented with 95 % confidence interval.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


M. Shahverdi Kondori et al. 6

Table 3: Different decision tree depths and accuracies obtained on the test, validation, and sensitivity data

Depth* Accuracy (95% CI) Test Sensitivity
Training Test

3 81.85 (78.75 – 84.95) 83.33 (79.53 – 87.13) 95.45 (92.55 – 98.35)
4 83.14 (80.08 – 86.2) 83.66 (79.8 – 87.52) 97.47 (94.66 – 100)
5 84.28 (81.27 – 87.29) 84 (80.28 – 87.72) 95.95 (93.43 – 98.47)
6 85.28 (82.41 – 88.15) 84.33 (80.66 – 88) 95.45 (93.44 – 97.46)
7 86.57 (83.79 – 89.35) 85 (81.51 – 88.49) 97.47 (95.59 – 99.35)
8 87.28 (84.56 – 90) 85.33 (81.73 – 88.93) 97.47 (95.55 – 99.39)
9 88.14 (85.52 – 90.76) 83.66 (79.94 – 87.38) 93.93 (91.96 – 95.9)
10 88.85 (86.25 – 91.45) 84.66 (80.73 – 88.59) 94.94 (92.88 – 97)
*: Model depth. Data are presented with 95% confidence interval.

Table 4: The decision tree results with and without considering the chest X-ray and sonography findings

Model Accuracy (95% CI) Test
Training Test Sensitivity Specificity

With 99.95 (99.28 – 100) 99.91 (99.1 – 100) 100 (99.81 – 100) 99.33 (99.1 – 99.56)
Without 85.5 (93.32 – 87.68) 84.66 (81.9 – 87.42) 98.96 (98.05 – 99.87) 77.83 (72.49 – 83.17)
Data are presented with 95% confidence interval.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


7 Archives of Academic Emergency Medicine. 2021; 9(1): e15

Figure 1: The final model obtained by re-including the excluded

data.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


M. Shahverdi Kondori et al. 8

Figure 2: The contribution of parameters to model accuracy. JVP: jugular vein pressure; GCS: Glasgow coma scale.

This open-access article distributed under the terms of the Creative Commons Attribution NonCommercial 3.0 License (CC BY-NC 3.0).
Downloaded from: http://journals.sbmu.ac.ir/aaem


	Introduction
	Methods
	Results
	Discussion
	Conclusion
	Declarations
	References