Microsoft Word - 40-2794_sETASR_V9_N3_pp4287-4291


Engineering, Technology & Applied Science Research Vol. 9, No. 3, 2019, 4287-4291 4287  
  

www.etasr.com Alsuwaiket et al.: Formulating Module Assessment for Improved Academic Performance Predictability … 

 

Formulating Module Assessment for Improved 

Academic Performance Predictability in Higher 

Education 
 

Mohammed Alsuwaiket 

Department of Computer Science, 
Loughborough University,  

Loughborough, Leicestershire, UK  

m.alsuwaiket@lboro.ac.uk 

Anas H. Blasi 

Department of Computer Information 
Systems, Mutah University,  

Karak, Jordan 

ablasi1@mutah.edu.jo 

Ra'Fat Al-Msie'deen  

Department of Computer Information 
Systems, Mutah University,  

Karak, Jordan 

rafatalmsiedeen@mutah.edu.jo 
 

 

Abstract—The choice of an effective student assessment method is 

an issue of interest in Higher Education. Various studies [1] have 

shown that students tend to get higher marks when assessed 

through coursework-based assessment methods which include 

either modules that are fully assessed through coursework or a 

mixture of coursework and examinations than assessed by 

examination alone. There are a large number of educational data 

mining (EDM) studies that pre-process data through 

conventional data mining processes including data preparation 

process, but they are using transcript data as they stand without 
looking at examination and coursework results weighting which 

could affect prediction accuracy. This paper proposes a different 

data preparation process through investigating more than 

230,000 student records in order to prepare students’ marks 

based on the assessment methods of enrolled modules. The data 

have been processed through different stages in order to extract a 

categorical factor through which students’ module marks are 
refined during the data preparation process. The results of this 

work show that students’ final marks should not be isolated from 

the nature of the enrolled module’s assessment methods. They 

must rather be investigated thoroughly and considered during 

EDM’s data pre-processing phases. More generally, it is 

concluded that educational data should not be prepared in the 

same way as other data types due to differences as data sources, 
applications, and types of errors in them. Therefore, an attribute, 

coursework assessment ratio (CAR), is proposed to be used in 

order to take the different modules’ assessment methods into 

account while preparing student transcript data. The effect of 

CAR on prediction process using the random forest classification 

technique has been investigated. It is shown that considering 
CAR as an attribute increases the accuracy of predicting 
students’ second-year averages based on their first-year results. 

Keywords-EDM; data mining; higher education; machine 

learning; module assessment 

I. INTRODUCTION  

Although educational data have been recorded and analyzed 
from educational software for a long time, only recently has 
this process been formed into a new field, educational data 
mining (EDM). The EDM process converts raw data from 
educational systems into useful information that could 

potentially have a great impact on educational research and 
practice [1]. Additionally, EDM uses a wide range of methods 
to analyze data, including, but not limited to, supervised and 
unsupervised model induction, parameter estimation, 
relationship mining, etc. [2, 3]. During the last few decades, the 
use of coursework-based module assessment increased in the 
UK and other countries due to various educational arguments. 
Additionally, it appears to be that students prefer the 
assessment to be based on either coursework alone or by a mix 
of both coursework and exams because these types of 
assessments tend to yield higher marks than exam-based 
assessment alone [4]. The increased adoption of coursework-
based assessment has contributed to an increase over time in 
the marks on individual modules and in the proportion of good 
degrees across entire programmes [5]. Accurate and fair 
student assessment is an issue of concern in higher education 
(HE). Changes in the use of different assessment methods have 
given rise to an increasing number of universities shifting from 
traditional exam-based to continuous assessment throughout 
the semester (coursework-based) [6]. Coursework-based 
assessment methods differ from exam-based assessment 
methods where knowledge or skill is tested for a very specific 
period of time. Moreover, it has been widely acknowledged 
that the chosen assessment method will determine the style and 
content of student learning and skill acquisition [6]. 
Coursework marks are a better predictor of long-term learning 
of course content than are exams [7]. Nonetheless, it appears to 
be that none of the studies in the EDM research field reflected 
the assessment method used in modules on the final marks. So, 
this paper aims to pre-process students’ transcript data 
differently, extract a factor from the assessment method and 
use it to refine student marks again to ensure more accuracy 
prior to processing them by using the EDM techniques.  

II. ASSESSMENT METHODS IN HIGHER EDUCATION 

Regardless the inabilities to absolutely ensure student 
learning through different assessment methods, assessment is 
still an essential tool through which teachers influence the ways 
students respond to courses. On the other hand, there are clear 
steers from UK government towards coursework-based 

Corresponding author: Anas H. Blasi



Engineering, Technology & Applied Science Research Vol. 9, No. 3, 2019, 4287-4291 4288  
  

www.etasr.com Alsuwaiket et al.: Formulating Module Assessment for Improved Academic Performance Predictability … 

 

assessment focused on employability that should apply across 
all degree subjects [8]. However, other studies show that not all 
assessment methods suit different programmes or even courses 
[9]. Thus, student assessment methods in HE can be generally 
divided into two main categories: Exam-based assessments, 
which include different forms such as closed and open book 
examinations, essay-type exams, multiple choice exams, etc. 
and coursework-based assessments, which include research 
projects, assignments, etc. Different studies proved that 
students tend to gain higher marks from coursework-based 
assignments than they do from examinations [4]. Authors in 
[10] found that combining exam-based and coursework-based 
assessment methods produced up to 12% higher average marks 
than did examinations alone. As a result, this paper tries to take 
into account the different assessment methods of universities’ 
modules and their effect on the students’ academic 
performance, and also formulates the differences in assessment 
methods into a new attribute that can be part of further data 
mining processes. 

III. STUDENT TRANSCRIPT DATA 

In this study, 4662 modules with different assessment 
methods that were collected from a UK University, are 
investigated. Basic average calculations of module marks in 
various departments (Business, Computer Science, Math, 
Electrical and Computer Systems Engineering, Civil 
Engineering, and Mechanical Engineering) are calculated for 
students taking modules with exam-based, coursework-based 
or a mixture of both assessment methods. A simple t-test was 
applied on the data (Table I) in order to measure the difference 
between means of each pair of variables. Results show that 
there is a statistically significant difference between the exam-
based and coursework-based assessments (with 95% 
confidence level (which equates to declaring statistical 
significance at the p<0.05 level, a t-value of -5.06 and a P-
value of 0.001). 

TABLE I.  AVERAGE MARKS BASED ON ASSESSMENT METHOD 

Department 
Student 

number 

Average module mark of students 

Exam-based 

assessment 

Coursework-

based 

assessment 

Mixed 

assessment 

Business 54960 59.77 60.83 60.01 

Civil 

Engineering 
34892 58.78 63.74 60.70 

Computer 

Science 
19800 58.18 64.40 58.87 

Electronic and 

Computer 

Systems 

Engineering 

13740 59.55 63.26 57.00 

Math 24152 61.59 66.00 61.17 

Mechanical 

Engineering 
31385 58.80 64.26 60.24 

 

Applying the t-test to measure the significance of difference 
between each pair of variables Ex-CW, Ex-Mix, and CW-Mix 
assessment methods results in Table II. As shown in Table II, 
the p-value of the t-test between the fields: exam-based and 
coursework-based assessments (0.002) and both exam and 
coursework-based assessments, and coursework-based 
assessment (0.004) is less than 0.05, which indicates the 

statistical significant difference between these assessment 
methods. On the other hand, there seems to be no statistical 
significant difference between the exam-based assessment and 
the mixture of both exam and coursework assessment methods, 
since the P-Value was 0.749 which is greater than 0.05. 

TABLE II.  P-VALUES OF T-TEST 

Assessment method p-value t-value 

Exam and coursework 0.002 -4.5 

Mixed and exam 0.749 0.39 

Mixed and coursework 0.004 -3.99 

 

Throughout the literature, module assessment methods in 
HE have been investigated. Various studies such as [4] show 
that students tend to get higher marks when assessed using 
coursework than when assessed using exam-based assessment. 
Table I shows that this study initially does not contradict with 
previous studies, by confirming that students who are assessed 
using coursework tend to get higher marks than those who are 
assessed using exams or a mixture of both coursework and 
exams. The next section explains educational data used and 
their attributes before processing the data and applying data 
mining techniques. 

A. Understanding Student Transcript Data 

Student transcript data were collected, and they consisted of 
files with hundreds of thousands of records. The standard DM 
practice suggests that the data in these files had to be first 
understood, then cleaned, and finally the most significant 
factors had to be highlighted in order to further process these 
data using EDM techniques. Normally, educational data are 
discrete, i.e. either numeric or categorical data, and noise-free. 
The lack of noise in educational data is due to the fact that 
there are not measured. They are either collected automatically 
or checked carefully [8]. On the other hand, missing data 
values exist usually in the cases where students – for example - 
skip answering a given questionnaire or in other cases where 
teachers skip checking attendance. Humans normally do this 
type of errors, referred as data entry errors [11]. The 
investigated data represent around 230,823 student records 
representing 19,886 students in a total of six departments of a 
UK University. Each one of these department data sheets 
contains a number of student records. For each record, a 
number of attributes that represent a student’s academic 
accomplishments during three levels (preparatory, 1

st
, and 2

nd
 

levels) are divided as: 

• Student-Related Attributes: These attributes highlight the 
status of the students, including: (i) Module Mark: 
Student’s mark in a certain module, (ii) Exam Mark: The 
mark achieved by a student on the exam-based assessment 
part, and (iii) Cswk Mark: The mark achieved by a student 
on the coursework-based assessment part. 

• Module-Related Attributes: A group of attributes that 
describe a certain module and its characteristics including: 
Coursework Weighting (CWW): Alternatively, this 
attribute indicates the ratio of coursework-based assessment 
to the total mark of a module. 



Engineering, Technology & Applied Science Research Vol. 9, No. 3, 2019, 4287-4291 4289  
  

www.etasr.com Alsuwaiket et al.: Formulating Module Assessment for Improved Academic Performance Predictability … 

 

For each module, the values of these two attributes 
complement each other to reach maximum total mark (i.e., 
100). For example, if the Exam Weighting for a certain module 
is 75, the Coursework Weighting must be 25 in order to reach 
100, and so on. For each department, different ratios between 
exam weighting and coursework weighting will be described in 
detail. 

B. Causes of Errors in Educational Data 

Prior to storing educational data, they are normally 
processed by the involvement of human interaction, or 
computation, or both. Sources of errors in databases are 
categorized into four main types: data entry errors, 
measurement errors, distillation errors, and data integration 
errors [11]. As mentioned before, since educational data are not 
measured using instruments, the errors are caused by humans 
(i.e. through data entry errors) so these data have minimum, or 
zero noise compared to other non-educational data types. The 
following section will present the pre-processed students’ 

transcript data in a different way in order to increase the data 
mining accuracy by extracting a new factor from the 
assessment method and use it to refine student marks to ensure 
more accuracy prior to processing them using EDM.  

IV. COURSEWORK ASSESSMENT RATIO (CAR) 

In order to refine student’s marks based on modules’ 
assessment methods, the data have been processed through 
various stages. Each of these stages enhances the data in terms 
of readability by statistical software and ability to extract 
knowledge easily. The first step is to categorize the CW to EX 
ratios. The categorization algorithm relies on the number of 
classes the ratio between CW to EX weighting can have based 
on ratios of CW to EX the original data have. Namely, each 
department has its own classification of CW to EX weighting 
ratios. CAR represents the ratio of CW weighting for each 
module, which by default will reflect the ratio of EX weighting. 
Table III shows the different classes: 

TABLE III.  CLASSES OF CW TO EX WEIGHTING RATIOS 

 Model assessment method 

CW 0 10 15 20 25 30 35 40 45 50 55 60 65 66 70 75 100 

Business � �  � � �  �  �  � �    � 

CEng �  � � � �  � � �      � � 

CS � �  � � �  �  � � � �  �  � 

ECSEng � � � � � � � �  �  �  �   � 

Math � �  � � �  �  �     �  � 

MEng � �  � � �  �  �     � � � 

 

Table III shows that there is no department that shares the 
same ratio classes with the other departments, i.e. each 
department has its own unique ratios between CW to EX. Thus, 
filling the missing values in the Table is not a solution since 
doing so yields incorrect data. This research considers the 
department with the greatest number of classes to start with, 
and then generalizes the findings on the other departments 
while bearing in mind the change of ratios. Two departments 
have 12 complete ratios (the CS and ECSEng departments). In 
this paper, the CS department was chosen for further 
processing. 

V. REFINING STUDENTS MARKS BASED ON CAR 

In order to obtain the relation between the student’s module 
mark and CAR which represents the CWW to EXW ratios, 
simple quadratic regression was used. Regression analysis is 
being used to infer relationships between the independent 
(CAR) and dependent (refined module mark) variables. The 
variable of CW ratio was used, since the simple quadratic 
regression is more suitable for one variable relation. The choice 
of choosing quadratic over linear is based on the R-squared 
(coefficient of determination): when the R-squared is higher, 
then the model fits data more. For the case of quadratic 
regression, the coefficient is 2.90%, which is higher than in the 
linear model (2.77%). By applying simple quadratic regression 
on the data with CAR as a variable for module mark as a 
response, we achieved the following fitted regression line: 

RMM=MM-12.77(CAR)+5.873(CAR)
2 
  (1) 

where RMM is the refined module mark after fitting and MM is 
the current module mark. 

By applying (1) on student transcript data of the CS 
Department, an additional field will be added which contains 
the RMM for each student at the department. RMMs are self-
explained when referring to Table I that compares the average 
module marks for student attending modules according to 
different assessment methods. The more the percentage of 
CWW, the more the added marks to MM, and vice versa. Any 
EDM that considers student marks with different assessment 
methods should consider adding a sub-process within the data 
pre-processing phase that takes into account the difference 
between those assessment methods. Figure 1 shows the 
addition of sub-processes 2-5 on raw students’ module marks. 
The task of these sub-processes is to take the differences in 
assessment methods into account. These sub-processes are 
utilized because applying conventional DM processes on 
educational data may not produce accurate results. 

 

 
Fig. 1.  Refining students’ module marks sub-process 

VI. THE EFFECT OF CAR ON DM’S PREDICTION ACCURACY 

In order to verify the effect of the newly constructed 
variable, prediction of a student’s third year average mark 
using his first and second year’s average was compared to 



Engineering, Technology & Applied Science Research Vol. 9, No. 3, 2019, 4287-4291 4290  
  

www.etasr.com Alsuwaiket et al.: Formulating Module Assessment for Improved Academic Performance Predictability … 

 

predicted third year’s average using first and second year’s 
average and the newly constructed variable, CAR. This 
comparison may highlight how accurate the prediction process 
can be by adding new attributes that reflect the nature of the 
module. Orange Canvas Tool is a simple data analysis tool with 
clever data visualization and interactive data exploration for 
rapid qualitative analysis with clean visualizations. Moreover, 
there are many prediction techniques which can be investigated 
using the Orange Data Mining Tool, but one of the most 
popular technique that could be used is Random Forest [12]. 

The Random Forest was used in this study to evaluate the role 
of CAR in increasing the prediction accuracy. The data of 406 
undergraduate students of the Computer Science Department 
included their first, second and third years’ average marks, and 
their average CAR for all years. The comparison was done 
including and excluding CAR as an input attribute. Figures 3 
and 4 show the confusion matrices of the comparisons and how 
CAR affected the accuracy of predicting students’ third year 
average marks. 

TABLE IV.  PREDICTING THIRD YEAR’S AVERAGE FROM FIRST AND SECOND YEAR’S EXCLUDING CAR 

Prediction 

C
o
r
r
e
c
t 
C
la
ss
 

 Fail First Lower second Pass Third Upper second  

Fail 0 1 3 0 5 3 12 

First 0 36 0 0 0 37 73 

Lower Second 0 0 0 0 6 31 47 

Pass 0 0 0 0 0 3 3 

Third 0 0 8 0 9 13 30 

Upper Second 0 5 4 0 0 110 119 

 0 42 25 0 20 197 284 

TABLE V.  PREDICTING THIRD YEAR’S AVERAGE FROM FIRST AND SECOND YEAR’S INCLUDING CAR AS AN ATTRIBUTE 

Prediction 

C
o
r
r
e
c
t 
C
la
ss
  

Fail First Lower second Pass Third Upper second  

Fail 3 2 6 0 0 2 13 

First 0 59 0 0 0 23 82 

Lower Second 0 4 16 0 0 26 46 

Pass 0 0 1 0 0 2 3 

Third 1 1 14 0 0 9 25 

Upper Second 0 12 4 0 0 99 115 

 4 78 41 0 0 161 284 

 

The ROC (receiver operating characteristics) curve has 
been introduced to evaluate ranking performance of machine 
learning algorithms [13]. Author in [14] has compared popular 
machine learning algorithms using the area under the curve 
(AUC) that represents the proportion of false positive rate 
covered by the curve of true positive rate and found that AUC 
exhibits several desirable properties compared to classification 
accuracy (CA). Table V shows an enhancement on the 
prediction probabilities for each of the mark classes (Fail, Pass, 
Third, Lower second, Upper second, and First classes) 
compared to probabilities of prediction shown in Table IV. The 
Random Forest scored the highest AUC when CAR was 
considered (Table V) with AUC value of 0.9304 and 0.0696 
error rate. The AUC represents the proportion of false positive 
rate covered by the curve of true positive rate. In other words, 
the bigger the area of the curve, the more items are classified 
successfully as presumed, and with the increase of AUC, the 
mean profit difference also increases. When CAR was 
excluded as an input (Table IV), the AUC was lower with 
value 0.9073 and the error rate was 0.0927. 

VII. RESULTS AND DISCUSSION 

This study shows that the pre-processing phase of 
educational data should include additional sub-phases that deal, 
not only with noise or missing data, but also with data 
refinement so as to cope with differences between various 
educational systems and their data. Therefore, the attribute 
CAR was constructed in order to take the different modules’ 

assessment methods into account while preparing student 
transcript data. The effect of CAR on the prediction process 
using Random Forest classification was investigated and 
applied on the equation of RMM. It was shown that 
considering RMM increases the accuracy of predicting 
students’ marks. By refining students’ marks, they either 
increase or decrease depending on the ratio between CWW to 
EXW for each student during his study. For instance, by 
considering a student x as an example, the student had enrolled 
in 32 modules during his/her study at the Department of CS. 
Nineteen out of 32 modules are 100% exam-based assessed 
modules, 7 are assessed by a mixture of coursework and 
examination, while only 6 modules are 100% assessed by 
coursework only. Despite that the majority of the modules are 
assessed through examination only, which means that the 
student gets no extra marks compared with coursework-based 
modules, the rest of the modules give the student extra marks 
and hence add to his overall average. In numbers, 19 modules 
have 0 CAR value, which means that RMM=MM. On the other 
hand, the rest of the modules have values of CAR ranging from 
0.1 to 1, which means that the RMM is always less than MM for 
those courses. This decrement in the marks is due to the fact 
that students get higher marks in modules that are assessed by 
coursework or mixture of coursework and examinations. 
Therefore, to balance the module marks and the overall 
average, the formula decrements the module marks by varying 
percentage depending on the CWW to EXW to ratios. Table VI 



Engineering, Technology & Applied Science Research Vol. 9, No. 3, 2019, 4287-4291 4291  
  

www.etasr.com Alsuwaiket et al.: Formulating Module Assessment for Improved Academic Performance Predictability … 

 

shows the differences in module marks and overall average for 
the example student. 

TABLE VI.  REFINING STUDENT X MARKS BASED ON ENROLLED 
MODULES’ ASSESSMENT METHODS 

 Average MM Average RMM 

19 exam-based modules 48.6 48.6 

6 coursework-based modules 60.3 52.7 

7 mixed EX and CW modules 60.4 58.3 

Total 32 modules 56.4 53.2 

 

By following the procedure in Figure 1 on a real student’s 
marks, Table VI shows that RMM remains unchanged for the 
student when the assessment method of the enrolled modules is 
purely exam-based. Alternatively, when the assessment method 
of the enrolled modules is purely coursework-based, the RMM 
is refined down on average 7.6 compared to MM. Finally, a 
mixture of EX and CW based modules yields less refinement 
of the RMM (2.1 marks) compared to MM for the same student. 
This means that student in the example who is taking 32 
modules of different types may have his average marks refined 
down by 3.2 when applying the proposed procedure. Upon 
deriving CAR and examining its effect on students’ marks, 
Random Forest prediction technique was used to validate the 
enhancement on AUC. It was shown that by including MAR as 
an input for predicting students’ third year from their first and 
second years, AUC was enhanced from 0.9073 to 0.9304. The 
derived CAR enhances the predictability of students’ third year 
marks from their first two years, which means that additional 
preparation steps, such as this paper’s module mark refinement 
based on modules’ assessment methods, are required on the 
student transcript data prior to applying DM techniques for 
improved predictability. Additionally, the findings of this paper 
will be generalized in different departments and Universities 
that may have various assessment methods, in other words, the 
refinement procedure should be considered as a sub-process 
within the EDM data pre-processing phase when dealing with 
different modules and their marks.  

VIII. CONCLUSIONS AND FUTURE WORK 

During the last few decades, there has been an increased 
interest in coursework-based assessment in the UK and other 
countries due to various educational and personal arguments 
such as its learning effectiveness, the lack of time limits and 
stress, etc. This increased interest led to discovering that 
students who are assessed by coursework tend to achieve 
higher marks than those who are assessed by examinations in 
the same modules. However, it seems that no studies have 
considered this increase in marks in the data pre-processing 
phase. More generally, this led to a conclusion that applying 
conventional DM processes on educational data may not 
produce accurate results. In this paper, a model that refines 
students’ marks based on enrolled modules’ assessment 
methods has been proposed. The model represents a sub-
process through which module assessment methods are 
considered for further processing using a new attribute that 
reflects the ratio of coursework weightings. The key to refine 
students’ marks is to develop a ratio that represents the ratio of 
CW weighting for each module, which by default will reflect 
the ratio of EX weighting. This ratio, coursework assessment 

ratio (CAR), has been extracted using simple quadratic 
regression on the MM (module marks) variable. Based on CAR 
values, RMM (refined module marks) were calculated and 
compared to the original MM. Although this increase on marks 
was proved in literature and throughout this research, none of 
the previous studies considered this increase as a feedback to 
the data pre-processing phase of the EDM, which has mostly 
been following conventional DM pre-processing methods 
while neglecting the differences between the types of data 
being processed. Therefore, it is vital to consider this feedback 
generation for EDM pre-processing phase. This means that the 
formulation of the relation between CAR and the current MM 
should be part of the EDM pre-processing phase. This study 
investigated the effect of CAR on the prediction process using 
Random Forest classification technique. It was shown that 
considering CAR as an attribute increases the accuracy of 
predicting students’ third year averages based on their first and 
second year’s averages. For future work, the predictability of 
individual module results based on their assessment method 
and further investigation into the group course work can be 
applied to improve predictability. 

REFERENCES 

[1] C. Romero, S. Ventura, M. Pechenizkiy, R. Baker, Handbook of 

Educational Data Mining, Chapman & Hall, 2010 

[2] R. S. J. D. Baker, K. Yacef, “The state of educational data mining in 
2009: A review and future visions”, Journal of educational data mining, 

Vol. 1, No. 1, pp. 3-16, 2009 

[3] C. Romero, S. Ventura, “Educational data mining: A survey from 1995 
to 2005”, Expert systems with applications, Vol. 33, No. 1, pp. 135-146, 

2007 

[4] J. T. E. Richardson, “Coursework versus examinations in end-of-module 
assessment: A literature review”, Assessment & Evaluation in Higher 

Education, Vol. 40, No. 3, 2015 

[5] P. Bridges, A. Cooper, P. Evanson, C. Haines, D. Jenkins, D. Scurry, H. 
Woolf, M. Yorke, “Coursework Marks High, Examination Marks Low: 

Discuss”, Assessment & Evaluation in Higher Education, Vol. 27, No. 1, 
2002 

[6] J. Heywood, Assessment in Higher Education: Student Learning, 

Teaching, Programmes and Institutions, Jessica Kingsley, 2000 

[7] G. Gibbs, C. Simpson, “Conditions Under Which Assessment Supports 

Students’ Learning”, in: Learning and Teaching in Higher Education, 
Vol. 1, pp. 3-31, University of Gloucestershire, 2005 

[8] R. Dearing, Higher Education in the Learning Society: Main Report, 

National Committee of Inquiry into Higher Education, 1997 

[9] G. Gibbs, “Using assessment strategically to change the way students 
learn”, From Assessment Matters in Higher Education,Vol. 4, pp. 41-53, 

1999 

[10] P. E. Morris, C. Fritz, “Conscientiousness and procrastination predict 
academic coursework marks rather than examination performance”, 

Learning and Individual Differences, Vol. 39, pp. 193-198, 2015 

[11] J. M. Hellerstein, Quantitative Data Cleaning for Large Databases, 
United Nations Economic Commission for Europe, 2008 

[12] R. Asif, S. Hina, S. I. Haque, “Predicting student academic performance 

using data mining methods”, International Journal of Computer Science 
and Network Security, Vol. 17, No. 5, pp. 187-191, 2017 

[13] R. Kohavi, F. Provost, “Glossary of terms”, Machine Learning, Vol. 30, 
No. 2, pp. 271-274, 1998 

[14] A. P. Bradley, “The use of the area under the ROC carve in the 

evaluation of machine learning algorithms”, Pattern Recognition, Vol. 
30, No. 7, pp. 1145-1159, 1997