Microsoft Word - Mtebe.docx


 ISSN: 2311-1550 

 
Vol. 6, No. 3, pp. 228-244 

 
This work is licensed under a Creative Commons Attribution ShareAlike 4.0 International License. 

 

 

Mining Students’ Data to Analyse Usage Patterns in  
eLearning Systems of Secondary Schools in Tanzania 

Joel S. Mtebe and Aron W. Kondoro 

 The University of Dar es Salaam, Tanzania 
 

Abstract: The adoption and use of various eLearning systems to enhance the quality of education in 
secondary schools in Tanzania is becoming common. However, there is little evidence to suggest 
that students actually use them. Existing studies tend to focus on investigating students’ attitude 
towards using these systems through surveys. Nonetheless, data from surveys is normally subject 
to the possibility of distortion, low reliability, and rarely indicate the causal effects. This study 
adopted WEKA and KEEL as data mining tools to analyze students’ usage patterns and trends 
using 68,827 individual records from the log file of the Halostudy system implemented in 
secondary schools in Tanzania. The study found that the system usage is moderate and in decline. 
There is also variability in the usage of multimedia elements with biology having the highest 
number while mathematics has the lowest. Students from Dar es Salaam, Mwanza, and Arusha, in 
that order, had the highest system usage with the lowest being from the peripheral regions. The 
possible challenges limiting system usage are discussed. These findings show that data mining tools 
can be used to indicate usage patterns of systems implemented in sub-Saharan Africa and to help 
educators to find ways of maximising systems usage. 

Keywords: educational data mining, learning analytics, eLearning systems, eLearning secondary 
schools. 

Introduction 
The adoption and use of various eLearning systems, such as Moodle, Sakai, and Blackboard, to 
improve the quality of teaching and learning at various levels of education in sub-Saharan Africa is 
becoming common (Ssekakubo, Suleman, & Marsden, 2011; Unwin et al., 2010; Venter, Rensburg, & 
Davis, 2012). In Tanzania, for instance, more than 50% of surveyed institutions were found to have 
installed eLearning systems of various kinds (Mtebe & Raisamo, 2014). In the beginning, the majority 
of these systems were implemented within higher learning institutions. In recent years, many of these 
systems have increasingly been implemented in secondary schools. Some good examples of eLearning 
systems implemented in secondary schools in Tanzania include the Tans-eL system (Kalinga, Bagile, 
& Trojer, 2006), Retooling system (Mtebe, Kibga, Mwambela, & Kissaka, 2015), Christian Social 
Services Commission system (CSSC, 2014), Shule direct system (Mtebe & Kissaka, 2015), and Brain 
share system (Mwakisole, Kissaka, & Mtebe, 2018). 

With the increased number of students, which stretches beyond the limit of available teachers and 
learning resources, these initiatives aim to provide digital content accessible via the Internet where 
students can use these resources to enhance their learning activities. The majority of these initiatives 
aim to provide quality digital content to students with minimum intervention from teachers. For 
instance, the Shuledirect system has digital content for various subjects such as Physics, Chemistry, 



 229 

Biology, English, Geography, Civics, Mathematics and Kiswahili (ShuleDirect, 2019). The Tans-eL 
system has digital content for Mathematics, Biology, and Chemistry (Kalinga et al., 2006). 

Despite the continued investment in these systems, there is little evidence to suggest students across 
the country actually use them. These systems cannot help students improve learning if they are not 
used. Many studies have strongly shown that there is a correlation between eLearning system usage 
and students’ performance (Filippidi, Tselios, & Komis, 2010; Jo, Kim, & Yoon, 2014) and students’ 
satisfaction (Naveh, Tubin, & Pliskin, 2012; Tarigan, 2011). Despite these benefits, the lack of actual 
usage or underuse of systems implemented in sub-Saharan Africa is a common problem (Ssekakubo et 
al., 2011; Unwin et al., 2010), and has been a major setback against their success (Bervell & Umar, 2018; 
Lwoga, 2014). The need to ensure that students make full use of these systems is important so that the 
anticipated benefits are attained. 

Many of the existing studies have focused on students’ attitude towards using these systems through 
surveys (Mselle & Kondo, 2013; Msoka, Mtebe, Kissaka, & Kalinga, 2015). The findings from the 
majority of these studies tend to indicate students have positive attitudes but when it comes to actual 
usage their attitudes are more reserved. Moreover, data from surveys is normally subject to the 
possibility of distortion and low reliability and rarely indicate the causal effects (Jo et al., 2014). 
Therefore, there is a need for more sophisticated means for investing how students use these systems, 
which is important to avoid inefficient investments and ensure maximum utilisation of the installed 
systems.  

Recently, data mining technologies have been making a lot of headway in capturing and analysing 
massive amounts of data generated by these systems (Romero, Ventura, & García, 2008). The 
eLearning system keeps a record of all the activities that students perform in the log files which can be 
analyzed and used to provide immediate feedback to educators (Romero, Espejo, Zafra, Romero, & 
Ventura, 2013). Despite these great potential benefits of data mining technologies, few studies have 
utilised them in investigating eLearning system usage amongst students in secondary schools in sub-
Saharan Africa and Tanzania, in particular.  

This study utilised data from a Halostudy system (https://halostudy.ac.tz/) log to investigate students’ 
usage patterns in the system implemented in secondary schools in Tanzania. The Halostudy system 
was customised from the Moodle system to suit the context of secondary education. The study 
adopted WEKA and KEEL as data mining tools, involving 68,827 individual records, accessed the 
system for nearly 14 months. The findings from this study show that data mining tools can be used to 
indicate usage patterns of systems implemented in sub-Saharan Africa and help educators to find 
ways of maximising systems usage. The description of the Halostudy system is explained next. 

The Halostudy System 
The implementation of the Halostudy system can be traced back to 2013 when the Ministry of 
Education and Vocational Training (MoEVT) of the government of Tanzania implemented the 
retooling project in collaboration with the College of Information and Communication Technologies 
(Mtebe, Kibga, et al., 2015). The retooling project in this context was a project aimed at reskilling or 
upgrading subject content knowledge of teachers in science and mathematics subjects in secondary 
schools in Tanzania. Generally, the project aimed at addressing the low success rates of students in 
science and mathematics subjects in secondary schools through enhancing teachers’ subject content 



 230 

knowledge of the subjects they teach. It was claimed that the failure rates of students in these subjects 
were linked to inadequate secondary school teachers’ knowledge on the subjects. 

To address this problem, under the retooling project, the multimedia enhanced digital content (in the 
form of animations, simulations, video, and audio) were developed and shared with teachers across 
the country via the customised Moodle system. The developed content was for only topics perceived 
to be difficult to understand for an assessment conducted in various regions in Tanzania. A total of 70 
topics and 147 subtopics were developed and extensively supported with multimedia elements. More 
specifically, 93 videos were recorded, 57 animations were developed, and 201 still pictures were 
captured and integrated into the digital content (Mtebe, Kibga, et al., 2015). The developed 
multimedia enhanced content was then pilot tested with 2,000 teachers in 858 schools located in 13 
regions in Tanzania. During the pilot phase, the developed multimedia enhanced content was 
uploaded into the customised Moodle system where teachers accessed the system for three months 
before conducting a follow-up study using SMS quizzes. The result showed that multimedia enhanced 
content helped to improve subject content knowledge of teachers who participated in the retooling 
project (Mtebe, Kondoro, Kissaka, & Kibga, 2015). In a separate study, the customised Moodle system 
was perceived to be easy to use by the majority of surveyed teachers (Mtebe, Mbwilo, & Kissaka, 
2016).  More details about the retooling project can be obtained in Mtebe, Kibga, et al., 2015. 

Building from the success of the retooling project, the College of Information and Communication 
Technologies (CoICT) in partnership with Viettel Tanzania Ltd, aka Halotel, developed an Internet 
based application with multimedia enhanced content drawn from the retooling project for all topics of 
science and mathematics subjects for secondary education in Tanzania. While the retooling project 
focused on enhancing teachers’ subject content knowledge of the subjects they teach, this project 
focused on students. This is because the content developed during the retooling project could be used 
by students to enhance their subject content knowledge as this was the same content used by the 
teachers. 

In order to disseminate the content and ensure that many students benefit from it, the College entered 
into an agreement with Halotel to use its network to facilitate dissemination of the content to the 
learners in secondary schools in Tanzania. The Halotel has connected 427 secondary schools with the 
Internet, which could potentially benefit from the developed content.  The content was made free of 
charge to the students who had access to the Internet through www.halostudy.ac.tz. Moreover, the 
Halostudy app was also developed for those who have mobile devices and is available at the 
GooglePlay Store.  Figure 1 shows an interface of the Halostudy platform. 



 231 

 

Figure 1: The interface of the Halostudy system. (Source: https://halostudy.ac.tz/) 

 

Since the launching of Halostudy in August 2017, students all over the country have been accessing 
the content to enhance the content knowledge of their subjects. Just like other similar initiatives, the 
Halostudy system was meant to facilitate students’ centered learning; whereby students could access 
and use the multimedia enhanced content anywhere, anytime, without interaction with teachers. 
Similarly, teachers could also use the content to enhance their subject content knowledge for 
complementing classroom environments. Despite the continuous use of the Halostudy system 
throughout secondary schools in Tanzania, little information on how students use the system and 
what features are mostly used was available. In this study, we utilised data from a system log to 
investigate students’ usage patterns in the Halostudy system using WEKA and KEEL as data mining 
tools. 

Related Works 
The eLearning systems tend to keep records of all the activities that students have performed in the 
form of a log file. The activities which are kept in the log file include the time, Internet Protocol 
address,  name of the student, the action completed, and the activities performed in different modules 
(Kadoic & Oreski, 2018). Due to the large quantities of data these systems can generate daily, it is very 



 232 

difficult  for educators to manage them manually (Estacio & Raga, 2017; Romero et al., 2008).The data 
mining tools have adapted and been used to  explore the unique types of data from eLearning systems 
to better understand how students learn and identify the settings in which they learn to improve 
educational outcomes (Romero & Ventura, 2013).  

Given these advantages, studies have been using these mining tools to investigate systems features 
that have an influence on improving students’ learning performance. For instance, Macfadyen and 
Dawson (2010) used data mining tools to analyse students data from a Blackboard system. Authors 
extracted the total number of discussion messages posted, the total number of mail messages sent, and 
the total number of assessments completed as key variables. Through regression analysis with the 
final grade, it was found that the variables explained more than 30% of the variation in student final 
grade.  

Similarly, Mwalumbwe and Mtebe (2017) extracted data from the Moodle log of two courses delivered 
at the Mbeya University of Science and Technology, using a developed mining tool and subjected into 
linear regression analysis with students’ final results. The study found that discussion posts, peer 
interaction, and exercises were determined to be significant factors for students’ academic 
achievement in blended learning. However, time spent in the system, number of downloads, and 
login frequency were found to have no significant impact on students’ learning performance.  

Kadoic and Oreski (2018) used Moodle plugins to extract data from the log file and analyse students’ 
behavior at the Faculty of Organization and Informatics at the University of Zagreb. The results of the 
students’ behavior were interpreted in a bid to improve students’ learning. Similarly, Romero et al 
(2013) extracted data from Moodle logs and used them to predict student performance. Generally, the 
study found that the regular usage of features such as assignments, quizzes, and forum activity had 
an impact on students’ final grade.  

In general, these studies and many others, such as those in Estacio and Raga (2017); Hung and Zhang 
(2008); Kotsiantis, Tselios, Filippidi, and Komis (2013); Lotsari, Verykios, and Panagiotakopoulos 
(2014); Podgorelec and Kuhar (2011); Wen and Rose (2014); Yu and Jo (2014); and Zorrilla, García, and 
Álvarez (2010), have successfully used data mining tools to predict students’ performance based on 
log data from learning environments. Very few studies applied data mining tools to analyse log data 
to provide informed feedback on how students use eLearning systems in secondary schools in 
Tanzania. In a review of 74 articles, published between 2007 and 2017, Mtebe and Raphael (2018) 
found that the majority of articles focused on users’ attitude and perceptions towards using these 
systems. With increased adoption and use of these systems in secondary schools in Tanzania, the use 
of data mining tools that will use log data to inform educators on the usage pattern is increasingly 
important. 

Methodology 
Research Design 

This study is focused on investigating the access pattern of students using the eLearning system. We 
were interested in understanding how the system was being used and by whom. To find this out, we 
started by investigating the level of access based on the identified variables as shown  in Table 1. 

 



 233 

 

Table 1: Variables Extracted for the Study 
No. Variable Description 
1. Course Name Identification string of the course in which the action is related. This variable 

helped to differentiate between various courses accessed in the system 

2. Source of Access This data enabled to identify the region/places where users accessed the system 

3. Type of Content The system has text, video, animations, and simulations as types of content. This 
variable aimed to find out to what extent users have been using various types of 
the content installed in the system. 

4. Time Date and time stamp of when the action was executed which enabled to estimate 
the geographical location of users when they accessed the system.  

5. IP Address Unique numerical label assigned to the device used by the user in order to 
determine the type of the device used when accessing the system. 

6. Action  Type of action initiated which enabled to determine the number of most and least 
active users used the system. This number was later grouped per months.  

Using variables in Table 1, we were able to get a general picture on the behavioral pattern of students 
who used the system. To do so, usage data was extracted from the Halostudy system implemented in 
secondary schools in Tanzania in the form of text-based logs. These logs are a record of time-based 
events that occur in the system. Every time a user performs a certain action, information about it is 
recorded in the logs. The recorded data contains attributes with variables shown in Table 1. The data 
was extracted from the time the system was launched in August 2017 to the time that this study was 
conducted (September 2018) and exported to a CSV file. The extracted file had a total of 68,827 
individual records of raw data before being further analysed using data mining tools. The data mining 
tools used to analyse the obtained data are explained next. 

Data Mining Tools and Data Analysis 

The Halostudy is based on the Moodle platform and therefore the data mining tools compatible with 
Moodle system were selected and adopted. Therefore, Waikato Environment for Knowledge Analysis 
(WEKA) and Knowledge Extraction based on Evolutionary Learning (KEEL) data mining tools were 
adopted in extracting data from the Halostudy system in the form of text-based logs. The WEKA 
(https://www.cs.waikato.ac.nz/~ml/weka/index.html) is a  an open source software, written in Java, 
aiming at allowing users to compare different machine learning methods on new data sets (Hall et al., 
2009). It consists of visualisation tools and algorithms for data analysis and predictive modeling, 
together with a graphical user interface for easy access to these functions (Holmes, Donkin, & Witten, 
1994).  

The KEEL tool (http://www.keel.es/) is also an open source software that supports data management 
and the design of experiments. The KEEL pays attention to the implementation of evolutionary 
learning and soft computing based techniques for data mining problems including regression, 
classification, clustering, pattern mining and so on (Ernández, Uengo, & Errac, 2011). The WEKA and 
KEEL tools were selected due to the fact that they are both open source tools, developed in Java, and 
use the same dataset external representation format (ARFF files). The CSV file was exported into the 
WEKA in order to generate some patterns based on the identified variables in Table 1. In areas where 



 234 

the WEKA tool had limitations in generating the intended variables, the  KEEL tool was used.  The 
data from WEKA and KEEL tools were exported to Microsoft Excel for analysis with variables being 
grouped in the form of tables, charts and graphs for easy understanding. 

Ethical Issues 

This study used data mining tools to extract data from the Halostudy system in order to understand 
usage patterns amongst users from the onset of the project. However, like many other data mining 
studies, the issues of security, privacy, and individuality of data need to be respected and protected to 
make sure that people are judged and treated fairly (van Wel & Royakkers, 2004). In this study, the 
extracted data were treated confidentially by ensuring that the identified variables that might identify 
users were excluded. The excluded information included login credentials, personal user profiles, and 
personal details from quizzes, and the user’s data in discussion activities.  

Findings 
Access per Subject 

The level of user activity for each subject was determined by comparing the access percentage of each 
subject. Therefore, the total number of records for each subject was expressed as the percentage ratio 
of the total number of records in the whole file. Of the 68,827 individual records, there was a small 
difference in access levels across the four subjects with biology having the highest number of accesses 
(19,960) equivalent to 29% compared to other subjects. Chemistry and physics had the lowest with 
23% each as shown in Figure 2. 

 
Figure 2: The percentage of users’ access per subject. 

In Tanzania’s education system, students are allowed to drop chemistry and physics subjects at Form 
II if they are going into arts streams, while biology and mathematics are normally compulsory 
subjects. Therefore, those who accessed biology are likely the same students who accessed 
mathematics. On the other hand, the low percentage in chemistry and physics could be due to the fact 
that some students who dropped these subjects at Form II did not access them in subsequent classes, 
making a slightly small difference in access percentages.  

29

23
25

23

0

5

10

15

20

25

30

35

Biology Chemistry Maths Physics



 235 

Number of Users per Range of Access 

The activity patterns in terms of numbers of most and least active users were analysed. Users were 
categorised into different groups depending on the number of times they accessed the system while 
filtering records with duplicate users. Each group had an access range of 100, starting from 0-100,  
100-200, and so on. The study shows that the majority of users accessed the system between 0-100 
times followed by 102 to 201 times. Few users accessed the system 1400 times plus in the studied 
period.  Figure 3 shows the number of users per range of access for the studied period. 

 

Figure 3: The number of users per range of access in the Halostudy system. 

The study has shown that the access levels of students is moderate, with the majority of students 
accessing the system in a range starting from 0-100 in nearly 14 months.  Clearly, despite 68,827 
individual records available in the system, the majority of students do not access the Halostudy 
regularly. There are many reasons which could have contributed to this irregularity of Halostudy 
access, some of which are discussed in the challenges section in this study. 

Multimedia Access per Subject 

The multimedia content such as audio, video, and animations play a key role in the learning process. 
They are thought to enhance the understanding of abstract and difficult concepts  that cannot be easily 
grasped from words alone (Steinke, Huk, & Floto, 2003). Moreover, they are thought to support 
students with different learning styles by presenting content in a variety of multimedia (video, audio 
and sound) (Woodcock, Burns, Mount, Newman, & Gaura, 2005). As shown in Figure 4, there is 
variability in the usage of multimedia elements, with biology having the highest number (more than 
1500 times) while mathematics has the lowest. 

859

55 21 12 6 5 4 2 2 1 1 1 1 1 1
0

100
200
300
400
500
600
700
800
900

1000

2-1
01

10
2-2

01

20
2-3

01

30
2-4

01

40
2-5

01

50
2-6

01

60
2-7

01

70
2-8

01

90
2-1

00
1

14
02

-15
01

15
02

-16
01

28
02

-29
01

33
02

-34
01

59
02

-60
01

71
02

-72
01

N
um

be
r o

f u
se

rs

Access frequency

Total number of users per range of access

Total



 236 

 
Figure 4: The number of users per multimedia access per subject. 

Multimedia Access per Subject per Form 

The access of multimedia elements was also grouped per year of study from Form I to Form IV, in 
order to determine which cohort had accessed mostly the multimedia elements. The study shows that 
Form II and Form III students had the highest access of multimedia elements throughout the three 
subjects (see Figure 5). Interestingly, only a few Form IV students accessed multimedia elements 
nearly 100 times. 

 
Figure 5: The number of users per multimedia access per class of study. 

 

 

 

1529

645

132

461

0

200

400

600

800

1000

1200

1400

1600

1800

Biology Chemistry Maths Physics

0
50

100

150
200
250

300
350
400

450
500

Form I Form II Form III Form IV

Biology Chemistry Maths Physics



 237 

Access per Month 

The level of user engagement can change over time depending on various factors.  The access 
frequency per month starting from when the system was launched were extracted and analysed.  The 
results show that the most users accessed the system in the month of January 2018, followed by 
November 2017. However, the trend shows that the number of users accessing the system has been 
decreasing towards the end of September 2018. In fact, August 2018 and September 2018 had the 
lowest number of users who accessed the system as shown in Figure 6. 

 
Figure 6: The number of users accessed the system per month. 

The trend indicates that many students tend to access the system between November and December. 
This finding could be due to the fact that that many students tend access the system close to the final 
exams. Final year exams are normally conducted between November and early December in the 
majority of secondary schools in Tanzania. Another possible explanation could be that students tend 
to have access to the Internet during the holidays and outside school premises (Mwakisole et al., 
2018).  The majority of students are normally in holidays in December and January each year in the 
Tanzanian education system. 

Web vs Mobile app Access 

The system’s mobile version was also developed to provide access to those with access to mobile 
phones. Therefore, we were interested to find out the media that students used the most when 
accessing the system. Interestingly, 71.7% of students accessed the system using the web while 28.3% 

0

2000

4000

6000

8000

10000

12000

14000

16000

18000

Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep
2017 2018

Access frequency per Month

Total



 238 

of students accessed the system via mobile app. This result may be explained by the fact that the use 
of mobile phones are strictly prohibited in many secondary schools in Tanzania (Kafyulilo, 2014; 
Kihwele & Bali, 2013) despite many students having access to them (Chambo, Laizer, Nkansah-
Gyekye, & Ndume, 2013; Malero, Ismail, & Manyilizu, 2015; Mwakisole et al., 2018; Tarimo & 
Kavishe, 2017). Therefore, the use of mobile phones could not make much difference in helping 
students to access the system. 

Access per Location 

Users were also categorised on geographical location in order to visualise the accessibility and usage 
of the system across the country. Therefore, the total number of records originating from each region 
was calculated. Generally, an IP address value was used to estimate the geographical location while 
the IPInfo (IPinfo, 2018) was used to convert IP addresses to specific regions. Figure 7 shows the 
distribution of the percentage of users accessing the system per region. 

 
Figure 7: The percentage of users accessing the system per region. 

The findings show that many students who accessed the the system are from big cities with good 
Internet connectivity — Dar es Salaam, Mwanza, and Arusha, in that order. The lowest accessed 
regions were those located in peripheral areas such as Kagera, Mara, and Shinyanga. This finding 
confirms the fact that access to reliable Internet in rural areas is still a problem. The government has 
been making considerable efforts to roll out fiber optical cable, including the East African Submarine 
Cable System, SEACOM, and the East African Marine System, in order to widen access to the Internet 
in the rural areas (Mtebe & Raphael, 2018). It seems, therefore, these initiatives have not benefited 
many users in rural areas. 

Possible Challenges Limiting Access to the Halostudy System 
The study has shown that the students’ access of the Halostudy system is moderate, with the majority 
of students having accessed the system in a range starting from 0-100 in nearly 14 months. It was also 
interesting to note that many students tend to access the system during the   holiday months. Some of 

11

67.1

0.3 0.3

19.9

0.3 0.3 0.8
0

10

20

30

40

50

60

70

80

Ar
us

ha

Da
r e

s S
ala

am
Ka

ge
ra

Ma
ra

Mw
an

za

Na
iro

bi 
Ar

ea

Sh
iny

an
ga

Za
nz

iba
r U

rba
n



 239 

the possible challenges that could have hindered the access and use of Halostudy system are as 
follows: 

Lack of Computers in Schools 

The government and its partners have been making considerable efforts towards equipping schools 
with computers and other ICT facilities. Recently, the government equipped approximately 31.4% of 
government secondary schools (out of 3,601) with computers ranging from 1 to 68 computers, with 
20.1% them being connected to the Internet (MoEST, 2017). Similarly, Halotel supported 400 schools 
and the Tigo firm supported 700 schools with computers connected to the Internet in selected regions 
of the country (Kazoka, 2016; Tanzania TELECOMS, 2016). Despite these efforts, and many others, 
many secondary schools in Tanzania do not have computers (Muhoza, Tedre, Aghaee, & Hansson, 
2014) which limits students from accessing the system. It should be noted that the Halostudy is an 
Internet based system and therefore schools need to have computers connected to the Internet.   

The Cost of Internet Connectivity  

The cost of Internet remains a major challenge to the access of eLearning systems in Tanzania. The cost 
of connecting 300 computers was estimated to be 4 million TShs (2140€ ≈ 3100$) per month for a 
dedicated 704kb/128kb satellite connection (Tedre, Ngumbuke, & Kemppainen, 2010). This is 
definitely unaffordable to many secondary schools in Tanzania, given the fact that they depend on 
government funds to run most of their services. Although use of the mobile Internet could be a 
solution to the majority of schools, the cost of Internet bundles provided by many mobile firms is still 
high (Mtebe & Raphael, 2018). For instance, the subscription of 10GB of Internet cost around US$ 25 
per month, which is expensive to the majority of students. In a study conducted in seven schools in 
Dar es Salaam, it was found that the majority of students were paying less than Tsh 1000/ = (US $ 0.5) 
for the Internet per week via their mobile devices. Despite the availability of special student bundles 
(1GB per week @ Tsh 1500 [0.6 USD]), many students cannot afford them (Ghasia, De Smet, 
Machumu, & Musabila, 2018). 

Attitudes on the Use of Mobile Phones 

Studies have shown that mobile phones can compensate for a lack of existing infrastructure and 
erratic Internet connections in sub-Saharan Africa and Tanzania, in particular (Chambo et al., 2013; 
Ghasia et al., 2018; Joyce-Gibbons et al., 2018). However, teachers’ and parents’ negative perceptions 
and attitudes towards students using mobile phones in schools continues to be a limiting factor. 
Teachers and parents believe that mobile phones have a detrimental effect on student performance 
and moral values (Kihwele & Bali, 2013) and that students tend to misuse them by watching 
pornographic and entertainment materials instead of studying (Kafyulilo, 2014). Therefore, while it 
might be possible for students to access these devices informally, they cannot bring them to school or 
use them regularly at home limiting the possibilities of using them for accessing eLearning systems.  

Inadequate ICT Skills 

The use of Halostudy requires students to have the skills of using computers and the Internet. 
Nonetheless, many students do not have adequate skills to use ICT facilities and the Internet, 
especially in rural areas (Barakabitze, Kitindi, Sanga, Kibirige, & Makwinya, 2015; Tedre et al., 2010). 
The low number of students who accessed the system could be partly due to the lack of ICT and 
Internet skills amongst students in Tanzania.  



 240 

Lack of Awareness of the System 

Another challenge that could have limited students’ access to the system is lack of awareness among 
students of the existence of the system. The college and Halotel mobile firm have been advertising this 
system via social media and some selected radios in Tanzania. Due to the large population size of 
students and limited advertisement budgets, it is unlikely that many students are aware of this 
system.  

Conclusion 
The adopting and use of eLearning systems in enhancing the quality of teaching and learning at 
various levels of education in Africa will continue to increase given the proliferation of mobile phones 
and the Internet. With these systems continuing to generate massive amounts of new data through the 
data log, it is important to help educators with tools that will help them to understand the status of 
students’ learning and finding ways of helping struggling students. The use of data mining tools can 
effectively utilise existing generated data in eLearning systems to provide feedback for instructors 
about the efficiency of education such as the quality of students’ postings, visualisation usage 
behaviors, and engagement levels. 

This study aimed to demonstrate how the existing data mining tools can be used to provide important 
information about students’ access in the system implemented in secondary schools in sub-Saharan 
Africa. To do so, the study utilised data from a Halostudy system log to investigate students’ usage 
patterns in the system implemented in secondary schools in Tanzania using WEKA and KEEL as data 
mining tools. Using a total of 68,827 individual records accessed in nearly 14 moths, the study was 
able to generate useful usage patterns that can help educators to make informed decisions in finding 
strategies that will maximise system usage.  

Generally, the usage of the system has been moderate and has been declining almost every month. 
Declining usage is an important indication that the anticipated benefits may not be realised. These 
findings call for immediate action in order to find ways of ensuring that users use the system. The 
findings of this study have shown that data mining tools can be used to show usage patterns of 
systems implemented in sub-Saharan Africa through the use of system log data. However, one 
notable weakness of this study is that the quantitative data do not provide explanations of such trends 
and patterns. For instance, reasons for multimedia access per subject were high for biology compared 
to other subjects that could not be revealed. A mixed study would have been appropriate in 
complementing the findings obtained from the quantitative data. A further study with more focus on 
qualitative data is therefore recommended. 

References 
Barakabitze, A. A., Kitindi, E. J., Sanga, C., Kibirige, G., & Makwinya, N. (2015). Exploring students’ skills and 

attitudes on effective use of ICTs: Case study of selected tanzanian public secondary schools. Universal 
Journal of Educational Research, 3(6), 407–425. https://doi.org/10.13189/ujer.2015.030609 

Bervell, B., & Umar, I. N. (2018). Blended learning or face-to-face? Does tutor anxiety prevent the adoption of 
Learning Management Systems for distance education in Ghana? Open Learning: The Journal of Open, Distance 
and e-Learning, 1-19. https://doi.org/10.1080/02680513.2018.1548964 

 



 241 

Chambo, F., Laizer, L., Nkansah-Gyekye, Y., & Ndume, V. (2013). Mobile Learning Model for Tanzania 
Secondary Schools: Case study of Kilimanjaro Region. In Pan African International Conference on Information 
Science, Computing and Telecommunications (2013) (Vol. 4, pp. 698–701). 
https://doi.org/10.1109/SCAT.2013.7055102 

CSSC. (2014). the performance enhancement by e-learning for secondary schools. Retrieved September 21, 2019, from 
https://cssc.or.tz/ 

Ernández, A. F., Uengo, J. L., & Errac, J. D. (2011). KEEL: A software tool to assess evolutionary algorithms for 
data mining problems (regression, classification, clustering, pattern mining and so on). Journal of Multiple-
Valued Logic and Soft Computing, 17, 255–287. Retrieved from http://sci2s.ugr.es/keel/dataset.php?cod=99 

Estacio, R. R., & Raga, R. C. (2017). Analyzing students online learning behavior in blended courses using 
Moodle. Asian Association of Open Universities Journal, 12(1), 52–68. https://doi.org/10.1108/AAOUJ-01-2017-
0016 

Filippidi, A., Tselios, N., & Komis, V. (2010). Impact of Moodle usage practices on students’ performance in the 
context of a blended learning environment. In Social applications for lifelong learning (pp. 1–6). Patra, Greece,. 
Retrieved from 
https://www.academia.edu/425665/Impact_of_Moodle_usage_practices_on_students_performance_in_the_c
ontext_of_a_blended_learning_environment 

Ghasia, M. A., De Smet, E., Machumu, H., & Musabila, A. (2018). Towards mobile learning deployment in higher 
learning institutions: A report on the qualitative inquiries conducted in four universities in Tanzania. Afrika 
Focus, 31(1). https://doi.org/10.21825/af.v31i1.9042 

Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., & Witten, I. H. (2009). The WEKA data mining 
software. ACM SIGKDD Explorations Newsletter, 11(1), 10. https://doi.org/10.1145/1656274.1656278 

Holmes, G., Donkin, A., & Witten, I. H. (1994). WEKA: A machine learning workbench. In Proceedings, Second 
Australia and New Zealand Conference on Intelligent Information Systems (pp. 27–29). 
https://doi.org/10.1177/003693300505000111 

Hung, J.-L., & Zhang, K. (2008). Revealing online learning behaviors and activity patterns and making 
predictions with data mining techniques in online teaching. MERLOT Journal of Online Learning and Teaching, 
4(4), 426–437. https://doi.org/10.1002/mma.4340 

IPinfo. (2018). IP data for all your business needs. Retrieved from https://ipinfo.io/ 
Jo, I.-H., Kim, D., & Yoon, M. (2014). Analyzing the log patterns of adult learners in LMS using learning 

analytics. In Proceedings of the Fourth International Conference on Learning Analytics and Knowledge - LAK ’14 
(pp. 183–187). New York, New York, USA: ACM Press. https://doi.org/10.1145/2567574.2567616 

Joyce-Gibbons, A., Galloway, D., Mollel, A., Mgoma, S., Pima, M., & Deogratias, E. (2018). Mobile phone use in 
two secondary schools in Tanzania. Education and Information Technologies, 23(1), 73–92. 
https://doi.org/10.1007/s10639-017-9586-1 

Kadoic, N., & Oreski, D. (2018). Analysis of student behavior and success based on logs in Moodle. 2018 41st 
International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 
2018 - Proceedings, (pp. 654–659). https://doi.org/10.23919/MIPRO.2018.8400123 

Kafyulilo, A. (2014). Access, use and perceptions of teachers and students towards mobile phones as a tool for 
teaching and learning in Tanzania. Education and Information Technologies, 19(1), 115–127. 
https://doi.org/10.1007/s10639-012-9207-y 

Kalinga,  A. E., Bagile, R. B. B., & Trojer, L. (2006). An Interactive e-Learning Management System ( e-LMS ): A 
solution to Tanzanian Secondary Schools’ education. International Journal of Human and Social Sciences, 1(4), 
252–255. Retrieved from https://waset.org/publications/14641/an-interactive-e-learning-management-
system-e-lms-a-solution-to-tanzanian-secondary-schools-education 



 242 

Kazoka, L. (2016, December 3). Tanzania: Halotel offers free Internet to 400 schools. Tanzania Daily News. 
Retrieved from http://allafrica.com/stories/201609150157.html 

Kihwele, J. E., & Bali, T. A. L. (2013). The perceptions of teachers, parents and students on the effects of mobile 
phone use on student learning in Tanzania. Journal of Education and Practice, 4(25), 101–107. Retrieved from 
http://www.iiste.org/Journals/index.php/JEP/article/view/9050/9277 

Kotsiantis, S., Tselios, N., Filippidi, A., & Komis, V. (2013). Using learning analytics to identify successful 
learners in a blended learning course. International Journal of Technology Enhanced Learning, 5(2), 133–150. 
https://doi.org/10.1504/IJTEL.2013.059088 

Lotsari, E., Verykios, V. S., Panagiotakopoulos, C., & Kalles, D. (2014). A learning analytics methodology for 
student profiling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence 
and Lecture Notes in Bioinformatics) (Vol. 8445 LNCS, pp. 300–312). https://doi.org/10.1007/978-3-319-07064-
3_24 

Lwoga, E. T. (2014). Critical success factors for adoption of web-based learning management systems in 
Tanzania. International Journal of Education and Development Using Information and Communication Technology 
(IJEDICT), 10(1), 4–21. Retrieved from https://eric.ed.gov/?id=EJ1071193 

Macfadyen, L. P., & Dawson, S. (2010). Mining LMS data to develop an “early warning system” for educators: A 
proof of concept. Computers & Education, 54(2), 588–599. https://doi.org/10.1016/j.compedu.2009.09.008 

Malero, A., Ismail, A., & Manyilizu, M. (2015). ICT usage readiness for private and public secondary schools in 
Tanzania, a case of Dodoma Municipality. International Journal of Computer Applications, 129(3), 2013–2016. 
https://doi.org/10.5120/ijca2015906791 

MoEST. (2017). Feasibility study exploring e-learning initiatives at secondary schools in Tanzania mainland. Dodoma, 
Tanzania. 

Mselle, L. J., & Kondo, T. S. (2013). Results on implementing personal learning environments in Tanzanian 
secondary schools. In 8th International Conference for Internet Technology and Secured Transactions, ICITST 2013 
(pp. 417–421). https://doi.org/10.1109/ICITST.2013.6750234 

Msoka, V. C., Mtebe, J. S., Kissaka, M. M., & Kalinga, E. C. (2015). Developing and piloting interactive physics 
experiments for secondary schools in Tanzania. Journal of Learning for Development - JL4D, 2(2). Retrieved 
from http://jl4d.org/index.php/ejl4d/article/view/121 

Mtebe, J. S., Kibga, E. Y., Mwambela, A. A., & Kissaka, M. M. (2015). Developing multimedia enhanced content 
to upgrade subject content knowledge of secondary school teachers in Tanzania. Journal of Learning for 
Development - JL4D, 2(3), 29–44. Retrieved from http://jl4d.org/index.php/ejl4d/article/view/101/111 

Mtebe, J. S., & Kissaka, M. M. (2015). Heuristics for evaluating usability of learning management systems in 
Africa. In P. Cunningham & M. Cunningham (Eds.), IST-Africa 2015 Conference Proceedings (pp. 1–13). 
Lilongwe, Malawi. https://doi.org/10.1109/ISTAFRICA.2015.7190521 

Mtebe, J. S., Kondoro, A., Kissaka, M. M., & Kibga, E. (2015). Using SMS mobile technology to assess the mastery 
of subject content knowledge of science and mathematics teachers of secondary schools in Tanzania. World 
Academy of Science, Engineering and Technology, 9(11), 3498–3506. Retrieved from 
http://waset.org/publications/10002939/using-sms-mobile-technology-to-assess-the-mastery-of-subject-
content-knowledge-of-science-and-mathematics-teachers-of-secondary-schools-in-tanzania 

Mtebe, J. S., Mbwilo, B., & Kissaka, M. M. (2016). Factors influencing teachers’ use of multimedia enhanced 
content in secondary schools in Tanzania. International Review of Research in Open and Distributed Learning, 
17(2), 65–84. https://doi.org/10.19173/irrodl.v17i2.2280 

Mtebe, J. S., & Raisamo, R. (2014). Investigating perceived barriers to the use of Open Educational Resources in 
higher education in Tanzania. International Review of Research in Open and Distance Learning, 15(2), 43–65. 
https://doi.org/10.19173/irrodl.v15i2.1803 



 243 

Mtebe, J. S., & Raphael, C. (2018). A critical review of elearning research trends in Tanzania. Journal of Learning 
for Development - JL4D, 5(2), 163–178. Retrieved from http://jl4d.org/index.php/ejl4d/article/view/269 

Muhoza, O. U., Tedre, M., Aghaee, N., & Hansson, H. (2014). Viewpoints to ICT practices and hindrances from 
in tanzanian secondary schools and teacher training colleges: Focus on classroom teachers. In Proceedings - 
2014 International Conference on Teaching and Learning in Computing and Engineering, LATICE 2014 (pp. 133–
140). https://doi.org/10.1109/LaTiCE.2014.31 

Mwakisole, K. F., Kissaka, M. M., & Mtebe, J. S. (2018). Feasibility of cloud computing implementation for 
eLearning in secondary schools in Tanzania. International Journal of Education and Development Using 
Information and Communication Technology (IJEDICT), 14(1), 91–102. Retrieved from 
http://ijedict.dec.uwi.edu/viewarticle.php?id=2396 

Mwalumbwe, I., & Mtebe, J. S. (2017). Using learning analytics to predict students’ performance in Moodle 
Learning Management Systems: A case of Mbeya University of Science and Technology. The Electronic 
Journal of Information Systems in Developing Countries, 79(1), 1–13. https://doi.org/10.1002/j.1681-
4835.2017.tb00577.x 

Naveh, G., Tubin, D., & Pliskin, N. (2012). Student satisfaction with learning management systems: a lens of 
critical success factors. Technology, Pedagogy and Education, 21(3), 337–350. 
https://doi.org/10.1080/1475939X.2012.720413 

Podgorelec, V., & Kuhar, S. (2011). Taking advantage of education data: Advanced data analysis and reporting 
in virtual learning environments. Elektronika Ir Elektrotechnika, 8(8), 111–116. 
https://doi.org/10.5755/j01.eee.114.8.708 

Romero, C., Espejo, P. G., Zafra, A., Romero, J. R., & Ventura, S. (2013). Web usage mining for predicting final 
marks of students that use Moodle courses. Computer Applications in Engineering Education, 21(1), 135–146. 
https://doi.org/10.1002/cae.20456 

Romero, C., & Ventura, S. (2013). Data mining in education. Wiley Interdisciplinary Reviews: Data Mining and 
Knowledge Discovery, 3(1), 12–27. https://doi.org/10.1002/widm.1075 

Romero, C., Ventura, S., & García, E. (2008). Data mining in course management systems: Moodle case study 
and tutorial. Computers and Education, 51(1), 368–384. https://doi.org/10.1016/j.compedu.2007.05.016 

ShuleDirect. (2019). Shuledirect elearning system. Retrieved from https://www.shuledirect.co.tz/ 
Ssekakubo, G., Suleman, H., & Marsden, G. (2011). Issues of adoption: Have e-Learning Management Systems 

fulfilled their potential in developing countries? In Proceedings of the South African Institute of Computer 
Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, 
Multidisciplinary Environment (pp. 231–238). Cape Town, South Africa: ACM New York, NY, USA ©2011. 
https://doi.org/0.1145/2072221.2072248 

Steinke, M., Huk, T., & Floto, C. (2003). Helping teachers develop computer animations for improving learning 
in science education. In Society for Information Technology and Teacher Education International Conference (pp. 
3022–3025). Chesapeake, VA: AACE. Retrieved from http://www.editlib.org/p/18622 

Tanzania TELECOMS. (2016, July 31). 700 schools in Tanzania to be connected to the Internet. Retrieved from 
http://tanzaniainvest.com/telecoms/schools-internet-connectivity-tigo 

Tarigan, J. (2011). Factors influencing user's satisfaction on eLearning systems. Jurnal Manajemen Dan 
Kewirausahaan, 13(2), 177–188. https://doi.org/10.9744/jmk.13.2.177-188 

Tarimo, R., & Kavishe, G. (2017). Internet access and usage by secondary school students in Morogoro 
Municipality, Tanzania. International Journal of Education and Development Using Information and 
Communication Technology (IJEDICT), 13(2), 56–69. Retrieved from 
http://ijedict.dec.uwi.edu/viewarticle.php?id=2338 



 244 

Tedre, M., Ngumbuke, F., & Kemppainen, J. (2010). Infrastructure, human capacity, and high hopes: A decade of 
development of e-Learning in a Tanzanian HEI. Redefining the Digital Divide in Higher Education, 7(1), 7–20. 
Retrieved from https://core.ac.uk/download/pdf/39015490.pdf 

Unwin, T., Kleessen, B., Hollow, D., Williams, J., Oloo, L. M., Alwala, J., … Muianga, X. (2010). Digital learning 
management systems in Africa: Myths and realities. Open Learning: The Journal of Open and Distance Learning, 
25(1), 5–23. https://doi.org/10.1080/02680510903482033 

Van Wel, L., & Royakkers, L. (2004). Ethical issues in web data mining. Ethics and Information Technology, 6(2), 
129–140. https://doi.org/10.1023/B:ETIN.0000047476.05912.3d 

Venter, P., Rensburg, M. J. van, & Davis, A. (2012). Drivers of learning management system use in a South 
African open and distance learning institution. Australasian Journal of Educational Technology, 28(2), 183–198. 
https://doi.org/10.14742/ajet.868 

Wen, M., & Rose, C. P. (2014). Identifying latent study habits by mining learner behavior patterns in massive 
open online courses. Proceedings of the 23rd ACM International Conference on Conference on Information and 
Knowledge Management - CIKM ’14, (Vi), 1983–1986. https://doi.org/10.1145/2661829.2662033 

Woodcock, A., Burns, J., Mount, S., Newman, R., & Gaura, E. (2005). Animating pervasive computing. 23rd 
Annual International Conference on Design of Communication: Documenting & Designing for Pervasive Information. 
Coventry, United Kingdom: ACM. https://doi.org/10.1145/1085313.1085341 

Yu, T., & Jo, I. (2014). Educational technology approach toward learning analytics : Relationship between student 
online behavior and learning performance in Higher Education. In Proceedings of the Fourth International 
Conference on Learning Analytics and Knowledge (pp. 269–270). Indianapolis, IN, USA. 
https://doi.org/10.1145/2567574.2567594 

Zorrilla, M., García, D., & Álvarez, E. (2010). A decision support system to improve e-learning environments. In 
Proceedings of the 1st International Workshop on Data Semantics - DataSem ’10 (p. 1). 
https://doi.org/10.1145/1754239.1754252 

Authors: 
Joel S. Mtebe, is a Senior Lecturer in Computer Science and the Director of the Center for Virtual Learning 
(CVL) at the University of Dar es Salaam, Tanzania. He received a B.Sc. in Computer Science and Statistics from 
the University of Dar es Salaam (UDSM), Dar es Salaam, Tanzania in 2002, a Master’s of Online Education from 
the University of Southern Queensland, Australia in 2004, and his doctoral degree in Interactive 
Technology/Human Computer Interaction from the University of Tampere in Finland in 2014. Email: 
jmtebe@gmail.com 
Aron W. Kondoro, received his Master of Science degree in Information and Communication Systems Security 
from the Royal Institute of Technology (KTH - Sweden) in 2012 and Bachelor of Science in Computer Science 
from the University of Dar es Salaam in 2007. He is an Assistant Lecturer in the Computer Science and 
Engineering (CSE) Department at the University of Dar-es-Salaam (UDSM), Tanzania. Currently, he is a PhD 
student at KTH/UDSM doing research in using smart grid technologies to design and implement more efficient, 
reliable and autonomous solar-driven microgrids for off-grid rural communities. His other research and 
consultancy activities are focused on analysing ICT systems security and using mobile technologies to design 
applications for educational and financial use-cases. Email: awkondoro@gmail.com 
 
 
Cite this paper as: Mtebe, J. S., & Kondoro, A. W. (2019). Mining Students’ Data to Analyse Usage Patterns in eLearning 
Systems of Secondary Schools in Tanzania. Journal of Learning for Development 6(3), 228-244.