Educational Data Mining Using Cluster Analysis Methods and Decision
Trees based on Log Mining
Dublin Core
Title
Educational Data Mining Using Cluster Analysis Methods and Decision
Trees based on Log Mining
Trees based on Log Mining
Subject
Analysis Cluster, Decision Tree, Educational Data Mining (EDM), Log, SPADA
Description
Higher education institutions store data keep growing every year. The data has important information, but it still not optimized
into knowledge. Data Mining (DM) can be used to process existing data in universities in order to obtain knowledge that can
be utilized further. Educational Data Mining (EDM) often appears to be applied in big data processing in the education sector.
One of the educational data that can be further processed with EDM is activity log data from an e-learning system used in
teaching and learning activities. The log activity can be further processed more specifically by using log mining. The purpose
of this study was to process log data from the Sebelas Maret University Online Learning System (SPADA UNS) to determine
student learning behavior patterns and their relationship to the final results obtained. The data mining method applied in this
research is cluster analysis with the K-means Clustering and Decision Tree algorithms. The clustering process is used to find
groups of students who have similar learning patterns. While the decision tree is used to model the results of the clustering in
order to enable the analysis and decision-making processes. Processing of 11,139 SPADA UNS log data resulted in 3 clusters
with a Davies Bouldin Index (DBI) value of 0.229. The results of these three clusters are modeled by using a Decision Tree.
The decision tree model in cluster 0 represents a group of students who have a low tendency of learning behavior patterns with
the highest frequency of access to course viewing activities obtained accuracy of 74.42% . In cluster 1, which contains groups
of students with high learning behavior patterns, have a high frequency of access to viewing discussion activities obtained
accuracy of 76.47%. While cluster 2 is a group of students who have a pattern of learning behavior that is having a high
frequency of access to the activity of sending assignments obtained accuracy of 90.00%
into knowledge. Data Mining (DM) can be used to process existing data in universities in order to obtain knowledge that can
be utilized further. Educational Data Mining (EDM) often appears to be applied in big data processing in the education sector.
One of the educational data that can be further processed with EDM is activity log data from an e-learning system used in
teaching and learning activities. The log activity can be further processed more specifically by using log mining. The purpose
of this study was to process log data from the Sebelas Maret University Online Learning System (SPADA UNS) to determine
student learning behavior patterns and their relationship to the final results obtained. The data mining method applied in this
research is cluster analysis with the K-means Clustering and Decision Tree algorithms. The clustering process is used to find
groups of students who have similar learning patterns. While the decision tree is used to model the results of the clustering in
order to enable the analysis and decision-making processes. Processing of 11,139 SPADA UNS log data resulted in 3 clusters
with a Davies Bouldin Index (DBI) value of 0.229. The results of these three clusters are modeled by using a Decision Tree.
The decision tree model in cluster 0 represents a group of students who have a low tendency of learning behavior patterns with
the highest frequency of access to course viewing activities obtained accuracy of 74.42% . In cluster 1, which contains groups
of students with high learning behavior patterns, have a high frequency of access to viewing discussion activities obtained
accuracy of 76.47%. While cluster 2 is a group of students who have a pattern of learning behavior that is having a high
frequency of access to the activity of sending assignments obtained accuracy of 90.00%
Creator
Safira Nury Safitri1
, Haryono Setiadi2
, Esti Suryani3
, Haryono Setiadi2
, Esti Suryani3
Publisher
Universitas Sebelas Maret
Date
30-06-2022
Contributor
Fajar bagus W
Format
PDF
Language
Indonesia
Type
Text
Files
Collection
Citation
Safira Nury Safitri1
, Haryono Setiadi2
, Esti Suryani3, “Educational Data Mining Using Cluster Analysis Methods and Decision
Trees based on Log Mining,” Repository Horizon University Indonesia, accessed June 5, 2025, https://repository.horizon.ac.id/items/show/9173.
Trees based on Log Mining,” Repository Horizon University Indonesia, accessed June 5, 2025, https://repository.horizon.ac.id/items/show/9173.