Students Performance Using Data Mining Techniques Education Essay

Published: Last Edited:

This essay has been submitted by a student. This is not an example of the work written by our professional essay writers.


Nowadays; The Internet has become a common medium that improves the education. E-learning is a process in which education is in digital learning method. E-learning mainly focuses on learner-centric training rather than teacher-centric training, which has been in practice in traditional teaching. The amount of data stored in educational database increasing rapidly without any benefit to the management These databases contain hidden information for improvement of students' performance. In our case study, we will use educational data mining to analyze the data available from the students' database and bring out the hidden knowledge from it, we will apply data mining techniques to discover association, classification, clustering and outlier detection rules. In each of these four tasks, we will extract knowledge that describes students' behavior.

Keywords: E-Learning, Students' Performance, Data Mining.


The Internet has become a pervasive medium that has changed completely the education environment, and the way of knowledge that shared throughout the world. It provides an easy way to search and access to any information that you need. E- Learning has made knowledge accessible to a large number of people. There are increasing research interests in using data mining in education. This new emerging field, called Educational Data Mining, Data mining concepts and techniques can be applied in E-Learning to discover knowledge that comes from educational environments. There are many Data Mining techniques such as Naïve Bayes, Neural Networks, Decision Trees, K- Nearest neighbor, and many others, to offer useful knowledge about the learning process for instructors.

Data mining known as Knowledge Discovery in Database. Data mining techniques are used to extract or "mining" knowledge and discover hidden patterns from large volumes of data, this knowledge is helpful in decision making, data mining is actually part of the knowledge discovery process Baradwaj, B. and Pal, S. (2011). Mining online learning events is becoming promising area for research and development, particularly when the business in education is growing impressively Margo Hanna (2004). Higher education is become a big business, with growing of IT technology supporting online learning. The world is going fast towards online learning, so we can see a lot of open universities that provide courses online through the Internet. So one can study and take the exam and get certified whenever and wherever he/she wants Margo Hanna (2004).

Romero, C. and Ventura, S. (2007), have a survey on educational data mining between 1995 and 2005. They concluded that there is growing interest in data mining and the evaluation of online educational systems, educational data mining a rising and promising area of research. Al-Radaideh, Q., Al-Shawakfa, E. and Al-Najjar, M. (2006). applied the data mining techniques to evaluate student data and study the main attributes that may affect the student performance in courses. The extracted classification rules are based on the decision tree as a classification method. It allows students to predict the final grade in a course under study. Baradwaj, B. and Pal, S. (2011), used the classification task to evaluate student' performance, they used decision tree method for classification. The goal of their study is to extract knowledge that describes students' performance in end semester examination. This study helps earlier in identifying the dropouts and students who need special attention and allow the teacher to provide appropriate advising.

Chandra and Nandhini Chandra, E. and Nandhini, K. (2010), applied the association rule on students' failed courses to identify the reason for students' failure patterns courses and suggest relevant causes of the failure to improve the low capacity students' performances. It also reveals some hidden patterns of students failed courses which could serve as base for academic planners in making academic decisions and an aid in the curriculum re-structuring and modification, it will help to improve students' performance and reducing failure rate

Mohammed M and Alaa M (2012), applied educational data mining to extract useful knowledge from graduate students data to improve graduate students' performance, and overcome the problem of low grads of graduate students.

Research problem

Students are the main assets of educational institutions. The students' performance is an important factor to produce the best quality graduates. Students have to place the greatest effort in their study to obtain a good grade to achieve their academic carrier. Many factors could act as barrier and catalyst to students, these factors will reflect on student to achieve a high grade or low grade.

Universiti Sains Islam Malaysia (USIM) does not use any knowledge discovery process approach to get knowledge about student' performance, decision making in educational system need to use these knowledge:

To overcome the problem of low grades of graduate students.

Identifying the dropouts and students who need special attention and allow the teacher to provide appropriate advising/counseling.

Scope Of Research

In this research we will extract knowledge from graduate student data collected from the Faculty of Science and Technology, USIM. The data include five years from 2007-2012. We will use data mining method to predict their performance to get helpful knowledge that will improve the graduate quality.

Research Objective

To study students‟ performance in e-learning using data mining methodology

Discover hidden information of students' performance.

To identify the weak students and help to score better mark and reduce fail ratio.

To improve the performance of the students.

Identify those students which needed special attention.

Research Outcome

Provide to the decision maker helpful constructive recommendation to overcome the problem of low grade of graduate students.

Improve students' academic performance.

Research Question

What is the current level of graduate students at USIM?

Do USIM' decision maker have a good knowledge about students' performance?

Do students need special attention to improve their performance?

Research Methodology

The methodology used in this research starts from the problem definition, then data collection and preprocessing, then we come to the data mining methods which are association, classification, clustering, and outlier detection, followed by the evaluation of results and patterns, finally the knowledge representation process.

Understanding the domain and problem definition.

Data collection and preprocessing: the data will be collected from Graduate Students data base, and the data will clean and transformed into a mineable format.

Apply data mining techniques: the data mining techniques will apply to discover and summarize knowledge about student performance.

Followed by the evaluation of results and patterns, process.

Finally the knowledge representation.


In this research we will discover hidden information from graduate student data collected from the Faculty of Science and Technology in USIM. The data include five years from 2007-2012. This data existing and available in the USIM database, particularly we will use association rules to discovering, Rule Induction, Neural network, Naïve bayes, and Decision Tree to predict grade of students. The result will contain helpful knowledge, this knowledge can provide to decision maker to improve the graduate quality.