Big Data Analytics: Student Performance Prediction Using Feature Selection and Machine Learning on Microsoft Azure Platform

Authors

  • Wattana Punlumjeak Department of Computer Engineering,Faculty of Engineering, Rajamangala University of Technology Thanyaburi, Pathumthani, Thailand.
  • Nachirat Rachburee Department of Computer Engineering,Faculty of Engineering, Rajamangala University of Technology Thanyaburi, Pathumthani, Thailand.
  • Jedsada Arunrerk Department of Computer Engineering,Faculty of Engineering, Rajamangala University of Technology Thanyaburi, Pathumthani, Thailand.

Keywords:

Big Data Analytics, Feature Selection, Microsoft Azure, Student Prediction,

Abstract

In recent years, big data analytics has been a new growing research area and the essence of cloud computing is used to support a shared pool of resources. In educational mining, the huge volume of student data needs analytics technologies to extract valuable knowledge. It has been recognized that a high performance accuracy of student prediction model will be helpful for student and stakeholders. In this experiment, feature selection methods were proposed to identify the most significant and intrinsic features before classification methods were used. Experiment was conducted to evaluate the performance of the prediction model. The result of the experiment showed that mutual information in feature selection method with neural network classifier gave the best overall accuracy at 90.60% for student’s data at Rajamangala University of Technology Thanyaburi. This experiment is extremely useful for students, teachers and management to find useful knowledge not only in identifying the problem areas and reasons that affect student’s performance, but also in understanding the feature selection and classification methods, which are the most effective way to analyze student’s performance on a cloud computing environment.

References

Chen, M., S. Mao, and Y. Liu, Y., “Big Data: A Survey”, Mobile Networks and Applications, vol.19, no.2, 2014, pp. 171-209.

Swan, M., “Philosophy of Big Data: Expanding the Human-Data Relation with Big Data Science Services”, IEEE First International Conference on Big Data Computing Service and Applications, 2015, pp. 468-477.

Meetali, “From Big Data to Big Values: A Big Science Leading to a Revolution”, 2015 2nd International Conference on Computing for Sustainable Global Development, 2015, pp.56-59.

Khurana, A., “Bringing Big Data Systems to the Cloud”, IEEE Cloud Computing, (3), 2014, pp.72-75.

Roloff, E., and et al., “Evaluating high performance computing on the windows azure platform”, 2012 IEEE 5th International Conference on Cloud Computing, 2012, pp.803-810.

Akinbi, A., E. Pereira, and C. Beaumont, “Evaluating Security Mechanisms Implemented On Public Platform-As-A-Service Cloud Environments Case Study: Windows Azure”, 2013 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013), 2013, pp.162-167.

Yi, J., and et al., “Cloud-Based Educational Big Data Application of Apriori Algorithm and K-Means Clustering Algorithm Based on Students' Information”, 2014 IEEE Fourth International Conference on Big Data and Cloud Computing, 2014, pp.151-158.

Li, X., and Y. Li., “The Reform of Vocational Colleges' Teaching

Method in the Age of Big Data--Based on PHP Programming”, 2015 IEEE Fifth International Conference on Big Data and Cloud Computing, 2015, pp. 99-103.

Cen, L., D. Ruta, and J. Ng., “Big education: Opportunities for Big Data analytics”, 2015 IEEE International Conference on Digital Signal Processing, 2015, pp.502-506.

Liu, W. and D. Chen., “Big Data Classification Based On Multi-View Method”, 2015 International Conference on Wavelet Analysis and Pattern Recognition, 2015, pp.165-170.

Mustafa, M. N., L. Chowdhury, and M. S. Kamal., “Students Dropout Prediction For Intelligent System From Tertiary Level In Developing Country”, 2012 International Conference on Informatics, Electronics & Vision, 2012, pp.113-118.

Alsaffar, A. and N. Omar., “Study on feature selection and machine learning algorithms for Malay sentiment classification”, 2014 International Conference on Information Technology and Multimedia,2014, pp.270-275.

Wolf, L. and S. Bileschi., “Combining variable selection with dimensionality reduction”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, 2005, pp. 801-806.

Biesiada, J. and W. Duch., “Feature Selection For High-Dimensional Data: A Pearson Redundancy Based Filter”, Computer Recognition Systems 2. Springer Berlin Heidelberg, 2007, pp.242-249.

Peng, H., F. Long, and C. Ding., “Feature Selection Based On Mutual Information Criteria Of Max-Dependency, Max-Relevance, And MinRedundancy”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, no.8, 2005, pp.1226-1238.

Punlumjeak w. and N. Rachburee, “A Comparative Study of Feature Selection Techniques for Classify Student Performance”, IEEE 7th International Conference on Information Technology and Electrical Engineering, 2015, pp.425-429.

Downloads

Published

2017-03-15

How to Cite

Punlumjeak, W., Rachburee, N., & Arunrerk, J. (2017). Big Data Analytics: Student Performance Prediction Using Feature Selection and Machine Learning on Microsoft Azure Platform. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 9(1-4), 113–117. Retrieved from https://jtec.utem.edu.my/jtec/article/view/1791