Machine Learning Algorithms and Predictive Models for Undergraduate Student Retention

Jia, Ji-Wu; Mareboyana, Manohar
July 2013
Proceedings of the World Congress on Engineering & Computer Scie;2013, p1
Conference Proceeding
In this paper, we have presented some results of undergraduate student retention using machine learning algorithms classifying the student data. We have also made some improvements to the classification algorithms such as Decision tree, Support Vector Machines (SVM), and neural networks supported by Weka software toolkit. The experiments revealed that the main factors that influence student retention in the Historically Black Colleges and Universities (HBCU) are the cumulative grade point average (GPA) and total credit hours (TCH) taken. The target functions derived from the bare minimum decision tree and SVM algorithms were further revised to create a two-layer neural network and a regression to predict the retention. These new models improved the classification accuracy.


Related Articles

  • Guidance on and comparison of machine learning classifiers for Landsat-based land cover and land use mapping. Shih, Hsiao-chien; Stow, Douglas A.; Tsai, Yu Hsin // International Journal of Remote Sensing;Feb2019, Vol. 40 Issue 4, p1248 

    Remote sensing scientists are increasingly adopting machine learning classifiers for land cover and land use (LCLU) mapping, but model selection, a critical step of the machine learning classification, has usually been ignored in the past research. In this paper, step-by-step guidance (for...

  • Land Cover/Use Classification Based on Feature Selection. Zhang, Yuwei; Liu, Jinting; Wan, Luhe; Qi, Shaoqun // Journal of Coastal Research;Mar2015 Supplement, Vol. 73, p380 

    Zhang, Y.; Liu, J.; Wan, L., and Qi, S., 2015. Land cover/use classification based on feature selection. Based on Support Vector Machine (SVM) and decision trees (DTs) classification methods, in this paper, we take Harbin, Daqing, Suihua, and Qiqihar in Heilongjiang Province as study area for...

  • Joint wall clutter mitigation and data-driven model for through-wall detection. Wang, F. F.; Zhang, Y. R.; Zhang, H. M. // International Journal of Remote Sensing;Sep2016, Vol. 37 Issue 18, p4486 

    Wall effects such as the strong front wall return and the propagation of the signal as it travels through the wall must be taken into account in through-wall detection. First of all, compensation of through-wall propagation effects such as attenuation and dispersion should be considered in...

  • Evaluation of Different Machine Learning Methods for Caesarean Data Classification. Alsharif, O. S. S.; Elbayoudi, K. M.; Aldrawi, A. A. S.; Akyol, K. // International Journal of Information Engineering & Electronic Bu;Sep2019, Vol. 11 Issue 5, p19 

    Recently, a new dataset has been introduced about the caesarean data. In this paper, the caesarean data was classified with five different algorithms; Support Vector Machine, K Nearest Neighbours, Naïve Bayes, Decision Tree Classifier, and Random Forest Classifier. The dataset is retrieved...

  • Detection & Classification of Network Anomalies using SVM and Decision Tree. Nagar, Mayank; Pandit, Shraddha; Maurya, JayPrakash // International Journal of Computer Science & Information Technolo;2014, Vol. 5 Issue 2, p2338 

    Here in this paper a new technique of detecting network anomalies in the traffic is implemented using the concept of Support vector machine and decision tree. The idea is to first apply clustering of the data traffic using support vector machine and then classifying the network traffic using...

  • A new algorithm for forecasting optimization model. Jun Shi; Entao Huang; Min Zhu; Yun Hu // International Journal of Digital Content Technology & its Applic;Nov2012, Vol. 6 Issue 20, p446 

    Traditional gray system theory based prediction models often involve lots of factors and falls short in their forecast accuracy. By combining rough theory and SVM method in the account, this paper put forward an optimized prediction model. This approach first employs attribute deduction method...

  • An experimental comparison of classification algorithm performances for highly imbalanced datasets. Oreški, Goran; Oreški, Stjepan // Central European Conference on Information & Intelligent Systems;Sep2014, p4 

    Imbalanced learning data often emerges during the process of the knowledge discovery in data and presents a significant challenge for data mining methods. In this paper we investigate the influence of class imbalanced data on: artificial intelligence methods, i.e. neural networks and support...

  • Effect of fusing features from multiple DCNN architectures in image classification. Thangarajah Akilan; Wu, Qingming Jonathan; Hui Zhang // IET Image Processing;May2018, Vol. 12 Issue 7, p1102 

    Automatic image classification has become a necessary task to handle the rapidly growing digital image usage. It has branched out many algorithms and adopted new techniques. Among them, feature fusion-based image classification methods rely on hand-crafted features traditionally. However, it has...

  • Performance Comparison of Intrusion Detection System based Anomaly Detection using Artificial Neural Network and Support Vector Machine. Cahyo, Aditya Nur; Hidayat, Risanuri; Adhipta, Dani // AIP Conference Proceedings;2016, Vol. 1755 Issue 1, p070011-1 

    This study presents a comparison of the detection accuracy of ANN and SVM on the anomaly-based IDS and uses all the features in the dataset. The experiments were performed on two algorithms using KDDCup99 dataset, preprocessing performed on datasets for normalization and scaling attributes which...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics