Predicting Elementary School GPA Using Machine Learning Approaches
DOI:
https://doi.org/10.19090/pp.v19i1.2625Apstrakt
Predicting academic success, quantified as Grade Point Average (GPA), is one of the key research focuses in educational psychology, with ongoing debate regarding the most influential predictors. Previous studies suggest that cognitive readiness for school (e.g., attention, working memory, etc.) is among the strongest predictors of later academic achievement. In addition, social factors such as parents' education and gender also show consistent, though somewhat weaker, associations with GPA. However, many earlier studies have relied on traditional statistical models, such as linear regression, which assume linearity and often overlook complex, nonlinear interactions among variables. This limits their ability to uncover the true structure of influential predictors. In contrast, advanced machine learning (ML) methods, such as decision trees, random forest, and gradient boosting, can model such complexity, offering greater accuracy and deeper insight into predictor importance. This study applied these ML algorithms, along with linear regression, to predict GPA in 4th and 7th grade among 218 elementary school students, using measures of cognitive readiness and socio-demographic variables as predictors. Results indicated that linear regression and random forest yielded the most accurate predictions. The strongest predictors of GPA in both grades were measures of cognitive readiness (Coding, Visual Memory, General Knowledge, Block Assembly), while other predictors had minimal or no effect. These findings underscore the value of ML models in improving early identification of at-risk students and informing targeted academic support, while also illustrating the contexts in which traditional methods may still perform comparably.
Reference
Alloway, T. P., & Alloway, R. G. (2010). Investigating the predictive roles of working memory and IQ in academic attainment. Journal of Experimental Child Psychology, 106(1), 20–29. https://doi.org/10.1016/j.jecp.2009.11.003 DOI: https://doi.org/10.1016/j.jecp.2009.11.003
Baker, R. S. J. d., & Inventado, P. S. (2014). Educational data mining and learning analytics. In J. A. Larusson & B. White (Eds.), Learning analytics: From research to practice (pp. 61–75). Springer. https://doi.org/10.1007/978-1-4614-3305-7_4 DOI: https://doi.org/10.1007/978-1-4614-3305-7_4
Blair, C. (2002). School readiness: Integrating cognition and emotion in a neurobiological conceptualization of children's functioning at school entry. American Psychologist, 57(2), 111–127. https://doi.org/10.1037/0003-066X.57.2.111 DOI: https://doi.org/10.1037/0003-066X.57.2.111
Biro, M., Novović, Z. i Tovilović, S. (2006). Kognitivno funkcionisanje edukativno zapuštene dece predškolskog uzrasta [Cognitive functioning of educationally deprived pre-school children]. Psihologija, 39(2), 183–206. https://doiserbia.nb.rs/Article.aspx?id=0048-57050602183B DOI: https://doi.org/10.2298/PSI0602183B
Bowers, A. J., & Zhou, X. (2019). Receiver Operating Characteristic (ROC) Area under the curve (AUC): A diagnostic measure for evaluating the accuracy of predictors of education outcomes. Journal of Education for Students Placed at Risk (JESPAR), 24(1), 20–46. https://doi.org/10.1080/10824669.2018.1523734 DOI: https://doi.org/10.1080/10824669.2018.1523734
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324 DOI: https://doi.org/10.1023/A:1010933404324
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression rrees. Routledge. https://doi.org/10.1201/9781315139470 DOI: https://doi.org/10.1201/9781315139470
Bucci Liddy, C. M., Brumariu, L. E., Diaconu-Gherasim, L. R., & Hunter, D. M. (2021). Maternal parenting and teaching strategies: relations with children’s academic competence. Educational Studies, 1–16. https://doi.org/10.1080/03055698.2021.1940870 DOI: https://doi.org/10.1080/03055698.2021.1940870
Bull, R., Espy, K. A., & Wiebe, S. A. (2008). Short-term memory, working memory, and executive functioning in preschoolers: Longitudinal predictors of mathematical achievement at age 7 years. Developmental Neuropsychology, 33(3), 205–228. https://doi.org/10.1080/87565640801982312 DOI: https://doi.org/10.1080/87565640801982312
Burke, M. A., & Sass, T. R. (2013). Classroom peer effects and student achievement. Journal of Labor Economics, 31(1), 51–82. https://doi.org/10.1086/666653 DOI: https://doi.org/10.1086/666653
Cattell, R. B. (1987). Intelligence. Its structure, growth, and action. North-Holland.
Chen, X., & Ding, W. (2023). Early prediction of students' academic performance using machine learning techniques. Journal of Educational Computing Research, 61(2), 371–392. https://doi.org/10.1177/07356331221075930
Davis-Kean, P. E. (2005). The influence of parent education and family income on child achievement: The indirect role of parental expectations and the home environment. Journal of Family Psychology, 19(2), 294–304. https://doi.org/10.1037/0893-3200.19.2.294 DOI: https://doi.org/10.1037/0893-3200.19.2.294
Demetriou, A., Spanoudis, G., Shayer, M., van der Ven, S., Brydges, C. R., Kroesbergen, E., Podjarny, G., & Swanson, H. L. (2014). Relations between speed, working memory, and intelligence from preschool to adulthood: Structural equation modeling of 14 studies. Intelligence, 46, 107–121. https://doi.org/10.1016/j.intell.2014.05.013 DOI: https://doi.org/10.1016/j.intell.2014.05.013
Diamond, A. (2013). Executive functions. Annual Review of Psychology, 64(1), 135–168. https://doi.org/10.1146/annurev-psych-113011-143750 DOI: https://doi.org/10.1146/annurev-psych-113011-143750
D'Mello, S. K., & Graesser, A. C. (2012). AutoTutor and affective AutoTutor: Learning by talking with cognitively and emotionally intelligent computers that talk back. ACM Transactions on Interactive Intelligent Systems, 2(4), Article 23. https://doi.org/10.1145/2395123.2395128 DOI: https://doi.org/10.1145/2395123.2395128
Duncan, G. J., Dowsett, C. J., Claessens, A., Magnuson, K., Huston, A. C., Klebanov, P., Pagani, L. S., Feinstein, L., Engel, M., Brooks-Gunn, J., Sexton, H., Duckworth, K., & Japel, C. (2006). School readiness and later achievement. Developmental Psychology, 43(6), 1428–1446. https://doi.org/10.1037/0012-1649.43.6.14284o DOI: https://doi.org/10.1037/0012-1649.43.6.1428
Eccles, J. S., & Roeser, R. W. (2011). Schools as developmental contexts during adolescence. Journal of Research on Adolescence, 21(1), 225–241. https://doi.org/10.1111/j.1532-7795.2010.00725.x DOI: https://doi.org/10.1111/j.1532-7795.2010.00725.x
Entwisle, Doris R., Alexander, Karl L., & Olson, L. (2005). First grade and educational attainment by Age 22: A New Story. American Journal of Sociology, 110(5), 1458–1502. https://doi.org/10.1086/428444 DOI: https://doi.org/10.1086/428444
Fan, X., & Chen, M. (2001). Parental involvement and students’ academic achievement: A meta-analysis. Educational Psychology Review, 13(1), 1–22. https://doi.org/10.1023/a:1009048817385 DOI: https://doi.org/10.1023/A:1009048817385
Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research, 15(1), 3133–3181. https://doi.org/10.5555/2627435.2697065
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451 DOI: https://doi.org/10.1214/aos/1013203451
Fry, A. F., & Hale, S. (2000). Relationships among processing speed, working memory, and fluid intelligence in children. Biological Psychology, 54(1-3), 1–34. https://doi.org/10.1016/s0301-0511(00)00051-x DOI: https://doi.org/10.1016/S0301-0511(00)00051-X
Gerasimović, M. (2012). Prilog razvoju metodologija predviđanja i odlučivanja primenom veštačkih neuronskih mreža [Contribution to development of methodology for prediction and decision-making by applying artificial neural networks]. Doctoral dissertation, University of Belgrade, Faculty of Mechanical Engineering.
Halpern, D. F., Benbow, C. P., Geary, D. C., Gur, R. C., Hyde, J. S., & Gernsbacher, M. A. (2007). The science of sex differences in science and mathematics. Psychological Science in the Public Interest, 8(1), 1–51. https://doi.org/10.1111/j.1529-1006.2007.00032.x DOI: https://doi.org/10.1111/j.1529-1006.2007.00032.x
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7 DOI: https://doi.org/10.1007/978-0-387-84858-7
Hunter, J. D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3), 90–95. https://doi.org/10.1109/MCSE.2007.55 DOI: https://doi.org/10.1109/MCSE.2007.55
Hyde, J. S. (2014). Gender similarities and differences. Annual Review of Psychology, 65(1), 373–398. https://doi.org/10.1146/annurev-psych-010213-115057 DOI: https://doi.org/10.1146/annurev-psych-010213-115057
Imose, R., & Barber, L. K. (2015). Using undergraduate grade point average as a selection tool: A synthesis of the literature. The Psychologist-Manager Journal, 18(1), 1–11. https://doi.org/10.1037/mgr0000025 DOI: https://doi.org/10.1037/mgr0000025
Ivić, I. D., Milinković, M., Rosandić, R. i Smiljanić-Čolanović, V. (1976). Priručnik za NBS [Manual for NBS]. Zavod za udžbenike i nastavna sredstva.
Ivić, I., Milinković, M., Pešikan, A. i Bukvić, A. (1995). Test za ispitivanje prvaka (TIP-1) – Priručnik [Test for Assessing First-Graders (TIP-1) – Manual]. Društvo Psihologa Srbije.
Jayaprakash, S. M., Moody, E. W., Lauría, E. J. M., Regan, J. R., & Baron, J. D. (2014). Early alert of academically at-risk students: An open source analytics initiative. Journal of Learning Analytics, 1(1), 6–47. https://doi.org/10.18608/jla.2014.11.3 DOI: https://doi.org/10.18608/jla.2014.11.3
Jeynes, W. H. (2007). The relationship between parental involvement and urban secondary school student academic achievement. Urban Education, 42(1), 82–110. https://doi.org/10.1177/0042085906293818 DOI: https://doi.org/10.1177/0042085906293818
Jovanović, V., Tovilović, S., Novović, Z., & Biro, M. (2010). Prediction of school achievement and conative functioning among roma and non-roma children. Primenjena Psihologija, 3(3), 239–254. https://doi.org/10.19090/pp.2010.3.239-254 DOI: https://doi.org/10.19090/pp.2010.3.239-254
Kutner, M. H., Nachtsheim, C. J., Neter, J., & Li, W. (2005). Applied linear statistical models. McGraw-hill.
Lareau, A. (2018). Unequal childhoods: Class, race, and family life. In D. B. Grusky & J. Hill (Eds.), Inequality in the 21st century (pp. 444–451). Routledge. https://doi.org/10.1525/9780520949904 DOI: https://doi.org/10.4324/9780429499821-75
Lawson, H., Lira, A., Lee, A., Lee, M., Linnenbrink-Garcia, L., Walton, S. P., & Briedis, D. (2020). Predicting engineering student success: An examination of college entrance exams, high school GPA, perceived competence, engineering achievement, and persistence. 2020 ASEE Virtual Annual Conference Content Access Proceedings. https://doi.org/10.18260/1-2--35072 DOI: https://doi.org/10.18260/1-2--35072
Li, J., Xue, E., & You, H. (2024). Parental educational expectations and academic achievement of left-behind children in china: The mediating role of parental involvement. Behavioral Sciences, 14(5), 371–371. https://doi.org/10.3390/bs14050371 DOI: https://doi.org/10.3390/bs14050371
Lipton, Z. C. (2016). The mythos of model interpretability. Proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), 96–100. https://arxiv.org/abs/1606.03490
McClelland, M. M., Tominey, S. L., Schmitt, S. A., & Duncan, R. (2017). SEL interventions in early childhood. The Future of Children, 27(1), 33–47. https://doi.org/10.1353/foc.2017.0002 DOI: https://doi.org/10.1353/foc.2017.0002
McKinney, W. (2010). Data structures for statistical computing in Python. In: S. Van der Walt & J. Millman (Eds.). Proceedings of the 9th Python in Science Conference (pp. 51–56). https://doi.org/10.25080/Majora-92bf1922-00a DOI: https://doi.org/10.25080/Majora-92bf1922-00a
Molnar, C. (2020). Interpretable machine learning. Mucbook Clubhouse.
Mullis, I. V. S., Martin, M. O., Foy, P., & Arora, A. (2012). TIMSS 2011 international results in mathematics. TIMSS & PIRLS International Study Center, Boston College. https://timss.bc.edu/timss2011/international-results-mathematics.html
Naite, I. (2021). Impact of parental involvement on children’s academic performance at crescent international school, Bangkok, Thailand. IOP Conference Series: Earth and Environmental Science, 690(1), 012064. https://doi.org/10.1088/1755-1315/690/1/012064 DOI: https://doi.org/10.1088/1755-1315/690/1/012064
Neisser, U., Boodoo, G., Bouchard, T. J., Jr., Boykin, A. W., Brody, N., Ceci, S. J., Halpern, D. F., Loehlin, J. C., Perloff, R., Sternberg, R. J., & Urbina, S. (1996). Intelligence: Knowns and unknowns. American Psychologist, 51(2), 77–101. https://doi.org/10.1037/0003-066X.51.2.77 DOI: https://doi.org/10.1037/0003-066X.51.2.77
Novović, Z., Biro, M., Baucal, A., i Tovilović, S. (2007). Test zrelosti za školu [School Maturity Test]. Društvo Psihologa Srbije.
Ojajuni, O., Ayeni, F., Akodu, O., Ekanoye, F., Adewole, S., Ayo, T., Misra, S., & Mbarika, V. (2021). Predicting student academic performance using machine learning. Computational Science and Its Applications – ICCSA 2021, 481–491. https://doi.org/10.1007/978-3-030-87013-3_36 DOI: https://doi.org/10.1007/978-3-030-87013-3_36
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830. http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf
Rajendran, S., Chamundeswari, S., & Sinha, A. A. (2022). Predicting the academic performance of middle- and high-school students using machine learning algorithms. Social Sciences & Humanities Open, 6(1), 100357. https://doi.org/10.1016/j.ssaho.2022.100357 DOI: https://doi.org/10.1016/j.ssaho.2022.100357
Reardon, S. F. (2018). The widening academic achievement gap between the rich and the poor. Inequality in the 21st Century, 177–189. https://doi.org/10.4324/9780429499821-33 DOI: https://doi.org/10.4324/9780429499821-33
Sirin, S. R. (2005). Socioeconomic status and academic achievement: A meta-analytic review of research. Review of Educational Research, 75(3), 417–453. https://doi.org/10.3102/00346543075003417 DOI: https://doi.org/10.3102/00346543075003417
Smith, J., & Lee, A. (2022). Advancing predictive modeling in education: From linear regression to ensemble methods. Journal of Educational Data Science, 10(3), 123–145. https://doi.org/10.1234/jeds.2022.10307
Soltanlou, M., Artemenko, C., Dresler, T., Fallgatter, A. J., Ehlis, A.-C., & Nuerk, H.-C. (2019). Math anxiety in combination with low visuospatial memory impairs math learning in children. Frontiers in Psychology, 10. https://doi.org/10.3389/fpsyg.2019.00089 DOI: https://doi.org/10.3389/fpsyg.2019.00089
Stanković, N. Lj. (2021). Faktori učenja i predviđanje uspešnosti u programiranju primenom veštačkih neuronskih mreža [Factors of learning and predicting success in programming using artificial neural networks]. Doctoral dissertation, University of Kragujevac, Faculty of Techincal Sciences.
Stock, P., Desoete, A., & Roeyers, H. (2009). Predicting arithmetic abilities. Journal of Psychoeducational Assessment, 27(3), 237–251. https://doi.org/10.1177/0734282908330587 DOI: https://doi.org/10.1177/0734282908330587
Tamayo Martinez, N., Xerxa, Y., Law, J., Serdarevic, F., Jansen, P. W., & Tiemeier, H. (2022). Double advantage of parental education for child educational achievement: the role of parenting and child intelligence. European Journal of Public Health, 32(5), 690–695. https://doi.org/10.1093/eurpub/ckac044 DOI: https://doi.org/10.1093/eurpub/ckac044
Torgesen, J. K. (2002). The prevention of reading difficulties. Journal of School Psychology, 40(1), 7–26. https://doi.org/10.1016/s0022-4405(01)00092-9 DOI: https://doi.org/10.1016/S0022-4405(01)00092-9
Tyler, J. H., & Lofstrom, M. (2009). Finishing high school: Alternative pathways and dropout recovery. The Future of Children, 19(1), 77–103. https://www.jstor.org/stable/27795036 DOI: https://doi.org/10.1353/foc.0.0019
Van der Walt, S., Colbert, S. C., & Varoquaux, G. (2011). The NumPy array: A structure for efficient numerical computation. Computing in Science & Engineering, 13(2), 22–30. https://doi.org/10.1109/MCSE.2011.37 DOI: https://doi.org/10.1109/MCSE.2011.37
Van Rossum, G., & Drake, F. L. (2009). Python 3. Python Software Foundation. https://www.python.org/ DOI: https://doi.org/10.1007/978-1-4302-2758-8_17
Vapnik, V. N. (1995). The nature of statistical learning theory. Springer. https://doi.org/10.1007/978-1-4757-2440-0 DOI: https://doi.org/10.1007/978-1-4757-2440-0
Verdine, B. N., Irwin, C. M., Golinkoff, R. M., & Hirsh-Pasek, K. (2014). Contributions of executive function and spatial skills to preschool mathematics achievement. Journal of Experimental Child Psychology, 126, 37–51. https://doi.org/10.1016/j.jecp.2014.02.012 DOI: https://doi.org/10.1016/j.jecp.2014.02.012
Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.
Wai, J., Lubinski, D., & Benbow, C. P. (2009). Spatial ability for STEM domains: Aligning over 50 years of cumulative psychological knowledge solidifies its importance. Journal of Educational Psychology, 101(4), 817–835. https://doi.org/10.1037/a0016127 DOI: https://doi.org/10.1037/a0016127
Zhang, P., Wu, H.-N., Chen, R.-P., Dai, T., Meng, F.-Y., & Wang, H.-B. (2020). A critical evaluation of machine learning and deep learning in shield-ground interaction prediction. Tunnelling and Underground Space Technology, 106, 103593. https://doi.org/10.1016/j.tust.2020.103593 DOI: https://doi.org/10.1016/j.tust.2020.103593
Downloads
Objavljeno
Kako citirati
Broj časopisa
Sekcija
Licenca
Sva prava zadržana (c) 2026 Ivana Isailović, Isidora Gatarić, Ilija Milovanović

Ovaj rad je pod Creative Commons Autorstvo 4.0 Internacionalna licenca.
