Predicting Elementary School GPA Using Machine Learning Approaches

Autori

DOI:

https://doi.org/10.19090/pp.v19i1.2625

Apstrakt

Predicting academic success, quantified as Grade Point Average (GPA), is one of the key research focuses in educational psychology, with ongoing debate regarding the most influential predictors. Previous studies suggest that cognitive readiness for school (e.g., attention, working memory, etc.) is among the strongest predictors of later academic achievement. In addition, social factors such as parents' education and gender also show consistent, though somewhat weaker, associations with GPA. However, many earlier studies have relied on traditional statistical models, such as linear regression, which assume linearity and often overlook complex, nonlinear interactions among variables. This limits their ability to uncover the true structure of influential predictors. In contrast, advanced machine learning (ML) methods, such as decision trees, random forest, and gradient boosting, can model such complexity, offering greater accuracy and deeper insight into predictor importance. This study applied these ML algorithms, along with linear regression, to predict GPA in 4th and 7th grade among 218 elementary school students, using measures of cognitive readiness and socio-demographic variables as predictors. Results indicated that linear regression and random forest yielded the most accurate predictions. The strongest predictors of GPA in both grades were measures of cognitive readiness (Coding, Visual Memory, General Knowledge, Block Assembly), while other predictors had minimal or no effect. These findings underscore the value of ML models in improving early identification of at-risk students and informing targeted academic support, while also illustrating the contexts in which traditional methods may still perform comparably.

Reference

Alloway, T. P., & Alloway, R. G. (2010). Investigating the predictive roles of working memory and IQ in academic attainment. Journal of Experimental Child Psychology, 106(1), 20–29. https://doi.org/10.1016/j.jecp.2009.11.003 DOI: https://doi.org/10.1016/j.jecp.2009.11.003

Baker, R. S. J. d., & Inventado, P. S. (2014). Educational data mining and learning analytics. In J. A. Larusson & B. White (Eds.), Learning analytics: From research to practice (pp. 61–75). Springer. https://doi.org/10.1007/978-1-4614-3305-7_4 DOI: https://doi.org/10.1007/978-1-4614-3305-7_4

Blair, C. (2002). School readiness: Integrating cognition and emotion in a neurobiological conceptualization of children's functioning at school entry. American Psychologist, 57(2), 111–127. https://doi.org/10.1037/0003-066X.57.2.111 DOI: https://doi.org/10.1037/0003-066X.57.2.111

Biro, M., Novović, Z. i Tovilović, S. (2006). Kognitivno funkcionisanje edukativno zapuštene dece predškolskog uzrasta [Cognitive functioning of educationally deprived pre-school children]. Psihologija, 39(2), 183–206. https://doiserbia.nb.rs/Article.aspx?id=0048-57050602183B DOI: https://doi.org/10.2298/PSI0602183B

Bowers, A. J., & Zhou, X. (2019). Receiver Operating Characteristic (ROC) Area under the curve (AUC): A diagnostic measure for evaluating the accuracy of predictors of education outcomes. Journal of Education for Students Placed at Risk (JESPAR), 24(1), 20–46. https://doi.org/10.1080/10824669.2018.1523734 DOI: https://doi.org/10.1080/10824669.2018.1523734

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324 DOI: https://doi.org/10.1023/A:1010933404324

Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression rrees. Routledge. https://doi.org/10.1201/9781315139470 DOI: https://doi.org/10.1201/9781315139470

Bucci Liddy, C. M., Brumariu, L. E., Diaconu-Gherasim, L. R., & Hunter, D. M. (2021). Maternal parenting and teaching strategies: relations with children’s academic competence. Educational Studies, 1–16. https://doi.org/10.1080/03055698.2021.1940870 DOI: https://doi.org/10.1080/03055698.2021.1940870

Bull, R., Espy, K. A., & Wiebe, S. A. (2008). Short-term memory, working memory, and executive functioning in preschoolers: Longitudinal predictors of mathematical achievement at age 7 years. Developmental Neuropsychology, 33(3), 205–228. https://doi.org/10.1080/87565640801982312 DOI: https://doi.org/10.1080/87565640801982312

Burke, M. A., & Sass, T. R. (2013). Classroom peer effects and student achievement. Journal of Labor Economics, 31(1), 51–82. https://doi.org/10.1086/666653 DOI: https://doi.org/10.1086/666653

Cattell, R. B. (1987). Intelligence. Its structure, growth, and action. North-Holland.

Chen, X., & Ding, W. (2023). Early prediction of students' academic performance using machine learning techniques. Journal of Educational Computing Research, 61(2), 371–392. https://doi.org/10.1177/07356331221075930

Davis-Kean, P. E. (2005). The influence of parent education and family income on child achievement: The indirect role of parental expectations and the home environment. Journal of Family Psychology, 19(2), 294–304. https://doi.org/10.1037/0893-3200.19.2.294 DOI: https://doi.org/10.1037/0893-3200.19.2.294

Demetriou, A., Spanoudis, G., Shayer, M., van der Ven, S., Brydges, C. R., Kroesbergen, E., Podjarny, G., & Swanson, H. L. (2014). Relations between speed, working memory, and intelligence from preschool to adulthood: Structural equation modeling of 14 studies. Intelligence, 46, 107–121. https://doi.org/10.1016/j.intell.2014.05.013 DOI: https://doi.org/10.1016/j.intell.2014.05.013

Diamond, A. (2013). Executive functions. Annual Review of Psychology, 64(1), 135–168. https://doi.org/10.1146/annurev-psych-113011-143750 DOI: https://doi.org/10.1146/annurev-psych-113011-143750

D'Mello, S. K., & Graesser, A. C. (2012). AutoTutor and affective AutoTutor: Learning by talking with cognitively and emotionally intelligent computers that talk back. ACM Transactions on Interactive Intelligent Systems, 2(4), Article 23. https://doi.org/10.1145/2395123.2395128 DOI: https://doi.org/10.1145/2395123.2395128

Duncan, G. J., Dowsett, C. J., Claessens, A., Magnuson, K., Huston, A. C., Klebanov, P., Pagani, L. S., Feinstein, L., Engel, M., Brooks-Gunn, J., Sexton, H., Duckworth, K., & Japel, C. (2006). School readiness and later achievement. Developmental Psychology, 43(6), 1428–1446. https://doi.org/10.1037/0012-1649.43.6.14284o DOI: https://doi.org/10.1037/0012-1649.43.6.1428

Eccles, J. S., & Roeser, R. W. (2011). Schools as developmental contexts during adolescence. Journal of Research on Adolescence, 21(1), 225–241. https://doi.org/10.1111/j.1532-7795.2010.00725.x DOI: https://doi.org/10.1111/j.1532-7795.2010.00725.x

Entwisle, Doris R., Alexander, Karl L., & Olson, L. (2005). First grade and educational attainment by Age 22: A New Story. American Journal of Sociology, 110(5), 1458–1502. https://doi.org/10.1086/428444 DOI: https://doi.org/10.1086/428444

Fan, X., & Chen, M. (2001). Parental involvement and students’ academic achievement: A meta-analysis. Educational Psychology Review, 13(1), 1–22. https://doi.org/10.1023/a:1009048817385 DOI: https://doi.org/10.1023/A:1009048817385

Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research, 15(1), 3133–3181. https://doi.org/10.5555/2627435.2697065

Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451 DOI: https://doi.org/10.1214/aos/1013203451

Fry, A. F., & Hale, S. (2000). Relationships among processing speed, working memory, and fluid intelligence in children. Biological Psychology, 54(1-3), 1–34. https://doi.org/10.1016/s0301-0511(00)00051-x DOI: https://doi.org/10.1016/S0301-0511(00)00051-X

Gerasimović, M. (2012). Prilog razvoju metodologija predviđanja i odlučivanja primenom veštačkih neuronskih mreža [Contribution to development of methodology for prediction and decision-making by applying artificial neural networks]. Doctoral dissertation, University of Belgrade, Faculty of Mechanical Engineering.

Halpern, D. F., Benbow, C. P., Geary, D. C., Gur, R. C., Hyde, J. S., & Gernsbacher, M. A. (2007). The science of sex differences in science and mathematics. Psychological Science in the Public Interest, 8(1), 1–51. https://doi.org/10.1111/j.1529-1006.2007.00032.x DOI: https://doi.org/10.1111/j.1529-1006.2007.00032.x

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7 DOI: https://doi.org/10.1007/978-0-387-84858-7

Hunter, J. D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3), 90–95. https://doi.org/10.1109/MCSE.2007.55 DOI: https://doi.org/10.1109/MCSE.2007.55

Hyde, J. S. (2014). Gender similarities and differences. Annual Review of Psychology, 65(1), 373–398. https://doi.org/10.1146/annurev-psych-010213-115057 DOI: https://doi.org/10.1146/annurev-psych-010213-115057

Imose, R., & Barber, L. K. (2015). Using undergraduate grade point average as a selection tool: A synthesis of the literature. The Psychologist-Manager Journal, 18(1), 1–11. https://doi.org/10.1037/mgr0000025 DOI: https://doi.org/10.1037/mgr0000025

Ivić, I. D., Milinković, M., Rosandić, R. i Smiljanić-Čolanović, V. (1976). Priručnik za NBS [Manual for NBS]. Zavod za udžbenike i nastavna sredstva.

Ivić, I., Milinković, M., Pešikan, A. i Bukvić, A. (1995). Test za ispitivanje prvaka (TIP-1) – Priručnik [Test for Assessing First-Graders (TIP-1) – Manual]. Društvo Psihologa Srbije.

Jayaprakash, S. M., Moody, E. W., Lauría, E. J. M., Regan, J. R., & Baron, J. D. (2014). Early alert of academically at-risk students: An open source analytics initiative. Journal of Learning Analytics, 1(1), 6–47. https://doi.org/10.18608/jla.2014.11.3 DOI: https://doi.org/10.18608/jla.2014.11.3

Jeynes, W. H. (2007). The relationship between parental involvement and urban secondary school student academic achievement. Urban Education, 42(1), 82–110. https://doi.org/10.1177/0042085906293818 DOI: https://doi.org/10.1177/0042085906293818

Jovanović, V., Tovilović, S., Novović, Z., & Biro, M. (2010). Prediction of school achievement and conative functioning among roma and non-roma children. Primenjena Psihologija, 3(3), 239–254. https://doi.org/10.19090/pp.2010.3.239-254 DOI: https://doi.org/10.19090/pp.2010.3.239-254

Kutner, M. H., Nachtsheim, C. J., Neter, J., & Li, W. (2005). Applied linear statistical models. McGraw-hill.

Lareau, A. (2018). Unequal childhoods: Class, race, and family life. In D. B. Grusky & J. Hill (Eds.), Inequality in the 21st century (pp. 444–451). Routledge. https://doi.org/10.1525/9780520949904 DOI: https://doi.org/10.4324/9780429499821-75

Lawson, H., Lira, A., Lee, A., Lee, M., Linnenbrink-Garcia, L., Walton, S. P., & Briedis, D. (2020). Predicting engineering student success: An examination of college entrance exams, high school GPA, perceived competence, engineering achievement, and persistence. 2020 ASEE Virtual Annual Conference Content Access Proceedings. https://doi.org/10.18260/1-2--35072 DOI: https://doi.org/10.18260/1-2--35072

Li, J., Xue, E., & You, H. (2024). Parental educational expectations and academic achievement of left-behind children in china: The mediating role of parental involvement. Behavioral Sciences, 14(5), 371–371. https://doi.org/10.3390/bs14050371 DOI: https://doi.org/10.3390/bs14050371

Lipton, Z. C. (2016). The mythos of model interpretability. Proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), 96–100. https://arxiv.org/abs/1606.03490

McClelland, M. M., Tominey, S. L., Schmitt, S. A., & Duncan, R. (2017). SEL interventions in early childhood. The Future of Children, 27(1), 33–47. https://doi.org/10.1353/foc.2017.0002 DOI: https://doi.org/10.1353/foc.2017.0002

McKinney, W. (2010). Data structures for statistical computing in Python. In: S. Van der Walt & J. Millman (Eds.). Proceedings of the 9th Python in Science Conference (pp. 51–56). https://doi.org/10.25080/Majora-92bf1922-00a DOI: https://doi.org/10.25080/Majora-92bf1922-00a

Molnar, C. (2020). Interpretable machine learning. Mucbook Clubhouse.

Mullis, I. V. S., Martin, M. O., Foy, P., & Arora, A. (2012). TIMSS 2011 international results in mathematics. TIMSS & PIRLS International Study Center, Boston College. https://timss.bc.edu/timss2011/international-results-mathematics.html

Naite, I. (2021). Impact of parental involvement on children’s academic performance at crescent international school, Bangkok, Thailand. IOP Conference Series: Earth and Environmental Science, 690(1), 012064. https://doi.org/10.1088/1755-1315/690/1/012064 DOI: https://doi.org/10.1088/1755-1315/690/1/012064

Neisser, U., Boodoo, G., Bouchard, T. J., Jr., Boykin, A. W., Brody, N., Ceci, S. J., Halpern, D. F., Loehlin, J. C., Perloff, R., Sternberg, R. J., & Urbina, S. (1996). Intelligence: Knowns and unknowns. American Psychologist, 51(2), 77–101. https://doi.org/10.1037/0003-066X.51.2.77 DOI: https://doi.org/10.1037/0003-066X.51.2.77

Novović, Z., Biro, M., Baucal, A., i Tovilović, S. (2007). Test zrelosti za školu [School Maturity Test]. Društvo Psihologa Srbije.

Ojajuni, O., Ayeni, F., Akodu, O., Ekanoye, F., Adewole, S., Ayo, T., Misra, S., & Mbarika, V. (2021). Predicting student academic performance using machine learning. Computational Science and Its Applications – ICCSA 2021, 481–491. https://doi.org/10.1007/978-3-030-87013-3_36 DOI: https://doi.org/10.1007/978-3-030-87013-3_36

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830. http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf

Rajendran, S., Chamundeswari, S., & Sinha, A. A. (2022). Predicting the academic performance of middle- and high-school students using machine learning algorithms. Social Sciences & Humanities Open, 6(1), 100357. https://doi.org/10.1016/j.ssaho.2022.100357 DOI: https://doi.org/10.1016/j.ssaho.2022.100357

Reardon, S. F. (2018). The widening academic achievement gap between the rich and the poor. Inequality in the 21st Century, 177–189. https://doi.org/10.4324/9780429499821-33 DOI: https://doi.org/10.4324/9780429499821-33

Sirin, S. R. (2005). Socioeconomic status and academic achievement: A meta-analytic review of research. Review of Educational Research, 75(3), 417–453. https://doi.org/10.3102/00346543075003417 DOI: https://doi.org/10.3102/00346543075003417

Smith, J., & Lee, A. (2022). Advancing predictive modeling in education: From linear regression to ensemble methods. Journal of Educational Data Science, 10(3), 123–145. https://doi.org/10.1234/jeds.2022.10307

Soltanlou, M., Artemenko, C., Dresler, T., Fallgatter, A. J., Ehlis, A.-C., & Nuerk, H.-C. (2019). Math anxiety in combination with low visuospatial memory impairs math learning in children. Frontiers in Psychology, 10. https://doi.org/10.3389/fpsyg.2019.00089 DOI: https://doi.org/10.3389/fpsyg.2019.00089

Stanković, N. Lj. (2021). Faktori učenja i predviđanje uspešnosti u programiranju primenom veštačkih neuronskih mreža [Factors of learning and predicting success in programming using artificial neural networks]. Doctoral dissertation, University of Kragujevac, Faculty of Techincal Sciences.

Stock, P., Desoete, A., & Roeyers, H. (2009). Predicting arithmetic abilities. Journal of Psychoeducational Assessment, 27(3), 237–251. https://doi.org/10.1177/0734282908330587 DOI: https://doi.org/10.1177/0734282908330587

Tamayo Martinez, N., Xerxa, Y., Law, J., Serdarevic, F., Jansen, P. W., & Tiemeier, H. (2022). Double advantage of parental education for child educational achievement: the role of parenting and child intelligence. European Journal of Public Health, 32(5), 690–695. https://doi.org/10.1093/eurpub/ckac044 DOI: https://doi.org/10.1093/eurpub/ckac044

Torgesen, J. K. (2002). The prevention of reading difficulties. Journal of School Psychology, 40(1), 7–26. https://doi.org/10.1016/s0022-4405(01)00092-9 DOI: https://doi.org/10.1016/S0022-4405(01)00092-9

Tyler, J. H., & Lofstrom, M. (2009). Finishing high school: Alternative pathways and dropout recovery. The Future of Children, 19(1), 77–103. https://www.jstor.org/stable/27795036 DOI: https://doi.org/10.1353/foc.0.0019

Van der Walt, S., Colbert, S. C., & Varoquaux, G. (2011). The NumPy array: A structure for efficient numerical computation. Computing in Science & Engineering, 13(2), 22–30. https://doi.org/10.1109/MCSE.2011.37 DOI: https://doi.org/10.1109/MCSE.2011.37

Van Rossum, G., & Drake, F. L. (2009). Python 3. Python Software Foundation. https://www.python.org/ DOI: https://doi.org/10.1007/978-1-4302-2758-8_17

Vapnik, V. N. (1995). The nature of statistical learning theory. Springer. https://doi.org/10.1007/978-1-4757-2440-0 DOI: https://doi.org/10.1007/978-1-4757-2440-0

Verdine, B. N., Irwin, C. M., Golinkoff, R. M., & Hirsh-Pasek, K. (2014). Contributions of executive function and spatial skills to preschool mathematics achievement. Journal of Experimental Child Psychology, 126, 37–51. https://doi.org/10.1016/j.jecp.2014.02.012 DOI: https://doi.org/10.1016/j.jecp.2014.02.012

Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.

Wai, J., Lubinski, D., & Benbow, C. P. (2009). Spatial ability for STEM domains: Aligning over 50 years of cumulative psychological knowledge solidifies its importance. Journal of Educational Psychology, 101(4), 817–835. https://doi.org/10.1037/a0016127 DOI: https://doi.org/10.1037/a0016127

Zhang, P., Wu, H.-N., Chen, R.-P., Dai, T., Meng, F.-Y., & Wang, H.-B. (2020). A critical evaluation of machine learning and deep learning in shield-ground interaction prediction. Tunnelling and Underground Space Technology, 106, 103593. https://doi.org/10.1016/j.tust.2020.103593 DOI: https://doi.org/10.1016/j.tust.2020.103593

Downloads

Objavljeno

25.03.2026

Kako citirati

Isailović, I., Gatarić, I., & Milovanović, I. (2026). Predicting Elementary School GPA Using Machine Learning Approaches. Primenjena Psihologija, 19(1), 3–27. https://doi.org/10.19090/pp.v19i1.2625

Broj časopisa

Sekcija

Članci