Özdemir Ş.Çınar F.Coşkun Küçüközmen C.Merih K.2024-03-132024-03-1320182213-8684https://doi.org/10.1007/978-3-319-64554-4_36https://hdl.handle.net/20.500.12662/3002The rate of chronic absenteeism is important in assessing the validity of current educational practices conditions. Every student who exhibits this behavior faces the risk of failing to progress to higher level of education and/or dropping out/leaving the school. Students in this risk group represent not only a problem from an educational standpoint but also a potential and multifaceted problem with respect to participation in the economy, the development of a skilled labor force, and the ability to become well integrated into society. In the literature for Turkey, the framework of this problem was constructed using statistical methods, and it is important to analyze this problem in greater depth. The main objective of this study is therefore to employ educational data mining methods to predict cases of chronic absenteeism at high school level. The data, compiled from 2,495 students from different districts of Istanbul, was prepared for data mining operations based on the CRISP-EDM steps. The analysis process was conducted using R language and R language packages due to their flexibility and strength. The study results revealed that the random forest algorithm is able to establish a more successful model, while the C4.5 algorithm more accurately describes the problem in terms of decision rules. © 2018, Springer International Publishing AG, part of Springer Nature.eninfo:eu-repo/semantics/closedAccessChronic absenteeismCRISP-EDM (cross-industry standard process for educational data mining)Educational data miningMachine learningRPredicting chronic absenteeism using educational data mining methodsBook Chapter10.1007/978-3-319-64554-4_362-s2.0-85059097061526N/A511