Abstract
In this chapter we present a case study that illustrates how educational data mining (EDM) is able to support the implementation of government policies and assist the labor of public institutions. Specifically, we highlight the current educational reforms in Mexico and focus on one of its main goals: to enhance the education quality. In response, a valuable data source is mined to discover interesting findings what students think about education, family, teachers, and their surroundings. Thus, a brief description of the legal and social context is given, as well as a profile of the students opinions expressed in a national survey is shaped. Moreover, a framework to build an EDM approach is outlined and a sample of the mined results is stated. As a result of the findings generated by the EDM approach, an interpretation is provided to tailor a conceptual view of the observations made by students, as well as some initiatives to deal with the findings. The work concludes with an exposition of the reasons for presenting this kind of work, a comment on the research fulfilled, a viewpoint of the education in Mexico, and some suggestions to support State polices to enhance education.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Some acronyms maintain the initials written in Spanish to preserve their national identity.
- 2.
The symbol/separates the category from its subcategory.
- 3.
Waikato Environment for Knowledge Analysis.
Abbreviations
- AIWBES:
-
Adaptive and intelligent web-based educational systems
- CBIS:
-
Computer-based information systems
- CNTE:
-
National Coordination of Workers of the Education
- CPEUM:
-
Politic Constitution of the Mexican United States
- CV:
-
Confidence value
- DK:
-
Domain knowledge
- DM:
-
Data mining
- EDM:
-
Educational data mining
- EM:
-
Expectation maximization
- ENLACE:
-
National Evaluation of the Academic Achievement of Scholar Centers
- ES:
-
Educational systems
- EXCALE:
-
Exams for the Quality and Educative Achievements
- INEE:
-
National Institute for Educative Evaluation
- ITS:
-
Intelligent tutoring systems
- KDD:
-
Knowledge discovery in databases
- LCMS:
-
Learning content management systems
- LMS:
-
Learning management systems
- PISA:
-
Program for International Student Assessment
- SAV:
-
Statistical Package for the Social Sciences data document
- SEP:
-
Secretary of Public Education
- SNTE:
-
National Union of Workers of the Education
- SPS:
-
Statistical Package for the Social Sciences syntax
- SPSS:
-
Statistical Package for the Social Sciences
- SQL:
-
Structured Query Language
- TXT:
-
Text data file
- USA:
-
United States of America
- WBC:
-
Web-based courses
- XLSX:
-
Excel extended
References
Vlahos, G.E., Ferratt, T.W., Knoepfle, G.: The use of computer-based information systems by German managers to support decision making. J. Inf. Manag 41(6), 763–779 (2004)
Lucey, T.: Management Information, 9th edn. Thomson, London (2005)
Power, D.J., Sharda, R.: Decision support systems. In: Shimon, N.Y. (ed.) Springer Handbook of Automation, pp. 1539–1546. Springer, Heidelberg (2009)
Mazón, J.N., Trujillo, J.: An MDA approach for the development of data warehouses. J. Decis. Support Syst. 45(1), 41–58 (2008)
Alkharouf, N.W., Curtis-Jamison, D., Matthews, B.F.: Online analytical processing: A fast and effective data mining tool for gene expression databases. J. Biomed. Biotechnol. 2005(2), 181–188 (2005)
Turban, E., Sharda, R., Delen, D.: Decision Support and Business Intelligence Systems, 9th edn. Prentice Hall, New Jersey (2010)
Wille, R.: Why can concept lattices support knowledge discovery in databases? J. Exp. Theor. Artif. Intell. 14(2–3), 81–92 (2002)
Dunham, M.H.: Data Mining Introductory and Advanced Topics. Pearson Education, Harlow (2006)
Han, J., Kamber, M., Pei, J.: Data Mining Concepts and Techniques, 2nd edn. Morgan Kaufmann, California (2006)
Hill, T., Lewicki, P.: Statistics: Methods and Applications: A Comprehensive Reference for Science, Industry, and Data Mining. StatSoft, Oklahoma (2006)
Karegar, M., Isazadeh, A., Fartash, F., Saderi, T., Navin, A.H.: Data-mining by probability-based patterns. In: Luzar-stiffler, V. (ed.) 30th International Conference on Information Technology Interfaces, pp. 353–360. IEEE, Cavtat (2008)
Soukup, T., Davidson, I.: Visual Data Mining: Techniques and Tools for Data Visualization and Mining. Wiley, New Jersey (2002)
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann, California (2011)
McCarthy, P.M., Boonthum-Denecke, C.H. (eds.): Applied Natural Language: Identification, Investigation and Resolution. Information Science Reference, Pennsylvania (2011)
Peng, Y., Kou, G., Shi, Y., Chen, Z.: A descriptive framework for the field of data mining and knowledge discovery. Int. J. Inf. Technol. Decis. Making 7(4), 639–682 (2008)
Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Massachusetts (2001)
Berkhin, P.: Survey of clustering data mining techniques. In: Kogan, J., Nicholas, C., Teboulle, M. (eds.) Grouping Multidimensional Data, pp. 25–71. Springer, Heidelberg (2006)
Hardoon, D.R., Shawe-Taylor, J., Szedmak, S.: Canonical correlation analysis: An overview with application to learning methods. J. Neural Comput. 16(12), 2639–2664 (2004)
Hong, T.P., Lin, K.Y., Wang, S.L.: Fuzzy data mining for interesting generalized association rules. J. Fuzzy Sets syst. 138(2), 255–269 (2003)
Pinto, H., Han, J., Pei, J., Wang, K., Chen, Q., Umeshwar, D.: Multidimensional sequential pattern mining. In: Paques, H., Liu, L., Grossman, D. (eds.) 10th International Conference on Information and Knowledge Management, pp. 81–88. ACM, New York (2001)
Chau, M., Cheng, R., Kao, B., Ng, J.: Uncertain data mining: An example in clustering location data. In: Ng, W.K., Kitsuregawa, M., Li, J., Chang, K. (eds.) Advances in Knowledge Discovery and Data Mining, LNCS, vol. 3918, pp. 199–204. Springer, Heidelberg (2006)
Wu, Z., Li, C.H.: L0-constrained regression for data mining. In: Ng, W.K., Kitsuregawa, M., Li, J., Chang, K. (eds.) Advances in Knowledge Discovery and Data Mining, LNCS, vol. 4426, pp. 981–988. Springer, Heidelberg (2007)
Corteza, P., Cerdeirab, A., Almeidab, F., Matos, T., Reisa, J.: Modeling wine preferences by data mining from physicochemical properties. J. Decis. Support Syst. 47(4), 547–553 (2009)
Du, W., Han, Y.S., Chen, S.: Privacy-preserving multivariate statistical analysis: Linear regression and classification. In: 4th SIAM International Conference on Data Mining, pp. 222–233. Society for Industrial and Applied Mathematics, Lake Buena Vista (2004)
Komarek, P.: Logistic Regression for Data Mining and High-dimensional Classification. Carnegie Mellon University, Pennsylvania (2004)
Pang, S., Ozawa, S., Kasabov, N.: Incremental linear discriminant analysis for classification of data streams. J. IEEE Trans. Syst. Man Cybern. B Cybern. 35(5), 905–914 (2005)
Wang, L. (ed.): Support Vector Machines: Theory and Applications. Springer, Heidelberg (2005)
Glick, M., Klon, A.E., Acklin, P., Davies, J.W.: Enrichment of extremely noisy high-throughput screening data using a naive Bayes classifier. J. Biomol. Screen. 9(1), 32–36 (2004)
Rogan, J., Miller, J., Stow, D., Franklin, J., Levien, L., Fisher, C.: Land-cover change monitoring with classification trees using Landsat TM and ancillary data. J. Photogram. Eng. Remote Sens. 69(7), 793–804 (2003)
Alves, R.T., Delgado, M.R., Lopes, H.S., Freitas, A.A.: An artificial immune system for fuzzy-rule induction in data mining. In: Yao, X., Burke, E.K., Lozano, J.A., Smith, J., Merelo-Guervós, J.J., Bullinaria, J.A., Rowe, M.J.E., Tiňo, P., Kabán, A., Schwefel, H.P. (eds.) Parallel Problem Solving from Nature-PPSN VIII, LNCS, vol. 3342, pp. 1011–1020. Springer, Heidelberg (2004)
Brighton, H., Mellish, C.: Advances in instance selection for instance-based learning algorithms. J. Data Min. Knowl. Disc 6, 153–172 (2002)
Wu, X., Kumar, V., Quinlan, J.S., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng, A., Liu, B., Yu, P.S., Zhou, Z.H., Steinbach, M., Hand, D.J., Steinberg, D.: Top 10 algorithms in data mining. J. Knowl. Inf. Syst. 14(1), 1–37 (2008)
Williams, N., Zander, S., Armitage, G.: A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification. ACM SIGCOMM Comput. Commun. Rev. 36(5), 5–16 (2006)
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extension, 2nd edn. Wiley, New Jersey (2007)
Yao, Z., Liu, P., Lei, L.,Yin, Y.: R-C4.5 decision tree model and its applications to health care dataset. In: International Conference on Services Systems and Services Management, vol. 2, pp. 1099–1103. IEEE, Chongqing (2005)
Waheed, T., Bonnell, R.B., Prasher, S.O., Paulet, E.: Measuring performance in precision agriculture: CART—a decision tree approach. J. Agric. Water Manag. 84(1–2), 173–185 (2006)
Lavrac, N., Kavsek, B., Flach, P., Todorovski, L.: Subgroup discovery with CN2-SD. J. Mach. Learn. Res. 5, 153–188 (2004)
Lazcorreta, E., Botella, F., Fernández-Caballero, A.: Towards personalized recommendation by two-step modified Apriori data mining algorithm. J. Expert Syst. Appl. 35(3), 1422–1429 (2008)
Zhang, M.L., Zhou, Z.H.: ML-KNN: A lazy learning approach to multi-label learning. J. Pattern Recognit. 40(7), 2038–2048 (2007)
Jain, A.K.: Data clustering: 50 years beyond K-means. J. Pattern Recognit. Lett. 31(8), 651–666 (2010)
Burdick, D., Calimlim, M., Flannick, J., Johannes, G., Tomi, Y.: MAFIA: A maximal frequent item set algorithm. IEEE Trans. Knowl. Data Eng. 17(11), 1490–1504 (2005)
Dwork, C.: Differential privacy: A survey of results. In: Agrawal, M., Zu, D., Duan, Z., Li, A. (eds.) Theory and Applications of Models of Computation, LNAI, vol. 4978, pp. 1–19. Springer, Heidelberg (2008)
Lamma, E., Mello, P., Riguzzi, F., Storari, S.: Applying inductive logic programming to process mining. In: Blockeel, H., Ramon, J., Shavlik, J., Tadepalli, P. (eds.) Inductive Logic Programming, LNCS, vol. 4894, pp. 132–146. Springer, Heidelberg (2008)
Evans, M.P.: Analysing Google rankings through search engine optimization data. J. Internet Res. 17(1), 21–37 (2007)
Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitter rank: Finding topic-sensitive influential twitters. In: 3rd International Conference on Web Search and Data Mining, pp. 261–270 (2010)
Vialardi-Sacin, C., Bravo-Agapito, J., Shafti, L., Ortigosa, A.: Recommendation in higher education using data mining techniques. In: Barnes, T., Desmarais, M., Romero, R., Ventura, S. (eds.) 2nd International Conference on Educational Data Mining, pp. 190–199. International Educational Data Mining Society, Cordoba (2009)
Anjewierden, A., Kollöffel, B., Hulshof, C.: Towards educational data mining: Using data mining methods for automated chat analysis to understand and support inquiry learning processes. In: International Workshop on Applying Data Mining in e-Learning, EC-TEL 2007, pp. 23–32. EC-TEL, Crete (2007)
Luan, J.: Data mining and its applications in higher education. J. New Dir. Inst. Res. 113, 17–36 (2002)
Romero, C., Ventura, S.: Educational data mining: A survey from 1995 to 2005. Expert Syst. Appl. 33(1), 135–146 (2007)
Romero, C., Ventura, S.: Educational data mining: A review of the state of the art. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 40(6), 601–618 (2010)
Peña-Ayala, A.: Educational data mining: A survey and a data mining-based analysis of recent works. Expert systems with applications 41(4), doi:10.1016/j.eswa.2013.08.042 (2014)
Marsh, H.W., Roche, L.A.: Effects of grading leniency and low workload on students’ evaluations of teaching: Popular myth, bias, validity, or innocent bystanders? J. Educ. Psychol. 92(1), 202–228 (2000)
About the Students Evaluations of Teaching (USE): Institute for Teaching and Learning, University of Sidney. http://www.itl.usyd.edu.au/use/
Kim, S.M., Calvo, R.A.: Sentiment analysis in student experiences of learning. In: Baker, R.S.J.D., Merceron, A., Pavlik Jr, P.I. (eds.) 3rd International Conference on Educational Data Mining, pp. 111–120. International Educational Data Mining Society, Pittsburgh (2010)
Champaign, J., Cohen, R.: An annotations approach to peer tutoring. In: Baker, R.S.J.D., Merceron, A., Pavlik Jr, P.I. (eds.) 3rd International Conference on Educational Data Mining, pp. 231–240. International Educational Data Mining Society, Pittsburgh (2010)
Koprinska, I.: Mining assessment and teaching evaluation data of regular and advanced stream students. In: Pechenizkiy, M., Calders, T., Conati, C., Ventura, S., Romero, C., Stamper, J. (eds.) 4th International Conference on Educational Data Mining, pp. 359–360. International Educational Data Mining Society, Eindhoven (2011)
Barracosa, J., Antunes, C.: Anticipating teachers’ performance. In: KDD 2011 Workshop: Knowledge Discovery in Educational Data, pp. 77–82. San Diego (2011)
Leong, C.K., Lee, Y.H., Mark, W.K.: Mining sentiments in SMS texts for teaching evaluation. Expert Syst. Appl. 39(3), 2584–2589 (2012)
Gates, K., Wilkins, D., Conlon, S., Mossing, S., Eftink, M.: Maximizing the value of student ratings through data mining. In: Peña-Ayala, A. (ed.) Educational Data Mining: Applications and Trends. Studies in Computational Intelligence. Springer, Heidelberg (2013, in press)
Instituto Nacional de Estadística y Geografía: Population in 2010. http://www.inegi.org.mx/
Instituto Nacional de Estadística y Geografía: Population in 2010 organized by age ranges from 3 up to 19 year old. http://www3.inegi.org.mx/sistemas/sisept/default.aspx?t=medu01&s=est&c=21778
Constitución Política de los Estados Unidos Mexicanos: Constitution published by the Official Diary of the Federation on 5 Feb 1917, Deputy Chamber of the Honorarium Congress of the Union, Mexico. http://www.diputados.gob.mx/LeyesBiblio/pdf/1.pdf
Instituto Nacional de Estadística y Geografía: Students roll in 2011–2012 organized by academic level from pre-primary up to high-school. http://www3.inegi.org.mx/sistemas/sisept/default.aspx?t=medu17&s=est&c=21788
Instituto Nacional de Estadística y Geografía: Teachers inventory in 2011–2012 organized by academic level from pre-primary up to high-school. http://www3.inegi.org.mx/sistemas/sisept/default.aspx?t=medu03&s=est&c=21819
Instituto Nacional de Estadística y Geografía: Schools inventory in 2011–2012 organized by academic level from pre-primary up to high-school. http://www3.inegi.org.mx/sistemas/sisept/default.aspx?t=medu57&s=est&c=21815
Presidencia de la República de los Estados Unidos Mexicanos: Educational Reform News. http://www.presidencia.gob.mx/reforma-educativa-construyendo-la-gran-obra-colectiva-que-es-mexico/
Secretaría de Educación Pública: Constitutional Reform in Educative Subject. http://www.sep.gob.mx/
Diario Oficial de la Federación: Educational reform decree, addition of the fraction IX to the article 3 of the Politic Constitution of the Mexican United States. http://www.sep.gob.mx/work/models/sep1/pdf/promulgacion_dof_26_02_13.pdf
Cámara de Diputados del Honorable Congreso de la Unión: Bulletin number 1937 to approve the verdict to found the General Law of Education. http://www3.diputados.gob.mx/camara/005_comunicacion/a_boletines/2013_2013/agosto_agosto/22_22/1936_aprueba_camara_de_diputados_reformas_a_la_ley_general_de_educacion
Cámara de Diputados del Honorable Congreso de la Unión: Bulletin number 1937 to approve the verdict to found the Law of the National Institute for Educative Evaluation. http://www3.diputados.gob.mx/camara/005_comunicacion/a_boletines/2013_2013/agosto_agosto/22_22/1937_aprueban_diputados_dictamen_que_crea_la_ley_del_instituto_nacional_para_la_evaluacion_de_la_educacion
Sindicato Nacional de Trabajadores de la Educación: Workers of the Education. http://www.snte.org.mx/index2.php
El Universal Newspaper: SNTE surveys Workers of the Education. http://www.eluniversal.com.mx/primera/38857.html
Coordinadora Nacional de Trabajadores de la Educación: National Coordination of Workers of the Education. http://cntrabajadoresdelaeducacion.blogspot.mx/
Instituto Mexicano para la Competitividad: Initiative improve your School. http://www.mejoratuescuela.org
Consejo Mexicano de Investigación Educativa: Home page. https://www.comie.org.mx
Secretaría de Educación Pública: Parents of Family Associations and Scholar Councils for Social Participation. http://csep.sepdf.gob.mx/wcsep/default.asp?id=5
Secretaría de Educación Pública: National Evaluation of the Academic Achievement of Scholar Centers. http://www.enlace.sep.gob.mx/que_es_enlace/
Instituto Nacional para la Evaluación de la Educación. Exams for the Quality and Educative Achievements. http://www.inee.edu.mx/index.php/bases-de-datos/bases-de-datos-excale/marcos-de-referencia
Organization for Economic Co-operation and Development: Program for International Student Assessment. http://www.oecd.org/pisa/
Fayers, P.: Item response theory for psychologists. Qual. Life Res. 13(3), 715–716 (2004)
Diario Oficial de la Federación: Decree to reform the article 3 of the Politic Constitution of the Mexican United States. http://www.dof.gob.mx/nota_detalle.php?codigo=5288919&fecha=26/02/2013
Cámara de Diputados del Honorable Congreso de la Unión: General Law of Education. http://www.diputados.gob.mx/LeyesBiblio/pdf/137.pdf
El Universal Newspaper: Law of the National Institute for Educative Evaluation. http://www.dstjal.com.mx/est92/images/docs/biblioteca/Ley_Evaluacion.pdf
El Universal Newspaper: Law of Professional Academic Service. http://www.dstjal.com.mx/est92/images/docs/biblioteca/leyservicioprofesionaldocente.pdf
Noticiario de José Cárdenas Informa: Radio Fórmula 103.3 FM. CNTE is founded 1979, radio news provided on 27 Aug 2013 at 19:20. http://www.radioformula.com.mx/estaciones.asp?idEs=tele&iPos=2
Instituto Nacional para la Evaluación de la Educación: Home page. http://www.inee.edu.mx/
Instituto Nacional para la Evaluación de la Educación: Databases. http://www.inee.edu.mx/index.php/bases-de-datos
Instituto Nacional para la Evaluación de la Educación: EXCALE, middle-school third grade evaluation made in 2007–2008. http://www.inee.edu.mx/index.php/bases-de-datos/bases-de-datos-excale/excale-09-ciclo-2007-2008
Pallant, J.: SPSS Survival Manual: A Step by Step Guide to Data Analysis Using SPSS. McGraw-Hill International, Berkshire (2010)
Skulschus, M., Tittel, J., Wiederstein, M.: MS SQL Server 2012 (4): Data Mining, Analyse und Multivariate Verfahren. Comelio GmbH, Munchen (2013)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explor 11(1), 10–18 (2009)
El Universal Newspaper: Census reports 33.6 millions of Mexicans living in USA. http://archivo.e-consulta.com/2013/index.php/nacionales/item/reporta-censo-336-millones-de-mexicanos-en-estados-unidos
Noticiario de Joaquín López Dóriga: Radio Fórmula 103.3 FM. 1.3 millions of students without classes, radio news provided on 26 Aug 2013 at 14:00. http://www.radioformula.com.mx/estaciones.asp?idEs=103-3-fm&iPos=0
Acknowledgments
The first author gives testimony of the strength given by his Father, Brother Jesus, and Helper, as part of the research projects of World Outreach Light to the Nations Ministries (WOLNM). Moreover, a mention is given to Mr. Lawrence Whitehill Waterson, a native British English speaker who tunes the manuscript, as well as Aldo Ramírez Arellano for his edition support. In addition, a special mention is given to the Instituto Nacional para la Evaluación de la Educación for the publication of the source data used in this case study for exclusively research purposes. Finally, this research holds a partial support from grants given by: CONACYT-SNI-36453, CONACYT 118862, CONACYT 118962-162727, IPN-PIFI/20131093-28/1398, CONACYT 360532/289763. and IPN-COFAA-SIBE DEBEC/647-391/2013.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Peña-Ayala, A., Cárdenas, L. (2014). How Educational Data Mining Empowers State Policies to Reform Education: The Mexican Case Study. In: Peña-Ayala, A. (eds) Educational Data Mining. Studies in Computational Intelligence, vol 524. Springer, Cham. https://doi.org/10.1007/978-3-319-02738-8_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-02738-8_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02737-1
Online ISBN: 978-3-319-02738-8
eBook Packages: EngineeringEngineering (R0)