Skip to main content

Advertisement

Log in

A practical, iterative framework for secondary data analysis in educational research

  • Published:
The Australian Educational Researcher Aims and scope Submit manuscript

Abstract

Secondary data analysis in educational research has been an established research method for many years. Yet, few publications outline the “how to” of undertaking the process. This paper presents an analysis framework suitable for undertaking secondary data analysis within the field of education. The framework is a modification and an application of a pre-existing data mining research process known as Knowledge Discovery in Databases (KDD). The KDD process is interactive and generative and involves a series of sequential steps and decision-making processes. The modified KDD process is described to show how it supports secondary data analysis and provides an example of how the modified KDD process was applied across a secondary analysis in mathematics education. This paper provides educational researchers with a practical and iterative framework through which to undertake secondary analysis that enhances flexibility and encourages depth and saturation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

(adapted from Fayyad et al. 1996, p. 41)

Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Attwood, G., & Croll, P. (2011). Attitudes to school and intentions for educational participation: An analysis of data from the Longitudinal Survey of Young People in England. International Journal of Research and Method in Education,34(3), 269–287.

    Article  Google Scholar 

  • Australian Government. (2013). Australian Research Council Discovery Projects—Instructions to applicants for funding commencing in 2015. Resource document. Australian Research Council. Retreived from http://www.arc.gov.au/pdf/DP15/DP15_ITA.pdf.

  • Bishop, L., & Kuula-Luumi, A. (2016). Revisiting qualitative data reuse: A decade on. SAGE Open. https://doi.org/10.1177/2158244016685136.

    Article  Google Scholar 

  • Bobis, J., Shore, S., Bennett, D., Bennett, S., Chan, P., Harrison, N., et al. (2013). Educational research in Australia: Where is it conducted? Australian Educational Researcher,40(4), 453–471.

    Article  Google Scholar 

  • Borgman, C. L. (2012). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology,63(6), 1059–1078.

    Article  Google Scholar 

  • Brewer, E. W. (2007). Secondary data analysis. In N. J. Salkind & K. Rasmussen (Eds.), Encyclopedia of measurement and statistics (pp. 870–877). Thousand Oaks: SAGE Publications Inc.

    Google Scholar 

  • Burstein, L. (1978). Secondary analysis: An important resource for educational research and evaluation. Educational Researcher,7(5), 9–12.

    Article  Google Scholar 

  • Cabena, P., Hadjinian, P., Stadler, R., Verhees, J., & Zanasi, A. (1998). Discovering data mining: From concept to implementation. Englewood Cliffs, NJ: Prentice Hall.

    Google Scholar 

  • Carter, J., Noble, S., Russell, A., & Swanson, E. (2011). Developing statistical literacy using real-world data: Investigating socioeconomic secondary data resources used in research and teaching. International Journal of Research and Method in Education,34(3), 223–240.

    Article  Google Scholar 

  • Chen, M. S., Han, J., & Yu, P. S. (1996). Data mining: An overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering, 8(6), 866–883.

    Article  Google Scholar 

  • Cheng, H. G., & Phillips, M. R. (2014). Secondary analysis of existing data: Opportunities and implementation. Shanghai Archives of Psychiatry,26(6), 371–375.

    Google Scholar 

  • Cheshire, L. (2009). Archiving qualitative data: Prospects and challenges of data preservation and sharing among Australian qualitative researchers. Discussion Paper. http://www.assda.edu.au/forms/AQuAQualitativeArchiving_DiscussionPaper_FinalNov09.pdf.

  • Colonna, L. (2013). Taxonomy and classification of data mining. SMU Science and Technology Law Review,16, 309–369.

    Google Scholar 

  • Corti, L., & Fielding, N. (2017). Opportunities from the digital revolution: Implications for researching, publishing and consuming qualitative research. SAGE Open. https://doi.org/10.1177/2158244016678912.

    Article  Google Scholar 

  • Devine, P. (2003). Secondary data analysis. In R. L. Miller & J. D. Brewer (Eds.), The A-Z of social research (pp. 286–289). London: SAGE Publications Ltd.

    Google Scholar 

  • Economic and Social Research Council (ESRC). (2010). ESRC Research Data Policy (updated 2013). Resource document. Retrieved from http://www.esrc.ac.uk/_images/Research_Data_Policy_2010_tcm8-4595.pdf.

  • Fayyad, U. M. (1996). Data mining and knowledge discovery: Making sense out of data. IEEE Intelligent Systems,11(5), 20–25.

    Google Scholar 

  • Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine,17(3), 37–54.

    Google Scholar 

  • Goodwin, J. (2012). Editor’s introduction: Secondary sources and secondary analysis. In J. Goodwin (Ed.), SAGE secondary data analysis (Vol. 1, pp. xix–xxxvii). London: SAGE.

    Google Scholar 

  • Hakim, C. (1982). Secondary analysis and the relationship between official and academic social research. Sociology,16(1), 12–28.

    Article  Google Scholar 

  • Hammersley, M. (2010). Can we re-use qualitative data via secondary analysis? Notes on some terminology and substantive issues. Sociological Research Online. https://doi.org/10.5153/sro.2076.

    Article  Google Scholar 

  • Hand, D. J. (1998). Data mining: Statistics and more? The American Statistician,52(2), 112–118.

    Google Scholar 

  • Hofferth, S. L. (2005). Secondary data analysis in family research. Journal of Marriage and Family,67(4), 891–907.

    Article  Google Scholar 

  • Houghton, J., & Gruen, N. (2014). Open research data: Report to the Australian National Data Service (ANDS). Resource document. Retrieved from http://ands.org.au/resource/open-research-data-report.pdf.

  • Irwin, S., Bornat, J., & Winterton, M. (2012). Timescapes secondary analysis: Comparison, context and working across data sets. Qualitative Research,12(1), 66–80.

    Article  Google Scholar 

  • Kolb, B. (2008). Marketing research: A practical approach. London: SAGE Publishing.

    Book  Google Scholar 

  • Logan, T. (2015). Starting a conversation about open data in mathematics education research. In M. Marshman, V. Geiger & A. Bennison (Eds.), Mathematics education in the margins. Proceedings of the 38th annual conference of the mathematics education research group of Australasia (pp. 373–380). Sunshine Coast, Australia: MERGA Inc.

  • Logan, T., & Lowrie, T. (2017). Gender perspectives on spatial tasks in a national assessment: A secondary data analysis. Research in Mathematics Education, 19(2), 199–216. https://doi.org/10.1080/14794802.2017.1334577.

    Article  Google Scholar 

  • Logan, T., & Woodland, K. (2015). Digital games and mathematics learning: The state of play. In T. Lowrie & R. Jorgensen (Eds.), Digital games and mathematics learning: Potential, promises and pitfalls (pp. 277–304). The Netherlands: Springer.

    Chapter  Google Scholar 

  • Logan, T., Lowrie, T., & Diezmann, C. M. (2014). Co-thought gestures: Supporting students to successfully navigate map tasks. Educational Studies in Mathematics, 87, 87–102. https://doi.org/10.1007/s10649-014-9546-2.

    Article  Google Scholar 

  • Lowrie, T. (2014). An educational practice framework: The empowerment of the teaching profession. Journal of Education for Teaching,40(1), 34–46.

    Article  Google Scholar 

  • Mariscal, G., Marbán, O., & Fernández, C. (2010). A survey of data mining and knowledge discovery process models and methodologies. The Knowledge Engineering Review,25(2), 137–166.

    Article  Google Scholar 

  • McNeill, D. (1992). Hand and mind: What gestures reveal about thought. Chicago, IL: University of Chicago Press.

    Google Scholar 

  • Miles, M. B., & Huberman, A. M. (1984). Drawing meaning from qualitative data: Toward a shared craft. Educational Researcher,13(5), 20–30.

    Article  Google Scholar 

  • Morris, D., & Ecclesfield, N. (2011). A new computer-aided technique for qualitative document analysis. International Journal of Research and Method in Education,34(3), 241–254.

    Article  Google Scholar 

  • National Health and Medical Research Council. (2018). Australian code for responsible conduct of research. Canberra, Australia: Author. Retrieved from https://nhmrc.gov.au/about-us/publications/australian-code-responsible-conduct-research-2018#block-views-block-file-attachments-content-block-1.

  • National Science Foundation (NSF). (2010). NSF Data Sharing Policy. Resource document. Retrieved from http://www.nsf.gov/bfa/dias/policy/dmp.jsp.

  • Olesen, S. (2014). Publishing and sharing sensitive data. Australian National Data Service Guides. Resource document. Retrieved from http://ands.org.au/guides/sensitivedata.html.

  • Onwuegbuzie, A. J., & Johnson, R. B. (2006). The validity issue in mixed research. Research in the Schools,13(1), 48–63.

    Google Scholar 

  • Organisation for Economic Co-operation and Development (OECD). (2007). OECD principles and guidelines for access to research data from public funding. Paris, OECD. Resource document. Retrieved from http://www.oecd.org/sti/sci-tech/38500813.pdf.

  • Payne, G., & Payne, J. (2004). Key concepts in social research. London: SAGE Publications.

    Book  Google Scholar 

  • Pyle, D. (1999). Data preparation for data mining. San Francisco, CA: Morgan Kaufmann.

    Google Scholar 

  • Queensland School Curriculum Council. (2001). 2001 Queensland year 5 test: Aspects of numeracy (p. 2). Brisbane, Australia: Queensland School Curriculum Council.

    Google Scholar 

  • Raven, J., Raven, J. C., & Court, J. H. (1998). Manual for Raven’s standard progressive matrices. Oxford: Oxford Psychologists Press.

    Google Scholar 

  • Rew, L., Koniak-Griffin, D., Lewis, M. A., Miles, M., & O’Sullivan, A. (2000). Secondary data analysis: New perspectives for adolescent research. Nursing Outlook,48(5), 223–229.

    Article  Google Scholar 

  • Romero, C., & Ventura, S. (2013). Data mining in education. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery,3(1), 12–27.

    Google Scholar 

  • Rowan, M., & Ramsay, E. (2018). Educational inequality in Tasmania: Evidence and explanations. Australian Educational Researcher,45(3), 277–295.

    Article  Google Scholar 

  • Smith, E. (2008a). Pitfalls and promises: The use of secondary data analysis in educational research. British Journal of Educational Studies,56(3), 323–339.

    Article  Google Scholar 

  • Smith, E. (2008b). Using secondary data in educational and social research. Maidenhead: McGraw Hill/Open University Press.

    Google Scholar 

  • Smith, E. (2011). Special issue on using secondary data analysis in educational research. International Journal of Research and Method in Education,34(3), 219–221.

    Article  Google Scholar 

  • Smith, A. K., Ayanian, J. Z., Cobinsky, K. E., Landon, B. E., McCarthy, E. P., Wee, C. C., et al. (2011). Conducting high-value secondary dataset analysis: An introductory guide and resources. Journal of General Internal Medicine,26(8), 920–929.

    Article  Google Scholar 

  • Thompson, P. (2000). Re-using qualitative research data: A personal account. Forum: Qualitative Social Research. https://doi.org/10.17169/fqs-1.3.1044.

    Article  Google Scholar 

  • Wellcome Trust. (2010). Policy on data management and sharing. Resource document. Retrieved from http://www.wellcome.ac.uk/About-us/Policy/Policy-and-position-statements/WTX035043.htm.

  • Zhang, S., Zhang, C., & Yang, Q. (2003). Data preparation of data mining. Applied Artificial Intelligence,17, 375–381.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tracy Logan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Logan, T. A practical, iterative framework for secondary data analysis in educational research. Aust. Educ. Res. 47, 129–148 (2020). https://doi.org/10.1007/s13384-019-00329-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13384-019-00329-z

Keywords

Navigation