Abstract
Research in the field of educational videos and the contribution of data mining to education can affect the instructors’ approach to learning. This particular study focuses on online educational videos and more specifically on their speakers. Initially a survey is conducted related to the popularity of educational videos on the YouTube which are then divided into two categories the more popular and the less popular. Then the characteristics related to language are extracted from the transcript of the speakers and after a clustering procedure the differences between the two categories are stated. The characteristics related to the language of the speakers of the popular videos present very interesting results. That is, the pace of speaking is faster and the complexity off the sentences is higher than the ones in the less popular videos.
Chapter PDF
Similar content being viewed by others
References
Moel de, E.L.: Expanding the usability of recorded lectures (2010), http://purl.utwente.nl/essays/59431
Waldrop, M.M., Nature Magazine: Massive open online courses, aka MOOCs, transform higher education and science (2014)
Gibbon, D.C., Liu, Z.: Introduction to Video Search Engines. Springer (2008)
Cutting, D.R., Pedersen, J.O., Karger, D.R., Tukey, J.W.: Scatter/gather: a cluster-based approach to browsing large document collections. In: SIGIR, pp. 318–329 (1992)
Yang, J., Li, Q., Wenyin, L., Zhuang, Y.: Searching for flash movies on the web: A content and context based framework. World Wide Web 8(4), 495–517 (2005)
Cheung, S.C.S., Zakhor, A.: Efficient video similarity measurement with video signature. IEEE Trans. Circuits Syst. Video Technol. 13(1), 59–74 (2003)
Hindle, A., Shao, J., Lin, D., Lu, J., Zhang, R.: Clustering web video search results based on integration of multiple features. World Wide Web 14(1), 53–73 (2011)
Amine, A., Elberrichi, Z., Simonet, M.: Evaluation of text clustering methods using wordnet. Int. Arab J. Inf. Technol. 7(4), 349–357 (2010)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. The Journal of Machine Learning Research 12, 2493–2537 (2011)
Friederici, A.D.: The brain basis of language processing: from structure to function. Physiological Reviews 91(4), 1357–1392 (2011)
Jin, M., Murakami, M.: Authors’ characteristic writing styles as seen through their use of commas. Behaviormetrika 20(1), 3–76 (1992)
Shehata, S., Karray, F., Kamel, M.S.: An efficient concept-based mining model for enhancing text clustering. IEEE Transactions on Knowledge and Data Engineering 22(10), 1360–1371 (2010)
Passini, C., Luiza, M., Estébanez, K.B., Figueredo, G.P., Ebecken, F., Nelson, F.: A Strategy for Training Set Selection in Text Classification Problems. International Journal of Advanced Computer Science & Applications 4(6) (2013)
Kiritchenko, S., Matwin, S.: Email classification with co-training. In: Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research, pp. 301–312 (2011)
Filippova, K., Hall, K.B.: Improved video categorization from text metadata and user comments. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 835–842 (2011)
Gilroy, M.: Higher education migrates to YouTube and social networks. Education Digest 75(7), 18–22 (2010)
Selwyn, N.: Social media in higher education. The Europa World of Learning (2012)
Chatzopoulou, G., Sheng, C., Faloutsos, M.: A first step towards understanding popularity in youtube. In: INFOCOM IEEE Conference on Computer Communications Workshops, pp. 1–6 (2010)
Figueiredo, F., Almeida, J.M., Gonçalves, M.A., Benevenuto, F.: On the Dynamics of Social Media Popularity: A YouTube Case Study. arXiv preprint arXiv:1402.1777 (2014)
Liu, B.: Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5(1), 1–167 (2012)
Chen, H., Zimbra, D.: AI and opinion mining. IEEE Intelligent Systems 25(3), 74–80 (2010)
Wattenhofer, M., Wattenhofer, R., Zhu, Z.: The YouTube Social Network. In: ICWSM (2012)
Moran, M., Seaman, J., Tinti-Kane, H.: Teaching, Learning, and Sharing: How Today’s Higher Education Faculty Use Social Media. Babson Survey Research Group (2011)
Padilla, A., DeFields, A.: Beginning Zend Framework. Apress (2009)
Weka.: Data Mining Software in Java. University of Waikato (2014), http://www.cs.waikato.ac.nz/ml/weka/
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C., Silverman, R., Wu, A.Y.: The analysis of a simple k-means clustering algorithm. In: Proceedings of the Sixteenth Annual Symposium on Computational Geometry, pp. 100–109 (2000)
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: An efficient k-means clustering algorithm: Analysis and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7), 881–892 (2002)
Vora, P., Oza, B.: A Survey on K-mean Clustering and Particle Swarm Optimization. International Journal of Science and Modern Engineering (IJISME), 24-26 (2013)
Färber, I., Günnemann, S., Kriegel, H.P., Kröger, P., Müller, E., Schubert, E., et al.: On using class-labels in evaluation of clusterings. In: MultiClust: 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings Held in Conjunction with KDD (2010)
Wiegand, N.: Creating complex sentence structure. In: Proceedings of the Annual Meeting of the Berkeley Linguistics Society vol. 10 (2011)
Beliakov, G., Bustince, H., Fernandez, J.: The median and its extensions. Fuzzy Sets and Systems 175(1), 36–47 (2011)
Grzybek, P., Stadlober, E., Kelih, E.: The relationship of word length and sentence length: the inter-textual perspective. In: Advances in Data Analysis, pp. 611–618. Springer (2007)
Mahowald, K., Fedorenko, E., Piantadosi, S.T., Gibson, E.: Info/information theory: Speakers choose shorter words in predictive contexts. Cognition 126(2), 313–318 (2013)
Hill, R.L., Murray, W.S.: Commas and spaces: The point of punctuation. In: 11th Annual CUNY Conference on Human Sentence Processing (1998)
Palmer, D.D.: Tokenisation and sentence segmentation, chapter 2. In: Dale, R., Moisi, H., Somers, H. (eds.) Handbook of Natural Language Processing. Marcel Dekker (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 IFIP International Federation for Information Processing
About this paper
Cite this paper
Kravvaris, D., Kermanidis, K.L. (2014). Speakers’ Language Characteristics Analysis of Online Educational Videos. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds) Artificial Intelligence Applications and Innovations. AIAI 2014. IFIP Advances in Information and Communication Technology, vol 436. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44654-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-662-44654-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44653-9
Online ISBN: 978-3-662-44654-6
eBook Packages: Computer ScienceComputer Science (R0)