Using Automatic Metadata Extraction to Build a Structured Syllabus Repository

Yu, Xiaoyan; Tungare, Manas; Fan, Weiguo; Pérez-Quiñones, Manuel; Fox, Edward A.; Cameron, William; Cassel, Lillian

doi:10.1007/978-3-540-77094-7_43

Using Automatic Metadata Extraction to Build a Structured Syllabus Repository

Xiaoyan Yu¹,
Manas Tungare¹,
Weiguo Fan¹,
Manuel Pérez-Quiñones¹,
Edward A. Fox¹,
William Cameron² &
…
Lillian Cassel²

Conference paper

1764 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4822))

Abstract

Syllabi are important documents created by instructors for students. Gathering syllabi that are freely available, and creating useful services on top of the collection, will yield a digital library of value for the educational community. However, gathering and building a repository of syllabi is complicated by the unstructured nature of syllabus representation and the lack of a unified vocabulary for syllabus construction. In this paper, we propose an intelligent approach to automatically annotate freely-available syllabi from the Web to benefit the educational community through supporting services such as semantic search. We discuss our detailed process for converting unstructured syllabi to structured representations through entity recognition, segmentation, and association. Our evaluation results demonstrate the effectiveness of our extractor and also suggest improvements. We hope our work will benefit not only users of our services but also people who are interested in building other genre-specific repositories.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hodgins, W., Duval, E.: Draft standard for learning technology - Learning Object Metadata - ISO/IEC 11404. Technical report (2002)
Google Scholar
Mccallum, A.: Information extraction: Distilling structured data from unstructured text. ACM Queue 3(9) (November 2005)
Google Scholar
Thompson, C.A., Smarr, J., Nguyen, H., Manning, C.: Finding educational resources on the web: Exploiting automatic extraction of metadata. In: Proc. ECML Workshop on Adaptive Text Extraction and Mining (2003)
Google Scholar
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: Gate: A framework and graphical development environment for robust nlp tools and applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia (July 2002)
Google Scholar
Dowman, M., Tablan, V., Cunningham, H., Popov, B.: Web-assisted annotation, semantic indexing and search of television and radio news. In: WWW 2005. Proceedings of the 14th international conference on World Wide Web, pp. 225–234. ACM Press, New York (2005)
Chapter Google Scholar
Choi, F.Y.Y.: Advances in domain independent linear text segmentation. In: Proceedings of the first conference on North American chapter of the Association for Computational Linguistics, pp. 26–33. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Kehagias, A., Nicolaou, A., Petridis, V., Fragkou, P.: Text segmentation by product partition models and dynamic programming. Mathematical and Computer 39(2-3), 209–217 (2004)
MATH MathSciNet Google Scholar
Tungare, M., Yu, X., Cameron, W., Teng, G., Pérez-Quiñones, M., Fox, E., Fan, W., Cassel, L.: Towards a syllabus repository for computer science courses. In: SIGCSE 2007. Proceedings of the 38th Technical Symposium on Computer Science Education, vol. 39, pp. 55–59. ACM Press, New York, NY, USA (2007)
Chapter Google Scholar
Tungare, M., Yu, X., Teng, G., P érez Quiñones, M., Fox, E., Fan, W., Cassel, L.: Towards a standardized representation of syllabi to facilitate sharing and personalization of digital library content. In: Proceedings of the 4th International Workshop on Applications of Semantic Web Technologies for E-Learning (SW-EL) (2006)
Google Scholar
Yu, X., Tungare, M., Fan, W., Pérez-Quiñones, M., Fox, E.A., Cameron, W., Teng, G., Cassel, L.: Automatic syllabus classification. In: Proceedings of the Seventh ACM/IEEE-CS Joint Conference on Digital Libraries - JCDL 2007, pp. 440–441 (2007)
Google Scholar
de Larios-Heiman, L., Cracraft, C.: (SylViA: The Syllabus Viewer Application)
Google Scholar
Dolog: Reasoning and ontologies for personalized e-learning. Educational Technology and Society (2004)
Google Scholar
Han, H., Giles, C.L., Manavoglu, E., Zha, H., Zhang, Z., Fox, E.A.: Automatic document metadata extraction using support vector machines. In: JCDL 2003. Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital Libraries, Washington, DC, USA, pp. 37–48. IEEE Computer Society Press, Los Alamitos (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Virginia Tech, Blacksburg VA 24061, USA
Xiaoyan Yu, Manas Tungare, Weiguo Fan, Manuel Pérez-Quiñones & Edward A. Fox
Villanova University, Villanova PA 19085, USA
William Cameron & Lillian Cassel

Authors

Xiaoyan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Manas Tungare
View author publications
You can also search for this author in PubMed Google Scholar
Weiguo Fan
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Pérez-Quiñones
View author publications
You can also search for this author in PubMed Google Scholar
Edward A. Fox
View author publications
You can also search for this author in PubMed Google Scholar
William Cameron
View author publications
You can also search for this author in PubMed Google Scholar
Lillian Cassel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Dion Hoe-Lian Goh Tru Hoang Cao Ingeborg Torvik Sølvberg Edie Rasmussen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, X. et al. (2007). Using Automatic Metadata Extraction to Build a Structured Syllabus Repository. In: Goh, D.HL., Cao, T.H., Sølvberg, I.T., Rasmussen, E. (eds) Asian Digital Libraries. Looking Back 10 Years and Forging New Frontiers. ICADL 2007. Lecture Notes in Computer Science, vol 4822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77094-7_43

Download citation

DOI: https://doi.org/10.1007/978-3-540-77094-7_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77093-0
Online ISBN: 978-3-540-77094-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics