SASL: A Semantic Annotation System for Literature

  • Pingpeng Yuan
  • Guoyin Wang
  • Qin Zhang
  • Hai Jin
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5854)


Due to ambiguity, search engines for scientific literatures may not return right search results. One efficient solution to the problems is to automatically annotate literatures and attach the semantic information to them. Generally, semantic annotation requires identifying entities before attaching semantic information to them. However, due to abbreviation and other reasons, it is very difficult to identify entities correctly. The paper presents a Semantic Annotation System for Literature (SASL), which utilizes Wikipedia as knowledge base to annotate literatures. SASL mainly attaches semantic to terminology, academic institutions, conferences, and journals etc. Many of them are usually abbreviations, which induces ambiguity. Here, SASL uses regular expressions to extract the mapping between full name of entities and their abbreviation. Since full names of several entities may map to a single abbreviation, SASL introduces Hidden Markov Model to implement name disambiguation. Finally, the paper presents the experimental results, which confirm SASL a good performance.


Semantic Annotation Name Disambiguation 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Reeve, L., Han, H.: Survey of Semantic Annotation Platforms. In: Proceedings of the 2005 ACM Symposium on Applied Computing, New Mexico, USA, pp. 1634–1638 (2005)Google Scholar
  2. 2.
    Popov, B., Kiryakov, A., Ognyanoff, D., Manov, D., Kirilov, A.: KIM: a Semantic Platform for Information Extraction and Retrieval. Natural Language Engineering 10(3-4), 375–392 (2004)CrossRefGoogle Scholar
  3. 3.
    Bontcheva, K., Kiryakov, A., Cunningham, H., Popov, B., Dimitrov, M.: Semantic Web Enabled, Open Source Language Technology. In: Proceedings of EACL 2003 Workshop on Language Technology and the Semantic Web, Budapest, Hungary, pp. 335–343 (2003)Google Scholar
  4. 4.
    Collier, N., Takeuchi, K., Kawazoe, A.: Open Ontology Forge: an Environment for Text Mining in a Semantic Web World. In: Proceedings of International Workshop on Semantic Web Foundations and Application Technologies, Nara, Japan, March 12, pp. 128–140 (2003)Google Scholar
  5. 5.
  6. 6.
    Dill, S., Eiron, N., Gibson, D., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., Tomlin, J.A., Zien, J.Y.: SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation. In: Proceedings of the 12th International Conference on World Wide Web, Budapest, Hungary, pp. 178–186 (2003)Google Scholar
  7. 7.
    Cimiano, P., Ladwig, G., Staab, S.: Gimme’ The Context: Context-driven Automatic Semantic Annotation with C-PANKOW. In: Proceedings of the Fourteenth International World Wide Web Conference, Chiba, Japan, May 10-14, pp. 332–341 (2005)Google Scholar
  8. 8.
    Cimiano, P., Handschuh, S., Staab, S.: Towards the Self-Annotating Web. In: Proceedings of the 13th International Conference on World Wide Web, New York, USA, pp. 462–471 (2004)Google Scholar
  9. 9.
    Lih, A.: The Wikipedia Revolution: How a Bunch of Nobodies Created the World’s Greatest Encyclopedia. In: Hyperion, 1st edn., March 17 (2009)Google Scholar
  10. 10.
    Gabrilovich, E., Markovitch, S.: Overcoming the Brittleness Bottleneck Using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge. In: Proceedings of the Twenty-First National Conference on Artificial Intelligence, Boston, MA, pp. 1301–1306 (2006)Google Scholar
  11. 11.
    Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)CrossRefGoogle Scholar
  12. 12.

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Pingpeng Yuan
    • 1
  • Guoyin Wang
    • 1
  • Qin Zhang
    • 1
  • Hai Jin
    • 1
  1. 1.Service Computing Technology and System Lab Cluster and Grid Computing Lab School of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhanChina

Personalised recommendations