Abstract
The BBC has a very large archive of programmes, covering a wide range of topics. This archive holds a significant part of the BBC’s institutional memory and is an important part of the cultural history of the United Kingdom and the rest of the world. These programmes, or parts of them, can help provide valuable context and background for current news events. However the BBC’s archive catalogue is not a complete record of everything that was ever broadcast. For example, it excludes the BBC World Service, which has been broadcasting since 1932. This makes the discovery of content within these parts of the archive very difficult. In this paper we describe a system based on Semantic Web technologies which helps us to quickly locate content related to current news events within those parts of the BBC’s archive with little or no pre-existing metadata. This system is driven by automated interlinking of archive content with the Semantic Web, user validations of the resulting data and topic extraction from live BBC News subtitles. The resulting interlinks between live news subtitles and the BBC’s archive are used in a dynamic visualisation enabling users to quickly locate relevant content. This content can then be used by journalists and editors to provide historical context, background information and supporting content around current affairs.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Abberley, D., Kirby, D., Renals, S., Robinson, T.: The THISL broadcast news retrieval system. In: Proc. ESCA Workshop on Accessing Information In Spoken Audio (1999)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: A nucleus for a web of open data. In: Aberer, K., et al. (eds.) ISWC/ASWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Berenzweig, A., Logan, B., Ellis, D.P.W., Whitman, B.: A large-scale evaluation of acoustic and subjective music-similarity measures. Computer Music Journal 28(2), 63–76 (2004)
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using gmm supervectors for speaker verification. IEEE Signal Processing Letters 13(5), 308–311 (2006)
Cannam, C., Landone, C., Sandler, M., Bello, J.P.: The Sonic Visualiser: A visualisation platform for semantic descriptors from musical signals. In: Proceedings of the International Conference on Music Information Retrieval (2006)
Choi, F.Y.Y.: Advances in domain independent linear text segmentation. Proceedings of the 1st North American Chapter of the Association for Computational Linguistics Conference (2000)
Dowman, M., Tablan, V., Cunningham, H., Popov, B.: Web-assisted annotation, semantic indexing and search of television and radio news. In: WWW 2005 Proceedings of the 14th International Conference on World Wide Web (2005)
Eggink, J., Bland, D.: A large scale experiment for mood-based classification of tv programmes. In: Proc. IEEE Int. Conf. on Multimedia and Expo, ICME 2012 (July 2012)
Kobilarov, G., Scott, T., Raimond, Y., Oliver, S., Sizemore, C., Smethurst, M., Bizer, C., Lee, R.: Media meets semantic web - how the BBC uses DBpedia and linked data to make connections. In: Aroyo, L., et al. (eds.) ESWC 2009. LNCS, vol. 5554, pp. 723–737. Springer, Heidelberg (2009)
Mendes, P., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: Shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, I-Semantics (2011)
Milne, D., Witten, I.H.: Learning to link with wikipedia. In: CIKM Proceedings (2008)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Raimond, Y., Lowis, C.: Automated interlinking of speech radio archives. In: Proceedings of the Linked Data on the Web Workshop, World Wide Web Conference (2012)
Raimond, Y., Scott, T., Oliver, S., Sinclair, P., Smethurst, M.: Use of Semantic Web technologies on the BBC Web Sites. In: Linking Enterprise Data, pp. 263–283. Springer (2010)
Saunders, J.: Real-time discrimination of broadcast speech/music. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (1996)
Seymore, K., Chen, S., Doh, S.-J., Eskenazi, M., Gouvea, E., Raj, B., Ravishankar, M., Rosenfeld, R., Siegler, M., Sternane, R., Thayer, E.: The 1997 CMU sphinx-3 English broadcast news transcription system. In: Proceedings of the DARPA Speech Recognition Workshop (1998)
Slaney, M., Casey, M.: Locality-sensitive hashing for finding nearest neighbors. IEEE Signal Processing Magazine, 128–131 (March 2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Raimond, Y., Smethurst, M., McParland, A., Lowis, C. (2013). Using the Past to Explain the Present: Interlinking Current Affairs with Archives via the Semantic Web. In: Alani, H., et al. The Semantic Web – ISWC 2013. ISWC 2013. Lecture Notes in Computer Science, vol 8219. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41338-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-41338-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41337-7
Online ISBN: 978-3-642-41338-4
eBook Packages: Computer ScienceComputer Science (R0)