Skip to main content

Advertisement

SpringerLink
Log in
Menu
Find a journal Publish with us Track your research
Search
Cart
Book cover

Iberoamerican Congress on Pattern Recognition

CIARP 2005: Progress in Pattern Recognition, Image Analysis and Applications pp 1047–1054Cite as

  1. Home
  2. Progress in Pattern Recognition, Image Analysis and Applications
  3. Conference paper
Language Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

Language Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

  • G. Bordel18,
  • A. Ezeiza19,
  • K. Lopez de Ipina20,
  • J. M. López20,
  • M. Peñagarikano18 &
  • …
  • E. Zulueta20 
  • Conference paper
  • 1049 Accesses

Part of the Lecture Notes in Computer Science book series (LNIP,volume 3773)

Abstract

Automatic Indexing of Broadcast News is a developing research area of great recent interest [1]. This paper describes the development steps for designing an automatic index system of broadcast news for both Basque and Spanish. This application requires of appropriate Language Resources to design all the components of the system. Nowadays, large and well-defined resources can be found in most widely used languages, but there is a lot of work to do with respect to minority languages. Even if Spanish has much more resources than Basque, this work has parallel efforts for both languages. These two languages have been chosen because they are evenly official in the Basque Autonomous Community and they are used in many mass media of the Community including the Basque Public Radio and Television EITB [2].

Keywords

  • Basque Country
  • Minority Language
  • Textual Sample
  • Language Resource
  • Vocabulary Size

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Chapter PDF

Download to read the full chapter text

References

  1. Vandecatseye, A., Martens, J.P., Neto, J., Meinedo, H., Garcia-Mateo, C., Dieguez, F.J., Mihelic, F., Zibert, J., Nouza, J., David, P., Pleva, M., Cizmar, A., Papageorgiou, H., Alexandris, C.: The COST278 pan-European Broadcast News Database. In: Proceedings of LREC 2004, Lisbon, Portugal (2004)

    Google Scholar 

  2. EITB Basque Public Radio and Television, http://www.eitb.com/

  3. Euskaltzaindia, http://www.euskaltzaindia.net/

  4. Alegria, I., Artola, X., Sarasola, K., Urkia, M.: Automatic morphological analysis of Basque. In: Literary & Linguistic Computing, vol. 11(4), pp. 193–203. Oxford Univ. Press, Oxford (1996)

    Google Scholar 

  5. Peñagarikano, M., Bordel, G., Varona, A., de Ipina, L.: Using non-word Lexical Units in Automatic Speech Understanding. In: Proceedings of IEEE, ICASSP 1999, Phoenix, Arizona (1999)

    Google Scholar 

  6. Lopez de Ipiña, K., Graña, M., Ezeiza, N., Hernández, M., Zulueta, E., Ezeiza, A., Tovar, C.: Selection of Lexical Units for Continuous Speech Recognition of Basque. Progress in Pattern Recognition, Speech and Image Analysis, 244–250 (2003)

    Google Scholar 

  7. Lopez de Ipina, K., Ezeiza, N.: Bordel. N., Graña M.: Automatic Morphological Segmentation for Speech Processing in Basque IEEE TTS Workshop. Santa Monica USA (2002)

    Google Scholar 

  8. Egunkaria, Euskaldunon Egunkaria, the only newspaper in Basque, which has been recently replaced by Berria, online, at http://www.berria.info/

  9. GARA, local Basque Country newspaper in Spanish, online, at http://www.gara.net/

  10. Barras, C., Geoffrois, E., Wu, Z., Liberman, M.: Transcriber: a Free Tool for Segmenting, Labeling and Transcribing Speech. In: First International Conference on Language Resources and Evaluation, LREC 1998 (1998)

    Google Scholar 

  11. Linguistic Data Consortium, Design Specifications for the Transcription of Spoken Language, available online, at http://www.ldc.upenn.edu/Projects/Corpus_Cookbook

Download references

Author information

Authors and Affiliations

  1. University of the Basque Country, Elektrizitate eta Elektronika Saila, Leioa

    G. Bordel & M. Peñagarikano

  2. Ixa taldea. Sistemen Ingeniaritza eta Automatika Saila, Donostia

    A. Ezeiza

  3. Sistemen Ingeniaritza eta Automatika Saila, Gasteiz

    K. Lopez de Ipina, J. M. López & E. Zulueta

Authors
  1. G. Bordel
    View author publications

    You can also search for this author in PubMed Google Scholar

  2. A. Ezeiza
    View author publications

    You can also search for this author in PubMed Google Scholar

  3. K. Lopez de Ipina
    View author publications

    You can also search for this author in PubMed Google Scholar

  4. J. M. López
    View author publications

    You can also search for this author in PubMed Google Scholar

  5. M. Peñagarikano
    View author publications

    You can also search for this author in PubMed Google Scholar

  6. E. Zulueta
    View author publications

    You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

  1. Dept. System Engineering and Automation, Universitat Politècnica de Catalunya (UPC) Barcelona, Spain

    Alberto Sanfeliu

  2. Pattern Recognition Group, ICIMAF, Havana, Cuba

    Manuel Lazo Cortés

Rights and permissions

Reprints and Permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bordel, G., Ezeiza, A., de Ipina, K.L., López, J.M., Peñagarikano, M., Zulueta, E. (2005). Language Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish. In: Sanfeliu, A., Cortés, M.L. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2005. Lecture Notes in Computer Science, vol 3773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11578079_107

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/11578079_107

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29850-2

  • Online ISBN: 978-3-540-32242-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Publish with us

Policies and ethics

  • The International Association for Pattern Recognition

    Published in cooperation with

    http://www.iapr.org/

search

Navigation

  • Find a journal
  • Publish with us
  • Track your research

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Publish your research
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our imprints

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support
  • Cancel contracts here

167.114.118.210

Not affiliated

Springer Nature

© 2023 Springer Nature