Skip to main content

Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9924))

Included in the following conference series:

Abstract

This paper describes a summarization system that was developed in order to summarize news delivered orally. The system generates text summaries from input audio using three independent components: an automatic speech recognizer, a syntactic analyzer, and a summarizer. The absence of sentence boundaries in the recognized text complicates the summarization process. Therefore, we use a syntactic analyzer to identify continuous segments in the recognized text.

We used 50 reference articles to perform our evaluation. The data are publicly available at http://nlp.ite.tul.cz/sumarizace. The results of the proposed system were compared with the results of sentence summarization in the reference articles. The evaluation was performed using co-occurrence of n-grams in the reference and generated summaries, and by readers mark-ups. The readers marked two aspects of the summaries: readability and information relevance. Experiments confirm that the generated summaries have the same information value as the reference summaries. However, readers state that phrase summaries are hard to read without the whole sentence context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Sentences are used as segments when the text is summarized.

References

  1. Mateju, L., Cerva, P., Zdansky, J.: Investigation into the use of deep neural networks for LVCSR of Czech. In: 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM), pp. 1–4. IEEE (2015)

    Google Scholar 

  2. Nouza, J., Zdansky, J., Cerva, P., Silovsky, J.: Challenges in speech processing of Slavic languages (case studies in speech recognition of Czech and Slovak). In: Esposito, A., Campbell, N., Vogel, C., Hussain, A., Nijholt, A. (eds.) Second COST 2102. LNCS, vol. 5967, pp. 225–241. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Bohac, M., Blavka, K., Kucharova, M., Skodova, S.: Post-processing of the recognized speech for web presentation of large audio archive. In: 2012 35th International Conference on Telecommunications and Signal Processing (TSP), pp. 441–445 (2012)

    Google Scholar 

  4. Kovář, V., Horák, A., Jakubíček, M.: Syntactic analysis using finite patterns: a new parsing system for Czech. In: Vetulani, Z. (ed.) LTC 2009. LNCS, vol. 6562, pp. 161–171. Springer, Heidelberg (2011)

    Google Scholar 

  5. Hori, C., Furui, S., Malkin, R., Yu, H., Waibel, A.: Automatic speech summarization applied to English broadcast news speech. In: 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I-9–I-12 (2002)

    Google Scholar 

  6. Chen, Y.T., Chen, B., Wang, H.M.: A probabilistic generative framework for extractive broadcast news speech summarization. IEEE Trans. Audio Speech Lang. Process. 17, 95–106 (2009)

    Article  Google Scholar 

  7. Furui, S., Kikuchi, T., Shinnaka, Y., Hori, C.: Speech-to-text and speech-to-speech summarization of spontaneous speech. IEEE Trans. Speech Audio Process. 12, 401–408 (2004)

    Article  Google Scholar 

  8. Straková, J., Straka, M., Hajič, J.: Open-source tools for morphology, lemmatization, POS tagging and named entity recognition. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 13–18. Association for Computational Linguistics, Baltimore, Maryland (2014)

    Google Scholar 

  9. Rott, M., Červa, P.: SummEC: a summarization engine for Czech. In: Habernal, I. (ed.) TSD 2013. LNCS, vol. 8082, pp. 527–535. Springer, Heidelberg (2013)

    Google Scholar 

  10. Michal, R.: The initial study of term vector generation methods for news summarization. In: Proceedings of the Ninth Workshop on Recent Advances in Slavonic Natural Languages Processing, pp. 23–30. Tribun EU, Brno (2015)

    Google Scholar 

  11. Vanderwende, L., Suzuki, H., Brockett, C., Nenkova, A.: Beyond sumbasic: task-focused summarization with sentence simplification and lexical expansion. Inf. Process. Manag. 43, 1606–1618 (2007). Text Summarization

    Article  Google Scholar 

  12. Lin, C.Y.: Rouge: a package for automatic evaluation of summaries, pp. 25–26 (2004)

    Google Scholar 

  13. Dahl, G.E., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Trans. Audio Speech Lang. Process. 20, 30–42 (2012)

    Article  Google Scholar 

  14. Svendsen, T., Hamar, J.B.: Combining NDHMM and phonetic feature detection for speech recognition. In: 2015 23rd European Signal Processing Conference (EUSIPCO), pp. 1666–1670 (2015)

    Google Scholar 

Download references

Acknowledgement

This paper was supported by the Technology Agency of the Czech Republic (Project No. TA04010199) and by the Student Grant Scheme 2016 (SGS) at the Technical University of Liberec.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michal Rott .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Rott, M., Červa, P. (2016). Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2016. Lecture Notes in Computer Science(), vol 9924. Springer, Cham. https://doi.org/10.1007/978-3-319-45510-5_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45510-5_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45509-9

  • Online ISBN: 978-3-319-45510-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics