An Integrated AMIS Prototype for Automated Summarization and Translation of Newscasts and Reports
Conference paper
First Online:
- 1 Citations
- 10 Mentions
- 426 Downloads
Abstract
In this paper we present the results of the integration works on the system designed for automated summarization and translation of newscast and reports. We show the proposed system architectures and list the available software modules. Thanks to well defined interfaces the software modules may be used as building blocks allowing easy experimentation with different summarization scenarios.
Keywords
Integration Video summarization Speech recognition Machine translation Text boundary segmentation Text summarizationNotes
Acknowledgements
Research work funded by the National Science Centre, Poland, conferred on the basis of the decision number DEC-2015/16/Z/ST7/00559 under the Chist-Era AMIS project.
References
- 1.Baran, R., Rudzinski, F., Zeja, A.: Face recognition for movie character and actor discrimination based on similarity scores. In: 2016 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 1333–1338, December 2016Google Scholar
- 2.Brown, P.F., Della Pietra, V.J., Della Pietra, S.A., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguist. 19(2), 263–311 (1993)Google Scholar
- 3.Derkacz, J., Leszczuk, M., Grega, M., Koźbiał, A., Hernández, F.J., Zorrilla, A.M., Zapirain, B.G., Smaïli, K.: Definition of requirements for accessing multilingual information opinions. Multimedia Tools Appl. 77(7), 8359–8374 (2018)CrossRefGoogle Scholar
- 4.Eisele, A., Chen, Y.: Multiun: a multilingual corpus from united nation documents. In: LREC (2010)Google Scholar
- 5.Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.-F., Gravier, G.: The ester phase 2 evaluation campaign for the rich transcription of French broadcast news. In: Interspeech (2005)Google Scholar
- 6.Gao, X., Tang, X.: Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing. IEEE Trans. Circuits Syst. Video Technol. 12(9), 765–776 (2002)CrossRefGoogle Scholar
- 7.González-Gallardo, C.-E., Torres-Moreno, J.-M.: Sentence boundary detection for French with subword-level information vectors and convolutional neural networks. arXiv preprint arXiv:1802.04559 (2018)
- 8.Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp. 177–180. Association for Computational Linguistics (2007)Google Scholar
- 9.Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pp. 48–54. Association for Computational Linguistics (2003)Google Scholar
- 10.Leszczuk, M., Grega, M., Koźbiał, A., Gliwski, J., Wasieczko, K., Smaïli, K.: Video summarization framework for newscasts and reports – work inprogress. In: Dziech, A., Czyżewski, A. (eds.) Multimedia Communications, Services and Security, pp. 86–97. Springer, Cham (2017)Google Scholar
- 11.Leszczuk, M.I., Duplaga, M.: Algorithm for video summarization of bronchoscopy procedures. BioMed. Eng. OnLine 10(1), 110 (2011)CrossRefGoogle Scholar
- 12.Menacer, M.A., Mella, O., Fohr, D., Jouvet, D., Langlois, D., Smaïli, K.: Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect. In: ACLing 2017 - 3rd International Conference on Arabic Computational Linguistics, Dubai, UAE, pp. 1–8, November 2017Google Scholar
- 13.Och, F.J., Ney, H.: A systematic comparison of various statistical alignment models. Comput. Linguist. 29(1), 19–51 (2003)CrossRefGoogle Scholar
- 14.Papineni, K., Roukos, S., Ward, T., Zhu, W.-J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)Google Scholar
- 15.Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Hannemann, M., Motlicek, P., Qian, Y., Schwarz, P., Silovsky, J., Stemmer, G., Vesely, K.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, December 2011Google Scholar
- 16.Rousseau, A., Deléglise, P., Estève, Y.: Enhancing the TED-LIUM corpus with selected data for language modeling and more ted talks. In: 9th International Conference on Language Resources and Evaluation (LREC 2014), Interspeech (2014)Google Scholar
- 17.Torres-Moreno, J.-M.: Artex is another text summarizer. arXiv preprint arXiv:1210.3312 (2012)
- 18.Zhang, H.J., Low, C.Y., Smoliar, S.W., Wu, J.H.: Video parsing, retrieval and browsing: an integrated and content-based solution. In: Proceedings of the Third ACM International Conference on Multimedia, MULTIMEDIA 1995, pp. 15–24. ACM, New York (1995)Google Scholar
- 19.Zhang, H.J., Wu, J., Zhong, D., Smoliar, S.W.: An integrated system for content-based video retrieval and browsing. Pattern Recogn. 30(4), 643–658 (1997)CrossRefGoogle Scholar
Copyright information
© Springer Nature Switzerland AG 2019