Abstract
In a world emphasizing multilingual interactions in a meeting, this study showcases advanced audio and text processing for accurate multilingual transcription, enhancing international collaboration, ensuring clear understanding across diverse linguistic backgrounds. Harnessing the capabilities of DPTNet, this study achieves superior sound source separation, isolating speech from ambient noise. The pyannote toolkit excels in speaker diarization, segmenting audio based on speaker identities. The SpeechRecognition module showcases its prowess in transcribing dialogue with unparalleled accuracy. Highlighting advancements in textual summarization, the research underscores the synergistic power of the TextRank algorithm and the BART model in distilling extensive narratives into succinct and abstractive summaries. The Hugging Face Transformers, especially the MarianMTModel and MarianTokenizer, provide exemplary translation from audio transcripts. Collectively, these methodologies present a comprehensive blueprint for navigating and deciphering multilingual meetings with precision and clarity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Li H, Chen K, Wang L, Liu J, Wan B, Zhou B (2022) Sound source separation mechanisms of different deep networks explained from the perspective of auditory perception. Appl Sci 12(2)
Asteroid: The pytorch-based audio source separation toolkit for researchers, pp 2637–2641 (2020)
Khoma V, Khoma Y, Brydinskyi V, Konovalov A (2023) Development of supervised speaker diarization system based on the pyannote audio processing library. Sensors 23(4)
Barkovska O (2022) Research into speech-to-text tranfromation module in the proposed model of a speaker’s automatic speech annotation. Innovative Technol Sci Solutions Ind 5–13
Chen X, Li S, Liu S, Fowler R, Wang X (2023) Meetscript: designing transcript-based interactions to support active participation in group video meetings. Proc ACM Hum-Comput Interact 7CSCW2):1–32
Dewan A, Ziemski M, Meylan H, Concina L, Pouliquen B (2023) Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution. arXiv preprint arXiv:2309.15609
Majeed M, Kala MT (2023) Comparative study on extractive summarization using sentence ranking algorithm and text ranking algorithm. In: 2023 International conference on power, instrumentation, control and computing (PICC), pp 1–5
Banu S, Ummayhani S (2023) Text summarisation and translation across multiple languages. J Sci Res Technol 242–247
Pham N-Q, Waibel A, Niehues J (2022) Adaptive multilingual speech recognition with pretrained models. arXiv preprint arXiv:2205.12304
Stanik C, Haering M, Maalej W (2019) Classifying multilingual user feedback using traditional machine learning and deep learning
Srinivasan K, Raman K, Chen J, Bendersky M, Najork M (2021) Wit: Wikipedia-based image text dataset for multimodal multilingual machine learning. In: Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, pp 2443–2449
Khanna D, Bhushan R, Goel K, Juneja S (2023) Summarizeai-summarization of the podcasts. Available at SSRN 4628657
Faria FTJ, Moin MB, Wase AA, Ahmmed M, Sani MR, Muhammad T (2023) Vashantor: a large-scale multilingual benchmark dataset for automated translation of bangla regional dialects to bangla language. arXiv preprint arXiv:2311.11142
Posey J, Aiken M (2015) Large-scale, distributed, multilingual, electronic meetings: a pilot study of usability and comprehension. Int J Comput Technol 14:5578–5585
Wairagala EP, Mukiibi J, Tusubira JF, Babirye C, Nakatumba-Nabende J, Katumba A, Ssenkungu I (2022) Gender bias evaluation in Luganda-English machine translation. In: Association for machine translation in the americas, pp 274–286
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mehendale, G., Kale, C., Khatri, P., Goswami, H., Shah, H., Bagul, S. (2024). Multilingual Meeting Management with NLP: Automated Minutes, Transcription, and Translation. In: Sharma, H., Shrivastava, V., Tripathi, A.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2023. Lecture Notes in Networks and Systems, vol 968. Springer, Singapore. https://doi.org/10.1007/978-981-97-2079-8_24
Download citation
DOI: https://doi.org/10.1007/978-981-97-2079-8_24
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-2078-1
Online ISBN: 978-981-97-2079-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)