Skip to main content

Multilingual Meeting Management with NLP: Automated Minutes, Transcription, and Translation

  • Conference paper
  • First Online:
Communication and Intelligent Systems (ICCIS 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 968))

Included in the following conference series:

  • 14 Accesses

Abstract

In a world emphasizing multilingual interactions in a meeting, this study showcases advanced audio and text processing for accurate multilingual transcription, enhancing international collaboration, ensuring clear understanding across diverse linguistic backgrounds. Harnessing the capabilities of DPTNet, this study achieves superior sound source separation, isolating speech from ambient noise. The pyannote toolkit excels in speaker diarization, segmenting audio based on speaker identities. The SpeechRecognition module showcases its prowess in transcribing dialogue with unparalleled accuracy. Highlighting advancements in textual summarization, the research underscores the synergistic power of the TextRank algorithm and the BART model in distilling extensive narratives into succinct and abstractive summaries. The Hugging Face Transformers, especially the MarianMTModel and MarianTokenizer, provide exemplary translation from audio transcripts. Collectively, these methodologies present a comprehensive blueprint for navigating and deciphering multilingual meetings with precision and clarity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Li H, Chen K, Wang L, Liu J, Wan B, Zhou B (2022) Sound source separation mechanisms of different deep networks explained from the perspective of auditory perception. Appl Sci 12(2)

    Google Scholar 

  2. Asteroid: The pytorch-based audio source separation toolkit for researchers, pp 2637–2641 (2020)

    Google Scholar 

  3. Khoma V, Khoma Y, Brydinskyi V, Konovalov A (2023) Development of supervised speaker diarization system based on the pyannote audio processing library. Sensors 23(4)

    Google Scholar 

  4. Barkovska O (2022) Research into speech-to-text tranfromation module in the proposed model of a speaker’s automatic speech annotation. Innovative Technol Sci Solutions Ind 5–13

    Google Scholar 

  5. Chen X, Li S, Liu S, Fowler R, Wang X (2023) Meetscript: designing transcript-based interactions to support active participation in group video meetings. Proc ACM Hum-Comput Interact 7CSCW2):1–32

    Google Scholar 

  6. Dewan A, Ziemski M, Meylan H, Concina L, Pouliquen B (2023) Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution. arXiv preprint arXiv:2309.15609

  7. Majeed M, Kala MT (2023) Comparative study on extractive summarization using sentence ranking algorithm and text ranking algorithm. In: 2023 International conference on power, instrumentation, control and computing (PICC), pp 1–5

    Google Scholar 

  8. Banu S, Ummayhani S (2023) Text summarisation and translation across multiple languages. J Sci Res Technol 242–247

    Google Scholar 

  9. Pham N-Q, Waibel A, Niehues J (2022) Adaptive multilingual speech recognition with pretrained models. arXiv preprint arXiv:2205.12304

  10. Stanik C, Haering M, Maalej W (2019) Classifying multilingual user feedback using traditional machine learning and deep learning

    Google Scholar 

  11. Srinivasan K, Raman K, Chen J, Bendersky M, Najork M (2021) Wit: Wikipedia-based image text dataset for multimodal multilingual machine learning. In: Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, pp 2443–2449

    Google Scholar 

  12. Khanna D, Bhushan R, Goel K, Juneja S (2023) Summarizeai-summarization of the podcasts. Available at SSRN 4628657

    Google Scholar 

  13. Faria FTJ, Moin MB, Wase AA, Ahmmed M, Sani MR, Muhammad T (2023) Vashantor: a large-scale multilingual benchmark dataset for automated translation of bangla regional dialects to bangla language. arXiv preprint arXiv:2311.11142

  14. Posey J, Aiken M (2015) Large-scale, distributed, multilingual, electronic meetings: a pilot study of usability and comprehension. Int J Comput Technol 14:5578–5585

    Article  Google Scholar 

  15. Wairagala EP, Mukiibi J, Tusubira JF, Babirye C, Nakatumba-Nabende J, Katumba A, Ssenkungu I (2022) Gender bias evaluation in Luganda-English machine translation. In: Association for machine translation in the americas, pp 274–286

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gautam Mehendale .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mehendale, G., Kale, C., Khatri, P., Goswami, H., Shah, H., Bagul, S. (2024). Multilingual Meeting Management with NLP: Automated Minutes, Transcription, and Translation. In: Sharma, H., Shrivastava, V., Tripathi, A.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2023. Lecture Notes in Networks and Systems, vol 968. Springer, Singapore. https://doi.org/10.1007/978-981-97-2079-8_24

Download citation

Publish with us

Policies and ethics