Skip to main content

Speech Corpus Development for Voice-Controlled MAV

  • Conference paper
  • First Online:
Emerging Trends in Photonics, Signal Processing and Communication Engineering

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 649))

  • 491 Accesses

Abstract

The main idea of this paper is to bring out the uniqueness of the speech corpus required for development of command and control applications such as voice-controlled MAV. Since MAV finds application in adverse environments, the effect of noise degrades ASR performance. Since English words uttered are greatly influenced by user’s mother tongue, there is a necessity to create a customized speech corpus. The corpus creation is accomplished by a NALVoiceCorpus tool, which is designed to capture the specific requirements of the corpus. The tool is quite generic in nature and it can find application in development of any ASR system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kang Y, Yuan M (2009) Software design for mini-type ground control station of UAVICEMI’09. In: 9th International conference on electronic measurement and instruments IEEE

    Google Scholar 

  2. Draper M et al (2003) Manual versus speech input for unmanned aerial vehicle control station operations. In: Proceedings of the human factors and ergonomics society annual meeting, vol 47(1). CA: SAGE Publications, Sage CA: Los Angeles

    Google Scholar 

  3. Shrishrimal PP, Deshmukh RR, Waghmare VB (2012) Indian language speech database: a review. Int J Comput Appl 47(5):17–21

    Google Scholar 

  4. Google cloud speech-to-text documentation page. https://cloud.google.com/speech/docs/

  5. Speech recognition in IOS. https://developer.apple.com/documentation/speech

  6. Robust Automatic Transcription of Speech (RATS), for Information Processing Techniques Office (IPTO), Defense Advanced Research Projects Agency (DARPA), DARPA-BAA-10-34, (2012)

    Google Scholar 

  7. Hofbauer K, Petrik S, Hering H (2008) The ATCOSIM corpus of non-prompted clean air traffic control speech. In: LREC

    Google Scholar 

  8. Ayres Tony, Nolan Brian (2006) Voice activated command and control with speech recognition over WiFi. Sci Comput Program 59(1-2):109–126

    Article  MathSciNet  Google Scholar 

  9. Li A-J, Yin Z-G (2007) Standardization of speech corpus. Data Sci J 6:806–812

    Article  Google Scholar 

  10. Mission planner overview, ardupilot.org/planner/docs/mission-planner-overview.html

    Google Scholar 

  11. Paliwal KK, Yao K (2010) Robust speech recognition under noisy ambient conditions. In: Human-centric interfaces for ambient intelligence, 135–162

    Google Scholar 

  12. Benzeghiba M et al (2007) Automatic speech recognition and speech variability: a review. Speech Commun 49(10–11):763–786

    Google Scholar 

  13. Flight gear simulator homepage, home.flightgear.org/

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank SIGMA panel, AR & DB for funding this activity.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Veena .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rahul, D.K., Veena, S., Lokesha, H., Lakshmi, P. (2020). Speech Corpus Development for Voice-Controlled MAV. In: Kadambi, G., Kumar, P., Palade, V. (eds) Emerging Trends in Photonics, Signal Processing and Communication Engineering. Lecture Notes in Electrical Engineering, vol 649. Springer, Singapore. https://doi.org/10.1007/978-981-15-3477-5_11

Download citation

Publish with us

Policies and ethics