Design and research of multimedia information publishing system based on speech recognition technology

Li, Zhuoran; Wang, Yafei; Wang, Cong

doi:10.1007/s11082-023-05926-y

Design and research of multimedia information publishing system based on speech recognition technology

Published: 29 December 2023

Volume 56, article number 327, (2024)
Cite this article

Optical and Quantum Electronics Aims and scope Submit manuscript

Zhuoran Li¹,
Yafei Wang¹ &
Cong Wang²

79 Accesses
Explore all metrics

Abstract

Internet, also known as the Internet, refers to the huge Internet connected between the LAN and the LAN, which connects a huge Internet from a set of common protocols. Today, accurate real-time multimedia information publishing technology has been widely used in many fields. In order to achieve the purpose of information disclosure, the multimedia information disclosure system uses multimedia resources as an object, and the resources are displayed by the user’s editing of the management party multimedia resource. With the increased workload of management and multimedia’s maintenance information, the current multimedia information management and control software cannot meet the needs of the market. In order to ensure the real-time, accuracy of the multimedia information disclosure, it is necessary to check if the speech recognition technology can accurately collect voice information. Techniques are particularly important for speech recognition, so it is also called automatic voice recognition; its purpose is to convert the words content contained in human speech to computer recognition content. It is different from the speaker’s identification, the speaker is different, the latter tries to identify or confirm that the person who makes a voice, rather than identifying or confirming the vocabulary included. However, voice recognition technology requires a very perfect system design, and the system design can use system science ideas and methods, according to systematic analysis results, it is also possible to design a new system. Therefore, this paper uses the voice system on the Internet to provide a solution to the inaccuracicity in speech recognition technology, and use speech recognition techniques to disclose multimedia information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Innovative application of voice real time system based on embedded system in e-commerce management

Article 01 July 2023

Development of music teaching system by using speech recognition and intelligent mobile remote device

Article 10 June 2023

Interactive voice system and legal education innovation based on the background of 5G converged media

Article 28 June 2023

Data availability

The data will be available upon request.

References

Abernethy, M.A., Bouwens, J., Van Lent, L.: Leadership and control system design. Manag. Account. Res. 21(1), 2–16 (2010)
Article Google Scholar
Becchi, G., Bertini, M., Del Bimbo, A., et al.: A distributed system for multimedia monitoring, publishing and retrieval. Procedia Comput. Sci. 38, 100–107 (2014)
Article Google Scholar
Bonawitz, K., Eichner, H., Grieskamp, W., et al.: Towards federated learning at scale: system design. Proc. Mach. Learn. Syst. 1, 374–388 (2019)
Google Scholar
Chen, C.Y., Chang, B.R., Huang, P.S.: Multimedia augmented reality information system for museum guidance. Pers. Ubiquit. Comput. 18(2), 315–322 (2014)
Article Google Scholar
Elimat, A.K., AbuSeileek, A.F.: Automatic speech recognition technology as an effective means for teaching pronunciation. Jalt Call J. 10(1), 21–47 (2014)
Article Google Scholar
Fontan, L., Ferrané, I., Farinas, J., et al.: Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss. J. Speech Lang. Hear. Res. 60(9), 2394–2405 (2017)
Article PubMed Google Scholar
Kazancioglu, H.O., Dahhan, A.S., Acar, A.H.: How could multimedia information about dental implant surgery effects patients’ anxiety level? Med. Oral Patol. Oral Cir. Bucal 22(1), e102–e107 (2017)
PubMed Google Scholar
Li, X., Lin, L., Liu, X., et al.: STB based multimedia information publication system. J. Netw. 6(9), 1305–1312 (2011)
Google Scholar
Liu, H.C., Chuang, H.H.: An examination of cognitive processing of multimedia information based on viewers’ eye movements. Interact. Learn. Environ. 19(5), 503–517 (2011)
Article Google Scholar
Mrva-Montoya, A.: Beyond the monograph: Publishing research for multimedia and multiplatform delivery. J. Sch. Publ. 46(4), 321–342 (2015)
Article Google Scholar
Pereira, M.H., de Souza, C.L., Pádua, F.L., et al.: SAPTE: A multimedia information system to support the discourse analysis and information retrieval of television programs. Multimed. Tools Appl. 74(23), 10923–10963 (2015)
Article Google Scholar
Qin, Z., Yu, J., Cong, Y., et al.: Topic correlation model for cross-modal multimedia information retrieval. Pattern Anal. Appl. 19(4), 1007–1022 (2016)
Article MathSciNet Google Scholar
Toda, S., Kobayashi, K., Saito, Y., et al.: Know-Live: a farm information Web disclosure system with subjective information. Agric. Inf. Res. 22(1), 12–23 (2013)
Google Scholar
Vincent, E., Watanabe, S., Nugraha, A.A., et al.: An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Comput. Speech Lang. 46, 535–557 (2017)
Article Google Scholar
Vrugt, J.A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation. Environ Model Softw. 75, 273–316 (2016)
Article Google Scholar
Wu, T.J., Tai, Y.N.: Effects of multimedia information technology integrated multi-sensory instruction on students’ learning motivation and outcome. Eurasia J. Math. Sci. Technol. Educ. 12(4), 1065–1074 (2016)
Article Google Scholar

Download references

Funding

This paper was supported by University-level project: Research on Digital Resource Construction of Situational Teaching in the New Liberal Arts Era of Aesthetic Education (22rczx001).

Author information

Authors and Affiliations

School of Fine Arts, Anshan Normal University, Anshan, 114007, Liaoning, China
Zhuoran Li & Yafei Wang
College of Arts and Design, Ningbo University of Finance and Economics, Ningbo, 315175, Zhejiang, China
Cong Wang

Authors

Zhuoran Li
View author publications
You can also search for this author in PubMed Google Scholar
Yafei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The first version was written by ZL, at the same time, YW and CW has done the simulations. All authors have contributed to the paper’s analysis, discussion, writing, and revision.

Corresponding author

Correspondence to Zhuoran Li.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, Z., Wang, Y. & Wang, C. Design and research of multimedia information publishing system based on speech recognition technology. Opt Quant Electron 56, 327 (2024). https://doi.org/10.1007/s11082-023-05926-y

Download citation

Received: 16 October 2023
Accepted: 22 November 2023
Published: 29 December 2023
DOI: https://doi.org/10.1007/s11082-023-05926-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Design and research of multimedia information publishing system based on speech recognition technology

Abstract

Access this article

Similar content being viewed by others

Innovative application of voice real time system based on embedded system in e-commerce management

Development of music teaching system by using speech recognition and intelligent mobile remote device

Interactive voice system and legal education innovation based on the background of 5G converged media

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Design and research of multimedia information publishing system based on speech recognition technology

Abstract

Access this article

Similar content being viewed by others

Innovative application of voice real time system based on embedded system in e-commerce management

Development of music teaching system by using speech recognition and intelligent mobile remote device

Interactive voice system and legal education innovation based on the background of 5G converged media

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation