Abstract
Internet, also known as the Internet, refers to the huge Internet connected between the LAN and the LAN, which connects a huge Internet from a set of common protocols. Today, accurate real-time multimedia information publishing technology has been widely used in many fields. In order to achieve the purpose of information disclosure, the multimedia information disclosure system uses multimedia resources as an object, and the resources are displayed by the user’s editing of the management party multimedia resource. With the increased workload of management and multimedia’s maintenance information, the current multimedia information management and control software cannot meet the needs of the market. In order to ensure the real-time, accuracy of the multimedia information disclosure, it is necessary to check if the speech recognition technology can accurately collect voice information. Techniques are particularly important for speech recognition, so it is also called automatic voice recognition; its purpose is to convert the words content contained in human speech to computer recognition content. It is different from the speaker’s identification, the speaker is different, the latter tries to identify or confirm that the person who makes a voice, rather than identifying or confirming the vocabulary included. However, voice recognition technology requires a very perfect system design, and the system design can use system science ideas and methods, according to systematic analysis results, it is also possible to design a new system. Therefore, this paper uses the voice system on the Internet to provide a solution to the inaccuracicity in speech recognition technology, and use speech recognition techniques to disclose multimedia information.
Similar content being viewed by others
Data availability
The data will be available upon request.
References
Abernethy, M.A., Bouwens, J., Van Lent, L.: Leadership and control system design. Manag. Account. Res. 21(1), 2–16 (2010)
Becchi, G., Bertini, M., Del Bimbo, A., et al.: A distributed system for multimedia monitoring, publishing and retrieval. Procedia Comput. Sci. 38, 100–107 (2014)
Bonawitz, K., Eichner, H., Grieskamp, W., et al.: Towards federated learning at scale: system design. Proc. Mach. Learn. Syst. 1, 374–388 (2019)
Chen, C.Y., Chang, B.R., Huang, P.S.: Multimedia augmented reality information system for museum guidance. Pers. Ubiquit. Comput. 18(2), 315–322 (2014)
Elimat, A.K., AbuSeileek, A.F.: Automatic speech recognition technology as an effective means for teaching pronunciation. Jalt Call J. 10(1), 21–47 (2014)
Fontan, L., Ferrané, I., Farinas, J., et al.: Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss. J. Speech Lang. Hear. Res. 60(9), 2394–2405 (2017)
Kazancioglu, H.O., Dahhan, A.S., Acar, A.H.: How could multimedia information about dental implant surgery effects patients’ anxiety level? Med. Oral Patol. Oral Cir. Bucal 22(1), e102–e107 (2017)
Li, X., Lin, L., Liu, X., et al.: STB based multimedia information publication system. J. Netw. 6(9), 1305–1312 (2011)
Liu, H.C., Chuang, H.H.: An examination of cognitive processing of multimedia information based on viewers’ eye movements. Interact. Learn. Environ. 19(5), 503–517 (2011)
Mrva-Montoya, A.: Beyond the monograph: Publishing research for multimedia and multiplatform delivery. J. Sch. Publ. 46(4), 321–342 (2015)
Pereira, M.H., de Souza, C.L., Pádua, F.L., et al.: SAPTE: A multimedia information system to support the discourse analysis and information retrieval of television programs. Multimed. Tools Appl. 74(23), 10923–10963 (2015)
Qin, Z., Yu, J., Cong, Y., et al.: Topic correlation model for cross-modal multimedia information retrieval. Pattern Anal. Appl. 19(4), 1007–1022 (2016)
Toda, S., Kobayashi, K., Saito, Y., et al.: Know-Live: a farm information Web disclosure system with subjective information. Agric. Inf. Res. 22(1), 12–23 (2013)
Vincent, E., Watanabe, S., Nugraha, A.A., et al.: An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Comput. Speech Lang. 46, 535–557 (2017)
Vrugt, J.A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation. Environ Model Softw. 75, 273–316 (2016)
Wu, T.J., Tai, Y.N.: Effects of multimedia information technology integrated multi-sensory instruction on students’ learning motivation and outcome. Eurasia J. Math. Sci. Technol. Educ. 12(4), 1065–1074 (2016)
Funding
This paper was supported by University-level project: Research on Digital Resource Construction of Situational Teaching in the New Liberal Arts Era of Aesthetic Education (22rczx001).
Author information
Authors and Affiliations
Contributions
The first version was written by ZL, at the same time, YW and CW has done the simulations. All authors have contributed to the paper’s analysis, discussion, writing, and revision.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Z., Wang, Y. & Wang, C. Design and research of multimedia information publishing system based on speech recognition technology. Opt Quant Electron 56, 327 (2024). https://doi.org/10.1007/s11082-023-05926-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11082-023-05926-y