Skip to main content

Advertisement

Log in

Design and research of multimedia information publishing system based on speech recognition technology

  • Published:
Optical and Quantum Electronics Aims and scope Submit manuscript

Abstract

Internet, also known as the Internet, refers to the huge Internet connected between the LAN and the LAN, which connects a huge Internet from a set of common protocols. Today, accurate real-time multimedia information publishing technology has been widely used in many fields. In order to achieve the purpose of information disclosure, the multimedia information disclosure system uses multimedia resources as an object, and the resources are displayed by the user’s editing of the management party multimedia resource. With the increased workload of management and multimedia’s maintenance information, the current multimedia information management and control software cannot meet the needs of the market. In order to ensure the real-time, accuracy of the multimedia information disclosure, it is necessary to check if the speech recognition technology can accurately collect voice information. Techniques are particularly important for speech recognition, so it is also called automatic voice recognition; its purpose is to convert the words content contained in human speech to computer recognition content. It is different from the speaker’s identification, the speaker is different, the latter tries to identify or confirm that the person who makes a voice, rather than identifying or confirming the vocabulary included. However, voice recognition technology requires a very perfect system design, and the system design can use system science ideas and methods, according to systematic analysis results, it is also possible to design a new system. Therefore, this paper uses the voice system on the Internet to provide a solution to the inaccuracicity in speech recognition technology, and use speech recognition techniques to disclose multimedia information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The data will be available upon request.

References

  • Abernethy, M.A., Bouwens, J., Van Lent, L.: Leadership and control system design. Manag. Account. Res. 21(1), 2–16 (2010)

    Article  Google Scholar 

  • Becchi, G., Bertini, M., Del Bimbo, A., et al.: A distributed system for multimedia monitoring, publishing and retrieval. Procedia Comput. Sci. 38, 100–107 (2014)

    Article  Google Scholar 

  • Bonawitz, K., Eichner, H., Grieskamp, W., et al.: Towards federated learning at scale: system design. Proc. Mach. Learn. Syst. 1, 374–388 (2019)

    Google Scholar 

  • Chen, C.Y., Chang, B.R., Huang, P.S.: Multimedia augmented reality information system for museum guidance. Pers. Ubiquit. Comput. 18(2), 315–322 (2014)

    Article  Google Scholar 

  • Elimat, A.K., AbuSeileek, A.F.: Automatic speech recognition technology as an effective means for teaching pronunciation. Jalt Call J. 10(1), 21–47 (2014)

    Article  Google Scholar 

  • Fontan, L., Ferrané, I., Farinas, J., et al.: Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss. J. Speech Lang. Hear. Res. 60(9), 2394–2405 (2017)

    Article  PubMed  Google Scholar 

  • Kazancioglu, H.O., Dahhan, A.S., Acar, A.H.: How could multimedia information about dental implant surgery effects patients’ anxiety level? Med. Oral Patol. Oral Cir. Bucal 22(1), e102–e107 (2017)

    PubMed  Google Scholar 

  • Li, X., Lin, L., Liu, X., et al.: STB based multimedia information publication system. J. Netw. 6(9), 1305–1312 (2011)

    Google Scholar 

  • Liu, H.C., Chuang, H.H.: An examination of cognitive processing of multimedia information based on viewers’ eye movements. Interact. Learn. Environ. 19(5), 503–517 (2011)

    Article  Google Scholar 

  • Mrva-Montoya, A.: Beyond the monograph: Publishing research for multimedia and multiplatform delivery. J. Sch. Publ. 46(4), 321–342 (2015)

    Article  Google Scholar 

  • Pereira, M.H., de Souza, C.L., Pádua, F.L., et al.: SAPTE: A multimedia information system to support the discourse analysis and information retrieval of television programs. Multimed. Tools Appl. 74(23), 10923–10963 (2015)

    Article  Google Scholar 

  • Qin, Z., Yu, J., Cong, Y., et al.: Topic correlation model for cross-modal multimedia information retrieval. Pattern Anal. Appl. 19(4), 1007–1022 (2016)

    Article  MathSciNet  Google Scholar 

  • Toda, S., Kobayashi, K., Saito, Y., et al.: Know-Live: a farm information Web disclosure system with subjective information. Agric. Inf. Res. 22(1), 12–23 (2013)

    Google Scholar 

  • Vincent, E., Watanabe, S., Nugraha, A.A., et al.: An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Comput. Speech Lang. 46, 535–557 (2017)

    Article  Google Scholar 

  • Vrugt, J.A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation. Environ Model Softw. 75, 273–316 (2016)

    Article  Google Scholar 

  • Wu, T.J., Tai, Y.N.: Effects of multimedia information technology integrated multi-sensory instruction on students’ learning motivation and outcome. Eurasia J. Math. Sci. Technol. Educ. 12(4), 1065–1074 (2016)

    Article  Google Scholar 

Download references

Funding

This paper was supported by University-level project: Research on Digital Resource Construction of Situational Teaching in the New Liberal Arts Era of Aesthetic Education (22rczx001).

Author information

Authors and Affiliations

Authors

Contributions

The first version was written by ZL, at the same time, YW and CW has done the simulations. All authors have contributed to the paper’s analysis, discussion, writing, and revision.

Corresponding author

Correspondence to Zhuoran Li.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Z., Wang, Y. & Wang, C. Design and research of multimedia information publishing system based on speech recognition technology. Opt Quant Electron 56, 327 (2024). https://doi.org/10.1007/s11082-023-05926-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11082-023-05926-y

Keywords

Navigation