Abstract
This paper presents a cloud-based Chinese language query system which provides n-gram and skip n-gram frequencies retrieved from the Chinese Web 5-gram corpus. Language learners can learn frequently co-occurring context words or other contextual information from the retrieval results. The system was implemented using a MySQL relational database and PHP script language. Experimental results show the retrieval time for some sample queries which reveals that the system achieves high retrieval efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cheng, C. C. (2004). Word-focused extensive reading with guidance. Proceedings of the 13th International Symposium on English Teaching (pp. 24–32).
Inkpen, D. (2007). A statistical model of near-synonym choice. ACM Transactions on Speech and Language Processing, 4(1), 1–17.
Ouyang, S., Gao, H. H., & Koh, S. N. (2009). Developing a computer-facilitated tool for acquiring near-synonyms in Chinese and english. Proceedings of IWCS-09 (pp. 316–319).
Lam, Y. C. (2010). Managing the google web 1T 5-gram with relational database. Journal of Education, Informatics and Cybernetics, 2(2), 1–6.
Yu, L. C., Wu, C. H., Chang, R. Y., Liu, C. H., & Hovy, E. H. (2010). Annotation and verification of sense pools in ontoNotes. Information Processing and Management, 46(4), 436–447.
Yu, L. C. & Hsu, K. H. (2012). Developing and evaluating a computer-assisted near-synonym learning system. Proceedings of the 24th International Conference on Computational Linguistics (pp. 509–516).
Yu, L. C., & Chien, W. N. (2013). Independent component analysis for near-synonym choice. Decision Support Systems, 55(1), 146–155.
Wu, C. H., Liu, C. H., Matthew, H., & Yu, L. C. (2010). Sentence correction incorporating relative position and parse template language models. IEEE Transactions on Audio, Speech and Language Processing, 18(6), 1170–1181.
Manning, C., & Schütze, H. (1999). Foundations of statistical natural language processing. Cambridge: MIT Press.
Acknowledgments
This work was partially supported by National Science Council, Taiwan, under Grant No. NSC102-2221-E-155-029-MY3, and Bureau of Energy, Ministry of Economic Affairs, Taiwan, under Grant No. 102-E0616. The authors would like to thank the anonymous reviewers for their constructive comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Lim, C.S., Yu, LC. (2014). A Cloud-Based Chinese Web 5-Gram Query System. In: Juang, J., Chen, CY., Yang, CF. (eds) Proceedings of the 2nd International Conference on Intelligent Technologies and Engineering Systems (ICITES2013). Lecture Notes in Electrical Engineering, vol 293. Springer, Cham. https://doi.org/10.1007/978-3-319-04573-3_49
Download citation
DOI: https://doi.org/10.1007/978-3-319-04573-3_49
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04572-6
Online ISBN: 978-3-319-04573-3
eBook Packages: EngineeringEngineering (R0)