Abstract
Lexical simplification under a given vocabulary scope for specified communities would potentially benefit many applications such as second language learning and cognitive disabilities education. This paper proposes a new concise ranking strategy for incorporating semantic and context for lexical simplification to a restricted scope. Our approach utilizes WordNet-based similarity calculation for semantic expansion and ranking. It then uses Part-of-Speech tagging and Google 1T 5-gram corpus for context-based ranking. Our experiments are based on a publicly available data sets. Through the comparison with baseline methods including Google Word2vec and four-step method, our approach achieves best F1 measure as 0.311 and Oot F1 measure as 0.522, respectively, demonstrating its effectiveness in combining semantic and context for English lexical simplification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Biran, O., Brody, S., Elhadad, N.: Putting it simply: a context-aware approach to lexical simplification. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, pp. 496–501. Association for Computational Linguistics, Stroudsburg (2011)
Coster, W., Kauchak, D.: Learning to simplify sentences using Wikipedia. In: Proceedings of the Workshop on Monolingual Text-to-Text Generation, pp. 1–9. Association for Computational Linguistics (2011)
Coster, W., Kauchak, D.: Simple English Wikipedia: a new text simplification task. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, vol. 2, pp. 665–669. ACL (2011)
Dagan, I., Glickman, O., Gliozzo, A., Marmorshtein, E., Strapparava, C.: Direct word sense matching for lexical substitution. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the ACL, ACL-44, pp. 449–456. ACL, Stroudsburg (2006)
Glavaš, G., Štajner, S.: Simplifying lexical simplification: do we need simplified corpora. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 63–68 (2015)
Horn, C., Manduca, C., Kauchak, D.: Learning a lexical simplifier using Wikipedia. In: ACL, vol. 2, pp. 458–463 (2014)
Kajiwara, T., Matsumoto, H., Yamamoto, K.: Selecting proper lexical paraphrase for children. In: Proceedings of the Twenty-Fifth Conference on Computational Linguistics and Speech Processing (ROCLING 2013) (2013)
Kauchak, D.: Improving text simplification language modeling using unsimplified text data. In: ACL, vol. 1, pp. 1537–1546 (2013)
McCarthy, D., Navigli, R.: Semeval-2007 task 10: English lexical substitution task. In: Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval 2007, pp. 48–53. Association for Computational Linguistics, Stroudsburg (2007). http://dl.acm.org/citation.cfm?id=1621474.1621483
Paetzold, G.H., Specia, L.: Unsupervised lexical simplification for non-native speakers. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)
Paetzold, G.H.: Reliable lexical simplification for non-native speakers. In: NAACL-HLT 2015 Student Research Workshop (SRW), p. 9 (2015)
Paetzold, G.H., Specia, L.: Lexenstein: a framework for lexical simplification. In: ACL-IJCNLP 2015, vol. 1, no. 1, p. 85 (2015)
Saggion, H., Bott, S., Rello, L.: Simplifying words in context. Experiments with two lexical resources in Spanish. Comput. Speech Lang. 35, 200–218 (2016)
Shirzadi, S.: Syntactic and lexical simplification: the impact on EFL listening comprehension at low and high language proficiency levels. J. Lang. Teach. Res. 5(3), 566–571 (2014)
Specia, L., Jauhar, S.K., Mihalcea, R.: Semeval-2012 task 1: English lexical simplification. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics, pp. 347–355. Association for Computational Linguistics (2012)
The Education Bureau, Curriculum Development Institute: Enhancing English Vocabulary Learning and Teaching at Primary Level. Technical report, The Hong Kong Special Administrative Region (2016)
Wolf, L., Hanani, Y., Bar, K., Dershowitz, N.: Joint word2vec networks for bilingual semantic representations. Int. J. Comput. Linguist. Appl. 5(1), 27–44 (2014)
Yakovets, N., Agrawal, A.: Simple: lexical simplification using word sense disambiguation (2013)
Zhao, S., Zhao, L., Zhang, Y., Liu, T., Li, S.: Hit: web based scoring method for English lexical substitution. In: Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval 2007, pp. 173–176. ACL, Stroudsburg (2007)
Zhu, Z., Bernhard, D., Gurevych, I.: A monolingual tree-based translation model for sentence simplification. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 1353–1361. Association for Computational Linguistics (2010)
Acknowledgments
This work was supported by the Innovation and Technology Fund (Ref: ITS/132/15) of the Innovation and Technology Commission, the Government of the Hong Kong Special Administrative Region, National Natural Science Foundation of China (No. 61772146 & No. 61403088), and Innovative School Project in Higher Education of Guangdong Province (No. YQ2015062).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Hao, T., Xie, W., Lee, J. (2018). A Semantic-Context Ranking Approach for Community-Oriented English Lexical Simplification. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_68
Download citation
DOI: https://doi.org/10.1007/978-3-319-73618-1_68
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)