An Extended Graph-Based Label Propagation Method for Readability Assessment

Jiang, Zhiwei; Sun, Gang; Gu, Qing; Yu, Lixia; Chen, Daoxu

doi:10.1007/978-3-319-25255-1_40

Zhiwei Jiang¹⁸,
Gang Sun¹⁸,
Qing Gu¹⁸,
Lixia Yu¹⁸ &
…
Daoxu Chen¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9313))

Included in the following conference series:

Asia-Pacific Web Conference

2818 Accesses
1 Citations

Abstract

Readability assessment is to evaluate the reading difficulty of a document, which can be quantified as reading levels. In this paper, we propose an extended graph-based label propagation method for readability assessment. We employ three vector space models (VSMs) to compute edges and weights for the graphs, along with three graph sparsification techniques. By incorporating the pre-classification results, we develop four strategies to reinforce the graphs before label propagation to capture the ordinal relation among the reading levels. The reinforcement includes recomputing weights for the edges, and filtering out edges linking nodes with big level difference. Experiments are conducted systematically on datasets of both English and Chinese. The results demonstrate both effectiveness and potential of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Benjamin, R.G.: Reconstructing readability: Recent developments and recommendations in the analysis of text difficulty. Educational Psychology Review 24(1), 63–88 (2012)
Article Google Scholar
Blei, D.M.: Probabilistic topic models. Communications of the ACM 55(4), 77–84 (2012)
Article Google Scholar
Collins-Thompson, K., Bennett, P.N., White, R.W., de la Chica, S., Sontag, D.: Personalizing web search results by reading level. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 403–412. ACM (2011)
Google Scholar
Collins-Thompson, K., Callan, J.P.: A language modeling approach to predicting reading difficulty. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 193–200 (2004)
Google Scholar
Daitch, S.I., Kelner, J.A., Spielman, D.A.: Fitting a graph to vector data. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 201–208. ACM (2009)
Google Scholar
Feng, L., Jansche, M., Huenerfauth, M., Elhadad, N.: A comparison of features for automatic readability assessment. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 276–284 (2010)
Google Scholar
François, T., Fairon, C.: An ai readability formula for french as a foreign language. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 466–477 (2012)
Google Scholar
Jameel, S., Qian, X., Lam, W.: N-gram fragment sequence based unsupervised domain-specific document readability. In: COLING, pp. 1309–1326 (2012)
Google Scholar
Jebara, T., Wang, J., Chang, S.F.: Graph construction and b-matching for semi-supervised learning. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 441–448. ACM (2009)
Google Scholar
Jiang, Z., Sun, G., Gu, Q., Chen, D.: An ordinal multi-class classification method for readability assessment of chinese documents. In: Buchmann, R., Kifor, C.V., Yu, J. (eds.) KSEM 2014. LNCS, vol. 8793, pp. 61–72. Springer, Heidelberg (2014)
Google Scholar
Kim, D.S., Verma, K., Yeh, P.Z.: Joint extraction and labeling via graph propagation for dictionary construction. In: Twenty-Seventh AAAI Conference on Artificial Intelligence (2013)
Google Scholar
Kincaid, J.P., Fishburne Jr., R.P., Rogers, R.L., Chissom, B.S.: Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel, Tech. rep., DTIC Document (1975)
Google Scholar
Ma, Y., Fosler-Lussier, E., Lofthus, R.: Ranking-based readability assessment for early primary children’s literature. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 548–552 (2012)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)
Article Google Scholar
Subramanya, A., Petrov, S., Pereira, F.: Efficient graph-based semi-supervised learning of structured tagging models. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 167–176 (2010)
Google Scholar
Tanaka-Ishii, K., Tezuka, S., Terada, H.: Sorting texts by readability. Computational Linguistics 36(2), 203–227 (2010)
Article Google Scholar
Vogel, M., Washburne, C.: An objective method of determining grade placement of children’s reading material. The Elementary School Journal 28(5), 373–381 (1928)
Article Google Scholar
Xie, W., Peng, Y., Xiao, J.: Semantic graph construction for weakly-supervised image parsing. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Québec City, Québec, Canada, July 27-31, pp. 2853–2859 (2014)
Google Scholar
Zeng, X., Wong, D.F., Chao, L.S., Trancoso, I.: Graph-based semi-supervised model for joint chinese word segmentation and part-of-speech tagging. In: Proceeding of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 770–779 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, China
Zhiwei Jiang, Gang Sun, Qing Gu, Lixia Yu & Daoxu Chen

Authors

Zhiwei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Gang Sun
View author publications
You can also search for this author in PubMed Google Scholar
Qing Gu
View author publications
You can also search for this author in PubMed Google Scholar
Lixia Yu
View author publications
You can also search for this author in PubMed Google Scholar
Daoxu Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiwei Jiang .

Editor information

Editors and Affiliations

University of Hong Kong, Hong Kong, China
Reynold Cheng
Computer Science, Peking University, Beijing, China
Bin Cui
Advanced Digital Sciences Center (ADSC), Singapore, Singapore
Zhenjie Zhang
University of Technology, Guangzhou, China
Ruichu Cai
Guangxi University, Guangxi, China
Jia Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, Z., Sun, G., Gu, Q., Yu, L., Chen, D. (2015). An Extended Graph-Based Label Propagation Method for Readability Assessment. In: Cheng, R., Cui, B., Zhang, Z., Cai, R., Xu, J. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9313. Springer, Cham. https://doi.org/10.1007/978-3-319-25255-1_40

Download citation

DOI: https://doi.org/10.1007/978-3-319-25255-1_40
Published: 13 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25254-4
Online ISBN: 978-3-319-25255-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics