Skip to main content

An Extended Graph-Based Label Propagation Method for Readability Assessment

  • Conference paper
  • First Online:
Web Technologies and Applications (APWeb 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9313))

Included in the following conference series:

Abstract

Readability assessment is to evaluate the reading difficulty of a document, which can be quantified as reading levels. In this paper, we propose an extended graph-based label propagation method for readability assessment. We employ three vector space models (VSMs) to compute edges and weights for the graphs, along with three graph sparsification techniques. By incorporating the pre-classification results, we develop four strategies to reinforce the graphs before label propagation to capture the ordinal relation among the reading levels. The reinforcement includes recomputing weights for the edges, and filtering out edges linking nodes with big level difference. Experiments are conducted systematically on datasets of both English and Chinese. The results demonstrate both effectiveness and potential of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benjamin, R.G.: Reconstructing readability: Recent developments and recommendations in the analysis of text difficulty. Educational Psychology Review 24(1), 63–88 (2012)

    Article  Google Scholar 

  2. Blei, D.M.: Probabilistic topic models. Communications of the ACM 55(4), 77–84 (2012)

    Article  Google Scholar 

  3. Collins-Thompson, K., Bennett, P.N., White, R.W., de la Chica, S., Sontag, D.: Personalizing web search results by reading level. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 403–412. ACM (2011)

    Google Scholar 

  4. Collins-Thompson, K., Callan, J.P.: A language modeling approach to predicting reading difficulty. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 193–200 (2004)

    Google Scholar 

  5. Daitch, S.I., Kelner, J.A., Spielman, D.A.: Fitting a graph to vector data. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 201–208. ACM (2009)

    Google Scholar 

  6. Feng, L., Jansche, M., Huenerfauth, M., Elhadad, N.: A comparison of features for automatic readability assessment. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 276–284 (2010)

    Google Scholar 

  7. François, T., Fairon, C.: An ai readability formula for french as a foreign language. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 466–477 (2012)

    Google Scholar 

  8. Jameel, S., Qian, X., Lam, W.: N-gram fragment sequence based unsupervised domain-specific document readability. In: COLING, pp. 1309–1326 (2012)

    Google Scholar 

  9. Jebara, T., Wang, J., Chang, S.F.: Graph construction and b-matching for semi-supervised learning. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 441–448. ACM (2009)

    Google Scholar 

  10. Jiang, Z., Sun, G., Gu, Q., Chen, D.: An ordinal multi-class classification method for readability assessment of chinese documents. In: Buchmann, R., Kifor, C.V., Yu, J. (eds.) KSEM 2014. LNCS, vol. 8793, pp. 61–72. Springer, Heidelberg (2014)

    Google Scholar 

  11. Kim, D.S., Verma, K., Yeh, P.Z.: Joint extraction and labeling via graph propagation for dictionary construction. In: Twenty-Seventh AAAI Conference on Artificial Intelligence (2013)

    Google Scholar 

  12. Kincaid, J.P., Fishburne Jr., R.P., Rogers, R.L., Chissom, B.S.: Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel, Tech. rep., DTIC Document (1975)

    Google Scholar 

  13. Ma, Y., Fosler-Lussier, E., Lofthus, R.: Ranking-based readability assessment for early primary children’s literature. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 548–552 (2012)

    Google Scholar 

  14. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  15. Subramanya, A., Petrov, S., Pereira, F.: Efficient graph-based semi-supervised learning of structured tagging models. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 167–176 (2010)

    Google Scholar 

  16. Tanaka-Ishii, K., Tezuka, S., Terada, H.: Sorting texts by readability. Computational Linguistics 36(2), 203–227 (2010)

    Article  Google Scholar 

  17. Vogel, M., Washburne, C.: An objective method of determining grade placement of children’s reading material. The Elementary School Journal 28(5), 373–381 (1928)

    Article  Google Scholar 

  18. Xie, W., Peng, Y., Xiao, J.: Semantic graph construction for weakly-supervised image parsing. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Québec City, Québec, Canada, July 27-31, pp. 2853–2859 (2014)

    Google Scholar 

  19. Zeng, X., Wong, D.F., Chao, L.S., Trancoso, I.: Graph-based semi-supervised model for joint chinese word segmentation and part-of-speech tagging. In: Proceeding of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 770–779 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhiwei Jiang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Jiang, Z., Sun, G., Gu, Q., Yu, L., Chen, D. (2015). An Extended Graph-Based Label Propagation Method for Readability Assessment. In: Cheng, R., Cui, B., Zhang, Z., Cai, R., Xu, J. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9313. Springer, Cham. https://doi.org/10.1007/978-3-319-25255-1_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25255-1_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25254-4

  • Online ISBN: 978-3-319-25255-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics