Readability Applied to Information Retrieval

  • Lorna Kane
  • Joe Carthy
  • John Dunnion
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3936)


Readability refers to all characteristics of a document that contribute to its ‘ease of understanding or comprehension due to the style of writing’ [1]. The readability of a text is dependent on a number of factors, including but not constrained to; its legibility, syntactic difficulty, semantic difficulty and the organization of the text [2]. As many as 228 variables were found to influence the readability of a text in Gray and Leary’s seminal study [2]. These variables were classified as relating to document content, style, format or, features of organization.


Latent Semantic Analysis Information Seeker Syntactic Complexity Textual Coherence Annual International Conference 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Klare, G.: The Measurement of Readability, Iowa State University Press (1963)Google Scholar
  2. 2.
    Chall, J.: Readability: An Appraisal of Research and Application. Number 34 in Bureau of Educational Research Monographs, The Bureau of Educational Research Ohio State University (1958)Google Scholar
  3. 3.
    Entin, E., Klare, G.: Relationships of measures of interest, prior knowledge, and readabilty to comprehension of expository passages. Advances in reading/language research 3, 9–38 (1985)Google Scholar
  4. 4.
    Schiefele, U.: Topic interest and free recall of expository text. Learning and Individual Differences 8(2), 141–160 (1996)CrossRefGoogle Scholar
  5. 5.
    Borlund, P.: The concept of relevance in ir. JASIST 54(10), 913–925 (2003)CrossRefGoogle Scholar
  6. 6.
    Cool, C., Belkin, N., Frieder, O., Kantor, P.: Characteristics of text affecting relevance judgments. In: Proceedings of the 14th Natioal Online Meeting, pp. 74–84. Learned Information, Inc (1993)Google Scholar
  7. 7.
    Barry, C.L., Schamber, L.: Users’ criteria for relevance evaluation: a crosssituational comparison. Information Processing Management 34(2-3), 219–236 (1998)CrossRefGoogle Scholar
  8. 8.
    Kintsch, W.: Learning from text. American Psychologist 49, 294–303 (1994)CrossRefGoogle Scholar
  9. 9.
    Flesch, R.: A new readability yardstick. Journal of Applied Psychology 32, 221–233 (1948)CrossRefGoogle Scholar
  10. 10.
    Dale, E., Chall, J.: A formula for predicting readability. Educational Research Bulletin 27, 11–20 (1948)Google Scholar
  11. 11.
    Gunning, R.: The Technique of Clear Writing, 2nd edn. McGraw-Hill, New York (1968)Google Scholar
  12. 12.
    Fry, E.: Fry’s readability graph: Clarifications, validity, and extension to level 17. Journal of Reading 21(3), 242–252 (1977)Google Scholar
  13. 13.
    Wolfe, M., Schreiner, M., Rehder, B., Laham, D., Kinstch, W., Landauer, T.: Learning from text: Matching readers and texts by latent semantic analysis. Discourse Processes 25, 309–336 (1998)CrossRefGoogle Scholar
  14. 14.
    Dufty, D.F., McNamara, D., Louwerse, M., Cai, Z., Graesser, A.C.: Automatic evaluation of aspects of document quality. In: SIGDOC 2004: Proceedings of the 22nd annual international conference on Design of communication, pp. 14–16. ACM Press, New York (2004)Google Scholar
  15. 15.
    Collins-Thompson, K., Callan, J.: Predicting reading difficulty with statistical language models: Research articles. Journal of the American Society for Information Science and Technology 56(13), 1448–1462 (2005)CrossRefGoogle Scholar
  16. 16.
    Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco (1993)Google Scholar
  17. 17.
    Finn, A., Kushmeric, N.: Learning to classify documents according to genre. In: IJCAI 2003 Workshop on Computational Approaches to Style Analysis and Synthesis (2003)Google Scholar
  18. 18.
    Talburt, J.: The flesch index: An easily programmable readability analysis algorithm. In: SIGDOC 1985: Proceedings of the 4th annual international conference on Systems documentation, pp. 114–122. ACM Press, New York (1985)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Lorna Kane
    • 1
  • Joe Carthy
    • 1
  • John Dunnion
    • 1
  1. 1.Intelligent Information Retrieval Group, School of Computer Science and InformaticsUniversity College DublinBelfield, Dublin 4Ireland

Personalised recommendations