Readability Applied to Information Retrieval
Readability refers to all characteristics of a document that contribute to its ‘ease of understanding or comprehension due to the style of writing’ . The readability of a text is dependent on a number of factors, including but not constrained to; its legibility, syntactic difficulty, semantic difficulty and the organization of the text . As many as 228 variables were found to influence the readability of a text in Gray and Leary’s seminal study . These variables were classified as relating to document content, style, format or, features of organization.
KeywordsLatent Semantic Analysis Information Seeker Syntactic Complexity Textual Coherence Annual International Conference
Unable to display preview. Download preview PDF.
- 1.Klare, G.: The Measurement of Readability, Iowa State University Press (1963)Google Scholar
- 2.Chall, J.: Readability: An Appraisal of Research and Application. Number 34 in Bureau of Educational Research Monographs, The Bureau of Educational Research Ohio State University (1958)Google Scholar
- 3.Entin, E., Klare, G.: Relationships of measures of interest, prior knowledge, and readabilty to comprehension of expository passages. Advances in reading/language research 3, 9–38 (1985)Google Scholar
- 6.Cool, C., Belkin, N., Frieder, O., Kantor, P.: Characteristics of text affecting relevance judgments. In: Proceedings of the 14th Natioal Online Meeting, pp. 74–84. Learned Information, Inc (1993)Google Scholar
- 10.Dale, E., Chall, J.: A formula for predicting readability. Educational Research Bulletin 27, 11–20 (1948)Google Scholar
- 11.Gunning, R.: The Technique of Clear Writing, 2nd edn. McGraw-Hill, New York (1968)Google Scholar
- 12.Fry, E.: Fry’s readability graph: Clarifications, validity, and extension to level 17. Journal of Reading 21(3), 242–252 (1977)Google Scholar
- 14.Dufty, D.F., McNamara, D., Louwerse, M., Cai, Z., Graesser, A.C.: Automatic evaluation of aspects of document quality. In: SIGDOC 2004: Proceedings of the 22nd annual international conference on Design of communication, pp. 14–16. ACM Press, New York (2004)Google Scholar
- 16.Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc, San Francisco (1993)Google Scholar
- 17.Finn, A., Kushmeric, N.: Learning to classify documents according to genre. In: IJCAI 2003 Workshop on Computational Approaches to Style Analysis and Synthesis (2003)Google Scholar
- 18.Talburt, J.: The flesch index: An easily programmable readability analysis algorithm. In: SIGDOC 1985: Proceedings of the 4th annual international conference on Systems documentation, pp. 114–122. ACM Press, New York (1985)Google Scholar