Skip to main content

Informality Judgment at Sentence Level and Experiments with Formality Score

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNTCS,volume 6609)

Abstract

Formality and its converse, informality, are important dimensions of authorial style that serve to determine the social background a particular document is coming from, and the potential audience it is targeted to. In this paper we explored the concept of formality at the sentence level from two different perspectives. One was the Formality Score (F-score) and its distribution across different datasets, how they compared with each other and how F-score could be linked to human-annotated sentences. The other was to measure the inherent agreement between two independent judges on a sentence annotation task. It gave us an idea how subjective the concept of formality was at the sentence level. Finally, we looked into the related issue of document readability and measured its correlation with document formality.

Keywords

  • News Article
  • Readability Test
  • Sentence Level
  • Academic Paper
  • Formality Score

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-642-19437-5_37
  • Chapter length: 12 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   84.99
Price excludes VAT (USA)
  • ISBN: 978-3-642-19437-5
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   109.99
Price excludes VAT (USA)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Biber, D.: Variation Across Speech and Writing. Cambridge University Press, Cambridge (1988)

    CrossRef  Google Scholar 

  2. Brants, T., Skut, W., Uszkoreit, H.: Syntactic annotation of a German newspaper corpus. In: Proceedings of the ATALA Treebank Workshop, Paris, France, pp. 69–76 (1999)

    Google Scholar 

  3. Brooke, J., Wang, T., Hirst, G.: Automatic acquisition of lexical formality. In: Proceedings of the 23rd International Conference on Computational Linguistics (COLING) (2010)

    Google Scholar 

  4. Chambers, J.K., Schilling-Estes, N., Trudgill, P.: The handbook of language variation and change. Blackwell, Malden (2006)

    Google Scholar 

  5. Halliday, M.: Comparison and translation. In: Halliday, M., McIntosh, M., Strevens, P. (eds.) The linguistic sciences and language teaching. Longman, Harlow (1964)

    Google Scholar 

  6. Herring, S.C., Scheidt, L.A., Wright, E., Bonus, S.: Weblogs as a bridging genre. IT & People 18(2), 142–171 (2005)

    CrossRef  Google Scholar 

  7. Heylighen, F., Marc Dewaele, J.: Formality of language: definition, measurement and behavioral determinants. Tech. rep. (1999)

    Google Scholar 

  8. Hudson, R.: About 37% of word-tokens are nouns. Language 70(2), 331–339 (1994)

    CrossRef  Google Scholar 

  9. Karlgren, J.: Stylistic experiments for information retrieval (2000)

    Google Scholar 

  10. Leckie-Tarry, H., Birch, D.: Language and context: a functional linguistic theory of register. In: Birch, D. (ed.) Pinter Publishers, London (1995)

    Google Scholar 

  11. Levelt, W.J.M.: Speaking: From Intention to Articulation. MIT Press, Cambridge (1989)

    Google Scholar 

  12. Likert, R.: A technique for the measurement of attitudes. Archives of Psychology 22(140), 1–55 (1932)

    Google Scholar 

  13. Mao, Y., Lebanon, G.: Isotonic Conditional Random Fields and Local Sentiment Flow. In: Advances in Neural Information Processing Systems (2007)

    Google Scholar 

  14. McLaughlin, H.G.: SMOG grading - a new readability formula. Journal of Reading, 639–646 (May 1969)

    Google Scholar 

  15. Nowson, S., Oberlander, J., Gill, A.J.: Weblogs, genres and individual differences. In: Proceedings of the 27th Annual Conference of the Cognitive Science Society, pp. 1666–1671 (2005)

    Google Scholar 

  16. Phan, X.H.: CRFTagger: CRF English POS Tagger (2006), http://crftagger.sourceforge.net/

  17. Reid, T.B.: Linguistics, structuralism, philology. Archivum Linguisticum 8

    Google Scholar 

  18. Ure, J.N.: Lexical density and register differentiation. In: Perren, G.E., Trim, J.L.M. (eds.) Applications of Linguistics: Selected Papers of the 2nd International Congress of Linguistics, Cambridge 1969. Cambridge University Press, Cambridge (1971)

    Google Scholar 

  19. Zampolli, A.: Statistique linguistique et dépouillements automatiques. Lexicologie, 325–358

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lahiri, S., Mitra, P., Lu, X. (2011). Informality Judgment at Sentence Level and Experiments with Formality Score. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19437-5_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19436-8

  • Online ISBN: 978-3-642-19437-5

  • eBook Packages: Computer ScienceComputer Science (R0)