Skip to main content

A Factorial Technique for Analysing Textual Data with External Information

  • Conference paper
Advances in Classification and Data Analysis

Abstract

The paper aims at proposing a method allowing to take into account information on context, when we are analysing lexical tables by means of factorial techniques, as Correspondence Analysis. Such a kind of information, external to the main data structure, can concern where and how words are used, but, moreover, can concern their (syntactical, grammatical, etc.) role inside the corpus. Here a methodological tool has been proposed: a technique based on projections on subspaces spanned by two sets of variables related to fragments and words. The matrix to be analysed, called inter-reference matrix, measures the importance of the association between the external information on words and fragments. The final outputs are graphical representations that enrich the results of textual data analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Balbi, S. (1995), Non symmetrical correspondence analysis of textual data and confidence regions for graphical forms, in S. Bolasco, L. Lebart, A. Salem (eds.), JADT’95, vol. 2, Roma, CISU, 5–12.

    Google Scholar 

  • Becue M., Peiro R. (1993), Les quasi-segments pour une classification automatique des reponses ouvertes, in S. J. Anastex (ed.), JADT’93, Paris, Telecom, 310–325.

    Google Scholar 

  • Bolasco S. (1993), Choix de lemmatisation en vue de reconshuctions syntagmatiques du texte par l’analyse des correspondence, in S. J. Anastex (ed.), JADT‘93, Paris, Telecom, 399–414.

    Google Scholar 

  • Gabriel K. R. (1971), The biplot graphical display of matrices with application to principal component analysis, Biometrika, 58, 453–467.

    Article  Google Scholar 

  • Gabriel K. R. (1981), Biplot display of multivariate matrices for inspection of data and diagnosis, in V. Barnett (ed.), Interpreting Multivariate Data, Chichester, Wiley, 147–174.

    Google Scholar 

  • Giordano G., Scepi G. (1998), La progettazione della qualità attraverso l’analisi di strutture informative differenti, Atti della XXXIX Riunione Scientifica SIS, vol. II ( CD-ROM ), Sorrento, 707–714.

    Google Scholar 

  • Greenacre, M. (1984), Theory and Applications of Correspondence Analysis, London, Academic Press.

    Google Scholar 

  • Lauro N.C., D’Ambra L. (1984), L’analyse non symétrique des correspondences, in Data Analysis and Informatics, III (E. Diday et al. eds.), Amsterdam, NorthHolland, 433–446.

    Google Scholar 

  • Lebart L. (1981), Vers l’analyse automatique des textes: le traitement des réponses libres aux questions ouvertes d’une enquête, in: J. P. Benzécri & collaborateurs, Pratique de l’Analyse des Données. Linguistique & Lexicologie, Paris, Dunod, 414–419.

    Google Scholar 

  • Lebart L., Salem A. (1994), Statistique Textuelle, Paris, Dunod.

    Google Scholar 

  • Lebart, L., Salem A., Berry, L. (1998), Exploring Textual Data, Dordrecht, Kluwer Academic Publishers.

    Google Scholar 

  • Salem A. (1984), La typologie des segmentes répétés dens un corpus, fondée sur l’analyse d’un tableau croisant mots et textes, Les Cahiers de l’Analyse des données, 9, 4, 489–500.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Balbi, S., Giordano, G. (2001). A Factorial Technique for Analysing Textual Data with External Information. In: Borra, S., Rocci, R., Vichi, M., Schader, M. (eds) Advances in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59471-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-59471-7_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41488-9

  • Online ISBN: 978-3-642-59471-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics