Abstract
The paper aims at proposing a method allowing to take into account information on context, when we are analysing lexical tables by means of factorial techniques, as Correspondence Analysis. Such a kind of information, external to the main data structure, can concern where and how words are used, but, moreover, can concern their (syntactical, grammatical, etc.) role inside the corpus. Here a methodological tool has been proposed: a technique based on projections on subspaces spanned by two sets of variables related to fragments and words. The matrix to be analysed, called inter-reference matrix, measures the importance of the association between the external information on words and fragments. The final outputs are graphical representations that enrich the results of textual data analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Balbi, S. (1995), Non symmetrical correspondence analysis of textual data and confidence regions for graphical forms, in S. Bolasco, L. Lebart, A. Salem (eds.), JADT’95, vol. 2, Roma, CISU, 5–12.
Becue M., Peiro R. (1993), Les quasi-segments pour une classification automatique des reponses ouvertes, in S. J. Anastex (ed.), JADT’93, Paris, Telecom, 310–325.
Bolasco S. (1993), Choix de lemmatisation en vue de reconshuctions syntagmatiques du texte par l’analyse des correspondence, in S. J. Anastex (ed.), JADT‘93, Paris, Telecom, 399–414.
Gabriel K. R. (1971), The biplot graphical display of matrices with application to principal component analysis, Biometrika, 58, 453–467.
Gabriel K. R. (1981), Biplot display of multivariate matrices for inspection of data and diagnosis, in V. Barnett (ed.), Interpreting Multivariate Data, Chichester, Wiley, 147–174.
Giordano G., Scepi G. (1998), La progettazione della qualità attraverso l’analisi di strutture informative differenti, Atti della XXXIX Riunione Scientifica SIS, vol. II ( CD-ROM ), Sorrento, 707–714.
Greenacre, M. (1984), Theory and Applications of Correspondence Analysis, London, Academic Press.
Lauro N.C., D’Ambra L. (1984), L’analyse non symétrique des correspondences, in Data Analysis and Informatics, III (E. Diday et al. eds.), Amsterdam, NorthHolland, 433–446.
Lebart L. (1981), Vers l’analyse automatique des textes: le traitement des réponses libres aux questions ouvertes d’une enquête, in: J. P. Benzécri & collaborateurs, Pratique de l’Analyse des Données. Linguistique & Lexicologie, Paris, Dunod, 414–419.
Lebart L., Salem A. (1994), Statistique Textuelle, Paris, Dunod.
Lebart, L., Salem A., Berry, L. (1998), Exploring Textual Data, Dordrecht, Kluwer Academic Publishers.
Salem A. (1984), La typologie des segmentes répétés dens un corpus, fondée sur l’analyse d’un tableau croisant mots et textes, Les Cahiers de l’Analyse des données, 9, 4, 489–500.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Balbi, S., Giordano, G. (2001). A Factorial Technique for Analysing Textual Data with External Information. In: Borra, S., Rocci, R., Vichi, M., Schader, M. (eds) Advances in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59471-7_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-59471-7_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41488-9
Online ISBN: 978-3-642-59471-7
eBook Packages: Springer Book Archive