Abstract
Text Mining (TM) is a competitive statistical technology to extract relevant information from huge textual unstructured databases (document warehousing). In this paper, from an immense linguistic archive such as that coming of 10 years of daily “La Repubblica”, we describe several examples on the language productivity and the changes of language in the Nineties, with a particular attention of the use evolution of declining of verb mood, tense and person.
The present research was funded by MIUR 2002 - C26A022374. The paper is a combined job of the two authors: paragraphs 1, 3.2, 4 were written by S. Bolasco, and paragraphs 2, 3.1, 3.3 were written by A. Canzonetti.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BALBI, S., BOLASCO, S., VERDE R. (2002): Text Mining on Elementary Forms in Complex Lexical Structures. In A. Morin and P. Sebillot (eds.) (2002), vol. 1, 89–100.
BOLASCO, S. (2002): Integrazione statistico-linguistica nell'analisi del contenuto. In: B. Mazzara (ed.): Metodi qualitativi in psicologia sociale. Carocci Ed., Roma, 329–342.
BOLASCO, S. and CANZONETTI, A. (2003): Text mining (TM) as key to reading the Nineties' language in the daily “La Repubblica”. SIS Scientific Convention, Napoli.
BOLASCO, S., VERDE, R., BALBI, S. (2002): Outils de Text Mining pour l'analyse de structures lexicales à éléments variables. In: A. Morin e P. Sebillot (eds.) (2002), vol. 1, 197–208.
CHIARI, I. (2002): Ridondanza e linguaggio. Carocci, Roma.
DELLARATTA, F. (2003): Automatic texts' classification on the basis of evaluative dictionary. In AA.VV.: Book of short papers. Cladag2003, CLUEB, Bologna, 155–158.
GAETA, L. (2001): Sviluppi recenti dell'analisi quantitativa della produttivit morfologica. Seminary at Institute of Psycology, CNR, Roma.
GIORDANO, R. and VOGHERA, M. (2002): Verb system and verb usage in spoken and written Italian. In A. Morin e P. Sebillot (eds), vol. 1, 289–300.
Koppel, M., Argamon, S., Shimoni, (2002): Automatic Categorization of Written Text by Author Gender. Literary and Linguistic Computing, vol 17, n 4, 401–412
MORIN, A. and SEBILLOT, P. (eds.) (2002): JADT 2002. St Malo, IRISA-INRIA, 2 voll.
SULLIVAN, D. (2001): Document Warehousing and Text Mining: Techniques for Improving Business Operations, Marketing, and Sales. Wiley, N.Y.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Bolasco, S., Canzonetti, A. (2005). Some Insights into the Evolution of 1990s' Standard Italian Using Text Mining Techniques and Automatic Categorization. In: Bock, HH., et al. New Developments in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27373-5_35
Download citation
DOI: https://doi.org/10.1007/3-540-27373-5_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23809-6
Online ISBN: 978-3-540-27373-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)