Abstract
Stable coordinate pairs (SCP) like comentarios y sugerencias ‘comments and suggestions’ or sano y salvo ‘safe and sound’ are rather frequent in texts in Spanish, though there are only few thousands of them in language. We characterize SCPs statistically by a numerical Stable Connection Index and reveal its unimodal distribution. We also propose lexical, morphologic, syntactic, and semantic categories for SCP structural description — for both a whole SCP and its components. It is argued that database containing a set of categorized SCPs facilitates several tasks of automatic NLP.. The research is based on a set of ca. 2200 Spanish coordinate pairs.
Work done under partial support of Mexican Government (CONACyT, SNI) and CGEPI-IPN, Mexico.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bloomfield, L.: Language. Halt, Rinehart and Winston (1964)
Bolshakov, I.A.: A Method of Linguistic Steganography Based on Collocation-Verified Synonymy. In: Fridrich, J. (ed.) IH 2004. LNCS, vol. 3200, pp. 180–191. Springer, Heidelberg (2004)
Bolshakov, I.A.: An Experiment in Detection and Correction of Malapropisms through the Web. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 803–815. Springer, Heidelberg (2005)
Bolshakov, I.A., Gaysinski, A.N.: Slovar’ ustojčivyx sočinennyx par v russkom jazyke (in Russian). Nauchnaya i Tekhnicheskaya Informatsiya 2(4), 28–33 (1993)
Bolshakov, I.A., Gelbukh, A., Galicia-Haro, S.N.: Stable Coordinated Pairs in Text Processing. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 27–34. Springer, Heidelberg (2003)
Galicia-Haro, S.N.: Using Electronic Texts for an Annotated Corpus Building. In: 4th Mexican International Conference on Computer Science (ENC 2003), pp. 26–33 (2003)
Malkiel, Y.: Studies in Irreversible Binomials. Lingua 8, 113–160 (1959)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Mel’čuk, I.: Dependency Syntax: Theory and Practice. SUNY Press, NY (1988)
Mel’čuk, I.: Phrasemes in Language and Phraseology in Linguistics. In: Everaert, M., et al. (eds.) Structural and Psychological Perspectives, pp. 169–252. Lawrence Erlbaum Associates Publ., Hillsdale
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolshakov, I.A., Galicia-Haro, S.N. (2005). Stable Coordinate Pairs in Spanish: Statistical and Structural Description. In: Sanfeliu, A., Cortés, M.L. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2005. Lecture Notes in Computer Science, vol 3773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11578079_51
Download citation
DOI: https://doi.org/10.1007/11578079_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29850-2
Online ISBN: 978-3-540-32242-9
eBook Packages: Computer ScienceComputer Science (R0)