Abstract
The modeling of component interactions represents a major challenge in designing component systems. In most cases, the components in such systems interact via the results they produce. This approach results in two conflicting requirements that have to be satisfied. On the one hand, the interfaces between the components are subject to exact specifications. On the other hand, however, the component interfaces should not be excessively restricted as this might require the data produced by the components to be converted into the system’s data format. This might pose certain difficulties if complex data types (e.g., graphs or matrices) have to be stored as they often require non-trivial access methods that are not supported by a general data format.
The approach introduced in this paper tries to overcome this dilemma by meeting both demands: A role system is a generic way that enables text processing components to produce highly specific results. The role concept described in this paper has been adopted by the Tesla (Text Engineering Software Laboratory) framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altschul, S. F. , Gish, W., Miller, W., Myers, E. W., & Lipman, D. J. (1990). Basic local alignment search tool. Journal of Molecular Biology, 215(3), 403–410.
Bird, S., Day, D., Garofolo, J., Henderson, J., Laprun, C., & Liberman, M. (1999). Atlas: A flexible and extensible architecture for linguistic annotation. Technical report, NIST, 1999.
Cunningham, H., & Bontcheva, K. (2006). Computational language systems, architectures. In K. Brown, A. H. Anderson, L. Bauer, M. Berns, G. Hirst, & J. Miller (Eds.), The encyclopedia of language and linguistics (2nd ed.). Munich: Elsevier.
Feldman, R., & Sanger, J. (2006). The text mining handbook: Advanced approaches in analyzing unstructured data. Cambridge: Cambridge University Press.
Götz, T., & Suhre, O. (2004). Design and implementation of the uima common analysis system. IBM Systems Journal, 43(3), 476–489.
Hahn, U., Buyko, E., Tomanek, K., Piao, S., Tsuruoka, Y., McNaught J., et al. (2007). An uima annotation type system for a generic text mining architecture. In UIMA-Workshop, GLDV Conference, 2007.
Hamlet, D., Mason, D., & Woit, D. (1991). Theory of software reliability based on components. In Proceedings ICSE ‘01, pages 361–370. IEEE Computer Society, 2001.
Harris, Z. S. (1951). Methods in structural linguistics. Chicago: University of Chicago Press.
Kondrak, G. (2002). Algorithms for language reconstruction. PhD thesis, Department of Computer Science, University of Toronto, Toronto, ON, Canada, July 2002.
Szyperski, C. (1998). Component software. Reading, MA: Addison-Wesley.
van Gurp J., & Bosch, J. (2002). Role-based component engineering. In M. Larsson, & I. Crnkovic (Eds.), Building reliable component-based systems. Norwood, MA: Artech House.
van Zaanen, M. (1999). Bootstrapping structure using similarity. In P. Monachesi (Ed.), Computational Linguistics in The Netherlands 1999 – Selected Papers from the Tenth CLIN Meeting; Utrecht, The Netherlands, pages 235–245, Utrecht, The Netherlands, 1999.
van Zaanen, M., & Geertzen, J. (2006). Grammatical inference for syntax-based statistical machine translation. In Y. Sakakibara, S. Kobayashi, K. Sato, T. Nishino, & E. Tomita (Eds.), Eighth International Colloquium on Grammatical Inference, (ICGI), Tokyo, Japan, number 4201 in Lecture Notes in AI, pages 356–358. Berlin: Springer.
Veronis, J., & Ide, N. (1996). Considerations for the reusability of linguistic software. Technical report, EAGLES, April 1996.
Acknowledgements
We would like to thank Maryia Fedzechkina and Sonja Subicin for their help.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hermes, J., Schwiebert, S. (2009). Classification of Text Processing Components: The Tesla Role System. In: Fink, A., Lausen, B., Seidel, W., Ultsch, A. (eds) Advances in Data Analysis, Data Handling and Business Intelligence. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01044-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-01044-6_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01043-9
Online ISBN: 978-3-642-01044-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)