Converting between Various Sequence Representations

Ritschard, Gilbert; Gabadinho, Alexis; Studer, Matthias; Müller, Nicolas S.

doi:10.1007/978-3-642-02190-9_8

Gilbert Ritschard⁴,
Alexis Gabadinho⁴,
Matthias Studer⁴ &
…
Nicolas S. Müller⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 223))

663 Accesses
3 Citations
3 Altmetric

Abstract

This chapter is concerned with the organization of categorical sequence data. We first build a typology of sequences distinguishing for example between chronological sequences and sequences without time content. This permits to identify the kind of information that the data organization should preserve. Focusing then mainly on chronological sequences, we discuss the advantages and limits of different ways of representing time stamped event and state sequence data and present solutions for automatically converting between various formats, e.g., between horizontal and vertical presentations but also from state sequences into event sequences and reciprocally. Special attention is also drawn to the handling of missing values in these conversion processes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Time in Data Models

Temporal Data Management – An Overview

Sequences and Operations with Sequences

References

Aassve, A., Billari, F., Piccarreta, R.: Strings of adulthood: A sequence analysis of young British women’s work-family trajectories. European Journal of Population 23(3), 369–388 (2007)
Article Google Scholar
Blossfeld, H.P., Golsch, K., Rohwer, G.: Event History Analysis with Stata. Lawrence Erlbaum, Mahwah (2007)
Google Scholar
Brock, G.N., Shaffer, J.R., Blakesley, R.E., Lotz, M.J., Tseng, G.C.: Which missing value imputation method to use in expression profiles: A comparative study and two selection schemes. BMC Bioinformatics 9, 12 (2008)
Article Google Scholar
Gabadinho, A., Ritschard, G., Studer, M., Müller, N.S.: Mining sequence data in R with TraMineR: A user’s guide for version 1.1. Technical report, Department of Econometrics and Laboratory of Demography, University of Geneva, Geneva (2009), http://mephisto.unige.ch/traminer
Gauthier, J.A., Widmer, E.D., Bucher, P., Notredame, C.: Multichannel sequence analysis applied to social science data, University of Lausanne (2007) (manuscript) (under review)
Google Scholar
Hobbs, J.R., Pan, F.: An ontology of time for the semantic web. ACM Transactions on Asian Language Information Processing 3(1), 66–85 (2004)
Article Google Scholar
Karweit, N., Kertzer, D.: Data organization and conceptualization. In: Giele, J.Z., Elder, G.H. (eds.) Methods of Life Course Research: Qualitative and Quantitative Approaches, pp. 81–97. Sage, Thousand Oaks (1998)
Google Scholar
Little, R.J.A.: Modeling the drop-out mechanism in repeated-measures studies. Journal of the American Statistical Association 90(431), 1112–1121 (1995), http://www.jstor.org/stable/2291350
Article MATH MathSciNet Google Scholar
Ritschard, G., Oris, M.: Life course data in demography and social sciences: Statistical and data mining approaches. In: Levy, R., Ghisletta, P., Le Goff, J.M., Spini, D., Widmer, E. (eds.) Towards an Interdisciplinary Perspective on the Life Course, Advances in Life Course Research, vol. 10, pp. 289–320. Elsevier, Amsterdam (2005)
Google Scholar
Yamaguchi, K.: Event history analysis. In: ASRM 28. Sage, Newbury Park (1991)
Google Scholar
Zaki, M.J.: SPADE: An efficient algorithm for mining frequent sequences. Machine Learning 42(1/2), 31–60 (2001)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Econometrics and Laboratory of Demography, University of Geneva, Switzerland
Gilbert Ritschard, Alexis Gabadinho, Matthias Studer & Nicolas S. Müller

Authors

Gilbert Ritschard
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Gabadinho
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Studer
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas S. Müller
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Computing and Informatics, University of North Carolina at Charlotte, 28223, Charlotte, N.C., USA
Zbigniew W. Ras
Wydzial Informatyki, Politechnika Bialostocka, ul.Wiejska 45a, 15-351, Bialystok, Poland
Agnieszka Dardzinska

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ritschard, G., Gabadinho, A., Studer, M., Müller, N.S. (2009). Converting between Various Sequence Representations. In: Ras, Z.W., Dardzinska, A. (eds) Advances in Data Management. Studies in Computational Intelligence, vol 223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02190-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-02190-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02189-3
Online ISBN: 978-3-642-02190-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Converting between Various Sequence Representations

Abstract

Access this chapter

Preview

Similar content being viewed by others

Time in Data Models

Temporal Data Management – An Overview

Sequences and Operations with Sequences

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Converting between Various Sequence Representations

Abstract

Access this chapter

Preview

Similar content being viewed by others

Time in Data Models

Temporal Data Management – An Overview

Sequences and Operations with Sequences

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation