Abstract
OBJECTIVES: Visualization in state sequence data has been developed extensively through an R package described in Gabadinho et al. (2011). Graphics depicting states prevalent at each cross section in time can be generated for all data as well as for covariate level sets for datasets with a large number of subjects with state transitions over time. Special longitudinal sequence sets can also be carved out using similarity measures across sequences. In our work, we believed there may be latent informative images inherent in data on changes in cancer states (degrees of response , progressions, and deaths). We obtain a longitudinal as well as a cross-sectional informative image through a novel heuristic grounded in the framework of hierarchical clustering . METHODS: We used iconic known images, stripped them of all ordering information, and attempted to recover the known latent image underlying the randomly permuted data using our heuristic as well as other alternative methods such as those in Sakai et al. (2014). RESULTS: Results validate our methods. The method is demonstrated through a visual representation of changes in cancer states for two induction therapies in a cancer trial. A further application to a two-way ordering of gene sample heat maps are also presented. CONCLUSIONS: When cancer state transition graphics for competing therapies are juxtaposed, there can be a quick read of early versus late response to therapy, the depth and duration of response as well as a rough gauge of events such as progression and death. This is a good complement to quantitative inferences. For gene expression data, we hope that our methods will bring out finer distinctions in addition to presenting gross patterns in the data like those seen using prevalent methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anderson, T. W. (1958). Wiley publications in statistics. An introduction to multivariate statistical analysis. Hoboken, NJ, US: Wiley.
Bar-Joseph, Z., Gifford, D. K., & Jaakkola, T. S. (2001). Fast optimal leaf ordering for hierarchical clustering. Bioinformatics, 17(Suppl 1), S22–S29.
Buchta, C., Hornik, K., & Hahsler, M. (2008). Getting things in order: An introduction to the R package seriation. Journal of Statistical Software 25(3).
Elder, G. H., & Kirkpatrick Johnson, M., & Crosnoe, R. (2003). The emergence and development of life course theory. In Handbook of the life course (pp. 3–19). https://doi.org/10.1007/978-0-306-48247-2_1.
Gabadinho, A., et al. (2013, October 11). Workshop on sequence analysis. New York.
Gabadinho, A., Ritschard, G., Studer, M., & Müller, N. S. (2010). Mining sequence data in R with the TraMineR package: A user’s guide. University of Geneva. http://mephisto.unige.ch/traminer.
Gabadinho, A., Ritschard, G., Müller, N. S., & Studer, M. (2011). Analyzing and visualizing state sequences in R with TraMineR. Journal of Statistical Software, 40(4), 1–37. http://www.jstatsoft.org/v40/i04.
Gabadinho, A., Studer, M., Müller, N. S., Bürgin, R., & Ritschard, G. (2016). Trajectory Miner (TraMineR): A Toolbox for Exploring and Rendering Sequences. R package version 1.8-13.
Giele, J. Z., & Elder, G. H., Jr. (Eds.). (1998). Methods of life course research: Qualitative and quantitative approaches. Sage Publications. ISBN 0 76191437 4.
Gruvaeus, G., & Wainer, H. (1972). Two additions to hierarchical cluster analysis. Journal of Mathematical and Statistical Psychology, 25(2), 200–206.
Hamming, R. W. (1950). Error detecting and error correcting codes. Bell System Technical Journal, 29, 147–160.
Johnson, R. A., & Wichern, D. W. (1992). Applied multivariate statistical analysis. Englewood Cliffs, N.J: Prentice Hall.
Levenshtein, V. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10, 707–710.
McVicar, D., & Anyadike-Danes, M. (2002). Predicting successful and unsuccessful transitions from school to work using sequence methods. Journal of the Royal Statistical Society A, 165(2), 317–334.
Resource Tepee. http://www.resourcetepee.com/.
R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
Sakai, R. (2015). dendsort: Modular Leaf Ordering Methods for Dendrogram Nodes. R package version 0.3.3. http://CRAN.R-project.org/package=dendsort.
Sakai, R., Winand, R., & Verbeiren, T. et al. (2014). dendsort: Modular leaf ordering methods for dendrogram representations in R. F1000Research, 3, 177.
The Cancer Genome Atlas Research Network. (2014). Comprehensive molecular characterization of gastric adenocarcinoma. Nature.
Tufte, E. R. (2001). The Visual Display of Quantitative Information (2nd ed.). Cheshire, Connecticut: Graphics Press.
https://rdrr.io/cran/dendsort/f/vignettes/example_figures.Rmd.
Acknowledgements
The authors would like to thank Arlene Swern and Janice Grecko for their leadership and support of this necessary endeavor to support a tool for the global assessment of response patterns to cancer therapy and for helping in uncovering useful patterns in gene expression data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method
*R Core Team 2013.
Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Srinivasan, S.S., Yue, L.H., Soong, R., He, M., Banerjee, S., Kotey, S. (2018). Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps. In: Peace, K., Chen, DG., Menon, S. (eds) Biopharmaceutical Applied Statistics Symposium . ICSA Book Series in Statistics. Springer, Singapore. https://doi.org/10.1007/978-981-10-7820-0_19
Download citation
DOI: https://doi.org/10.1007/978-981-10-7820-0_19
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7819-4
Online ISBN: 978-981-10-7820-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)