Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps

Srinivasan, Shankar S.; Yue, Li Hua; Soong, Rick; He, Mia; Banerjee, Sibabrata; Kotey, Stanley

doi:10.1007/978-981-10-7820-0_19

Shankar S. Srinivasan⁶,
Li Hua Yue⁶,
Rick Soong⁷,
Mia He⁷,
Sibabrata Banerjee⁶ &
…
Stanley Kotey⁶

Part of the book series: ICSA Book Series in Statistics ((ICSABSS))

626 Accesses

Abstract

OBJECTIVES: Visualization in state sequence data has been developed extensively through an R package described in Gabadinho et al. (2011). Graphics depicting states prevalent at each cross section in time can be generated for all data as well as for covariate level sets for datasets with a large number of subjects with state transitions over time. Special longitudinal sequence sets can also be carved out using similarity measures across sequences. In our work, we believed there may be latent informative images inherent in data on changes in cancer states (degrees of response , progressions, and deaths). We obtain a longitudinal as well as a cross-sectional informative image through a novel heuristic grounded in the framework of hierarchical clustering . METHODS: We used iconic known images, stripped them of all ordering information, and attempted to recover the known latent image underlying the randomly permuted data using our heuristic as well as other alternative methods such as those in Sakai et al. (2014). RESULTS: Results validate our methods. The method is demonstrated through a visual representation of changes in cancer states for two induction therapies in a cancer trial. A further application to a two-way ordering of gene sample heat maps are also presented. CONCLUSIONS: When cancer state transition graphics for competing therapies are juxtaposed, there can be a quick read of early versus late response to therapy, the depth and duration of response as well as a rough gauge of events such as progression and death. This is a good complement to quantitative inferences. For gene expression data, we hope that our methods will bring out finer distinctions in addition to presenting gross patterns in the data like those seen using prevalent methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Anderson, T. W. (1958). Wiley publications in statistics. An introduction to multivariate statistical analysis. Hoboken, NJ, US: Wiley.
Google Scholar
Bar-Joseph, Z., Gifford, D. K., & Jaakkola, T. S. (2001). Fast optimal leaf ordering for hierarchical clustering. Bioinformatics, 17(Suppl 1), S22–S29.
Article Google Scholar
Buchta, C., Hornik, K., & Hahsler, M. (2008). Getting things in order: An introduction to the R package seriation. Journal of Statistical Software 25(3).
Google Scholar
Elder, G. H., & Kirkpatrick Johnson, M., & Crosnoe, R. (2003). The emergence and development of life course theory. In Handbook of the life course (pp. 3–19). https://doi.org/10.1007/978-0-306-48247-2_1.
Gabadinho, A., et al. (2013, October 11). Workshop on sequence analysis. New York.
Google Scholar
Gabadinho, A., Ritschard, G., Studer, M., & Müller, N. S. (2010). Mining sequence data in R with the TraMineR package: A user’s guide. University of Geneva. http://mephisto.unige.ch/traminer.
Gabadinho, A., Ritschard, G., Müller, N. S., & Studer, M. (2011). Analyzing and visualizing state sequences in R with TraMineR. Journal of Statistical Software, 40(4), 1–37. http://www.jstatsoft.org/v40/i04.
Gabadinho, A., Studer, M., Müller, N. S., Bürgin, R., & Ritschard, G. (2016). Trajectory Miner (TraMineR): A Toolbox for Exploring and Rendering Sequences. R package version 1.8-13.
Google Scholar
Giele, J. Z., & Elder, G. H., Jr. (Eds.). (1998). Methods of life course research: Qualitative and quantitative approaches. Sage Publications. ISBN 0 76191437 4.
Google Scholar
Gruvaeus, G., & Wainer, H. (1972). Two additions to hierarchical cluster analysis. Journal of Mathematical and Statistical Psychology, 25(2), 200–206.
Article Google Scholar
Hamming, R. W. (1950). Error detecting and error correcting codes. Bell System Technical Journal, 29, 147–160.
Article MathSciNet Google Scholar
Johnson, R. A., & Wichern, D. W. (1992). Applied multivariate statistical analysis. Englewood Cliffs, N.J: Prentice Hall.
MATH Google Scholar
Levenshtein, V. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10, 707–710.
MathSciNet Google Scholar
McVicar, D., & Anyadike-Danes, M. (2002). Predicting successful and unsuccessful transitions from school to work using sequence methods. Journal of the Royal Statistical Society A, 165(2), 317–334.
Article MathSciNet Google Scholar
Resource Tepee. http://www.resourcetepee.com/.
R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
Sakai, R. (2015). dendsort: Modular Leaf Ordering Methods for Dendrogram Nodes. R package version 0.3.3. http://CRAN.R-project.org/package=dendsort.
Sakai, R., Winand, R., & Verbeiren, T. et al. (2014). dendsort: Modular leaf ordering methods for dendrogram representations in R. F1000Research, 3, 177.
Google Scholar
The Cancer Genome Atlas Research Network. (2014). Comprehensive molecular characterization of gastric adenocarcinoma. Nature.
Google Scholar
Tufte, E. R. (2001). The Visual Display of Quantitative Information (2nd ed.). Cheshire, Connecticut: Graphics Press.
Google Scholar
https://rdrr.io/cran/dendsort/f/vignettes/example_figures.Rmd.

Download references

Acknowledgements

The authors would like to thank Arlene Swern and Janice Grecko for their leadership and support of this necessary endeavor to support a tool for the global assessment of response patterns to cancer therapy and for helping in uncovering useful patterns in gene expression data.

Author information

Authors and Affiliations

Department of Biostatistics, Celgene Corporation, Summit, NJ, USA
Shankar S. Srinivasan, Li Hua Yue, Sibabrata Banerjee & Stanley Kotey
Department of Statistical Programming, Celgene Corporation, Summit, NJ, USA
Rick Soong & Mia He

Authors

Shankar S. Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar
Li Hua Yue
View author publications
You can also search for this author in PubMed Google Scholar
Rick Soong
View author publications
You can also search for this author in PubMed Google Scholar
Mia He
View author publications
You can also search for this author in PubMed Google Scholar
Sibabrata Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Stanley Kotey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shankar S. Srinivasan .

Editor information

Editors and Affiliations

Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, GA, USA
Karl E. Peace
School of Social Work and Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
Ding-Geng Chen
Boston University, Cambridge, MA, USA
Sandeep Menon

Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method

*R Core Team 2013.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Srinivasan, S.S., Yue, L.H., Soong, R., He, M., Banerjee, S., Kotey, S. (2018). Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps. In: Peace, K., Chen, DG., Menon, S. (eds) Biopharmaceutical Applied Statistics Symposium . ICSA Book Series in Statistics. Springer, Singapore. https://doi.org/10.1007/978-981-10-7820-0_19

Download citation

DOI: https://doi.org/10.1007/978-981-10-7820-0_19
Published: 01 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7819-4
Online ISBN: 978-981-10-7820-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps

Abstract

Access this chapter

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method

Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation