Skip to main content

Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps

  • Chapter
  • First Online:
Biopharmaceutical Applied Statistics Symposium

Part of the book series: ICSA Book Series in Statistics ((ICSABSS))

  • 626 Accesses

Abstract

OBJECTIVES: Visualization in state sequence data has been developed extensively through an R package described in Gabadinho et al. (2011). Graphics depicting states prevalent at each cross section in time can be generated for all data as well as for covariate level sets for datasets with a large number of subjects with state transitions over time. Special longitudinal sequence sets can also be carved out using similarity measures across sequences. In our work, we believed there may be latent informative images inherent in data on changes in cancer states (degrees of response , progressions, and deaths). We obtain a longitudinal as well as a cross-sectional informative image through a novel heuristic grounded in the framework of hierarchical clustering . METHODS: We used iconic known images, stripped them of all ordering information, and attempted to recover the known latent image underlying the randomly permuted data using our heuristic as well as other alternative methods such as those in Sakai et al. (2014). RESULTS: Results validate our methods. The method is demonstrated through a visual representation of changes in cancer states for two induction therapies in a cancer trial. A further application to a two-way ordering of gene sample heat maps are also presented. CONCLUSIONS: When cancer state transition graphics for competing therapies are juxtaposed, there can be a quick read of early versus late response to therapy, the depth and duration of response as well as a rough gauge of events such as progression and death. This is a good complement to quantitative inferences. For gene expression data, we hope that our methods will bring out finer distinctions in addition to presenting gross patterns in the data like those seen using prevalent methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Anderson, T. W. (1958). Wiley publications in statistics. An introduction to multivariate statistical analysis. Hoboken, NJ, US: Wiley.

    Google Scholar 

  • Bar-Joseph, Z., Gifford, D. K., & Jaakkola, T. S. (2001). Fast optimal leaf ordering for hierarchical clustering. Bioinformatics, 17(Suppl 1), S22–S29.

    Article  Google Scholar 

  • Buchta, C., Hornik, K., & Hahsler, M. (2008). Getting things in order: An introduction to the R package seriation. Journal of Statistical Software 25(3).

    Google Scholar 

  • Elder, G. H., & Kirkpatrick Johnson, M., & Crosnoe, R. (2003). The emergence and development of life course theory. In Handbook of the life course (pp. 3–19). https://doi.org/10.1007/978-0-306-48247-2_1.

  • Gabadinho, A., et al. (2013, October 11). Workshop on sequence analysis. New York.

    Google Scholar 

  • Gabadinho, A., Ritschard, G., Studer, M., & Müller, N. S. (2010). Mining sequence data in R with the TraMineR package: A user’s guide. University of Geneva. http://mephisto.unige.ch/traminer.

  • Gabadinho, A., Ritschard, G., Müller, N. S., & Studer, M. (2011). Analyzing and visualizing state sequences in R with TraMineR. Journal of Statistical Software, 40(4), 1–37. http://www.jstatsoft.org/v40/i04.

  • Gabadinho, A., Studer, M., Müller, N. S., Bürgin, R., & Ritschard, G. (2016). Trajectory Miner (TraMineR): A Toolbox for Exploring and Rendering Sequences. R package version 1.8-13.

    Google Scholar 

  • Giele, J. Z., & Elder, G. H., Jr. (Eds.). (1998). Methods of life course research: Qualitative and quantitative approaches. Sage Publications. ISBN 0 76191437 4.

    Google Scholar 

  • Gruvaeus, G., & Wainer, H. (1972). Two additions to hierarchical cluster analysis. Journal of Mathematical and Statistical Psychology, 25(2), 200–206.

    Article  Google Scholar 

  • Hamming, R. W. (1950). Error detecting and error correcting codes. Bell System Technical Journal, 29, 147–160.

    Article  MathSciNet  Google Scholar 

  • Johnson, R. A., & Wichern, D. W. (1992). Applied multivariate statistical analysis. Englewood Cliffs, N.J: Prentice Hall.

    MATH  Google Scholar 

  • Levenshtein, V. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10, 707–710.

    MathSciNet  Google Scholar 

  • McVicar, D., & Anyadike-Danes, M. (2002). Predicting successful and unsuccessful transitions from school to work using sequence methods. Journal of the Royal Statistical Society A, 165(2), 317–334.

    Article  MathSciNet  Google Scholar 

  • Resource Tepee. http://www.resourcetepee.com/.

  • R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.

  • Sakai, R. (2015). dendsort: Modular Leaf Ordering Methods for Dendrogram Nodes. R package version 0.3.3. http://CRAN.R-project.org/package=dendsort.

  • Sakai, R., Winand, R., & Verbeiren, T. et al. (2014). dendsort: Modular leaf ordering methods for dendrogram representations in R. F1000Research, 3, 177.

    Google Scholar 

  • The Cancer Genome Atlas Research Network. (2014). Comprehensive molecular characterization of gastric adenocarcinoma. Nature.

    Google Scholar 

  • Tufte, E. R. (2001). The Visual Display of Quantitative Information (2nd ed.). Cheshire, Connecticut: Graphics Press.

    Google Scholar 

  • https://rdrr.io/cran/dendsort/f/vignettes/example_figures.Rmd.

Download references

Acknowledgements

The authors would like to thank Arlene Swern and Janice Grecko for their leadership and support of this necessary endeavor to support a tool for the global assessment of response patterns to cancer therapy and for helping in uncovering useful patterns in gene expression data.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shankar S. Srinivasan .

Editor information

Editors and Affiliations

Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method

*R Core Team 2013.

Appendix: R* Code to Generate Ordered Sequences of Rows and Columns Using the Edge Clustering Method

figure g
figure h
figure i

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Srinivasan, S.S., Yue, L.H., Soong, R., He, M., Banerjee, S., Kotey, S. (2018). Some Methods for Longitudinal and Cross-Sectional Visualization with Further Applications in the Context of Heat Maps. In: Peace, K., Chen, DG., Menon, S. (eds) Biopharmaceutical Applied Statistics Symposium . ICSA Book Series in Statistics. Springer, Singapore. https://doi.org/10.1007/978-981-10-7820-0_19

Download citation

Publish with us

Policies and ethics