Skip to main content

Inferring Metabolic States from Single Cell Transcriptomic Data via Geometric Deep Learning

  • Conference paper
  • First Online:
Research in Computational Molecular Biology (RECOMB 2024)

Abstract

The ability to measure gene expression at single-cell resolution has elevated our understanding of how biological features emerge from complex and interdependent networks at molecular, cellular, and tissue scales. As technologies have evolved that complement scRNAseq measurements with things like single-cell proteomic, epigenomic, and genomic information, it becomes increasingly apparent how much biology exists as a product of multimodal regulation. Biological processes such as transcription, translation, and post-translational or epigenetic modification impose both energetic and specific molecular demands on a cell and are therefore implicitly constrained by the metabolic state of the cell. While metabolomics is crucial for defining a holistic model of any biological process, the chemical heterogeneity of the metabolome makes it particularly difficult to measure, and technologies capable of doing this at single-cell resolution are far behind other multiomics modalities. To address these challenges, we present GEFMAP (Gene Expression-based Flux Mapping and Metabolic Pathway Prediction), a method based on geometric deep learning for predicting flux through reactions in a global metabolic network using transcriptomics data, which we ultimately apply to scRNAseq. GEFMAP leverages the natural graph structure of metabolic networks to learn both a biological objective for each cell and estimate a mass-balanced relative flux rate for each reaction in each cell using novel deep learning models.

H. R. Steach and S. Viswanath—Co-lead author.

M. Perlmutter and S. Krishnaswamy—Co-senior author.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Brunk, E., et al.: Recon3D enables a three-dimensional view of gene variation in human metabolism. Nat. Biotechnol. 36(3), 272–281 (2018)

    Google Scholar 

  2. Coifman, R.R., Maggioni, M.: Diffusion wavelets. Appl. Comput. Harmon. Anal. 21(1), 53–94 (2006)

    Article  MathSciNet  Google Scholar 

  3. Cucuringu, M., Li, H., Sun, H., Zanetti, L.: Hermitian matrices for clustering directed graphs: insights and applications. In: International Conference on Artificial Intelligence and Statistics, pp. 983–992. PMLR (2020)

    Google Scholar 

  4. Defferrard, M., Bresson, X., Vandergheynst, P.: Convolutional neural networks on graphs with fast localized spectral filtering. In: Advances in Neural Information Processing Systems, vol. 29, pp. 3844–3852 (2016)

    Google Scholar 

  5. Fabregat, A., et al.: The reactome pathway knowledgebase. Nucleic Acids Res. 46(D1), D649–D655 (2017). https://doi.org/10.1093/nar/gkx1132

  6. Feist, A.M., Palsson, B.O.: The biomass objective function. Curr. Opin. Microbiol. 13(3), 344–349 (2010). https://doi.org/10.1016/j.mib.2010.03.003

  7. Gama, F., Ribeiro, A., Bruna, J.: Diffusion scattering transforms on graphs. In: 7th International Conference on Learning Representations, ICLR 2019 (2019)

    Google Scholar 

  8. Gao, F., Wolf, G., Hirn, M.: Geometric scattering for graph data analysis. In: International Conference on Machine Learning, pp. 2122–2131. PMLR (2019)

    Google Scholar 

  9. García-Rodríguez, D., Giménez-Cassina, A.: Ketone bodies in the brain beyond fuel metabolism: from excitability to gene expression and cell signaling. Front. Mol. Neurosci. 14, 732120 (2021)

    Article  Google Scholar 

  10. Heirendt, L., et al.: Creation and analysis of biochemical constraint-based models using the COBRA toolbox v.3.0. Nat. Protoc. 14(3), 639–702 (2019). https://doi.org/10.1038/s41596-018-0098-2

  11. Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, 24–26 April 2017, Conference Track Proceedings (2017)

    Google Scholar 

  12. Krishnaswamy, S., et al.: Conditional density-based analysis of t cell signaling in single-cell data. Science 346(6213) (2014). https://doi.org/10.1126/science.1250689

  13. Kyoto Encyclopedia of Genes and Genomes (KEGG): Kyoto encyclopedia of genes and genomes. https://www.genome.jp/kegg/

  14. Levie, R., Monti, F., Bresson, X., Bronstein, M.M.: Cayleynets: graph convolutional neural networks with complex rational spectral filters. IEEE Trans. Signal Process. 67(1), 97–109 (2019). https://doi.org/10.1109/TSP.2018.2879624

    Article  MathSciNet  Google Scholar 

  15. Li, F., Chen, Y., Anton, M., Nielsen, J.: Gotenzymes: an extensive database of enzyme parameter predictions. Nucleic Acids Res. 51(D1), D583–D586 (2022). https://doi.org/10.1093/nar/gkac831

  16. Lieb, E.H., Loss, M.: Fluxes, Laplacians, and Kasteleyn’s theorem. In: Nachtergaele, B., Solovej, J.P., Yngvason, J. (eds.) Statistical Mechanics, pp. 457–483. Springer, Heidelberg (1993). https://doi.org/10.1007/978-3-662-10018-9_28

    Chapter  Google Scholar 

  17. Maffezzini, C., Calvo-Garrido, J., Wredenberg, A., Freyer, C.: Metabolic regulation of neurodifferentiation in the adult brain. Cell. Mol. Life Sci. 77, 2483–2496 (2020)

    Article  Google Scholar 

  18. Mattson, M.P., Moehl, K., Ghena, N., Schmaedick, M., Cheng, A.: Intermittent metabolic switching, neuroplasticity and brain health. Nat. Rev. Neurosci. 19(2), 81–94 (2018). https://doi.org/10.1038/nrn.2017.156

  19. MetaboAnalyst: Metaboanalyst: A comprehensive tool suite for metabolomic data analysis. https://www.metaboanalyst.ca/

  20. Min, Y., Wenkel, F., Perlmutter, M., Wolf, G.: Can hybrid geometric scattering networks help solve the maximum clique problem? In: NeurIPS (2022). http://papers.nips.cc/paper_files/paper/2022/hash/8ec88961d36d9a87ac24baf45402744f-Abstract-Conference.html

  21. Mohar, B.: A new kind of Hermitian matrices for digraphs. Linear Algebra Appl. 584, 343–352 (2020)

    Article  MathSciNet  Google Scholar 

  22. Moon, K.R., et al.: Visualizing structure and transitions in high-dimensional biological data. Nat. Biotechnol. 37(12), 1482–1492 (2019). https://doi.org/10.1038/s41587-019-0336-3

  23. Orth, J.D., Fleming, R.M.T., Palsson, B.Ø.: Reconstruction and use of microbial metabolic networks: the core Escherichia coli metabolic model as an educational guide. EcoSal Plus 4(1) (2010). https://doi.org/10.1128/ecosalplus.10.2.1

  24. Orth, J.D., Thiele, I., Palsson, B.Ø.: What is flux balance analysis? Nat. Biotechnol. 28(3), 245–248 (2010). https://doi.org/10.1038/nbt.1614

    Article  Google Scholar 

  25. Palsson, B.O.: Systems Biology: Properties of Reconstructed Networks. Cambridge University Press, New York (2006)

    Book  Google Scholar 

  26. Sahu, A., Blätke, M.A., Szymański, J.J., Töpfer, N.: Advances in flux balance analysis by integrating machine learning and mechanism-based models. Comput. Struct. Biotechnol. J. 19, 4626–4640 (2021). https://doi.org/10.1016/j.csbj.2021.08.004

  27. Sastry, A.V., et al.: The Escherichia coli transcriptome mostly consists of independently regulated modules. Nat. Commun. 10(1) (2019). https://doi.org/10.1038/s41467-019-13483-w

  28. Schuetz, R., Zamboni, N., Zampieri, M., Heinemann, M., Sauer, U.: Multidimensional optimality of microbial metabolism. Science 336(6081), 601–604 (2012). https://doi.org/10.1126/science.1216882

  29. Subramanian, A., et al.: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102(43), 15545–15550 (2005). https://doi.org/10.1073/pnas.0506580102

    Article  Google Scholar 

  30. Wagner, A., et al.: Metabolic modeling of single TH17 cells reveals regulators of autoimmunity. Cell 184(16), 4168–4185.e21 (2021). https://doi.org/10.1016/j.cell.2021.05.045

  31. Xu, K., Hu, W., Leskovec, J., Jegelka, S.: How powerful are graph neural networks? In: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, 6–9 May 2019 (2019)

    Google Scholar 

  32. Yudkoff, M., et al.: Response of brain amino acid metabolism to ketosis. Neurochem. Int. 47(1–2), 119–128 (2005)

    Article  Google Scholar 

  33. Zhang, X., He, Y., Brugnone, N., Perlmutter, M., Hirn, M.: Magnet: a neural network for directed graphs. Adv. Neural. Inf. Process. Syst. 34, 27003–27015 (2021)

    Google Scholar 

  34. Zou, D., Lerman, G.: Graph convolutional neural networks via scattering. Appl. Comput. Harmon. Anal. 49(3), 1046–1074 (2020). https://doi.org/10.1016/j.acha.2019.06.003

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

Yixuan He is supported by a Clarendon scholarship from the University of Oxford. Michael Perlmutter and Smita Krishnaswamy were partially funded by NSF DMS 2327211. Additionally, Smita Krishnaswamy was partially supported by NSF Career Grant 2047856, by NIH 1R01GM130847-01A1, and by NIH 1R01GM135929-01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Smita Krishnaswamy .

Editor information

Editors and Affiliations

A Supplemental Information

A Supplemental Information

1.1 A.1 Supplemental Methods

Source code, GEM files, datasets, and other resources are openly available at our Github repository (https://github.com/KrishnaswamyLab/metabolic_GNN).

Transcriptomic Data. Bulk RNA sequencing data was obtained from the Precision RNA-seq Expression Compendium for Independent Signal Exploration (PRECISE) [27] that was collected from 15 studies conducted in the same laboratory using a standardized protocol and includes information on experimental conditions and measured growth rates. The original data contains over 250 samples and approximately 150 experimental conditions, which we augmented to over 3000 samples by adding Gaussian noise to gene expression values.

Single cell RNA sequencing data from human embryoid body cells was obtained from [22]; filtering and normalization was done identically to the original publication.

Objective Function Prediction Network. The graph neural network used to infer the objective c takes in torch geometric graph objects as input and utilizes the RMSProp optimizer. For all experiments, a learning rate \(lr = 0.0001\) trained on 3 epochs with a batch size of 80 as training parameters with a (60-20-20) train-test-validation split. For E. coli data the parameter \(\beta \) in the loss function was set to 0.1, and for human EB data was set to 0.25.

Objective Function Solver Networks. All four flux prediction/estimation models utilize the Adam optimizer with a learning rate \(lr = 0.01\) trained on 100 epochs. A train/test split of 80-20 is applied to the dataset. In order to compute the training losses, we use the mean squared error between the predicted and ground truth values. The performance metrics are computed by taking the mean and standard deviation of the Pearson Correlation Coefficient (PCC) between the predicted and ground truth values over 5 runs of each model.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Steach, H.R. et al. (2024). Inferring Metabolic States from Single Cell Transcriptomic Data via Geometric Deep Learning. In: Ma, J. (eds) Research in Computational Molecular Biology. RECOMB 2024. Lecture Notes in Computer Science, vol 14758. Springer, Cham. https://doi.org/10.1007/978-1-0716-3989-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-1-0716-3989-4_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-1-0716-3988-7

  • Online ISBN: 978-1-0716-3989-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics