Skip to main content
Log in

TransMiner: Mining transitive associations among biological objects from text

  • Original Paper
  • Published:
Journal of Biomedical Science

Abstract

Associations among biological objects such as genes, proteins, and drugs can be discovered automatically from the scientific literature. TransMiner is a system for finding associations among objects by mining the Medline database of the scientific literature. The direct associations among the objects are discovered based on the principle of co-occurrence in the form of an association graph. The principle of transitive closure is applied to the association graph to find potential transitive associations. The potential transitive associations that are indeed direct are discovered by iterative retrieval and mining of the Medline documents. Those associations that are not found explicitly in the entire Medline database are transitive associations and are the candidates for hypothesis generation. The transitive associations were ranked based on the sum of weight of terms that cooccur with both the objects. The direct and transitive associations are visualized using a graph visualization applet. TransMiner was tested by finding associations among 56 breast cancer genes and among 24 objects in the calpain signal transduction pathway. TransMiner was also used to rediscover associations between magnesium and migraine.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Baasiri RA, Glasser SR, Steffen DL, Wheeler DA. The Breast Cancer Gene Database: A collaborative information resource. Oncogene 18:7958–7965;1999. http://tyrosine.biomed-comp.com/4d.acgi$tsrchname?Name = &topic = BCIR.

    Article  PubMed  Google Scholar 

  2. Bao JJ, Le XF, Wang RY, Yuan J, Wang L, Atkinson EN, LaPushin R, Andreeff M, Fang B, Yu Y, Bast, RC Jr. Reexpression of the tumor suppressor gene ARHI induces apoptosis in ovarian and breast cancer cells through a caspase-independent calpain dependent pathway. Cancer Res 62:7264–7272;2002.

    PubMed  Google Scholar 

  3. Cuevas BD, Abell AN, Witowsky JA, Yujiri T, Johnson NL, Kesavan K, Ware M, Jones PL, Weed SA, DeBiasi RL, Oka Y, Tyler KL, Johnson GL. MEKK1 regulates calpain-dependent proteolysis of focal adhesion proteins for rearend detachment of migrating fibroblasts. EMBO J 22:3346–3355;2003.

    Article  PubMed  Google Scholar 

  4. Daraselia N, Yuryev A, Egorov S, Novichkova S, Nikitin A, Mazo I. Extracting human protein interactions from MEDLINE using a full-sentence parser. Bioinformatics 20:604–611;2004.

    Article  PubMed  Google Scholar 

  5. Demirkaya S, Vural O, Dora B, Topcuoglu MA. Efficacy of intravenous magnesium sulfate in the treatment of acute migraine attacks. Headache 41:171–177;2001.

    Article  PubMed  Google Scholar 

  6. Friedman C, Kra P, Yu H, Krauthammer M, Rzhetsky A. GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles. Bioinformatics 17(suppl 1):S74-S82;2001.

    PubMed  Google Scholar 

  7. Honderich T. The Oxford Companion to Philosophy, Oxford University Press. 1995 http://www.xrefer.com/entry/553381.

  8. Hristovski D, Dzeroski S, Peterlin B, Rozic-Hristovski A. Supporting Discovery in Medicine by Association Rule Mining of Bibliographic Databases. Proc Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases. Berlin, Springer, 446–451;2000.

    Google Scholar 

  9. Jenssen TK, Laegreid A, Komorowski J, Hovig E. A literature network of human genes for high-throughput analysis of gene expression. Nature Genetics 28:21–28;2000.

    Article  Google Scholar 

  10. Mathiasen IS, Sergeev IN, Bastholm L, Elling F, Norman AW, Jaattela M. Calcium and calpain as key mediators of apoptosis-like death induced by vitamin D compounds in breast cancer cells. J Biol Chem 277:30738–30745;2002.

    Article  PubMed  Google Scholar 

  11. Mrowka R. A Java applet for visualizing protein-protein interaction. Bioinformatics 17:669–671;2001.

    Article  PubMed  Google Scholar 

  12. Mukhopadhyay S, Mostafa J, Palakal M, Lam W, Xue L, Hudli A. An Adaptive Multi-level Information Filtering System. Proceedings of the Fifth International Conference on User Modeling, 21–28; 1996.

  13. Palakal M, Mukhopadhyay S, Mostafa J, Raje R, N'Cho M, Mishra S. An intelligent biological information management system. Bioinformatics 18:1283–1288;2002.

    Article  PubMed  Google Scholar 

  14. Pink JJ, Wuerzberger-Davis S, Tagliarino C, Planchon SM, Yang X, Froelich CJ, Boothman DA. Activation of a cysteine protease in MCF-7 and T47D breast cancer cells during beta-lapachone-mediated apoptosis. Exp Cell Res 255:144–155;2000.

    Article  PubMed  Google Scholar 

  15. Pratt W, Yetisgen-Yildiz M. LitLinker: capturing connections across the biomedical literature. Proceedings of the International Conference on Knowledge Capture 105–112;2003.

  16. Rebhan M, Chalifa-Caspi V, Prilusky J, Lancet D. GeneCards: encyclopedia for genes, proteins and diseases. Weizmann Institute of Science, Bioinformatics Unit and Genome Center (Rehovot, Israel) 1997. http://bioinformatics.weizmann.ac.il/cards.

  17. Salton G. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Reading, Addison-Wesley, 1989.

    Google Scholar 

  18. Stapley BJ, Benoit G. Biobibliometrics: information retrieval and visualization from cooccurrences of gene names in Medline abstracts. Pac Symp Biocomput 5:529–540;2000.

    Google Scholar 

  19. Stephens M, Palakal M, Mukhopadhyay S, Raje R, Mostafa J. Detecting gene relations from Medline abstracts. Pac Symp Biocomput 483–495;2001.

  20. Sun Microsystems, Inc. Graph.java demonstration software. Santa Clara, Sun Microsystems, 1995. http://java.sun.com/applets/jdk/1.0/demo/GraphLayout/index.html.

  21. Swanson DR. Fish oil, Raynaud's syndrome, and undiscovered public knowledge. Perspect Biol Med 30:7–18;1986.

    PubMed  Google Scholar 

  22. Swanson DR. Migraine and magnesium: eleven neglected connections. Perspect Biol Med 31:526–557;1988.

    PubMed  Google Scholar 

  23. Swanson DR, Smalheiser NR. An interactive system for finding complementary literatures: a stimulus to scientific discovery. Artif Intell 91:183–203;1997.

    Article  Google Scholar 

  24. Tagliarino C, Pink JJ, Dubyak GR, Nieminen AL, Boothman DA. Calcium is a key signaling molecule in beta-lapachone-mediated cell death. J Biol Chem 276:19150–19159;2001.

    Article  PubMed  Google Scholar 

  25. Tagliarino C, Pink JJ, Reinicke KE, Simmers SM, Wuerzberger-Davis SM, Boothman DA. Mu-calpain activation in beta-lapachone-mediated apoptosis. Cancer Biol Ther 2:141–152;2003.

    PubMed  Google Scholar 

  26. Warshall S. A theorem on boolean matrices. J ACM 9:11–12;1962.

    Article  Google Scholar 

  27. Wong PK, Whitney P, Thomas J. Visualizing Association Rules for Text Mining. Proceedings of IEEE Information Visualization, 120–123;1999.

  28. Wu WJ, Tu S, Cerione RA. Activated Cdc42 sequesters c-Cbl and prevents EGF receptor degradation. Cell 114:715–725;2003.

    Article  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Narayanasamy, V., Mukhopadhyay, S., Palakal, M. et al. TransMiner: Mining transitive associations among biological objects from text. J Biomed Sci 11, 864–873 (2004). https://doi.org/10.1007/BF02254372

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02254372

Key Words

Navigation