Skip to main content

SiBIC: A Tool for Generating a Network of Biclusters Captured by Maximal Frequent Itemset Mining

  • Protocol
  • First Online:
Data Mining for Systems Biology

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1807))

Abstract

Biclustering extracts coexpressed genes under certain experimental conditions, providing more precise insight into the genetic behaviors than one-dimensional clustering. For understanding the biological features of genes in a single bicluster, visualizations such as heatmaps or parallel coordinate plots and tools for enrichment analysis are widely used. However, simultaneously handling many biclusters still remains a challenge. Thus, we developed a web service named SiBIC, which, using maximal frequent itemset mining, exhaustively discovers significant biclusters, which turn into networks of overlapping biclusters, where nodes are gene sets and edges show their overlaps in the detected biclusters. SiBIC provides a graphical user interface for manipulating a gene set network, where users can find target gene sets based on the enriched network. This chapter provides a user guide/instruction of SiBIC with background of having developed this software. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/sibic/faces/index.jsp.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Jiang D, Tang C, Zhang A (2004) Cluster analysis for gene expression data: a survey. IEEE Trans Knowl Data Eng 16(11):1370–1386

    Google Scholar 

  2. A Ben-Dor, Shamir R, Yakhini Z (1999) Clustering gene expression patterns. J Comput Biol 6(3–4):281–297

    Google Scholar 

  3. Madeira SC, Oliveira AL (2004) Biclustering algorithms for biological data analysis: a survey. IEEE Trans Comput Biol Bioinf 1(1):24–45

    Google Scholar 

  4. Saber HB, Elloumi M (2015) DNA microarray data analysis: a new survey on biclustering. Int J Comput Biol 4(1):21–37

    Google Scholar 

  5. Barkow S, Bleuler S, Prelic A, Zimmermann P, Zitzler E (2006) BicAT: biclustering analysis toolbox. Bioinformatics 22:1282–1283

    Google Scholar 

  6. Cheng KO, Law NF, Siu WC, Lau TH (2007) BiVisu: software tool for bicluster detection and visualization. Bioinformatics 23(17):2342–2344

    Google Scholar 

  7. Grothaus GA, Mufti A, Murali TM (2006) Automatic layout and visualization of biclusters. Algorithms Mol Biol 1:15

    Google Scholar 

  8. Heinrich J, Seifert R, Burch M, Weiskopf D (2011) Bicluster viewer: a visualization tool for analyzing gene expression data. In: Advances in visual computing, pp 641–652. Springer, Berlin

    Google Scholar 

  9. Streit M, Gratzl S, Gillhofer M, Mayr A, Mitterecker A, Hochreiter S (2014) Furby: fuzzy force-directed bicluster visualization. BMC Bioinf 15(Suppl 6):4

    Google Scholar 

  10. Santamaria R, Theron R, Quintales L (2008) BicOverlapper: a tool for bicluster visualization. Bioinformatics 24(9):1212–1213

    Google Scholar 

  11. Takahashi K, Takigawa I, Mamitsuka H (2013) SiBIC: a web server for generating gene set networks based on biclusters obtained by maximal frequent itemset mining. PLoS One 8(12):e82890

    Google Scholar 

  12. Han J, Cheng H, Xin D, Yan X (2007) Frequent pattern mining: current status and future directions. Data Min Knowl Disc 15:55–86

    Google Scholar 

  13. Burdick D, Calimlim M, Flannick J, Gehrke J, Yiu T (2005) MAFIA: a maximal frequent itemset algorithm. IEEE Trans Knowl Data Eng 17:1490–1504

    Google Scholar 

  14. Edgar R, Domrachev M, Lash AE (2002) Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30:207–210

    Google Scholar 

  15. Sayers E, Wheeler D (2004) Building customized data pipelines using the entrez programming utilities (eUtils). NCBI

    Google Scholar 

  16. Huang DW, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4:44–57

    Google Scholar 

  17. Madadhain J, Fisher D, Smyth P, White S, Boey Y (2005) Analysis and visualization of network data using JUNG. J Stat Soft 10:1–35

    Google Scholar 

  18. Kamada T, Kawai S (1989) An algorithm for drawing general undirected graphs. Inf Process Lett 31:7–15

    Google Scholar 

  19. Fruchterman TMJ, Reingold EM (1991) Graph drawing by force-directed placement. Softw-Pract Exp 21:1129–1164

    Google Scholar 

  20. Meyer B (1998) Self-organizing graphs-a neural network perspective of graph layout. In: Graph drawing symposium, August 1998

    Google Scholar 

Download references

Acknowledgements

Part of this research has been supported by MEXT KAKENHI #16H02868 and #17H01783, ACCEL (JPMJAC1503) and PRESTO of JST, FiDiPro of Tekes and AIPSE programme, Academy of Finland.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kei-ichiro Takahashi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Takahashi, Ki., duVerle, D.A., Yotsukura, S., Takigawa, I., Mamitsuka, H. (2018). SiBIC: A Tool for Generating a Network of Biclusters Captured by Maximal Frequent Itemset Mining. In: Mamitsuka, H. (eds) Data Mining for Systems Biology. Methods in Molecular Biology, vol 1807. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8561-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-8561-6_8

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-8560-9

  • Online ISBN: 978-1-4939-8561-6

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics