Skip to main content

LabBase: Data and Workflow Management for Large Scale Biological Research

  • Chapter
Book cover Bioinformatics: Databases and Systems

Conclusion

The LabBase and LabFlow systems described in this chapter are the latest in a series of systems we have built to tackle the problems of data management and workflow management for large scale biological research projects. We and our colleagues have used the predecessor systems for several projects at the Whitehead/MIT Center for Genome Research and are beginning to use the current systems for projects there and elsewhere. Though the software is incomplete in many ways, it is proven technology that in our hands, at least, greatly reduces the work of creating laboratory informatics systems.

While we welcome others to use our software, we believe there is greater value in the ideas. The totalquantity ofcodeis modest, comprising about 10,000 lines of Perl5. It would not be hard to reproduce the ideas in other contexts, and we encourage others to do so.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sargent, R., D. Fuhrman, T. Critchlow, T.D. Sera, R. Mecklenburg, G. Lindstrom, and P. Cartwright. The Design and Implementation of a Database For Human Genome Research. in Eighth International Conference on Scientific and Statistical Database Management. 1996. Stockholm, Sweden: IEEE Computer Society Press.

    Google Scholar 

  2. Kerlavage, A.R., M. Adams, J.C. Kelley, M. Dubnick, J. Powell, P. Shanmugam, J.C. Venter, and C. Fields. Analysis and Management of Data from High Throughput Sequence Tag Projects. in 26th Annual Hawaii International Conference on System Sciences. 1993: IEEE Computer Society Press.

    Google Scholar 

  3. Clark, S.P., G.A. Evans, and H.R. Garner, Informatics and Automation Used in Physical Mapping of the Genome, in Biocomputing: Informatics and Genome Projects, D. Smith, Editor. 1994, Academic Press: New York. p. 13–49.

    Google Scholar 

  4. Rozen, S., L.D. Stein, and N. Goodman, LabBase: A Database to Manage Laboratory Data in a Large-Scale Genome-Mapping Project. IEEE Transactions on Engineering in Medicine and Biology, 1995. 14: p. 702–709.

    Article  Google Scholar 

  5. Stein, L.D., S. Rozen, and N. Goodman. Managing Laboratory Workflow With LabBase. in 1994 Conference on Computers in Medicine (CompMed94). 1994.

    Google Scholar 

  6. Stein, L., A. Marquis, E. Dredge, M.P. Reeve, M. Daly, S. Rozen, and N. Goodman. Splicing UNIX into a Genome Mapping Laboratory. in USENIX Summer 1994 Technical Conference. 1994: USENIX.

    Google Scholar 

  7. Goodman, N., S. Rozen, and L.D. Stein. Building a Laboratory Information System Around a C++-based Object-Oriented DBMS. in 20th International Conference on Very Large Data Bases. 1994. Santiago de Chile, Chile: The Very Large Data Bases (VLDB) Endowment Inc.

    Google Scholar 

  8. Mohan, C., G. Alonso, R. Guenthoer, M. Kamath, and B. Reinwald. An Overview ofthe Exotica Research Project on Workflow Management Systems. in 6th International Workshop on High Performance Transaction Systems. 1995. Asilomar, CA.

    Google Scholar 

  9. Mohan, C., Tutorial: State ofthe Art in Workflow Management System Research and Products, http://www.almaden.ibm.com/cs/exotica/sigmod96.eps. 1996, IBM Almaden Research Center.

  10. Hollingsworth, D., The Workflow Reference Model, http://www.aiai.ed.ac.uk:80/project/wfmc/. 1994, Workflow Management Coalition.

  11. Fayad, M. and D.C. Schmidt, Object-Oriented Application Frameworks. Communications of the ACM, 1997. 40(10): p. 32–28.

    Article  Google Scholar 

  12. Durbin, R. and J.T. Mieg, A C. elegans Database, Documentation, code and data available from anonymous FTP servers at lirmm.lirmm.fr, cele.mrc-Imb.cam.ac.uk and ncbi.nlm.nih.gov. 1991

    Google Scholar 

  13. Chen, I.-M.A. and V.M. Markowitz, An Overview of the Object-Protocol Model (OPM) and OPM Data Management Tools. Information Systems, 1995. 20(5): p. 393–418.

    Article  Google Scholar 

  14. McHugh, J., S. Abiteboul, R. Goldman, D. Quass, and J. Widom, Lore: A Database Management System for Semistructured Data. SIGMOD Record, 1997. 26(3): p. 54–66.

    Article  Google Scholar 

  15. Buneman, P., S. Davidson, G. Hillebrand, and D. Suciu. A Query Language and Optimization Techniques for Unstructured Data. in ACM Conference on Management of Data (SIGMOD). 1996. Montreal Quebec.

    Google Scholar 

  16. Altschul, S.F., W. Gish, W. Miller, and D.J. Lipman, Basic Local Alignment Search Tool. Journal ofMolecular Biology, 1990. 215: p. 403–410.

    Article  CAS  Google Scholar 

  17. Cattell, R., Object Data Management Revised Edition: Object-Oriented and Extended Relational Database Systems. 1994, Reading, MA: Addison-Wesley.

    Google Scholar 

  18. Orfali, R., D. Harkey, and J. Edwards, The Essential Distributed Objects Survival Guide. 1996, New York: John Wiley & Sons.

    Google Scholar 

  19. Green, P., PHRAP Documentation, http://www.mbt.washington.edu/phrap.docs/phrap.html. 1996, University of Washington.

  20. Sutton, G., 0. White, M.D. Adams, and A.R. Kerlavage, TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects. Genome Science & Technology, 1995. 1:p.9–19.

    CAS  Google Scholar 

  21. Pearson, W.R., Rapid and Sensitive Sequence Comparison with FASTP and FASTA. Methods in Enzymology, 1990. 183: p. 63–98.

    Article  PubMed  CAS  Google Scholar 

  22. Green, P., SWAT, CROSS_MATCH Documentation, http://www.mbt.washington.edu/phrap.docs/general.html. 1996, University of Washington.

  23. http://www.mbt.washington.edu/phrap.docs/phred.html. 1996, University of Washington.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Stanley Letovsky

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Kluwer Academic Publishers

About this chapter

Cite this chapter

Goodman, N., Rozen, S., Stein, L. (2002). LabBase: Data and Workflow Management for Large Scale Biological Research. In: Letovsky, S. (eds) Bioinformatics: Databases and Systems. Springer, Boston, MA. https://doi.org/10.1007/0-306-46903-0_24

Download citation

  • DOI: https://doi.org/10.1007/0-306-46903-0_24

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-7923-8573-8

  • Online ISBN: 978-0-306-46903-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics