Conclusion
The LabBase and LabFlow systems described in this chapter are the latest in a series of systems we have built to tackle the problems of data management and workflow management for large scale biological research projects. We and our colleagues have used the predecessor systems for several projects at the Whitehead/MIT Center for Genome Research and are beginning to use the current systems for projects there and elsewhere. Though the software is incomplete in many ways, it is proven technology that in our hands, at least, greatly reduces the work of creating laboratory informatics systems.
While we welcome others to use our software, we believe there is greater value in the ideas. The totalquantity ofcodeis modest, comprising about 10,000 lines of Perl5. It would not be hard to reproduce the ideas in other contexts, and we encourage others to do so.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sargent, R., D. Fuhrman, T. Critchlow, T.D. Sera, R. Mecklenburg, G. Lindstrom, and P. Cartwright. The Design and Implementation of a Database For Human Genome Research. in Eighth International Conference on Scientific and Statistical Database Management. 1996. Stockholm, Sweden: IEEE Computer Society Press.
Kerlavage, A.R., M. Adams, J.C. Kelley, M. Dubnick, J. Powell, P. Shanmugam, J.C. Venter, and C. Fields. Analysis and Management of Data from High Throughput Sequence Tag Projects. in 26th Annual Hawaii International Conference on System Sciences. 1993: IEEE Computer Society Press.
Clark, S.P., G.A. Evans, and H.R. Garner, Informatics and Automation Used in Physical Mapping of the Genome, in Biocomputing: Informatics and Genome Projects, D. Smith, Editor. 1994, Academic Press: New York. p. 13–49.
Rozen, S., L.D. Stein, and N. Goodman, LabBase: A Database to Manage Laboratory Data in a Large-Scale Genome-Mapping Project. IEEE Transactions on Engineering in Medicine and Biology, 1995. 14: p. 702–709.
Stein, L.D., S. Rozen, and N. Goodman. Managing Laboratory Workflow With LabBase. in 1994 Conference on Computers in Medicine (CompMed94). 1994.
Stein, L., A. Marquis, E. Dredge, M.P. Reeve, M. Daly, S. Rozen, and N. Goodman. Splicing UNIX into a Genome Mapping Laboratory. in USENIX Summer 1994 Technical Conference. 1994: USENIX.
Goodman, N., S. Rozen, and L.D. Stein. Building a Laboratory Information System Around a C++-based Object-Oriented DBMS. in 20th International Conference on Very Large Data Bases. 1994. Santiago de Chile, Chile: The Very Large Data Bases (VLDB) Endowment Inc.
Mohan, C., G. Alonso, R. Guenthoer, M. Kamath, and B. Reinwald. An Overview ofthe Exotica Research Project on Workflow Management Systems. in 6th International Workshop on High Performance Transaction Systems. 1995. Asilomar, CA.
Mohan, C., Tutorial: State ofthe Art in Workflow Management System Research and Products, http://www.almaden.ibm.com/cs/exotica/sigmod96.eps. 1996, IBM Almaden Research Center.
Hollingsworth, D., The Workflow Reference Model, http://www.aiai.ed.ac.uk:80/project/wfmc/. 1994, Workflow Management Coalition.
Fayad, M. and D.C. Schmidt, Object-Oriented Application Frameworks. Communications of the ACM, 1997. 40(10): p. 32–28.
Durbin, R. and J.T. Mieg, A C. elegans Database, Documentation, code and data available from anonymous FTP servers at lirmm.lirmm.fr, cele.mrc-Imb.cam.ac.uk and ncbi.nlm.nih.gov. 1991
Chen, I.-M.A. and V.M. Markowitz, An Overview of the Object-Protocol Model (OPM) and OPM Data Management Tools. Information Systems, 1995. 20(5): p. 393–418.
McHugh, J., S. Abiteboul, R. Goldman, D. Quass, and J. Widom, Lore: A Database Management System for Semistructured Data. SIGMOD Record, 1997. 26(3): p. 54–66.
Buneman, P., S. Davidson, G. Hillebrand, and D. Suciu. A Query Language and Optimization Techniques for Unstructured Data. in ACM Conference on Management of Data (SIGMOD). 1996. Montreal Quebec.
Altschul, S.F., W. Gish, W. Miller, and D.J. Lipman, Basic Local Alignment Search Tool. Journal ofMolecular Biology, 1990. 215: p. 403–410.
Cattell, R., Object Data Management Revised Edition: Object-Oriented and Extended Relational Database Systems. 1994, Reading, MA: Addison-Wesley.
Orfali, R., D. Harkey, and J. Edwards, The Essential Distributed Objects Survival Guide. 1996, New York: John Wiley & Sons.
Green, P., PHRAP Documentation, http://www.mbt.washington.edu/phrap.docs/phrap.html. 1996, University of Washington.
Sutton, G., 0. White, M.D. Adams, and A.R. Kerlavage, TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects. Genome Science & Technology, 1995. 1:p.9–19.
Pearson, W.R., Rapid and Sensitive Sequence Comparison with FASTP and FASTA. Methods in Enzymology, 1990. 183: p. 63–98.
Green, P., SWAT, CROSS_MATCH Documentation, http://www.mbt.washington.edu/phrap.docs/general.html. 1996, University of Washington.
http://www.mbt.washington.edu/phrap.docs/phred.html. 1996, University of Washington.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2002 Kluwer Academic Publishers
About this chapter
Cite this chapter
Goodman, N., Rozen, S., Stein, L. (2002). LabBase: Data and Workflow Management for Large Scale Biological Research. In: Letovsky, S. (eds) Bioinformatics: Databases and Systems. Springer, Boston, MA. https://doi.org/10.1007/0-306-46903-0_24
Download citation
DOI: https://doi.org/10.1007/0-306-46903-0_24
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-8573-8
Online ISBN: 978-0-306-46903-9
eBook Packages: Springer Book Archive