Web Information Systems Engineering – WISE 2005 Workshops

Volume 3807 of the series Lecture Notes in Computer Science pp 207-216

Towards Automatic Generation of Semantic Types in Scientific Workflows

  • Shawn BowersAffiliated withUC Davis Genome Center
  • , Bertram LudäscherAffiliated withDepartment of Computer Science, University of California

* Final gross prices may vary according to local VAT.

Get Access


Scientific workflow systems are problem-solving environments that allow scientists to automate and reproduce data management and analysis tasks. Workflow components include actors (e.g., queries, transformations, analyses, simulations, visualizations), and datasets which are produced and consumed by actors. The increasing number of such components creates the problem of discovering suitable components and of composing them to form the desired scientific workflow. In previous work we proposed the use of semantic types (annotations relative to an ontology) to solve these problems. Since creating semantic types can be complex and time-consuming, scalability of the approach becomes an issue. In this paper we propose a framework to automatically derive semantic types from a (possibly small) number of initial types. Our approach propagates the given semantic types through workflow steps whose input and output data structures are related via query expressions. By propagating semantic types, we can significantly reduce the effort required to annotate datasets and components and even derive new “candidate axioms” for inclusion in annotation ontologies.