Biological engineering

In recent years striking biological circuits have been fabricated, often resembling devices from electrical engineering. Examples include a genetic toggle switch[1], oscillators[2], a rewritable memory unit[3], but also more complex features such as a DNA-guided assembly line[4]. These sorts of devices provide a starting ground for further engineering. However, next to the development of stable and streamlined microbial chassis[57], biological engineering requires accessible, modular, standard parts with reliable characteristics. Even though several repositories have been constructed[8], the most well-known library is the Registry of Standard Biological Parts (http://partsregistry.org). It is widely used in the international Genetically Engineered Machine (iGEM) competition[9] aimed at teaching synthetic biology to undergraduate students[10],[11], but also outside of this competition[12]. Users can design new devices from so-called BioBrick parts, the standard genetic parts, and are encouraged to submit new parts and devices to this Registry, which has grown to approximately 20,000 entries in 2012.

However, the vast number of entries in this Registry as well as the numerous parts with an unconfirmed status can frustrate the straightforward design of new genetic circuits. Furthermore, often several variants of parts and devices are available in the databases, which makes it difficult to find the shortest and most reliable strategy to clone new gene circuits. We found a need to automate querying the Registry in order to find the most straightforward cloning strategy for any pre-designed genetic circuit. This would simultaneously reduce time spent on finding appropriate parts in the Registry. We therefore set out to code the Constructor, a web-based application that recommends the smallest number of biological parts for a user-defined gene circuit, reducing both dry-lab and wet-lab time.

The Constructor

The web interface of the Constructor accepts user-defined genetic circuits built from individual transcription units (TUs). A TU consists of a promoter, a ribosomal binding site (RBS), a coding sequence (CDS) and a terminator (see also Figure 1). Arrangements with multiple RBSs and CDSs between a promoter and terminator, i.e., an operon, are allowed. Next, users can give a complex genetic circuit consisting of a number of TUs as input. To our knowledge, there is no evidence that the physical location of TUs on a plasmid has any relevance for the function of the genetic circuit. Therefore the Constructor permutes all possible arrangements of the different TUs making up a circuit. For example, if the genetic circuit consists of three separate TUs (in the order A-B-C, where each letter signifies a TU), the Constructor uses all six possible arrangements in its query (A-B-C, B-A-C, B-C-A,…).

Figure 1
figure 1

Schematic representation of the functionality of the Constructor. The user fills the codes of the parts of the query circuit (based on the vocabulary of the Registry). Next, the Constructor finds all arrangements of the Transcription Units (TUs). The functionality of the gene circuit is supposed to be independent of the order of the different TUs, and the Constructor tries to build all possible arrangements from all BioBrick parts in the Registry. Finally, the Constructor recommends up to three sets of BioBrick parts with which the genetic circuit can be made, based on the smallest number of parts, aiming for the fewest cloning steps. The symbols of the various parts (promoter, ribosomal binding site (RBS), coding sequence (CDS) and terminator) are explained in the lower panel.

For each circuit arrangement, an SQL database containing information of ~20,000 parts (assembly standard 10 compatible) is queried. Since each circuit arrangement has a different sequence of genetic elements, each arrangement could collect a few arrangement-specific BioBrick parts. Through the combination of these specific BioBrick parts and the unique arrangement of genetic elements, some circuit arrangements could be constructed with fewer parts than others. An extra filter can be set if required on the availability, validity and quality of selected BioBrick parts by The Constructor.

Of all these possible combinations, the top three arrangements that could be assembled with the smallest number of BioBrick parts are shown as a recommendation to the user (Figure 1).

Validation

We tested several proposed cloning strategies from five iGEM 2011 team projects, and compared these to the cloning strategies recommended by the Constructor. We find that the number of cloning steps predicted by the Constructor with appropriate parts criteria is generally smaller than those that were undertaken by the teams (Table 1), indicating the usefulness of the Constructor in optimizing cloning strategies. Specifically, in four out of five cases, the Constructor recommends cloning strategies that involve fewer BioBrick parts, and therefore fewer cloning steps. Also, in most cases, the processing time of the Constructor is less than two minutes. The most complex genetic circuit, from the iGEM team UANL_Mty-Mexico, consisting of seven transcriptional units, took approximately ten hours to run. Still, this is expected to be less than manually querying the Registry for the optimal cloning strategy.

Table 1 Validation of the Constructor by comparing five 2011 iGEM team cloning approaches to the recommendation by the Constructor

Potential extensions

The Constructor facilitates the cloning strategy of complex pre-designed genetic circuits from elements of the Registry of Standard Biological Parts. Although the Constructor specifically focuses on this Registry, the same search algorithms for assembly optimization recommendations can be applied to other extensiveand well-defined parts libraries. Furthermore, theConstructor uses a straightforward transcriptional unit concept, which could be expanded by including different parts such as splicing signals. Finally, the web tool could be further optimised by suggesting alternatives for certain parts, like available reporter genes with another fluorescent ability, or different inducible promoters.

Availability

The Constructor is available athttp://www.systemsbiology.nl/the_Constructor. Help functions and a tutorial are provided with test cases of user defined gene circuits. All scripts are available from the authors upon request.