A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts

TR Number

Date

2007

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Motivation: The sequence of artificial genetic constructs is composed of multiple functional fragments, or genetic parts, involved in different molecular steps of gene expression mechanisms. Biologists have deciphered structural rules that the design of genetic constructs needs to follow in order to ensure a successful completion of the gene expression process, but these rules have not been formalized, making it challenging for non-specialists to benefit from the recent progress in gene synthesis.

Results: We show that context-free grammars (CFG) can formalize these design principles. This approach provides a path to organizing libraries of genetic parts according to their biological functions, which correspond to the syntactic categories of the CFG. It also provides a framework for the systematic design of new genetic constructs consistent with the design principles expressed in the CFG. Using parsing algorithms, this syntactic model enables the verification of existing constructs. We illustrate these possibilities by describing a CFG that generates the most common architectures of genetic constructs in Escherichia coli.

Availability: A web site allows readers to experiment with the algorithms presented in this article: www.genocad.org

Description

This data set corresponds to the grammar and parts list described in this article. This grammar can be download and imported in GenoCAD.

Keywords

synthetic biology, genetic parts, formal language

Citation

Cai Y, Hartnett B, Gustafsson C, Peccoud J. (2007) A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts. Bioinformatics. 23, 2760-7.