Sequencing project results in six-fold reduction of effective size of maize genome


Cost effective model for sequencing large complex genomes

A team of scientists that includes a Washington University in St. Louis biologist, has evaluated and validated a gene-enrichment strategy for genome sequencing and has reported a major advance in sequencing large genomes. The team showed a six-fold reduction of the effective size of the Zea mays (maize or corn) genome while creating a four-fold increase in the gene identification rate when compared to standard whole-genome sequencing methods.

The Maize Genomics Consortium reported their results in the December 19, 2003 issue of Science magazine. Karel R. Schubert, Ph.D., Washington University affiliate professor of biology in Arts & Sciences, was the principal investigator of the study. Schubert is vice president of technology management and science administration at the Donald Danforth Plant Science Center in St. Louis.

The Maize Genomics Consortium, consisting of The Donald Danforth Plant Science Center, The Institute for Genomic Research (TIGR), Purdue University, and Orion Genomics, was awarded a two-year, $6 million plant genome grant on September 20, 2002 by the National Science Foundation (NSF) to develop and evaluate high-throughput and robust strategies to isolate and sequence maize genes. The two gene-enrichment methods used in the research published in Science are methyl-filtration and high-Cot selection.

According to Schubert and W. Brad Barbazuk, Ph.D., senior bioinformatics specialist and assistant domain member, both at the Donald Danforth Plant Science Center, the overall goal of the pilot sequencing project in maize is to derive an effective strategy to sequence the maize genome. To meet this goal, the Maize Genomics Consortium will generate approximately 800,000 total sequence reads using the methyl-filtration and high-Cot methods, with the results published in Science describing the analysis of the first 200,000 sequence reads.

It is a challenging effort to sequence the maize genome, as its size and structure preclude using the standard whole-genome methods for sequence analysis and alignment. At about 2 to 3 billion base pairs, the maize genome is estimated to be 20 times larger than Arabidopsis, an experimental plant that is the first plant genome to be completely sequenced. However, maize probably has only twice as many genes as Arabidopsis. The rest of the maize genome is made up of a large amount of highly repetitive DNA including many mobile DNA elements. Unlike Arabidopsis genes, the maize genes are not spaced evenly throughout the genome but instead are clustered in "islands" floating in a large "sea" of repeat-sequence DNA.

To sequence these "islands", the Maize Consortium used two methods for gene-enrichment, methyl-filtration and high-Cot selection. The methyl-filtration method was developed at Cold Spring Harbor Laboratory in Long Island, New York, and has been exclusively licensed to St. Louis-based Orion Genomics. It is based on the finding that highly repetitive DNA is modified (methylated) while genes are largely free of such modification. The well-established high-Cot selection method was applied at Purdue University and exploits the fact that gene sequences are in relatively low abundance compared with the large amount of repeated non-genic sequences. These methods target overlapping, but non-identical fractions of the genome that are highly enriched for genes sequences.

Source: Eurekalert & others

Last reviewed: By John M. Grohol, Psy.D. on 21 Feb 2009
    Published on All rights reserved.



Never grow a wishbone, daughter, where your backbone ought to be.
-- Clementine Paddelford