Skip to main content
HomeAcademicsProgramsPeopleResourcesAbout Us
Synthetic Wheat
"Seeds of the Future"
Synthetic Wheat
Home : People : Profiles
Contact Information

David E. Matthews
Adjunct Associate Professor
Department of Plant Breeding & Genetics

409 Bradfield Hall
Cornell University
Ithaca, N.Y. 14853

Telephone: (607) 255-9951
Fax: (607) 255-6683
E-mail: dem3@cornell.edu

The Genome Data Project aims to create powerful, efficient, easy to use tools for consulting and interpreting genetic information and putting it to use for plant breeding.  On contract to the USDA Plant Genome Program, the project encompasses both construction of genome databases and development of specialized analysis software.  My role in the project is as Curator of the GrainGenes Database and its associated Internet Gopher server and bulletin board.  In addition I consult and advise on the other databases and software development projects, and administer the Unix computers.

The databases are:

   GrainGenes   Triticum, Hordeum, Avena, Secale, Saccharum
   RiceGenes     Oryza
   SolGenes       Lycopersicon, Solanum, Capsicum
   RoseDB         Malus
   BlastDB         Magnaporthe


Although these databases differ considerably in content, the general subject matter includes:

   genetic and cytogenetic maps
   genomic probes, nucleotide sequences
   genes, alleles and gene products
   phenotypes, quantitative traits and QTLs
   genotypes and pedigrees of cultivars, genetic stocks, and other germplasms
   pathologies and the corresponding pathogens, insects, and abiotic stresses
   taxonomy of the crops and related species
   addresses and research interests of colleagues
   relevant bibliographic citations


Each database is directed or co-directed by one of the faculty in the Field of Plant Breeding, and administered by a graduate student or a member of our staff.  In addition we cooperate with many scientists around the world who are authorities on particular datasets and curate their proper conversion to database format.

Our job at Cornell is to convert raw datasets contributed by researchers from their original software format, structure, syntax, assumptions, naming rules, and conventions to a form compatible and contextually consistent with our respective genome databases, while maintaining accuracy of fact and interpretation.  In addition we help users access and query the data, and cooperate with other groups to improve the software and data distribution infrastructure.