






David
E. Matthews
Adjunct Associate Professor
Department of Plant Breeding & Genetics
409 Bradfield Hall
Cornell University
Ithaca, N.Y. 14853
Telephone: (607) 255-9951
Fax: (607) 255-6683
E-mail: dem3@cornell.edu

The Genome Data Project aims to create powerful, efficient, easy to use tools for consulting and interpreting genetic information and putting it to use for plant breeding. On contract to the USDA Plant Genome Program, the project encompasses both construction of genome databases and development of specialized analysis software. My role in the project is as Curator of the GrainGenes Database and its associated Internet Gopher server and bulletin board. In addition I consult and advise on the other databases and software development projects, and administer the Unix computers.
The databases are:
GrainGenes Triticum, Hordeum, Avena, Secale, Saccharum
RiceGenes Oryza
SolGenes Lycopersicon, Solanum, Capsicum
RoseDB Malus
BlastDB Magnaporthe
Although these databases differ considerably in content, the
general subject matter includes:
genetic and cytogenetic maps
genomic probes, nucleotide sequences
genes, alleles and gene products
phenotypes, quantitative traits and QTLs
genotypes and pedigrees of cultivars, genetic stocks, and other germplasms
pathologies and the corresponding pathogens, insects, and abiotic stresses
taxonomy of the crops and related species
addresses and research interests of colleagues
relevant bibliographic citations
Each database is directed or co-directed by one of the faculty
in the Field of Plant Breeding, and administered by a graduate
student or a member of our staff. In addition we cooperate
with many scientists around the world who are authorities on
particular datasets and curate their proper conversion to database
format.
Our job at Cornell is to convert raw datasets contributed by researchers from their original software format, structure, syntax, assumptions, naming rules, and conventions to a form compatible and contextually consistent with our respective genome databases, while maintaining accuracy of fact and interpretation. In addition we help users access and query the data, and cooperate with other groups to improve the software and data distribution infrastructure.