List of sequenced eukaryotic genomes

From DrugPedia: A Wikipedia for Drug discovery

Revision as of 04:50, 17 September 2008 by Jasjit (Talk | contribs)
Jump to: navigation, search

This list of sequenced eukaryotic genomes contains all the eukaryotes known to have publicly available complete nuclear and organelle genome sequences that have been assembled, annotated and published; draft genomes are not included, nor are organelle only sequences.

DNA was first sequenced in 1977. The first free-living organism to have its genome completely sequenced was the bacterium Haemophilus influenzae, in 1995. In 1996 Saccharomyces cerevisiae (baker's yeast) was the first eukaryote genome sequence to be released and in 1998 the first genome sequence for a multicellular eukaryote, Caenorhabditis elegans, was released.

Contents

Protists

Chromista

The Chromista are a group of protists that contains the algal phyla Heterokontophyta, Haptophyta and Cryptophyta. Members of this group are mostly studied for evolutionary interest.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Guillardia theta Cryptomonad Model organism 551 kb
(nucleomorph genome only)
464 Canadian Institute of Advanced Research, Philipps-University Marburg and the University of British Columbia 2001
Thalassiosira pseudonana
Strain:CCMP 1335
Diatom 2.5 Mb 11,242 Joint Genome Institute and the University of Washington 2004

Alveolata

Alveolata are a group of protists which includes the Ciliophora, Apicomplexa and Dinoflagellata. Members of this group are of particular interest to science as the cause of serious human and livestock diseases.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Cryptosporidium hominis
Strain:TU502
Parasitic protozoan Human pathogen 10.4 Mb 3,994 Virginia Commonwealth University 2004
Cryptosporidium parvum
C- or genotype 2 isolate
Parasitic protozoan Human pathogen 16.5 Mb 3,807 UCSF and University of Minnesota 2004
Paramecium tetraurelia Ciliate Model organism 72 Mb 39,642 Genoscope 2006
Plasmodium falciparum
Clone:3D7
Parasitic protozoan Human pathogen (malaria) 22.9 Mb 5,268 Malaria Genome Project Consortium 2002
Plasmodium yoelii yoelii
Strain:17XNL
Parasitic protozoan Rodent pathogen (malaria) 23.1 Mb 5,878 TIGR and NMRC 2002
Theileria parva
Strain:Muguga
Parasitic protozoan Cattle pathogen (African east coast fever) 8.3 Mb 4,035 TIGR and the International Livestock Research Institute 2005
Theileria annulata
Ankara clone C9
Parasitic protozoan Cattle pathogen 8.3 Mb ? Sanger 2005
Tetrahymena thermophila Ciliate Model organism 104 Mb 27,000 2006

Excavata

Excavata is a group of related free living and symbiotic protists; it includes the Metamonada, Loukozoa, Euglenozoa and Percolozoa. They are researched for their role in human disease.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Leishmania major
Strain:Friedlin
Parasitic protozoan Human pathogen 32.8 Mb 8,272 Sanger Institute 2005
Trichomonas vaginalis Parasitic protozoan Human pathogen (Trichomoniasis) 160 Mb 59,681 TIGR 2007
Trypanosoma brucei
Strain:TREU927/4 GUTat10.1
Parasitic protozoan Human pathogen (Sleeping sickness) 26 Mb 9,068 Sanger Institute and TIGR 2005
Trypanosoma cruzi
Strain:CL Brener TC3
Parasitic protozoan Human pathogen (Chagas disease) 34 Mb 22,570 TIGR, Seattle Biomedical Research Institute and Uppsala University 2005

Amoebozoa

Amoebozoa are a group of motile amoeboid protists, members of this group move or feed by means of temporary projections, called pseudopods. The best known member of this group is the slime mold which has been studied for centuries; other members include the Archamoebae, Tubulinea and Flabellinea. Some Amoeboza cause disease.

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Dictyostelium discoideum
Strain:AX4
Slime mold Model organism 34 Mb 12,500 Consortium from University of Cologne, Baylor College of Medicine and the Sanger Centre 2005
Entamoeba histolytica
HM1:IMSS
Parasitic protozoan Human pathogen (amoebic dysentery) 23.8 Mb 9,938 TIGR, Sanger Institute and the London School of Hygiene and Tropical Medicine 2005

Plants

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Arabidopsis thaliana
Ecotype:Columbia
Wild mustard Model plant 120 Mb 25,498 Arabidopsis Genome Initiative 2000
Cyanidioschyzon merolae
Strain:10D
Red alga Simple eukaryote 16.5 Mb 5,331 University of Tokyo, Rikkyo University, Saitama University and Kumamoto University 2004
Oryza sativa
ssp indica
Rice Crop and model organism 420 Mb 32-50,000 Beijing Genomics Institute, Zhejiang University and the Chinese Academy of Sciences 2002
Oryza sativa
ssp japonica
Rice Crop and model organism 466 Mb 46,022-55,615 Syngenta and Myriad Genetics 2002
Ostreococcus tauri Green alga Simple eukaryote 12.6 Mb Laboratoire Arago 2006
Physcomitrella patens Bryophyte Model organism

early diverging land plant

500 Mb 39,458 US Department of Energy Office of Science Joint Genome Institute 2008
Populus trichocarpa Balsam poplar or Black Cottonwood Carbon sequestration, model tree, commercial use (timber), and comparison to A. thaliana 550 Mb 45,555 The International Poplar Genome Consortium 2006
Vitis vinifera Grapevine PN40024 Fruit crop 490 Mb 30,434 The French-Italian Public Consortium for Grapevine Genome Characterization 2007

Fungi

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Ashbya gossypii
Strain:ATCC 10895
Fungus Plant pathogen 9.2 Mb 4,718 SyngentaAG and University of Basel 2004
Aspergillus fumigatus
Strain:Af293
Fungus Human pathogen 29.4 Mb 9,926 Sanger Institute, University of Manchester, TIGR, Institut Pasteur, Nagasaki University, University of Salamanca and OpGen 2005
Aspergillus nidulans
Strain:FGSC A4
Fungus Model organism 30 Mb 9,500 2005
Aspergillus niger
Strain:CBS 513.88
Fungus Biotechnology - fermentation 33.9 Mb 14,165 2007
Aspergillus oryzae
Strain:RIB40
Fungus Used to ferment soy 37 Mb 12,074 National Institute of Technology and Evaluation 2005
Candida glabrata
Strain:CBS138
Fungus Human pathogen 12.3 Mb 5,283 2004
Cryptococcus (Filobasidiella) neoformans
JEC21
Fungus Human pathogen 20 Mb 6,500 TIGR and Stanford University 2005
Debaryomyces hansenii
Strain:CBS767
Yeast Cheese ripening 12.2 Mb 6,906 Génolevures Consortium 2004
Encephalitozoon cuniculi Microsporidium Human pathogen 2.9 Mb 1,997 Genoscope and Université Blaise Pascal 2001
Kluyveromyces lactis
Strain:CLIB210
Yeast 10-12 Mb 5,329 Génolevures Consortium 2004
Magnaporthe grisea Fungus Plant pathogen 37.8 Mb 11,109 2005
Neurospora crassa Fungus Model eukaryote 40 Mb 10,082 Broad Institute, Oregon Health and Science University, University of Kentucky, and the University of Kansas 2003
Saccharomyces cerevisiae
Strain:S288C
Baker's yeast Model eukaryote 12.1 Mb 6,294 International Collaboration for the Yeast Genome Sequencing 1996
Schizosaccharomyces pombe
Strain:972h
Yeast Model eukaryote 14 Mb 4,824 Sanger Institute and Cold Spring Harbor Laboratory 2002
Yarrowia lipolytica
Strain:CLIB99
Yeast Industrial uses 20 Mb 6,703 Génolevures Consortium 2004

Mammals

Organism Type Shotgun Coverage Genome size Number of genes predicted Organization Year of completion
Bos taurus Cow 6* 3.0 Gb Cattle Genome Sequencing International Consortium
Canis lupus familiaris Dog 7.6* 2.4 Gb 19,300 Broad Institute and Agencourt Bioscience 2005
Cavia porcellus Guinea Pig 2* 3.4 Gb The Genome Sequencing Platform, The Genome Assembly Team
Dasypus novemcinctus Nine-banded Armadillo 2* 3.0 Gb Broad Institute
Echinops telfairi Hedgehog-Tenrec 2* Broad Institute
Equus caballus Horse 6.8* 2.1 Gb Broad Institute et al. 2007
Erinaceus europaeus Western European Hedgehog 2* Broad Institute
Felis catus Cat 2* 3 Gb 20,285 The Genome Sequencing Platform, The Genome Assembly Team 2007
Homo sapiens Human 3.2 Gb 25,000 Human Genome Project Consortium and Celera Genomics Draft 2001
Complete 2006
Loxodonta africana African Elephant 2* 3 Gb Broad Institute
Macaca mulatta Rhesus Macaque 6* Macaque Genome Sequencing Consortium
Microcebus murinus Gray Mouse Lemur 2* The Genome Sequencing Platform, The Genome Assembly Team
Monodelphis domestica Gray Short-tailed Opossum 3.475 Gb
(only 10% in Genbank)
18 - 20,000
(protein coding)
Broad Institute et al. 2007
Mus musculus Mouse 2.5 Gb 24,174 International Collaboration for the Mouse Genome Sequencing 2002
Myotis lucifugus Little Brown Bat 2* Broad Institute
Ochotona princeps American Pika 2* Broad Institute
Ornithorhynchus anatinus Platypus 6* Washington University
Oryctolagus cuniculus Rabbit 2* 2.5 Gb Broad Institute et al.
Otolemur garnettii Small-eared Galago, or Bushbaby 2* Broad Institute
Pan troglodytes Chimpanzee 6* 3.1 Gb Chimpanzee Sequencing and Analysis Consortium 2005
Pongo pygmaeus Orangutan 3.0 Gb Institute for Molecular Biotechnology
Rattus norvegicus Rat 1.8* or better 2.8 Gb 21,166 Rat Genome Sequencing Project Consortium 2004
Sorex araneus European Shrew 2* 3.0 Gb The Genome Sequencing Platform, The Genome Assembly Team
Spermophilus tridecemlineatus Thirteen-lined Ground Squirrel 2* The Genome Sequencing Platform, The Genome Assembly Team
Tupaia belangeri Northern Tree Shrew 2* Broad Institute

Other Animals

Organism Type Relevance Genome size Number of genes predicted Organization Year of completion
Anopheles gambiae
Strain: PEST
Mosquito Vector of malaria 278 Mb 13,683 Celera Genomics and Genoscope 2002
Apis mellifera Honey bee 1.8 Gb 10,157 The Honeybee Genome Sequencing Consortium 2006<ref name = "The Honeybee Genome Sequencing Consortium"/>
Bombyx mori
Strain:p50T
Moth (domestic silk worm) Silk production 530 Mb University of Tokyo and National Institute of Agrobiological Sciences 2004
Caenorhabditis briggsae Nematode worm For comparison with C. elegans 104 Mb 19,500 Washington University, Sanger Institute and Cold Spring Harbor Laboratory 2003
Caenorhabditis elegans
Strain:Bristol N2
Nematode worm Model animal 97 Mb 19,000 Washington University and the Sanger Institute 1998
Ciona intestinalis Tunicate Simple chordate 116.7 Mb 16,000 Joint Genome Institute 2003
Ciona savignyi Tunicate 174 Mb Broad Institute 2007
Drosophila melanogaster Fruit fly Model animal 165 Mb 13,600 Celera, UC Berkeley, Baylor College of Medicine, European DGP 2000
Gallus gallus Chicken 1 Gb 20-23,000 International Chicken Genome Sequencing Consortium 2004
Strongylocentrotus purpuratus Sea urchin Model eukaryote 814 Mb 23,300 Sea Urchin Genome Sequencing Consortium 2006
Takifugu rubripes Puffer fish Vertebrate with small genome 390 Mb 22-29,000 International Fugu Genome Consortium 2002
Tetraodon nigroviridis Puffer fish Vertebrate with compact genome 340 Mb 22,400 Genoscope and the Broad Institute 2004

See also


External links