List of sequenced eukaryotic genomes
From DrugPedia: A Wikipedia for Drug discovery
This list of sequenced eukaryotic genomes contains all the eukaryotes known to have publicly available complete nuclear and organelle genome sequences that have been assembled, annotated and published; draft genomes are not included, nor are organelle only sequences.
DNA was first sequenced in 1977. The first free-living organism to have its genome completely sequenced was the bacterium Haemophilus influenzae, in 1995. In 1996 Saccharomyces cerevisiae (baker's yeast) was the first eukaryote genome sequence to be released and in 1998 the first genome sequence for a multicellular eukaryote, Caenorhabditis elegans, was released.
Contents |
Protists
Chromista
The Chromista are a group of protists that contains the algal phyla Heterokontophyta, Haptophyta and Cryptophyta. Members of this group are mostly studied for evolutionary interest.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Guillardia theta | Cryptomonad | Model organism | 551 kb (nucleomorph genome only) | 464 | Canadian Institute of Advanced Research, Philipps-University Marburg and the University of British Columbia | 2001 |
Thalassiosira pseudonana Strain:CCMP 1335 | Diatom | 2.5 Mb | 11,242 | Joint Genome Institute and the University of Washington | 2004 |
Alveolata
Alveolata are a group of protists which includes the Ciliophora, Apicomplexa and Dinoflagellata. Members of this group are of particular interest to science as the cause of serious human and livestock diseases.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Cryptosporidium hominis Strain:TU502 | Parasitic protozoan | Human pathogen | 10.4 Mb | 3,994 | Virginia Commonwealth University | 2004 |
Cryptosporidium parvum C- or genotype 2 isolate | Parasitic protozoan | Human pathogen | 16.5 Mb | 3,807 | UCSF and University of Minnesota | 2004 |
Paramecium tetraurelia | Ciliate | Model organism | 72 Mb | 39,642 | Genoscope | 2006 |
Plasmodium falciparum Clone:3D7 | Parasitic protozoan | Human pathogen (malaria) | 22.9 Mb | 5,268 | Malaria Genome Project Consortium | 2002 |
Plasmodium yoelii yoelii Strain:17XNL | Parasitic protozoan | Rodent pathogen (malaria) | 23.1 Mb | 5,878 | TIGR and NMRC | 2002 |
Theileria parva Strain:Muguga | Parasitic protozoan | Cattle pathogen (African east coast fever) | 8.3 Mb | 4,035 | TIGR and the International Livestock Research Institute | 2005 |
Theileria annulata Ankara clone C9 | Parasitic protozoan | Cattle pathogen | 8.3 Mb | ? | Sanger | 2005 |
Tetrahymena thermophila | Ciliate | Model organism | 104 Mb | 27,000 | 2006 |
Excavata
Excavata is a group of related free living and symbiotic protists; it includes the Metamonada, Loukozoa, Euglenozoa and Percolozoa. They are researched for their role in human disease.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Leishmania major Strain:Friedlin | Parasitic protozoan | Human pathogen | 32.8 Mb | 8,272 | Sanger Institute | 2005 |
Trichomonas vaginalis | Parasitic protozoan | Human pathogen (Trichomoniasis) | 160 Mb | 59,681 | TIGR | 2007 |
Trypanosoma brucei Strain:TREU927/4 GUTat10.1 | Parasitic protozoan | Human pathogen (Sleeping sickness) | 26 Mb | 9,068 | Sanger Institute and TIGR | 2005 |
Trypanosoma cruzi Strain:CL Brener TC3 | Parasitic protozoan | Human pathogen (Chagas disease) | 34 Mb | 22,570 | TIGR, Seattle Biomedical Research Institute and Uppsala University | 2005 |
Amoebozoa
Amoebozoa are a group of motile amoeboid protists, members of this group move or feed by means of temporary projections, called pseudopods. The best known member of this group is the slime mold which has been studied for centuries; other members include the Archamoebae, Tubulinea and Flabellinea. Some Amoeboza cause disease.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Dictyostelium discoideum Strain:AX4 | Slime mold | Model organism | 34 Mb | 12,500 | Consortium from University of Cologne, Baylor College of Medicine and the Sanger Centre | 2005 |
Entamoeba histolytica HM1:IMSS | Parasitic protozoan | Human pathogen (amoebic dysentery) | 23.8 Mb | 9,938 | TIGR, Sanger Institute and the London School of Hygiene and Tropical Medicine | 2005 |
Plants
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Arabidopsis thaliana Ecotype:Columbia | Wild mustard | Model plant | 120 Mb | 25,498 | Arabidopsis Genome Initiative | 2000 |
Cyanidioschyzon merolae Strain:10D | Red alga | Simple eukaryote | 16.5 Mb | 5,331 | University of Tokyo, Rikkyo University, Saitama University and Kumamoto University | 2004 |
Oryza sativa ssp indica | Rice | Crop and model organism | 420 Mb | 32-50,000 | Beijing Genomics Institute, Zhejiang University and the Chinese Academy of Sciences | 2002 |
Oryza sativa ssp japonica | Rice | Crop and model organism | 466 Mb | 46,022-55,615 | Syngenta and Myriad Genetics | 2002 |
Ostreococcus tauri | Green alga | Simple eukaryote | 12.6 Mb | Laboratoire Arago | 2006 | |
Physcomitrella patens | Bryophyte | Model organism
early diverging land plant | 500 Mb | 39,458 | US Department of Energy Office of Science Joint Genome Institute | 2008 |
Populus trichocarpa | Balsam poplar or Black Cottonwood | Carbon sequestration, model tree, commercial use (timber), and comparison to A. thaliana | 550 Mb | 45,555 | The International Poplar Genome Consortium | 2006 |
Vitis vinifera | Grapevine PN40024 | Fruit crop | 490 Mb | 30,434 | The French-Italian Public Consortium for Grapevine Genome Characterization | 2007 |
Fungi
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Ashbya gossypii Strain:ATCC 10895 | Fungus | Plant pathogen | 9.2 Mb | 4,718 | SyngentaAG and University of Basel | 2004 |
Aspergillus fumigatus Strain:Af293 | Fungus | Human pathogen | 29.4 Mb | 9,926 | Sanger Institute, University of Manchester, TIGR, Institut Pasteur, Nagasaki University, University of Salamanca and OpGen | 2005 |
Aspergillus nidulans Strain:FGSC A4 | Fungus | Model organism | 30 Mb | 9,500 | 2005 | |
Aspergillus niger Strain:CBS 513.88 | Fungus | Biotechnology - fermentation | 33.9 Mb | 14,165 | 2007 | |
Aspergillus oryzae Strain:RIB40 | Fungus | Used to ferment soy | 37 Mb | 12,074 | National Institute of Technology and Evaluation | 2005 |
Candida glabrata Strain:CBS138 | Fungus | Human pathogen | 12.3 Mb | 5,283 | 2004 | |
Cryptococcus (Filobasidiella) neoformans JEC21 | Fungus | Human pathogen | 20 Mb | 6,500 | TIGR and Stanford University | 2005 |
Debaryomyces hansenii Strain:CBS767 | Yeast | Cheese ripening | 12.2 Mb | 6,906 | Génolevures Consortium | 2004 |
Encephalitozoon cuniculi | Microsporidium | Human pathogen | 2.9 Mb | 1,997 | Genoscope and Université Blaise Pascal | 2001 |
Kluyveromyces lactis Strain:CLIB210 | Yeast | 10-12 Mb | 5,329 | Génolevures Consortium | 2004 | |
Magnaporthe grisea | Fungus | Plant pathogen | 37.8 Mb | 11,109 | 2005 | |
Neurospora crassa | Fungus | Model eukaryote | 40 Mb | 10,082 | Broad Institute, Oregon Health and Science University, University of Kentucky, and the University of Kansas | 2003 |
Saccharomyces cerevisiae Strain:S288C | Baker's yeast | Model eukaryote | 12.1 Mb | 6,294 | International Collaboration for the Yeast Genome Sequencing | 1996 |
Schizosaccharomyces pombe Strain:972h | Yeast | Model eukaryote | 14 Mb | 4,824 | Sanger Institute and Cold Spring Harbor Laboratory | 2002 |
Yarrowia lipolytica Strain:CLIB99 | Yeast | Industrial uses | 20 Mb | 6,703 | Génolevures Consortium | 2004 |
Mammals
Organism | Type | Shotgun Coverage | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Bos taurus | Cow | 6* | 3.0 Gb | Cattle Genome Sequencing International Consortium | ||
Canis lupus familiaris | Dog | 7.6* | 2.4 Gb | 19,300 | Broad Institute and Agencourt Bioscience | 2005 |
Cavia porcellus | Guinea Pig | 2* | 3.4 Gb | The Genome Sequencing Platform, The Genome Assembly Team | ||
Dasypus novemcinctus | Nine-banded Armadillo | 2* | 3.0 Gb | Broad Institute | ||
Echinops telfairi | Hedgehog-Tenrec | 2* | Broad Institute | |||
Equus caballus | Horse | 6.8* | 2.1 Gb | Broad Institute et al. | 2007 | |
Erinaceus europaeus | Western European Hedgehog | 2* | Broad Institute | |||
Felis catus | Cat | 2* | 3 Gb | 20,285 | The Genome Sequencing Platform, The Genome Assembly Team | 2007 |
Homo sapiens | Human | 3.2 Gb | 25,000 | Human Genome Project Consortium and Celera Genomics | Draft 2001 Complete 2006 | |
Loxodonta africana | African Elephant | 2* | 3 Gb | Broad Institute | ||
Macaca mulatta | Rhesus Macaque | 6* | Macaque Genome Sequencing Consortium | |||
Microcebus murinus | Gray Mouse Lemur | 2* | The Genome Sequencing Platform, The Genome Assembly Team | |||
Monodelphis domestica | Gray Short-tailed Opossum | 3.475 Gb (only 10% in Genbank) | 18 - 20,000 (protein coding) | Broad Institute et al. | 2007 | |
Mus musculus | Mouse | 2.5 Gb | 24,174 | International Collaboration for the Mouse Genome Sequencing | 2002 | |
Myotis lucifugus | Little Brown Bat | 2* | Broad Institute | |||
Ochotona princeps | American Pika | 2* | Broad Institute | |||
Ornithorhynchus anatinus | Platypus | 6* | Washington University | |||
Oryctolagus cuniculus | Rabbit | 2* | 2.5 Gb | Broad Institute et al. | ||
Otolemur garnettii | Small-eared Galago, or Bushbaby | 2* | Broad Institute | |||
Pan troglodytes | Chimpanzee | 6* | 3.1 Gb | Chimpanzee Sequencing and Analysis Consortium | 2005 | |
Pongo pygmaeus | Orangutan | 3.0 Gb | Institute for Molecular Biotechnology | |||
Rattus norvegicus | Rat | 1.8* or better | 2.8 Gb | 21,166 | Rat Genome Sequencing Project Consortium | 2004 |
Sorex araneus | European Shrew | 2* | 3.0 Gb | The Genome Sequencing Platform, The Genome Assembly Team | ||
Spermophilus tridecemlineatus | Thirteen-lined Ground Squirrel | 2* | The Genome Sequencing Platform, The Genome Assembly Team | |||
Tupaia belangeri | Northern Tree Shrew | 2* | Broad Institute |
Other Animals
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Anopheles gambiae Strain: PEST | Mosquito | Vector of malaria | 278 Mb | 13,683 | Celera Genomics and Genoscope | 2002 |
Apis mellifera | Honey bee | 1.8 Gb | 10,157 | The Honeybee Genome Sequencing Consortium | 2006<ref name = "The Honeybee Genome Sequencing Consortium"/> | |
Bombyx mori Strain:p50T | Moth (domestic silk worm) | Silk production | 530 Mb | University of Tokyo and National Institute of Agrobiological Sciences | 2004 | |
Caenorhabditis briggsae | Nematode worm | For comparison with C. elegans | 104 Mb | 19,500 | Washington University, Sanger Institute and Cold Spring Harbor Laboratory | 2003 |
Caenorhabditis elegans Strain:Bristol N2 | Nematode worm | Model animal | 97 Mb | 19,000 | Washington University and the Sanger Institute | 1998 |
Ciona intestinalis | Tunicate | Simple chordate | 116.7 Mb | 16,000 | Joint Genome Institute | 2003 |
Ciona savignyi | Tunicate | 174 Mb | Broad Institute | 2007 | ||
Drosophila melanogaster | Fruit fly | Model animal | 165 Mb | 13,600 | Celera, UC Berkeley, Baylor College of Medicine, European DGP | 2000 |
Gallus gallus | Chicken | 1 Gb | 20-23,000 | International Chicken Genome Sequencing Consortium | 2004 | |
Strongylocentrotus purpuratus | Sea urchin | Model eukaryote | 814 Mb | 23,300 | Sea Urchin Genome Sequencing Consortium | 2006 |
Takifugu rubripes | Puffer fish | Vertebrate with small genome | 390 Mb | 22-29,000 | International Fugu Genome Consortium | 2002 |
Tetraodon nigroviridis | Puffer fish | Vertebrate with compact genome | 340 Mb | 22,400 | Genoscope and the Broad Institute | 2004 |
See also
External links
- EMBL-EBL Eukaryotic Genomes
- UCSC Genome Browser
- International Sequencing Consortium - Large-scale Sequencing Project Database
- Ensembl The Ensembl Genome Browser (includes draft and low coverage genomes)
- Fungal Genome Initiative (includes draft genomes)
- GOLD:Genomes OnLine Database v 2.0