List of sequenced eukaryotic genomes
This list of "sequenced" eukaryotic genomes contains all the eukaryotes known to have publicly available complete nuclear and organelle genome sequences that have been sequenced, assembled, annotated and published; draft genomes are not included, nor are organelle-only sequences.
DNA was first sequenced in 1977. The first free-living organism to have its genome completely sequenced was the bacterium Haemophilus influenzae, in 1995. In 1996 Saccharomyces cerevisiae (baker's yeast) was the first eukaryote genome sequence to be released and in 1998 the first genome sequence for a multicellular eukaryote, Caenorhabditis elegans, was released.
Protists
Following are the nine earliest sequenced genomes of protists. For a more complete list, see the List of sequenced protist genomes.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Guillardia theta | Cryptomonad | Model organism | 0.551 Mb (nucleomorph genome only) |
465,[1] 513, 598 (UniProt) | Canadian Institute of Advanced Research, Philipps-University Marburg and the University of British Columbia | 2001[1] |
Plasmodium falciparum Clone:3D7 |
Apicomplexan | Human pathogen (malaria) | 22.9 Mb | 5,268[2] | Malaria Genome Project Consortium | 2002[2] |
Plasmodium yoelii yoelii Strain:17XNL |
Apicomplexan | Rodent pathogen (malaria) | 23.1 Mb | 5,878[3] | TIGR and NMRC | 2002[3] |
Cryptosporidium hominis Strain:TU502 |
Apicomplexan | Human pathogen | 10.4 Mb | 3,994[4] | Virginia Commonwealth University | 2004[4] |
Cryptosporidium parvum C- or genotype 2 isolate |
Apicomplexan | Human pathogen | 16.5 Mb | 3,807[5] | UCSF and University of Minnesota | 2004[5] |
Thalassiosira pseudonana Strain:CCMP 1335 |
Diatom | Model organism | 34.5 Mb | 11,242[6] | Joint Genome Institute and the University of Washington | 2004[6] |
Trypanosoma cruzi Strain:CL-Brener |
Kinetoplastid | Human Pathogen | 67 Mb | 22,570[7] | The Institute for Genome Research (TIGR) and Karolinska Institutet (KI) and Seattle Biomedical Research Institute (SBRI) | 2005[7] |
Trypanosoma brucei Clone:TREU 927/4 |
Kinetoplastid | Human Pathogen | 26 Mb | 9,068[8] | Wellcome Trust Sanger Institute and The Institute for Genome Research (TIGR) | 2005[8] |
Leishmania major Strain: Friedlin |
Kinetoplastid | Human Pathogen | 32.8 Mb | 8,272[9] | Wellcome Trust Sanger Institute and Seattle Biomedical Research Institute (SBRI) | 2005[9] |
Plants
Following are the five earliest sequenced genomes of plants. For a more complete list, see the List of sequenced plant genomes.
Organism | Type | Relevance | Genome size | Number of chromosomes | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|---|
Arabidopsis thaliana Ecotype:Columbia |
Wild mustard Thale Cress | Model plant | 135 Mb[10] | 5 | 25,498,[11] 27,400,[12] 31,670 (UniProt) | Arabidopsis Genome Initiative[13] | 2000[11] |
Cyanidioschyzon merolae Strain:10D |
Red algae | Simple eukaryote | 16.5 Mb | 20 | 5,331[14] | University of Tokyo, Rikkyo University, Saitama University and Kumamoto University | 2004[14] |
Oryza sativa ssp indica |
Rice | Crop and model organism | 420 Mb | 12 | 32-50,000[15] | Beijing Genomics Institute, Zhejiang University and the Chinese Academy of Sciences | 2002[15] |
Ostreococcus tauri | Green algae | Simple eukaryote, small genome | 12.6 Mb | 7,969 (UniProt) | Laboratoire Arago | 2006[16] | |
Populus trichocarpa | Balsam poplar or Black Cottonwood | Carbon sequestration, model tree, commercial use (timber), and comparison to A. thaliana | 550 Mb | 19 | 45,555[17] | The International Poplar Genome Consortium | 2006[17] |
Fungi
Following are the five earliest sequenced genomes of fungi. For a more complete list, see the List of sequenced fungi genomes.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Saccharomyces cerevisiae Strain:S288C |
Saccharomycetes | Baker's Yeast; Model eukaryote | 12.1 Mb | 6,294[18] | International Collaboration for the Yeast Genome Sequencing[19] | 1996[18] |
Encephalitozoon cuniculi | Microsporidium | Human pathogen | 2.9 Mb | 1,997[20] | Genoscope and Université Blaise Pascal | 2001[20] |
Schizosaccharomyces pombe Strain:972h- |
Schizosaccharomycetes | Model eukaryote | 14 Mb | 4,824[21] | Sanger Institute and Cold Spring Harbor Laboratory | 2002[21] |
Neurospora crassa | Sordariomycetes | Model eukaryote | 40 Mb | 10,082[22] | Broad Institute, Oregon Health and Science University, University of Kentucky, and the University of Kansas | 2003[22] |
Phanerochaete chrysosporium Strain:RP78 |
Agaricomycetes | Wood rotting fungus, use in mycoremediation | 30 Mb | 11,777[23] | Joint Genome Institute | 2004[23] |
Animals
Following are the five earliest sequenced genomes of animals. For a more complete list, see the List of sequenced animal genomes.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Caenorhabditis elegans Strain:Bristol N2 |
Nematode | Model animal | 100 Mb | 19,000[24] | Washington University and the Sanger Institute | 1998[24] |
Drosophila melanogaster | Fruit fly | Model animal | 165 Mb | 13,600[25] | Celera, UC Berkeley, Baylor College of Medicine, European DGP | 2000[25] |
Anopheles gambiae Strain: PEST |
Mosquito | Vector of malaria | 278 Mb | 13,683[26] | Celera Genomics and Genoscope | 2002[26] |
Takifugu rubripes | Puffer fish | Vertebrate with small genome | 390 Mb | 22–29,000[27] | International Fugu Genome Consortium[28] | 2002[29] |
Homo sapiens | Human | 3.2 Gb[30] | 18,826 (CCDS consortium) | Human Genome Project Consortium and Celera Genomics | Draft 2001[31][32] Complete 2006[33] |
See also
References
- Douglas S, Zauner S, Fraunholz M, et al. (April 2001). "The highly reduced genome of an enslaved algal nucleus". Nature. 410 (6832): 1091–6. Bibcode:2001Natur.410.1091D. doi:10.1038/35074092. PMID 11323671.
- Gardner MJ, Hall N, Fung E, et al. (October 2002). "Genome sequence of the human malaria parasite Plasmodium falciparum". Nature. 419 (6906): 498–511. Bibcode:2002Natur.419..498G. doi:10.1038/nature01097. PMC 3836256. PMID 12368864.
- Carlton JM, Angiuoli SV, Suh BB, et al. (October 2002). "Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii". Nature. 419 (6906): 512–9. Bibcode:2002Natur.419..512C. doi:10.1038/nature01099. PMID 12368865.
- Xu P, Widmer G, Wang Y, et al. (October 2004). "The genome of Cryptosporidium hominis". Nature. 431 (7012): 1107–12. Bibcode:2004Natur.431.1107X. doi:10.1038/nature02977. PMID 15510150.
- Abrahamsen MS, Templeton TJ, Enomoto S, et al. (April 2004). "Complete genome sequence of the apicomplexan, Cryptosporidium parvum". Science. 304 (5669): 441–5. Bibcode:2004Sci...304..441A. doi:10.1126/science.1094786. PMID 15044751. S2CID 26434820.
- Armbrust EV, Berges JA, Bowler C, et al. (October 2004). "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism". Science. 306 (5693): 79–86. Bibcode:2004Sci...306...79A. CiteSeerX 10.1.1.690.4884. doi:10.1126/science.1101156. PMID 15459382. S2CID 8593895.
- El-Sayed NM, Myler P, Bartholomeu DC, et al. (July 2005). "The Genome Sequence of Trypanosoma cruzi, Etiologic Agent of Chagas Disease". Science. 309 (5733): 409–415. doi:10.1126/science.1112631. PMID 16020725. S2CID 3830267.
- Berriman M, Ghedin E, Hertz-Fowler CH, et al. (July 2005). "The genome of the African trypanosome Trypanosoma brucei". Science. 309 (5733): 416–422. doi:10.1126/science.1112642. PMID 16020726. S2CID 18649858.
- Ivens AC, Peacock CS, Worthey EA, et al. (July 2005). "The genome of the kinetoplastid parasite, Leishmania major". Science. 309 (5733): 436–442. doi:10.1126/science.1112680. PMC 1470643. PMID 16020728.
- "TAIR - Genome Assembly".
- The Arabidopsis Genome Initiative (December 2000). "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana". Nature. 408 (6814): 796–815. Bibcode:2000Natur.408..796T. doi:10.1038/35048692. PMID 11130711.
- Ensembl entry
- Arabidopsis Genome Initiative Archived 2006-02-07 at the Wayback Machine
- Matsuzaki M, Misumi O, Shin-I T, et al. (April 2004). "Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D". Nature. 428 (6983): 653–7. Bibcode:2004Natur.428..653M. doi:10.1038/nature02398. PMID 15071595.
- Goff SA, Ricke D, Lan TH, et al. (April 2002). "A draft sequence of the rice genome (Oryza sativa L. ssp. japonica)". Science. 296 (5565): 92–100. Bibcode:2002Sci...296...92G. doi:10.1126/science.1068275. PMID 11935018. S2CID 2960202.
- Derelle E, Ferraz C, Rombauts S, et al. (August 2006). "Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features". PNAS. 103 (31): 11647–52. Bibcode:2006PNAS..10311647D. doi:10.1073/pnas.0604795103. PMC 1544224. PMID 16868079.
- Tuskan GA, Difazio S, Jansson S, et al. (September 2006). "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)". Science. 313 (5793): 1596–604. Bibcode:2006Sci...313.1596T. doi:10.1126/science.1128691. PMID 16973872. S2CID 7717980.
- Goffeau A, Barrell BG, Bussey H, et al. (October 1996). "Life with 6000 genes". Science. 274 (5287): 546, 563–7. Bibcode:1996Sci...274..546G. doi:10.1126/science.274.5287.546. PMID 8849441. S2CID 16763139.
- International Collaboration for the Yeast Genome Sequencing Archived 2007-09-27 at the Wayback Machine
- Katinka MD, Duprat S, Cornillot E, et al. (November 2001). "Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi". Nature. 414 (6862): 450–3. Bibcode:2001Natur.414..450K. doi:10.1038/35106579. PMID 11719806.
- Wood V, Gwilliam R, Rajandream MA, et al. (February 2002). "The genome sequence of Schizosaccharomyces pombe". Nature. 415 (6874): 871–80. doi:10.1038/nature724. PMID 11859360.
- Galagan JE, Calvo SE, Borkovich KA, et al. (April 2003). "The genome sequence of the filamentous fungus Neurospora crassa". Nature. 422 (6934): 859–68. Bibcode:2003Natur.422..859G. doi:10.1038/nature01554. PMID 12712197.
- Martinez, Diego; Larrondo, Luis F; Putnam, Nik; Gelpke, Maarten D Sollewijn; Huang, Katherine; Chapman, Jarrod; Helfenbein, Kevin G; Ramaiya, Preethi; et al. (2004). "Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78". Nature Biotechnology. 22 (6): 695–700. doi:10.1038/nbt967. PMID 15122302.
- C. elegans Sequencing Consortium (December 1998). "Genome sequence of the nematode C. elegans: a platform for investigating biology". Science. 282 (5396): 2012–8. Bibcode:1998Sci...282.2012.. doi:10.1126/science.282.5396.2012. PMID 9851916.
- Adams MD, Celniker SE, Holt RA, et al. (March 2000). "The genome sequence of Drosophila melanogaster". Science. 287 (5461): 2185–95. Bibcode:2000Sci...287.2185.. CiteSeerX 10.1.1.549.8639. doi:10.1126/science.287.5461.2185. PMID 10731132.
- Holt RA, Subramanian GM, Halpern A, et al. (October 2002). "The genome sequence of the malaria mosquito Anopheles gambiae". Science. 298 (5591): 129–49. Bibcode:2002Sci...298..129H. CiteSeerX 10.1.1.149.9058. doi:10.1126/science.1076181. PMID 12364791. S2CID 4512225.H
- International Fugu Genome Consortium. Forth Genome Assembly Archived 2012-02-05 at the Wayback Machine
- International Fugu Genome Consortium Archived 2012-02-05 at the Wayback Machine
- Aparicio S, Chapman J, Stupka E, et al. (August 2002). "Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes". Science. 297 (5585): 1301–10. Bibcode:2002Sci...297.1301A. doi:10.1126/science.1072104. PMID 12142439. S2CID 10310355.
- Human Genome Sequencing Consortium, International (October 2004). "Finishing the euchromatic sequence of the human genome". Nature. 431 (7011): 931–45. Bibcode:2004Natur.431..931H. doi:10.1038/nature03001. PMID 15496913.
- McPherson JD, Marra M, Hillier L, et al. (February 2001). "A physical map of the human genome". Nature. 409 (6822): 934–41. Bibcode:2001Natur.409..934M. doi:10.1038/35057157. PMID 11237014.
- Venter JC, Adams MD, Myers EW, et al. (February 2001). "The sequence of the human genome". Science. 291 (5507): 1304–51. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID 11181995.
- Gregory SG, Barlow KF, McLay KE, et al. (May 2006). "The DNA sequence and biological annotation of human chromosome 1". Nature. 441 (7091): 315–21. Bibcode:2006Natur.441..315G. doi:10.1038/nature04727. PMID 16710414.
External links
- Diark - a resource for eukaryotic genome research
- EMBL-EBL Eukaryotic Genomes
- UCSC Genome Browser
- International Sequencing Consortium - Large-scale Sequencing Project Database
- Ensembl The Ensembl Genome Browser (includes draft and low coverage genomes)
- GOLD:Genomes OnLine Database v 3.0
- SUPERFAMILY comparative genomics database Includes genomes of all completely sequenced eukaryotes, and sophisticated datamining plus visualisation tools for analysis
- Rat Genome Database