About

Log in?

DTU users get better search results including licensed content and discounts on order fees.

Anyone can log in and get personalized features such as favorites, tags and feeds.

Log in as DTU user Log in as non-DTU user No thanks

DTU Findit

Journal article

Preserving Accuracy in GenBank

By Bidartondo, M.I.1; Bruns, T. D.; Blackwell, M.; Edwards, I.; Taylor, A.F.S.; Horton, T.; Zhang, N.; Koljalg, U.; May, G.; Kuyper, T.W.; Bever, J.D.; Gilbert, G.; Taylor, J.W.; DeSantis, T.Z.; Pringle, A.; Borneman, J.; Thorn, G.; Berbee, M.; Mueller, G.M.; Andersen, G.L.; Villinga, E.C.; Branco, S.; Anderson, I.; Dickie, I.A.; Avis, P.; Timonen, S.; Kjøller, R.; Lodge, D.J.; Bateman, R.M.; Purvis, A.; Crous, P.W.; Hawkes, C.; Barraclough, T.; Burt, A.; Nilsson, R.H.; Larsson, K.-H.; Alexander, I.; Moncalvo, J.M.; Berube, J.; Spatafora, J.; Lumbsch, H.T.; Blair, J.E.; Suh, S.-O.; Pfister, D.H.; Binder, M.; Boehm, E.W.; Kohn, I.; Mata, J.L.; Dyer, P.; Sung, G.-H.; Dentinger, B.; Simmons, E.G.; Baird, R.E.; Volk, T.J.; Perry, B.A.; Kerrigan, R.W.; Campbell, J.; Rajesh, J.; Reynolds, D.R.; Geiser, D.; Humber, R.A.; Hausmann, N.; Szaro, T.; Stajich, J.; Gathman, A.; Peay, K.G.; Henkel, T.; Robinson, C.H.; Pukkila, P.J.; Nguyen, N.H.; Villata, C.; Kennedy, P.; Bergemann, S.; Aime, M.C.; Kauff, F.; Porras-Alfaro, A.; Gueidan, C.; Beck, A.; Andersen, Birgitte2,3; Marek, S.; Crouch, J.A.; Kerrigan, J.; Ristaino, J.B.; Hodge, K.T.; Kuldau, G.; Samuels, G.J.; Raja, H.J.; Voglmayr, H.; Gardes, M.; Janos, D.P.; Rogers, J.D.; Cannon, P.; Woolfolk, S.W.; Kistler, H.C.; Castellano, M.A.; Maldonado-Ramírez, S.L.; Kirk, P.M.; Farrar, J.J.; Osmundson, T.; Currah, R.S.; Vujanovic, V.; Chen, W.; Korf, R.P.; Atallah, Z.K.; Harrison, K.J.; Guarro, J.; Bates, S.T.; Bonello, P.; Bridge, P.; Schell, W.; Rossi, W.; Stenlid, J.; Frisvad, Jens Christian2,3; Miller, R.M.; Baker, S.E.; Hallen, H.E.; Janso, J.E.; Wilson, A.W.; Conway, K.E.; Egerton-Warburton, L.; Wang, Z.; Eastburn, D.; Ho, W.W.H.; Kroken, S.; Stadler, M.; Turgeon, G.; Lichtwardt, R.W.; Stewart, E.L.; Wedin, M.; Li, D.-W.; Uchida, J.Y.; Jumpponen, A.; Deckert, J.; Beker, H.J.; Rogers, S.O.; Xu, J.; Johnston, P.; Shoemaker, R.A.; Liu, M.; Marques, G.; Summerell, B.; Sokolski, S.; Thrane, Ulf2,3; Widden, P.; Bruhn, J.N.; Bianchinotti, V.; Tuthill, D.; Baroni, T.J.; Barron, G.; Hosaka, K.; Jewell, K.; Piepenbring, M.; Sullivan, R.; Griffith, G.W.; Bradley, S.G.; Aoki, T.; Yoder, W.T.; Ju, Y.-M.; Berch, S.M.; Trappe, M.; Duan, W.; Bonito, G.; Taber, R.A.; Coelho, G.; Bills, G.; Ganley, A.; Agerer, R.; Nagy, L.; Roy, B.A.; Læssøe, T.; Hallenberg, N.; Tichy, H.-V.; Stalpers, J.; Langer, E.; Scholler, M.; Krueger, D.; Pacioni, G.; Pöder, R.; Pennanen, T.; Capelari, M.; Nakasone, K.; Tewari, J.P.; Miller, A.N.; Decock, C.; Huhndorf, S.; Wach, M.; Vishniac, H.S.; Yohalem, D.S.; Smith, M.E.; Glenn, A.E.; Spiering, M.; Lindner, D.L.; Schoch, C.; Redhead, S.A.; Ivors, K.; Jeffers, S.N.; Geml, J.; Okafor, F.; Spiegel, F.W.; Dewsbury, D.; Carroll, J.; Porter, T.M.; Pashley, C.; Carpenter, S.E.; Abad, G.; Voigt, K.; Arenz, B.; Methven, A.S.; Schechter, S.; Vance, P.; Mahoney, D.; Kang, S.; Rheeder, J.P.; Mehl, J.; Grief, M.; Ngala, G.N.; Ammirati, J.; Kawasaki, M.; Gwo-Fang, Y.; Matsumoto, T.; Smith, D.; Koenig, G.; Luoma, D.; May, T.; Leonardi, M.; Sigler, L.; Taylor, D.L.; Gibson, C.; Sharpton, T.; Hawksworth, D.L.; Dianese, J.C.; Trudell, S.A.; Paulus, B.; Padamsee, M.; Callac, P.; Lima, N.; White, M.; Barreau, C.; Juncai, M.A.; Buyck, B.; Rebeler, R.K.; Liles, M.R.; Estes, D.; Carter, R.; Herr Jr., J.M.; Chandler, G.; Kerekes, K.; Cruse-Sanders, J.; Márquez, R.G.; Horak, E.; Fitzsimons, M.; Döring, H.; Yao, S.; Hynson, N.; Ryberg, M.; Arnold, A.E.; Hughes, K. ...and 247 more

From

Imperial College London1

Center for Microbial Biotechnology, Department of Systems Biology, Technical University of Denmark2

Department of Systems Biology, Technical University of Denmark3

GenBank, the public repository for nucleotide and protein sequences, is a critical resource for molecular biology, evolutionary biology, and ecology. While some attention has been drawn to sequence errors (1), common annotation errors also reduce the value of this database. In fact, for organisms such as fungi, which are notoriously difficult to identify, up to 20% of DNA sequence records may have erroneous lineage designations in GenBank (2).

Gene function annotation in protein sequence databases is similarly error-prone (3, 4). Because identity and function of new sequences are often determined by bioinformatic analyses, both types of errors are propagated into new accessions, leading to long-term degradation of the quality of the database.

Currently, primary sequence data are annotated by the authors of those data, and can only be reannotated by the same authors. This is inefficient and unsustainable over the long term as authors eventually leave the field. Although it is possible to link third-party databases to GenBank records, this is a short-term solution that has little guarantee of permanence.

Similarly, the current third-party annotation option in GenBank (TPA) complicates rather than solves the problem by creating an identical record with a new annotation, while leaving the original record unflagged and unlinked to the new record. Since the origin of public zoological and botanical specimen collections, an open system of cumulative annotation has evolved, whereby the original name is retained, but additional opinion is directly appended and used for filing and retrieval.

This was needed as new specimens and analyses allowed for reevaluation of older specimens and the original depositors became unavailable. The time has come for the public sequence database to incorporate a community-curated, cumulative annotation process that allows third parties to improve the annotations of sequences when warranted by published peer-reviewed analyses (5).

Language: English
Publisher: American Association for the Advancement of Science
Year: 2008
Pages: 1616
ISSN: 10959203 and 00368075
Types: Journal article
DOI: 10.1126/science.319.5870.1616a
ORCIDs: Andersen, Birgitte , Frisvad, Jens Christian , Thrane, Ulf and 0000-0002-2027-4119

DTU users get better search results including licensed content and discounts on order fees.

Log in as DTU user

Access

Analysis