Missouri Botanical Garden Open Conference Systems, TDWG 2016 ANNUAL CONFERENCE

Font Size: 
High-Throughput Sequencing and Taxon Occurrence Databases
Urmas Kõljalg

Building: CTEC
Room: Auditorium
Date: 2016-12-09 10:00 AM – 10:15 AM
Last modified: 2016-10-16


There is major gap between taxon occurrence data, which are based on (including living) specimens or human observations and on DNA sequence data from biological samples like soil, air, water, etc. These data are often accumulating in different databases and are not compatible with each other partly because of missing knowledge about useful data standards. This is especially evident in taxa were most new occurrence data are produced in High-Throughput Sequencing (HTS) studies. For example, HTS analyses of freshwater, marine, soil, etc. samples are generating millions of occurrences which are currently difficult, or mostly impossible, to search through common portals which rely on taxon names (GBIF, GGBN, etc.) This restricts research in many other fields, including nature conservation. Bridging DNA-based taxon occurrence datasets with observational and specimen datasets will be discussed.

GGBN = Global Genome Biodiversity Network