Missouri Botanical Garden Open Conference Systems, TDWG 2014 ANNUAL CONFERENCE

Font Size: 
IPT Demo & Presentation
Kyle Braak

Building: Elmia Congress Centre, Jönköping
Room: Rum 10
Date: 2014-10-30 03:00 PM – 03:05 PM
Last modified: 2014-10-03

Abstract


The Integrated Publishing Toolkit (IPT) is a free open source software tool written in Java that is used to publish and share biodiversity datasets through the GBIF network. As of April 2014, there were 128 IPT installations installed across 42 countries, supporting publication of 220 million records coming from 872 occurrence datasets (http://www.gbif.org/ipt/stats).

The IPT was initially built to help simplify data publishing steps for publishers, compared with previous publishing systems such as DiGIR, TAPIR, and BioCASE. Designed for interoperability, it enables the publishing of content directly from databases, Microsoft Excel spreadsheets, or text files using open standards namely the Darwin Core and the Ecological Metadata Language.

The latest version of the IPT (2.1.1, as of April 2014) introduced data validation of record identifiers (if mapped by the user) ensuring that each published record has a unique identifier. This was done in order to transition to wider use of stable identifiers that will enable tracking individual records both within and outside of the GBIF network.

The next version (2.2, due in November 2014) will talk directly with DOI registration agencies (DataCite and EZID) making it possible to mint DOIs for datasets. The one caveat is that the publishing organization will require an account with DataCite or EZID. This version will also include the option to auto generate a DOI-based citation for a dataset that includes its DOI and version number. Using the combination of DOI and version number, the exact copy of a dataset used in a scientific study can always be downloaded, thereby enabling reproducibility of scientific results. By consistently citing datasets with DOIs in the citation, it is possible to track the number of times the dataset has been cited (e.g. via the Thomson Reuters Web of Science).

This presentation and demo will highlight these latest data validation and DOI handling features. The core development of the IPT happens at the GBIF Secretariat, but since the IPT is a community-driven tool anyone is welcome to provide code patches, documentation, internationalization, or simply feedback. The IPT is released under the Apache License, Version 2.0(ALv2), and can be freely downloaded and used.