Missouri Botanical Garden Open Conference Systems, TDWG 2016 ANNUAL CONFERENCE

Font Size: 
Results of IndexMed GRAIL Days 2016: How to use standards to build GRAphs and mIne data for environmentaL research
Romain David, Jean-Pierre Féral, Anne-Sophie Archambeau, David Auber, Nicolas Bailly, Cyrille Blanpain, Vincent Breton, Alrick Dias, Anna Cohen-Nabeiro, Aurélie Delavaud, Sophie Gachet, Robin Goffaux, Karina Gibert, Manuel Herrera, Dino Ienco, Romain Julliard, Julien Lecubin, Yannick Legre, Grégoire Loïs, Victor Méndez Muñoz, Jean-Charles Meunier, Isabelle Mougenot, Sophie Pamerlon, Geneviève Romier, Alison Specht, Christian Surace, Thierry Tatoni

Building: CTEC
Room: Auditorium
Date: 2016-12-08 11:45 AM – 12:00 PM
Last modified: 2016-10-15

Abstract


Data produced by biodiversity research projects that evaluate and monitor ‘Good Environmental Status’ have a high potential for use by stakeholders involved in [marine] environmental management. The lack of specific scientific objectives, poor organizational logic, and a characteristically disorganized collection of information leads to a decentralized data distribution, hampering environmental research. In such a heterogeneous system across different organizations and data formats, it is difficult to efficiently harmonize the outputs. There are few tools available to assist.

The task of the newly created consortium of IndexMeed is to index biodiversity data (and to provide an index of qualified existing open datasets) and make it possible to build graphs to assist in the analysis and development of new ways to mine data. Standards (including TDWG) and specific protocols can be applied to interconnect databases. Such semantic approaches greatly increase data interoperability.

The aim of this talk is to present the 2016 IndexMed workshop results (https://indexmed2016.sciencesconf.org) and recent actions of the consortium (renamed “IndexMeed - Indexing for Mining Ecological and Environmental Data”): new approaches to investigate complex research questions and support the emergence of new scientific hypotheses. With one day of plenary sessions and two days of practical workshops, this event was dedicated to the sharing of experience and expertise, the acquisition of practical methods to construct graphs and value data through metadata and "data papers". Recent developments in data mining based on graphs, the potential for important contributions to environmental research, particularly about strategic decision-making, and new ways of organizing data were also discussed at the workshop.

In particular, this workshop promoted decisions on how (i) to analyze heterogeneous distributed data spread in different databases, (ii) to create matches and incorporate some approximations, (iii) to identify statistical relationships between observed data and the emergence of contextual patterns, and (iv) to encourage openness and the sharing of data, in order to value data and their utilization.

The IndexMeed project participants are now exploring the ability of two scientific communities (ecology sensu lato and computer sciences) to work together. The uses of data from biodiversity research demonstrate the prototype functionalities and introduce new perspectives to analyze environmental and societal responses including decision-making. Output of the seminar lists scientific questions that can be resolved by the new data mining approaches and proposes new ways to investigate heterogeneous environmental data with graph mining.