Technical Documentations

From GFBio Public Wiki
Revision as of 16:02, 6 November 2019 by David Fichtmueller (Talk | contribs)

Jump to: navigation, search

The research and infrastructure software, the management and publication systems and the long-term archiving solutions described here in detail represent the technical profiles and portfolios of the GFBio data centers (GFBio archives).



Within GFBio we distinguish five major types of biological data. They are used for the "Service Description" of the individual Collection Data Centers.


  • Type 2 data are taxonomic (checklist) data, which are treated via the ABCD and DwC standards (primary identifier=taxon name according the rule of the three International Codes of Biological Nomenclature)
  • Type 3 data are environmental biological and ecological data, which are transferred into a highly structured format at data item level (e.g., single measurement) and associated with e.g. EML or ISO 19139 metadata. This type includes functional and phylogenetic trait data, the latter are subject of DELTA or SDD standards. (primary identifier= biological concept, e.g. OTU or OFU), with environmental (analysis, measurement) information as main secondary information or primary identifier=environmental event with biological information as main secondary information)

  • Type 4 data are non-molecular analysis data (data sets and/or data packages) in its original data file format (often RAW format). (This data are accepted if well documented, with a core set of standard-compliant metadata and appropriate for long-term archiving, without further data management required.)
  • Type 5 data are molecular sequence data, including MIxS-compliant metadata (primary identifier=molecular sequence with geo-information and time as main secondary information)







Data management software supported by GFBio as user service for data producers is described under Tools and Workbenches; see also services provided by the GFBio Terminology Service.