Technical documentation of PANGAEA management software

From GFBio Public Wiki
Jump to: navigation, search

PANGAEA

PANGAEA is a globally leading information system, long term archive and data publisher for geoscientific, biological and environmental data. Jointly hosted by the Centre for Marine Environmental Sciences (MARUM) at the Universität Bremen and the Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research (AWI), PANGAEA is laid out as a permanent facility, guaranteeing the long-term availability and accessibility of archived data and metadata in secure and machine readable formats. Data published by PANGAEA origins from a broad range of subdisciplines of the Biological Sciences, Chemistry, Physics with a special focus an Earth Sciences and Environmental Sciences. It is World Data Center (WDC-PANGAEA) accredited by the International Council for Science (ICSU) World Data System. Further, it holds mandates from the World Meteorological Organization (WMO) as a World Radiation Monitoring Center (WRMC). Archiving follows the Recommendations of the Commission on Professional Self Regulation in Science for safeguarding good scientific practice and the Organisation for Economic Co-operation and Development (OECD) Principles and Guidelines for Access to Research Data from Public Funding. PANGAEA is the designated archive for the journal Earth System Science Data (ESSD) and recommended data repository of several international scientific journals such as “Scientific Data” by the Nature publishing group. Essential services supplied by PANGAEA are data curation, long-term data archiving and data publication. Data curation includes quality control of metadata and the development of ontologies and vocabularies according to international protocols and standards. Metadata are extensive and each dataset can be cited using a universally unique Digital Object Identifier (DOI). The system is operated in the sense of the Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities which is a follow up to the Budapest Open Access Initiative.

Documentation

Database,system, Homepage https://www.pangaea.de
General Information Installations at GFBio collection data centers/ archives
Contact persons M. Diepenbroek, R. Huber, J. Felden, J. Weber
Developer group, country PANGAEA IT staff at MARUM: M. Diepenbroek & U. Schindler
Data domains Data published by PANGAEA origins from a broad range of subdisciplines of the Biological Sciences, Chemistry, Physics with a special focus an Earth Sciences and Environmental Sciences.
Organismic data all recent and fossil organisms groups
Software and Database System Operating System Server UNIX
User rights management Database: PostgreSQL; Editorial system: internal
Client 4D PANGAEA Editorial system; PANGAEA web frontends
GIS functionalities Gmap
GUIs for data import 4D PANGAEA Editorial system; Bulk upload via import sheets, txt, csv
GUIs for data export/ reports Dataset level: PANGAEA web frontend, HTTP download in various formats; Bulk/mass download: PANGAEA data warehouse
GUI language default: english
Open access yes
Open source no
Licenses creative commons
Information model online http://wiki.pangaea.de/wiki/Data_model
State of development stable
Code,language, developer platform Java, C, PHP & Javascript
User manual http://wiki.pangaea.de/wiki/Main_Page
Training Data curator training by PANGAEA staff at AWI and MARUM
Notes
Interfaces to export data in various schemas and standards xml according ABCD schema no
BioCASe Wrapper Version installed, description of dataflow no
xml according DarwinCore schema yes
xml according EML schema as part of DwC archives
xml according other schemas ISO19136, DIF, Dublin Core,etc..
txt, CSV export yes, DwC Archives
Notes community driven export scripts using PANGAEA APIs: https://github.com/ropensci/pangaear

Python library:

https://github.com/pangaea-data-publisher/pangaeapy

GFBio Integration

Submission and Ingestion of Data

Data providers submit their original research data and corresponding metadata via the GFBio Submission System to PANGAEA. The submission systems uses the GFBio Submission Brokerage System to trigger a data submission event via the PANGAEA internal ticket system. Each data submission is handled according to the [PANGAEA standard data curation and archiving process] guided by dedicated data curators.

PANGAEA GFBio workflow



Status: August 2021


Back to Technical Documentations