Data hacking: Recovering the long tail of soils data
Data Hackathon at AGU 2017 (time and location tbd)
Lead: K Todd-Brown
Organizations: International Soil Carbon Network (ISCN) and Permafrost Carbon Network (PCN)
Soil biogeochemistry is an extremely data rich field with a long standing societal interest in soil management from both agricultural and climate change standpoints. The field has generated vasts amounts of data, much of which is archived online, however harmonization of these data to address broad global issues has been challenging. Many data collection efforts attempt to address comparability by imposing templates on data providers, which frequently represents a difficult and time-consuming hurdle for data providers. General templates are frequently both too big and too small; awkwardly large for the data collected for the study but do not include appropriate fields for the data collected. The alternative approach is to create a customized script to automate translation of data from the original format to a standardized database.
In this hackathon, we will produce 1) customized ingestion scripts for soil carbon related data sets that are registered in existing repositories, and 2) scripts for output databases that conform to common templates. Participants will need to be familiar with reading data files into R and capable of writing an R function. Coders and other contributors will be acknowledged in project doi’s and there will be additional opportunities to participate in meta-analysis manuscripts.
We hope you can join us for this new community project!
All are welcome and graduate students and early-career scientists are especially encouraged to join!
Questions? Email firstname.lastname@example.org
Click here to for early registration (limited space)!