CAMELS-DE Processing Pipeline

CAMELS-DE

CAMELS-DE (Catchment Attributes and Meteorology for Large-sample Studies) is a hydro-meteorological dataset which offers data for 1555 catchments in Germany.
The time series data of hydro-meteorological variables is in daily resolution and spans up to 70 years, from January 1951 to December 2020. Additionally, the dataset includes a set of derived catchment attributes, such as soil properties, land cover, and topographic data as well as time series data of simulated discharge from a regional Long-Short Term Memory (LSTM) network and a conceptual hydrological model. The catchment boundaries are also provided in the dataset, which allows for the derivation of further catchment attributes.




Download CAMELS-DE: 10.5281/zenodo.12760336

Map of all 1555 gauging stations

Processing Pipeline

Description

The processing pipeline of CAMELS-DE is built in a modular way where each component is implemented in a seperate GitHub repository that builds its own Docker container. Each repository includes detailed documentation, guidelines for building and running the Docker containers, and instructions for accessing the required input data.
We want to make the origin as well as used methodologies / packages / functions of each data component of CAMELS-DE as traceable, comprehensible and reproducible as possible. We think that this is best achieved through the modular containerized design instead of one monolithic code repository to generate CAMELS-DE. In addition, this design simplifies possible future updates of individual components and the integration of new data sources.

Pipeline Components

The figure below visualises the CAMELS-DE processing pipeline, the processing containers are clickable, with links leading to the respective GitHub repositories. All repositories are released as version 1, which ensures that the code used for the generation of CAMELS-DE can be referred to in a stable manner. The final CAMELS-DE tile brings you to the published CAMELS-DE data set:

  1. camelsp: Processing and harmonization of raw discharge data and water level data from the federal agencies (Note that we currently cannot share the raw data from the federal agencies).
  2. camels_catchments: Derivation of catchment boundaries for each gauging station in CAMELS-DE by its latitude and longitude.
  3. camels_meteo: Derivation of meteorological time series data from the HYRAS data set from the German Weather Service (DWD).
  4. camels_soils: Derivation of soil properties for each catchment in CAMELS-DE from the SoilGrids data set.
  5. camels_hydrogeo: Derivation of hydrogeological properties for each catchment in CAMELS-DE from the HÜK250 from the Federal Institute for Geosciences and Natural Resources (BGR)
  6. camels_landcover: Derivation of land cover data for each catchment in CAMELS-DE from the CORINE Land Cover data set.
  7. camels_dem: Derivation of digital elevation model (DEM) data for each catchment in CAMELS-DE from the EU DEM GLO-30 from Copernicus.
  8. camels_human_influence: Derivation of human influence data for each catchment in CAMELS-DE from the Inventory of Dams in Germany (Speckhann et al 2021).
  9. camels_de: Integration of all derived data sets and simulation results (generated with the package Hy2DL) into the final CAMELS-DE data set and calculation of hydrologic and climatic attributes.