Pre-defined data catalogs#

This page contains a list of (global) datasets which can be used with various HydroMT models and workflows. Below are drop down lists with datasets per pre-defined data catalog for use with HydroMT. The summary per dataset contains links to the online source and available literature.

The deltares_data catalog is only available within the Deltares network. However a selection of this data for a the Piave basin (Northern Italy) is available online in the artifact_data archive and will be used if no data catalog is provided. Local or other datasets can also be included by extending the data catalog with new yaml data catalog files. We plan to provide more data catalogs with open data sources in the (near) future. See the data catalog changelog for recent updates on the pre-defined catalogs.

Using a predefined catalog#

From CLI#

To use a predefined catalog, you can specify the catalog name with the -d or --data option when running a HydroMT command. For example, to use the deltares_data catalog with the hydromt build command, you can run the following:

hydromt build MODEL -d deltares_data ...

Alternatively, deltares_data can also be accessed with the --dd option:

hydromt build MODEL --dd ...

You can specify a version of the catalog by adding the version number after the catalog name, e.g. deltares_data=v1.0.0.

hydromt build MODEL -d deltares_data=v1.0.0 ...

Once you have set the data catalog you can specify the data source(s) for each method in the HydroMT model workflow file as shown in the example below with the setup_precip_forcing method.

setup_region:
  region:
    bbox: [4.5, 51.5, 6.5, 53.5]

setup_maps_from_rasterdataset:
  raster_fn:
    source: 'eobs'
    version: 'v22.0e'

From Python#

To use a predefined catalog in Python, you can specify the catalog name with the data_libs argument when initializing a DataCatalog class. You can specify a data catalog version by adding the version number after the catalog name. You can then get data from the catalog using the DataCatalog.get_rasterdataset() or other :ref: DataCatalog methods.

from hydromt import DataCatalog
data_catalog = DataCatalog(data_libs=["deltares_data"])
# specify a data catalog version
data_catalog = DataCatalog(data_libs=["deltares_data=v2024.2"])
# get data from the catalog
ds = data_catalog.get_rasterdataset("eobs") # get the most recently added
ds = data_catalog.get_rasterdataset("eobs", version="22.0e") # get a specific
version