Fetchez Documentation#
Fetch geospatial data with ease.
Fetchez Les Données
Fetchez is a lightweight, modular, and highly extendable Python framework designed to orchestrate geospatial data engineering workflows.
Originally developed as the core fetching engine for the CUDEM project, Fetchez has evolved into a standalone ETL platform. It seamlessly retrieves Bathymetry, Topography, Imagery, and Oceanographic data from dozens of global repositories (NOAA, USGS, Copernicus, ESA) and processes it on the fly.
Quickstart#
Installation:
pip install fetchez
Command Line Interface:#
Fetch Copernicus topography and NOAA multibeam bathymetry for a specific bounding box in one command:
fetchez run -R loc:"Miami, FL" --global-hook audit copernicus multibeam
Python API:#
import fetchez
# Search
bathy_mods = fetchez.search("bathymetry")
# Get Data (Returns list of local file paths)
files = fetchez.get("nos_hydro", region=[-120, -118, 33, 34], min_year=2020)
# Fetch Electronic Nautical Chart data from NOAA
files = fetchez.get("charts", region=[-120, -118, 33, 34], hooks=['unzip', 'filename_filter:match=.000,stage="pre"', 'audit'])
Key Features#
Unified Interface: Access 70+ different modules using the exact same syntax.
Parallel Fetching: High-performance, multi-threaded downloading with automatic retry, timeout handling, and partial-download resumption.
Infrastructure as Code: Define complex data pipelines, cropping, and gridding workflows using CLI switches or simple YAML “Recipes”.
Pipeline Hooks: Transparently stream, filter, and process data as it is being downloaded.
Infinite Extensibility: Built on a modern plugin architecture. Drop custom Python scripts into a local folder, or install community extensions via
pipto add your own data sources and domain schemas.