Tile-based geodata processing.
Mapchete processes raster and vector geodata in digestable chunks.
Processing larger amounts of data requires chunking the input data into smaller tiles
and process them one by one. Python provides a lot of useful packages to process geodata
like shapely or numpy. From within your process code you will have access to the geodata
in the form of NumPy
arrays for raster data or GeoJSON-like feature dictionaries for
vector data.
Internally the processing job is split into tasks which can processed in parallel using either
concurrent.futures
or build task graphs and use dask to intelligently process either on
the local machine or on a cluster.
With the help of fiona and rasterio Mapchete takes care about resampling and reprojecting geodata, applying your Python code to the tiles and writing the output either into a single file or into a directory of files organized in a WMTS-like tile pyramid. Details on tiling scheme and available map projections are outlined in the tiling documentation.
You need a .mapchete
file for the process configuration. The configuration is based
on the YAML
syntax.
process: my_python_process.py # or a Python module path: mypythonpackage.myprocess
zoom_levels:
min: 0
max: 12
input:
dem: /path/to/dem.tif
land_polygons: /path/to/polygon/file.geojson
output:
format: PNG_hillshade
path: /output/path
pyramid:
grid: mercator
# process specific parameters
resampling: cubic_spline
You also need either a .py
file or a Python module path where you specify the process
itself.
def execute(mp, resampling="nearest"):
# Open elevation model.
with mp.open("dem") as src:
# Skip tile if there is no data available or read data into a NumPy array.
if src.is_empty(1):
return "empty"
else:
dem = src.read(1, resampling=resampling)
# Create hillshade using a built-in hillshade function.
hillshade = mp.hillshade(dem)
# Clip with polygons from vector file and return result.
with mp.open("land_polygons") as land_file:
return mp.clip(hillshade, land_file.read())
You can then interactively inspect the process output directly on a map in a
browser (first, install dependencies by pip install mapchete[serve]
go to
localhost:5000
):
$ mapchete serve hillshade.mapchete --memory
The serve
tool recognizes changes in your process configuration or in the
process file. If you edit one of these, just refresh the browser and inspect the
changes (note: use the --memory
flag to make sure to reprocess each tile and
turn off browser caching).
Once you are done with editing, batch process everything using the execute
tool.
$ mapchete execute hillshade.mapchete
There are many more options such as zoom-dependent process parameters, metatiling, tile buffers or interpolating from an existing output of a higher zoom level. For deeper insights, please go to the documentation.
Mapchete is used in many preprocessing steps for the EOX Maps layers:
- Merge multiple DEMs into one global DEM.
- Create a customized relief shade for the Terrain Layer.
- Generalize landmasks & coastline from OSM for multiple zoom levels.
- Extract cloudless pixel for Sentinel-2 cloudless.
via PyPi:
$ pip install mapchete
from source:
$ git clone git@github.com:ungarj/mapchete.git && cd mapchete
$ pip install .
To make sure Rasterio, Fiona and Shapely are properly built against your local GDAL and GEOS installations, don't install the binaries but build them on your system:
$ pip install --upgrade rasterio fiona shapely --no-binary :all:
To keep the core dependencies minimal if you install mapchete using pip
, some features
are only available if you manually install additional dependencies:
# for contour extraction:
$ pip install mapchete[contours]
# for dask processing:
$ pip install mapchete[dask]
# for S3 bucket reading and writing:
$ pip install mapchete[s3]
# for mapchete serve:
$ pip install mapchete[serve]
# for VRT generation:
$ pip install mapchete[vrt]
MIT License
Copyright (c) 2015 - 2022 EOX IT Services