logo

FOSS4G 2022 | Spatial data processing with workflow engines

FOSS4G

FOSS4G

10 followers

time1 yr agoview1 views

Workflow engines like Apache Airflow are commonly used in data engineering nowadays. They provide an infrastructure for setting up, executing and monitoring a defined sequence of tasks, arranged as a workflow application. Tasks and dependencies are defined in a declarative way or in a programming language like Python. Airflow established using directed acyclic graphs (DAGs) to manage workflow orchestration.

This talk compares a selected subset out of the huge number of available Open Source workflow engines, which are especially suited for workflows containing spatial data processing. It compares the well known Apache Airflow engine with Dagster, an other solution using DAGs and a BPMN-based workflow engine using Celery as distributed task queue.

In the same space there is the new OGC API - Processes standard which is a modern REST API for wrapping computational tasks into executable processes. This talk gives an overview of the API and shows possible integrations with available workflow engines.

Pirmin Kalberer

https://talks.osgeo.org/foss4g-2022/talk/US3PDH/

#foss4g2022 #generaltrack #UsecasesAndapplications

Loading comments...