Anyone had real-world pleasant experiences with some sort of task orchestration thing? I feel with data engineering, they are in vogue and I might find something useful to me.
I'm writing a Python (Django) project, that I currently deploy to Kubernetes. The process "scrapes" data (mostly public APIs and Git repos, so I don't have to deal with adversarial problems. But I kinda would like to parallelize a bit, get nice logs and analytics, etc. of runs, so I can address problems.
Frankly, k8s jobs are mostly fine, and I'm happy not having to run any queue system. If I could find something that triggers k8s jobs and records results to a database, that might be enough.
(I have some ClickHouse/OpenTelemetry capabilities, perhaps I should dig more into that anyway. It seems to be that this is the future and I should get more familiar with that.)