At LinkedIn we heavily use Azkaban for this. (Open source: https://azkaban.github.io/)
Azkaban API can be used to launch offline computation jobs as necessary - Azkaban ensures monitoring, SLA alerting, failed restarts and other dependency management etc.
superzamp|7 years ago
The documentation often mention Hadoop and data jobs, have you also used it for non-data things? Would you by chance have some workflows examples?
ameyamk|7 years ago
Note execution environment for such jobs is Azkaban executor server itself, so you have to take care of resource management (eg. one job taking all RAM on the machine will affect other jobs running on the same machine)