top | item 35901179

Capillaries: Distributed data processing with Go and Cassandra

1 points| kleineshertz | 2 years ago |capillaries.io

1 comment

order

kleineshertz|2 years ago

Capillaries is a distributed data processing platform that: - works with structured row-based data - splits data into batches that can be processed as separate jobs on multiple machines in parallel - allows scenarios that involve human operator supervision and data validation - has ETL/ELT capabilities - has SQL-like join, grouping, and aggregation capabilities allows custom data processing plugins