top | item 41892964

(no title)

dmpetrov | 1 year ago

Right, DVC caches data for consistency and reproducibility.

If caching is not needed and streaming required, we've created a sister tool DataChain. It's even supports WebDataset and can stream from tar archives and filter images by metadata.

WebDataset example: https://github.com/iterative/datachain/blob/main/examples/mu...

discuss

order

notrealyme123|1 year ago

Thank you! Thats news to me. I will absolutely give it a try