Quantity of datasets doesn’t seem like the right metric. The library just needs the datasets you care about and both libraries have the popular ones. What’s more important is integration and if you’re training custom TF models then tfds will generally integrate more smoothly than huggingface.
Great resource. My experience has been that any data project is at least 1/3 data collection/preparation, 1/3 using the right tool the right way, and 1/3 asking the right questions and interpreting the outcome.
For computer vision, there are 100k+ open source classification, object detection, and segmentation datasets available on Roboflow Universe: https://universe.roboflow.com
So many of those have tiny datasets - like 30 images that are seemingly of low quality. I love roboflow, but those are really hard to work with. I wish there was an open platform for generating the datasets that was cost effective.
[+] [-] jamesblonde|3 years ago|reply
[+] [-] soraki_soladead|3 years ago|reply
[+] [-] xnx|3 years ago|reply
[+] [-] pj_mukh|3 years ago|reply
Would love a direct Google Photos style search method for especially the visual datasets.
[+] [-] AyyWS|3 years ago|reply
https://www.kaggle.com/datasets
[+] [-] yeldarb|3 years ago|reply
[+] [-] throwaway20222|3 years ago|reply