(no title)
MaheshNat | 1 year ago
Could also crop just the object detection regions of each image, run those cropped images through CLIP/SigLIP, then UMAP and HDBSCAN to view a 2 or 3 dimensional latent space clustering of office chair types.. might reveal some info as to what kinds of chairs exist in what geographical regions. Could use a VLM to auto-tag each cluster given a couple images from each one. Could run PCA on the CLIP embeddings and have some sliders for each principal component.. maybe the first is chair color or size or whatever
much data = much fun
myself248|1 year ago
I feel like they should be one database with object_type=car and object_type=firearm respectively. And then I can finally search by object_type=vacuum_cleaner and find out the wild-looking ball-shaped vacuum in that sci-fi movie whose name escapes me...
NikkiA|1 year ago
https://d3j17a2r8lnfte.cloudfront.net/mvh/2024/3/medium/bzSo...
My grandparents had one in the 70s, it always amazed me.
nomad86|1 year ago
pinoy420|1 year ago