matroid | 2 months ago | on: Simple 3D Packing
matroid's comments
matroid | 2 months ago | on: Simple 3D Packing
Over the holidays, I cleaned up the implementation (with the help of Claude Code, although this is not an advertisement for it) and released it on GitHub.
If anyone needs fast 3D packing in python, do give this a shot. Hopefully I have attributed all the code/ideas I have used from elsewhere properly (if not, please feel free to let me know).
matroid | 5 months ago | on: FaceLift [ICCV 2025]
matroid | 1 year ago | on: Weak supervision to isolate sign language communicators in crowded news videos
I do think these problems are much, much worse for ISL as you rightly noted.
I think I should have been careful when I said "solve" in my post. But that really came from a place of optimism/excitement.
matroid | 1 year ago | on: Weak supervision to isolate sign language communicators in crowded news videos
matroid | 1 year ago | on: Weak supervision to isolate sign language communicators in crowded news videos
From my understanding, they are quite dissimilar. A person who knows ISL will not understand ASL, for example.
matroid | 1 year ago | on: Weak supervision to isolate sign language communicators in crowded news videos
On a related note, this sort of explains why our model is struggling to fit on 500 hours of our current dataset (even on the training set). Even so, the current state of automatic translation for Indian Sign Language is that, in-the-wild, even individual words cannot be detected very well. We hope that what we are building might at least improve the state-of-the-art there.
> It's more of a bad and broken transliteration that if you struggle to think about you can parse out and understand.
Can you elaborate a bit more on this. Do you think if we make a system for bad/broken transliteration and funnel it through ChatGPT, it might give meaningful results? That is ChatGPT might be able to correct for errors as it is a strong language model.
matroid | 1 year ago | on: Harnessing Weak Supervision to Isolate Sign Language in Crowded News Videos
matroid | 1 year ago | on: Alternative clouds are booming as companies seek cheaper access to GPUs
matroid | 2 years ago | on: Segmenting comic book frames
matroid | 2 years ago | on: Segmenting comic book frames
matroid | 2 years ago | on: Segmenting comic book frames
matroid | 2 years ago | on: Segmenting comic book frames
As amazing as recent AI progress has been, we do overrate it a lot (I'm including myself in that).
matroid | 2 years ago | on: Segmenting comic book frames
Can you explain what you mean by motion comic generation? Sounds interesting!
matroid | 2 years ago | on: GALA3D: Towards Text-to-3D Complex Scene Generation
matroid | 2 years ago | on: Interactive Coloring with SD ControlNet
* Flash Attention, an efficient attention module which significantly speeds up training, only works on Ampere GPUs [1]
* Even if I bought a 3090, I would have to get a computer to go with it, along with a PSU and some cooling. Don't know where to start with that.
matroid | 2 years ago | on: Stable Cascade
matroid | 2 years ago | on: The engineering behind Figma's vector networks (2019)
P.S. I have no affiliation with his work, although I did contribute 10$ to his Kickstarter Campaign back in the day.
matroid | 2 years ago | on: Ask HN: How to get back to programming Python?
matroid | 2 years ago | on: Auto-unloading models using __init_subclass__ (Python)
I wanted a solution which was agnostic to the type of the model, with respect to loading and inferring.
So I made this AutoUnloadModel class that unloads the model if it hasn't been used for some period. I used __init_subclass__ to ensure that all the details regarding timers, locks etc are hidden from the subclass.
I found __init_subclass__ very cool for this job, which is the reason I'm sharing this. Thanks!