(no title)
jamesfly | 9 months ago
It’s the biggest and dirtiest dataset I’ve ever worked with, so it’s been interesting to figure out practical solutions to run things fast and generalize cleaning tasks. Of course it’ll be impossible to get every case (I can only match about half of the state licenses to national records at the moment), so I’ll have to figure out a user-edit/consensus system for the rest.
catwhatcat|9 months ago
1 - https://ibb.co/v6WZ7MFr
jamesfly|9 months ago