Show HN: VisaWhen – Data on US visa issuance backlogs
104 points| underyx | 4 years ago
<https://visawhen.com/consulates/bogota/h1b>
The code is source-available (not open source) at <https://github.com/underyx/visawhen>. It's my first time choosing a source-available license over MIT, mainly out of fear of existing immigration startups just gobbling this data and code up; frankly I didn't think the implications through though, I just threw a safe license on there.
The way the project works is:
- Use requests-html to find publicly available PDFs from government pages
- Use camelot to OCR the PDFs and extract data tables from them
- Since the previous step takes crazy long for my tastes (around 8000 pages at around 5 seconds each) I've used dask to split the work into chunks and parallel-process them across my laptop's CPUs.
- Do data cleanup and processing with pandas, and save all of it to a SQLite file.
- Take data from the SQLite file with next.js and generate a static HTML page for each possible embassy - visa type combination
- The pages use ECharts to visualize data, and Bulma as a CSS framework
- Build and host each commit via Netlify
- But proxy to Netlify from CloudFlare, which I believe has more edge locations in the free plan
- Collect any donations via Ko-Fi
- Use Google Analytics to have a general idea about visitor counts
- Use FullStory session recordings to find out about bugs – I've fixed quite a few and I think I'll probably remove this tracking after a bit of time
…and that's where I'm at now. I'm pretty happy about the results. Most pages load in less than 300ms, which is something I care about all too much. More importantly, I've shared the site with some immigration communities I'm part of, and the response has been very positive! Let me know what y'all think.
[+] [-] simonw|4 years ago|reply
Having done that...
http://127.0.0.1:8001/consulates/backlogs?_facet=Post+Slug&_...
Full page screenshot here: https://static.simonwillison.net/static/2021/consulate-backl...
Shows an interesting graph where the number of L1 visa issuances in London drops from around 500 a month to 0 around March/April 2020, eventually climbing to between 19 and 65 per month in the past few months up to today.
This is a really neat dataset, congrats!
[+] [-] jmercouris|4 years ago|reply
[+] [-] RileyJames|4 years ago|reply
I lucked out a drove there and back during a chinook, so roads were good, the drive was epic.
I recall I was able to see the bookings available at each consulate, but I think I’d been preliminarily approved, or paid something at that stage.
Nice work on the site, adding any opacity to beauracratic processes is a positive in my book.
[+] [-] lgats|4 years ago|reply
[+] [-] axaxs|4 years ago|reply
[+] [-] underyx|4 years ago|reply
[+] [-] Evidlo|4 years ago|reply
[+] [-] underyx|4 years ago|reply
[+] [-] ftyers|4 years ago|reply
[+] [-] ftyers|4 years ago|reply
[+] [-] simonw|4 years ago|reply
[+] [-] underyx|4 years ago|reply
This setup was lovely and worked quite well. Except it broke due to an absolutely obscure, insane, hopeless bug: https://github.community/t/cannot-resolve-travel-state-gov-h...
[+] [-] graton|4 years ago|reply
[+] [-] underyx|4 years ago|reply
[+] [-] estebandalelr|4 years ago|reply
[+] [-] jcims|4 years ago|reply
[+] [-] whoisjuan|4 years ago|reply
[+] [-] frakkingcylons|4 years ago|reply
[+] [-] underyx|4 years ago|reply
And also the repo: https://github.com/underyx/visawhen
They weren’t clickable before I added the < > either, I just thought that might fix it. Maybe I’ll have to ask dang.
[+] [-] estebandalelr|4 years ago|reply
[+] [-] snihalani|4 years ago|reply