sshumaker | 1 year ago | on: OpenAI and Microsoft Azure to deprecate GPT-4 32K
sshumaker's comments
sshumaker | 1 year ago | on: Context caching guide
You can also design an API optimized for batch workloads (say the same core prompt with different data for instruct-style reasoning) - that can result in large savings in those scenarios.
sshumaker | 1 year ago | on: Context caching guide
The simple version of this just involves saving off the KV cache in the attention layers, and restore it back instead of recomputing. It only requires small changes to inference and the attention layers.
The main challenge is being able to do this under scale, e.g. dump the weights out of GPU memory, persist them, and have a system to rapidly reload them as needed (or just regenerate).
sshumaker | 1 year ago | on: Context caching guide
There's even work that is more sophisticated in this domain that allows 'template' style partial caching: https://arxiv.org/abs/2311.04934
sshumaker | 1 year ago | on: Ask HN: Who is hiring? (May 2024)
We're a stealth startup building something unbelievably ambitious in the AI space that blends AI and gaming tech - co-founders are Andy Gavin [1] (co-founder of video game developer Naughty Dog) and myself (VP @ Microsoft, Credit Karma, previously Google and Naughty Dog). Venture-backed by top investors including First Round Capital and Battery. Get in on the ground floor and work directly alongside a living legend and a dream team of world-class talent.
We're looking for someone who has deep experience with CoreML and optimizing model inferencing for mobile usage (e.g. ANE on iOS).
If interested, reach out to me at: [email protected]
[1] Andy Gavin on the Making of Crash Bandicoot: https://www.youtube.com/watch?v=pSHj5UKSylk
sshumaker | 1 year ago | on: Descent 3 Source Code
sshumaker | 1 year ago | on: Bay Area workers charged for building secret apartments inside train stations
I’m not claiming democracy is any better - humans are notoriously bad at long-term thinking unfortunately.
sshumaker | 1 year ago | on: The Pentagon's Silicon Valley Problem
The Israelis know this by now, so the fact that Israel was goaded into a ground war speaks as much to the political situation as anything else, but either way it’s tragic.
sshumaker | 2 years ago | on: Bypassing Safari 17's advanced audio fingerprinting protection
sshumaker | 2 years ago | on: Ask HN: Who is hiring? (March 2024)
We're a stealth startup building something unbelievably ambitious in the AI space that blends AI and gaming tech - co-founders are Andy Gavin [1] (co-founder of video game developer Naughty Dog) and myself (VP @ Microsoft, Credit Karma, previously Google and Naughty Dog). Venture-backed by top investors including First Round Capital and Battery. Get in on the ground floor and work directly alongside a living legend and a small team of world-class talent.
Lead Mobile Engineer - Swift and C/C++, CoreML a plus
AI engineer - Finetuning/Retraining LLMs (LLama/Mixtral/etc), MLOps a plus
If interested, reach out to me at: [email protected]
[1] Andy Gavin on the Making of Crash Bandicoot: https://www.youtube.com/watch?v=pSHj5UKSylk
sshumaker | 2 years ago | on: Hallucination is inevitable: An innate limitation of large language models
I just tried exactly that with dalle-3 and it worked well.
More to the point, it’s pretty clear LLMs do form a model of the world, that’s exactly how they reason about things. There was some good experiments on this a while back - check out the Othello experiment.
sshumaker | 2 years ago | on: Generalized K-Means Clustering
sshumaker | 2 years ago | on: Generalized K-Means Clustering
That said, something like hdbscan doesn’t suffer from this problem.
sshumaker | 2 years ago | on: Cursorless is alien magic from the future
sshumaker | 2 years ago | on: Show HN: I rewrote the 1990's LambdaMOO server
sshumaker | 2 years ago | on: Doug Lenat has died
Response: There is no widely available historical information to suggest that Abraham Lincoln's maternal grandmother had an unusual number of thumbs. It would be reasonable to guess that she had the typical two thumbs, one on each hand, unless stated otherwise.
sshumaker | 2 years ago | on: Social media for AI bots: “No humans allowed”
From a pure measurement standpoint, could Jupiter fit in the space between the earth and moon?
The average distance from the Earth to the Moon is about 238,855 miles (384,400 kilometers). Jupiter, the largest planet in our solar system, has a diameter of about 86,881 miles (139,822 kilometers).
So, if you were to somehow place Jupiter in between the Earth and the Moon, it would fit with a significant amount of room to spare. However, it's important to note that this is a purely theoretical situation and not something that could actually happen without cataclysmic consequences due to gravitational forces and other factors.
sshumaker | 2 years ago | on: Why are there no antitrust claims vs. GitHub Copilot, when there is a precedent?
We’ll get engineers working long hours focused on it, consulting closely with our legal and trust teams. One of the first questions we ask legal when we suspect a privacy issue is “Is this a notifiable event?”
It’s not really about getting slapped by regulators - it’s the fact that much of Microsoft’s business is built by earning the trust of large companies and small ones. Many of them are in the EU of course, but we have strict compliance we apply broadly. It’s just not worth damaging our reputation (and hurting our business) for some shortcut somewhere, as trust takes a long time to build and is easily broken.
sshumaker | 2 years ago | on: I'm never investing in Google's smart home ecosystem again
sshumaker | 2 years ago | on: America Forgot About IBM Watson. Is ChatGPT Next?
In what way is this not true? Obviously there is no perfection here, only degrees of risk. But this is literally why people pick MSFT over others. They have by far the strongest culture around maintaining trust in the enterprise space.