granitepail
|
2 days ago
|
on: Cursor Composer 2 is just Kimi K2.5 with RL
granitepail
|
2 days ago
|
on: Cursor Composer 2 is just Kimi K2.5 with RL
"Just" Kimi K2.5 with RL—people really misunderstand how difficult it is to achieve these reults with RL. Cursor's research team is highly respected within the industry, and what they've done is quite impressive.
Before people go jumping to conclusions about model theft, it's worth considering the possibility that they did reach an agreement with Moonshot which their researchers were not aware of. That would certainly explain the deleted tweets. Until Moonshot makes an official statement, I'm not particularly concerned.
granitepail
|
2 months ago
|
on: Dead Internet Theory
To me, it’s very obvious that the problem is social media. To social media, AI slop is peak efficiency. The affordances and incentives of the network encourage its creation. I don’t care for the media slop, but eg television media has more or less been producing crap like this for a while.
I don’t think LLMs and video/image models are a negative at all. And it’s shocking to me that more people don’t share this viewpoint.
granitepail
|
4 months ago
|
on: Yann LeCun to depart Meta and launch AI startup focused on 'world models'
His JEPA family of models is a genuine step forward for SSL. Not the only approach, but a very insightful one. You’re very dismissive of his work.
granitepail
|
5 months ago
|
on: US airlines are pushing to remove protections for passengers and add more fees
If ya made it through all three of the sentences they wrote, you'd see the comment you replied to came around to it being reasonable to give families a break on group seating.
granitepail
|
7 months ago
|
on: OpenAI Leaks 120B Open Model on Hugging Face
In my case, I’m paying for inference on the original models from e.g. Fireworks. So it’s not a quantization problem. The Qwen3 I was using was the new 458B (i think that’s the size?) model that was their top performer for code.
I agree with other comments that there are productive uses for them. Just not on the scale of o4-mini/o3/claude 4 sonnet/opus.
So imo open weights larger models from big US labs is a big deal! Glad to see it. Gemma models, for example, are great for their size. They’re just quite small.
granitepail
|
7 months ago
|
on: OpenAI Leaks 120B Open Model on Hugging Face
While the benchmarks all say open source models Kimi and Qwen outpace proprietary models like GPT 4.1, GPT 4o, or even o3, my (and just about everyone I know's) boots on the ground experience suggests they're not even close. This is for tool calling agentic tasks, like coding, but also in other contexts (research, glue between services, etc). I feel like it's worth putting that out there--it's pretty clear there's a lot of benchmark hacking happening. I'm not really convinced it's purposeful/deceitful, but it's definitely happening. Qwen3 Coder, for example, is basically incompetent for any real coding tasks and frequently gets caught in death spirals of bad tool calls. I try all the OSS models regularly, because I'm really excited for them to get better. Right now Kimi K2 is the most usable one, and I'd rate it at a few ticks worse than GPT 4.1.
granitepail
|
1 year ago
|
on: Journal retracts all 23 articles in special issue
From the individual article summaries:
"The Editor-in-Chief and the publisher have retracted this article. The article was submitted to be part of a guest-edited issue. An investigation by the publisher found a number of articles, including this one, with a number of concerns, including but not limited to compromised editorial handling and peer review process, inappropriate or irrelevant references or not being in scope of the journal or guest-edited issue. Based on the investigation’s findings the Editor-in-Chief therefore no longer has confidence in the results and conclusions of this article."
granitepail
|
3 years ago
|
on: New insignia for Air Domain Intelligence has a UFO
More, I think, preying on the public's interest in UFOs to justify another massive military budget. All of the wild PR around UFOs with absolutely nothing of substance to back it seems to be a very effective PR campaign.
granitepail
|
4 years ago
|
on: Show HN: The Brutalist Report – A rolling snapshot of the day’s headlines
Maybe the backend was written in concrete.
granitepail
|
4 years ago
|
on: Ask HN: What's the best way to monetize actual programming?
Definitely. If you have the skills, working in a strong fund is the best way to get pay that scales with your contribution.
granitepail
|
4 years ago
|
on: Ask HN: Do you have to use LinkedIn to get hired?
I haven't updated my LinkedIn in around five years. Definitely not a requirement. I do keep an up-to-date CV and just pass that along. Could always try putting n/a in the field and writing a note, or finding a hiring manager or someone in HR to mail directly.
From the hiring side--I've done quite a bit--I never thought twice about whether someone had a LinkedIn or CV. I do think it's lazy not to provide anything on one's background, though. It helps me guide the conversation towards a candidate's strengths.
granitepail
|
5 years ago
|
on: The Haskell Elephant in the Room
Exactly. A cryptocurrency is driven by speculation. A treasury system helps manage flows of capital in and out of a business. Said company pays to use the proprietary system. Sure, they're both backed by a ledger, but Adjoint isn't turning profit by encouraging or even allowing people to speculate on the future value of their platform. They're selling a service.
granitepail
|
5 years ago
|
on: Is every game of Slay the Spire winnable?
I played a lot of M:tG at various times in my life, and games like Magic teach you a lot about these concepts. I would have never thought about it has I not had that experience! No shame in learning about the meta game!
granitepail
|
5 years ago
|
on: Wirecard files for insolvency after financial hole laid bare
Arbitrary precision arithmetic.
granitepail
|
5 years ago
|
on: Ask HN: What are some good resources to learn how electricity works?
The Art of Electronics is perhaps the best textbook (in any subject) that I have read. I’d highly recommend it. It’s exceedingly pragmatic and will discuss a mix of physical underpinnings as well as applications.
granitepail
|
5 years ago
|
on: One word in PostgreSQL unlocked a 9x performance improvement
I wonder how the naive timestamp approach works when multiple users are syncing at the same time?
granitepail
|
7 years ago
|
on: If San Francisco is so great, why is everyone I love leaving?
agreed. living in NYC, i’ve seen how extensive the problem is. we have lots of housing being built over a much broader area than san francisco and yet we still have the same problem. the number of people moving in to the city and getting high paying jobs (i’m a part of this problem) is so great that basically anyone else is pushed out. our affordable housing program has the right idea, but is so limited in its scope as to hardly make a dent. developers keep making luxury buildings with minimal affordable units and flipping them for big money. we need more regulation, more affordable housing.
granitepail
|
10 years ago
|
on: Gitfs
interesting to see this! the first thing that came to mind was irmin, from the good folks at MirageOS. unikernels provide an interesting opportunity to rethink block storage, as you're no longer bound the rest of your operating system.
curious to see how gitfs and irmin develop.
https://github.com/mirage/irmin
granitepail
|
12 years ago
|
on: Why Programming Language X Is Unambiguously Better than Programming Language Y
echoes in the echochamber