(no title)
Bjorkbat | 5 months ago
Really curious about this since people keep bringing it up on Twitter. They mention it pretty much off-handedly in their press release and doesn't show up at all in their system card. It's only through an article on The Verge that we get more context. Apparently they told it to build a Slack clone and left it unattended for 30 hours, and it built a Slack clone using 11,000 lines of code (https://www.theverge.com/ai-artificial-intelligence/787524/a...)
I have very low expectations around what would happen if you took an LLM and let it run unattended for 30 hours on a task, so I have a lot of questions as to the quality of the output
cowboy_henk|5 months ago
While it's impressive that you can now just have an llm build this, I wouldn't be surprised if the result of these 30 hours is essentially just a re-hash of one of those example Slack clones. Especially since all of these models have internet access nowadays; I honestly think 30 hours isn't even that fast for something like this, where you can realistically follow a tutorial and have it done.
In fact, I just did a quick google search and found this 15 hour course about building a slack clone: https://www.codewithantonio.com/projects/slack-clone
sigmoid10|5 months ago
Philpax|5 months ago
Your average dev can just use those.
ChadMoran|5 months ago
NaomiLehman|5 months ago
Imagine reviewing 30 hours of 2025-LLM code.
shanecp|5 months ago
Bjorkbat|5 months ago
Unless the main area of improvement was tools and scaffolding rather than the model itself.
gapeslape|5 months ago
Just to illustrate, say you are running on a slow machine that outputs 1 token per hour. At that speed you would produce approximately one sentence.
zelphirkalt|5 months ago
(First of all: Why would anyone in their right mind want a Slack clone? Slack is a cancer. The only people who want it are non-technical people, who inflict it upon their employees.)
Is it just a chat with a group or 1on1 chat? Or does it have threads, emojis, voice chat calls, pinning of messages, all the CSS styling (which probably already is 11k lines or more for the real Slack), web hooks/apps?
Also, of course it is just a BS announcement, without honesty, if they don't publish a reproducible setup, that leads to the same outcome they had. It's the equivalent of "But it worked on my machine!" or "scientific" papers that prove anti gravity with superconductors and perpetuum mobile infinite energy, that only worked in a small shed where some supposed physics professor lives.
mh-|5 months ago
> [..] left it unattended for 30 hours, and it built a Slack clone using 11,000 lines of code [..]
zmmmmm|5 months ago
it's going to be an issue I think, now that lots of these agents support computer use, we are at the point where you can install an app, tell the agent you want something that works exactly the same and just let it run until it produces it.
The software world may find it's got more in common with book authors than they thought sooner rather than later once full clones of popular apps are popping out of coding tools. It will be interesting to see if this results in a war of attrition with counter measures and strict ToU that prohibit use by AI agents etc.
stravant|5 months ago
walthamstow|5 months ago
What keeps people in are network effects and some dark patterns like vendor lock-in and data unportability.
technocrat8080|5 months ago
ChadMoran|5 months ago
osn9363739|5 months ago
haute_cuisine|5 months ago