top | item 46849409

Ask HN: The Next Big OS Leap

4 points| rafaelmdec | 29 days ago

After witnessing what is being said about the AI Botlers (like OpenClaw/Moltbot/Clawdbot), I believe UIs will start melting big time.

The point, click and type era is over.

Voice will take over as the primary interface.

UIs will be adaptive and enabled on demand.

There will be an AI agent layer on every single PC out there.

Since privacy will be an issue, "Shazam-like" filters will inhibit uncleared capture of voice.

Makes sense?

30 comments

order

adrianwaj|28 days ago

The next leap, I think will be when internet service providers are bought out by the hardware makers. Or vice versa? Everything tech will come in a bundle - and it will be global/multi-language and come with content. One ring to rule them all type stuff.

Then again, with the Agentic Economy, who knows, there could be a currency-based consolidation. Not sure what that all means for the OS. Maybe look at the payment and finance systems first and work backwards. You do some work for someone, how are they going to pay you and how are you going to spend the money?

codingdave|29 days ago

Nope, that sounds like a small iteration on UX, not a revolution, so it is not worth the massive cultural change to make it happen. After all, despite what tech folk think, most people really dislike change.

So we'll probably stick with what we've got until AI is truly empowered to change things, which we are probably a decade away from. At that point, it is far more likely that AI will be taking in full audio, video, and data from your environment, and will know you well enough that the mundane tasks will just happen, without need for any UX at all. Maybe a small device for you to tweak things and control non-standard tasks.

But again, that is a decade off, if not two. We're currently headed into the first downturn of the AI-driven world, when the hype dies, people really spell out the problems, platforms realize that most people don't want generative AI, and all of this quiets down, taking a back burner for 7-10 years while the research advances to move beyond today's problems and evolves into what people might actually want.

LargoLasskhyfv|28 days ago

Nope, because we already could have had that with VR/AR glasses, and while there are some (even impressive) options now, they aren't mainstream. Neither are the 'apps', nor the content interoperable, exchangable.

Furthermore I see nothing wrong with the desktop metaphor, it's just that we mostly only had a miserable magnifying glass, giving only a small viewport into a crammed childs toy, instead of real large high-resolution screens as can be had now, or sensible virtual desktops for more common sizes. To be expanded by "Metisse", an early 2.5D extension for FVWM, and later "User Interface Faćades". Maybe with some Zoomable UI sprinkled on top, like in https://eaglemode.sourceforge.net/ or whatever the clandestine weirdos from https://arcan-fe.com/ may come up with. (IF. EVER.)

crazyloglad|28 days ago

There's been a closed version of https://arcan-fe.com/2021/04/12/introducing-pipeworld/ that was VR centric as well as wilder 'layouters' to https://arcan-fe.com/2018/03/29/safespaces-an-open-source-vr... that is still around in my piles here somewhere.

For a handful of reasons (abusive and hostile actors being at the top) we focus elsewhere (https://www.divergent-desktop.org/blog/2026/01/26/a12web/ and https://arcan-fe.com/2025/01/27/sunsetting-cursed-terminal-e...).

AR/VR development in this space is a massive timesink for all the wrong reasons. Hardware vendors absolutely suck here. Everyone is openly or quietly dreaming of the vertically integrated 'app-store tax' being their real source of revenue rather than selling devices.

This means that if you don't want to fuzz around with half-baked proprietary SDKs that break more often than they do what they're supposed to, you get to sit around reverse engineering. As fun as that can be, it's much less so when that is not what you set out to be doing. Half my electronics 'donation boards bin' is discarded HMDs and input devices by now.

Even in the quirky missed opportunities like Tilt5 you have this situation.

rafaelmdec|28 days ago

Who said people want to wear goggles? I mean, seriously, what on earth is Apple Vision Pro??

Voice is natural, it is fluid, it conveys emotion, intent.

You cannot seriously be comparing metaverse immersion BS with voice commanded devices.

speakingmoistly|29 days ago

Does anyone actually ask for this? What problem is it solving other than following the hype?

One of the main things I've gotten out of the whole OpenClaw/Moltbot/Clawdbot situation is that the general public has a dangerously low grasp on information security. There's usefulness to that type of assistant, but I have yet to see a compelling, general consumer take on it.

rafaelmdec|28 days ago

I think that, for the first time in tech history, we have the tools to step away from ineffective app installs and menu cluttering and memorization and that is a rather big thing.

If you don't agree, take a step back and tell me how many people prefer navigating a terminal window using a keyboard instead of a graphic interface using a mouse.

The future belongs to a more frictionless, no keyboard, voice activated UI, IMHO.

mikewarot|28 days ago

The next big OS leap is a capabilities based security with a microkernel. The old model of assuming you wanted to share your authority with everything you run is unsustainable. It should have been a thing at least 20 years ago.

>>Please elaborate. How does this resonate with the average user who doesn't know anything about infosec?

Elaboration, with too much pop culture... ;-)

When you use cash, for example, you're using capabilities. You can hand out exactly $3.50 to the Loch Ness Monster[1], and no matter what, he's not going to be able to leverage that into taking out your entire bank balance, etc.

The current "ambient authority" system is like handing the Loch Ness Monster your wallet, and HOPING he only takes $3.50.

Another metaphor is power outlets, which limit how much of the power from the grid makes it to your device. The current system is much like the electric - i - cal, at the Douglass house in Green Acres.[2]

The point is, you can run any program you want, and give it only the files you want, and nothing else, by default in such a system. For the user, it really doesn't have to seem that different, they already use dialog boxes to select files to open and save things, they could use a "power box"[3] instead, which looks the same, except then the OS enforces their choices.

[1] https://www.quora.com/Why-does-the-Loch-Ness-monster-want-3-...

[2] https://youtu.be/EnGyq2JYrHk?si=c2iTB9BYxB0VwZ9u&t=184

[3] https://wiki.c2.com/?PowerBox

rafaelmdec|28 days ago

Please elaborate. How does this resonate with the average user who doesn't know anything about infosec.

nunobrito|29 days ago

Or maybe the next big OS leap is decentralization along with data sovereignity. Each person being their own server without so many dependencies to clouds and huge processing/database power inside their own pockets.

rafaelmdec|28 days ago

I have difficulty to see that, as it requires proper packaging and distribution for mainstream adoption.

Plus the average user doesn't care about data sovereignty, what they care about is UX and dopamine.

How many users you know of that are concerned with data collection by big tech? How much does that account for percent wise?

raw_anon_1111|28 days ago

Why does everyone think people want to talk to their computers? There are so many places where talking isn’t appropriate.

al2o3cr|29 days ago

    Since privacy will be an issue, "Shazam-like" filters will inhibit uncleared capture of voice.
So now the operating system will decide which recordings are "cleared" and which aren't? Fuck outta here with that nonsense

adrianwaj|27 days ago

Reminded me of the recent Q.ai acquisition by Apple. It's obvious that if you're using voice it should be as clear as possible.

I was going to suggest the next big leap will be some kind of "OmniLinux" that spreads across all devices, appliances and hardware that contains any kind of OS and enables interoperability, control and telemetry. Allows updating firmware from a central point, access control and power management. Will be used by humans first, then bots later. There might be some big retro movement to old world things as a result when people reject the idea of a "common dashboard" for the things they own. Might be useful for sharing and rentals though. Is a new OS needed for this though, why not some standards and protocols?

Had this vague thought that the OP is a bot. Does it matter?

rafaelmdec|28 days ago

I see it as a rather logical step with the advances in voice first AI wearables.

Think about it. Not everyone wants to be recorded as a bystander. Privacy will be an issue.

The technology for audio signature already exists and works fine.

It will be a matter of opt-in/opt-out from users, not an OS decision.