top | item 35070769

(no title)

vhanda | 3 years ago

The title seems quite disingenuous.

A better description would be "A PHP based web app which calls OpenAI's Whisper API to transcribe speech"

discuss

order

CSSer|3 years ago

I agree. Kudos to the author for sharing a working example of using the OpenAI's PHP Whisper client though. Digging a bit deeper into the organization that released this seems to provide more context: https://beyondco.de/. It appears this is Laravel oriented.

cinntaile|3 years ago

The main reason people add the tech stack is for marketing reasons.

The title describes what it does, I think you're making a mountain out of an anthill.

singularity2001|3 years ago

why php though, couldn't the whole thing not be completely running in the browser?

MrOwnPut|3 years ago

Many people on HN infamously called Dropbox just an rsync script, right?

It's usually all in the details and delivery (and ya'know we're lazy and lack time to setup stuff locally)

Though I wouldn't really knock anything free and open source either way.

blululu|3 years ago

The objection here is more structural than technical. The famous dropbox objection is 'anyone could do this' - even though they might not have the wherewithal to do so. The objection here is that the open source project is relying on a closed source paid service to do all the heavy lifting. Someone is going to need to foot the bill, which means this project will eventually have to answer some tough questions about funding, and what the project actually delivers.

phkahler|3 years ago

This is not open source. The wrapper may be, but it's using a non open source cloud service.

vubui|3 years ago

[deleted]

dang|3 years ago

Please don't break the site guidelines like this, regardless of how wrong someone is or you feel they are.

Rather, please make your substantive points thoughtfully and without name-calling or swipes.

https://news.ycombinator.com/newsguidelines.html

masukomi|3 years ago

It's disingenuous because literally none of the code transcribes or translates audio.

This is NOT an app that transcribes, or translates, audio.

This is a front end to another companies service.

In its defense, it is a useful front end, because getting whisper running locally was a pain in the butt because of py-torch's specific python requirements (not too old, not too new... juuuuust right).

This app also looks like it does very useful things with what whisper outputs.

But it is 100% disingenous because it does none of the things it markets itself as doing. I was expecting it to run whisper locally, not call out to a paid service.

max51|3 years ago

This is a really bad comparaison. Expedia didn't build their services in a way that makes the users think the hotel they are booking belongs to expedia. No one is going to buy an Air-France flight from them and expect the plane to be flown by Expedia employees

beernet|3 years ago

....only on HN

TylerE|3 years ago

I would expect “transcribe any audio” to mean music transcription, Personally.