top | item 41261765

(no title)

stevofolife | 1 year ago

Here’s the plan:

Run the following models:

- Speech-to-text - Text-to-text - Text-to-speech - Text-to-image - Image-to-text - Text-to-video - Video-to-text

Start by integrating third-party APIs, and later switch to open-source models.

Implement everything using your preferred backend language. After that, connect it to a frontend framework of your choice to create interactive interfaces.

You want use your own data? Put it in a database and connect it to your backend, and run these models on your database.

Once you’ve done this, you’ll have completed your full stack development training.

discuss

order

asp_hornet|1 year ago

I think this is a great take. Those problems have traditionally been hard to solve in engineering and you can get pretty reliable solutions from just an api call.