bing_dai | 1 year ago | on: In many cases we need something less like AI and more like a basic algorithm (2023)
bing_dai's comments
bing_dai | 2 years ago | on: Ask HN: What have you built with LLMs?
****
Project 2 - I also built a YouTube summarizer for individual video called Summary Cat (https://www.summarycat.com). It is not open source for now. The stack is very similar to project 1.
****
And yes I like summarizing YouTube videos:)
bing_dai | 2 years ago | on: Ask HN: What are you passionate about at the moment?
bing_dai | 2 years ago | on: Ask HN: What are you passionate about at the moment?
bing_dai | 2 years ago | on: Ask HN: Show me your half baked project
I made a YouTube summarizer, Summary Cat.
————————-
Technical Details
Goal: Generate brief and coherent summaries from YouTube video transcripts
Motivation of building this project: to save myself time watching videos that are too long.
Stack • Frontend: HTML/CSS, plus a tiny bit of Javascript for handling the button • Backend: Python/Flask • Flask served as the backend web framework, handling routes and integrating various services. • Front+Backend has about 500 lines in total • Hosting: AWS Free-tier Elastic Beanstalk • Database: AWS Free-tier RDS
APIs: • For grabbing YouTube's transcripts: I used youtube-transcript-api (https://pypi.org/project/youtube-transcript-api/) • For summarizing the transcripts: I used OpenAI's GPT-3.5-turbo-16k: https://platform.openai.com/docs/guides/gpt. • I used GPT-3.5 because GPT-4 is quite a lot more expensive (roughly 10X).
My Prompt (Super Simple!) • "please summarize the following text into a few paragraphs:" + the full transcript.
Thoughts about GPT-4 vs GPT-3.5-Turbo-16k or Summary Cat • GPT-4 was 20% better for "summary quality" • GPT-4 feels 50% faster • However, GPT-4 is about 10X as expensive as GPT-3.5
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
Your "use the summarizer to re-cut the videos" is fantastic!
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
Thanks for your suggestion about how to address it.
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
As for non-Youtube URL: I indeed do not handle that error right now :D. Thanks so much for bringing it up!
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
----------- My prompt is super simple. It is "please summarize the following text into a few paragraphs:" + the full transcript. -----------
Seriously that's it!
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
I would say, GPT-4 is - 20% better at "summary quality" - feels 50% faster - BUT, 10X as expensive.
So using GPT-3.5 was the right choice for me at this point.
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
Summary Cat doesn't work for videos where there's not a lot of speaking. I am hoping to build a bit more on text-rich videos first, so I do not plan to do actual video (image) processing any time soon.
I will keep that in mind!!
bing_dai | 2 years ago | on: Show HN: Summary Cat, a YouTube Video Summary Generator
So far, I found that each ~10 minute video uses around 1000 tokens. It costs me about 3 cents to summarize, which is not too bad as I don't have many users, and users haven't been requesting summaries for super long videos yet.
If this sites gets a lot of interest, I might start restricting something :)..
GPT 3.5 Pricing: https://openai.com/pricing
bing_dai | 4 years ago | on: Ask HN: What is your spiritual practice?
I am wondering what makes you think that?
bing_dai | 5 years ago | on: No meetings, no deadlines, no full-time employees
bing_dai | 5 years ago | on: To the brain, reading computer code is not the same as reading language
I have worked in both computer science and classical music, and found many crossover talents. Two previous Director of Music at my school (UBC in Canada) in the last few decades are both musicians and engineers/physicists.
bing_dai | 5 years ago | on: The global fertility rate is falling
I find it unfair that the society often consider people being "too young; may change mind later" when they say they do not want children; but we don't apply the same standards to young people who want children.