Launch HN: Golpo (YC S25) – AI-generated explainer videos
116 points| skar01 | 6 months ago |video.golpoai.com
We’ve always made videos to communicate any concept and felt like it was the clearest way to communicate. But making good videos was time-consuming and tedious. It required planning, scripting, recording, editing, syncing voice with visuals. Even a 2-minute video could take hours.
AI video tools are impressive at generating cinematic scenes and flashy content, but struggle to explain a product demo, walk through a complex workflow, or teach a technical topic. People still spend hours making explainer videos manually because existing AI tools aren’t built for learning or clarity.
Our solution is Golpo. Our video generation engine generates time-aligned graphics with spoken narration that are good for onboarding, training, product walkthroughs, and education. It’s fast, scalable, and built from the ground up to help people understand complex ideas through simple storytelling.
Here’s a demo: https://www.youtube.com/watch?v=C_LGM0dEyDA#t=7.
Golpo is built specifically for use cases involving explaining, learning, and onboarding. In our (obviously biased!) opinion, it feels authentic and engaging in a way no other AI video generator does.
Golpo can generate videos in over 190 languages. After it generates a video, you can fully customize its animations by just describing the changes you want to see in each motion graphic it generates in natural language.
It was challenging to get this to work! Initially, we used a code-generation approach with Manim, where we fine-tuned a language model to emit Python animation scripts directly from the input text. While promising for small examples, this quickly became brittle, and the generated code usually contained broken imports, unsupported transforms, and poor timing alignment between narration and visuals. Debugging and regenerating these scripts was often slower than creating them manually.
We also explored training a custom diffusion-based video model, but found it impractical for our needs. Diffusion could produce high-fidelity cinematic scenes, but generating coherent sequences beyond about 30 seconds was unreliable without complex stitching, making edits required regenerating large portions of the video, and visuals frequently drifted from the instructional intent, especially for abstract or technical topics. Also, we did not have the compute to scale this.
Existing state-of-the-art systems like Sora and Veo 3 face similar limitations: they are optimized for cinematic storytelling, not step-by-step educational content, and they lack both the deterministic control needed for time-aligned narration and the scalability for 5–10 minute explainers.
In the end, we took a different path of training a reinforcement learning agent to “draw” whiteboard strokes, step-by-step, optimized for clear, human-like explanations. This worked well because the action space was simple and the environment was not overly complex, allowing the agent to learn efficient, precise, and consistent drawing behaviors.
Here are some sample videos that Golpo generated:
https://www.youtube.com/watch?v=33xNoWHYZGA (Whiteboard Gym - the tech behind Golpo itself)
https://www.youtube.com/watch?v=w_ZwKhptUqI (How do RNNs work?)
https://www.youtube.com/watch?v=RxFKo-2sWCM (function pointers in C)
https://golpo-podcast-inputs.s3.us-east-2.amazonaws.com/file... (basic intro to Gödel's theorem)
You can try Golpo here: https://video.golpoai.com, and we will set you up with 2 credits. We’d love your feedback, especially on what feels off, what you’d want to control, and how you might use it. Comments welcome!
typs|6 months ago
Edit: I've used. It's amazing. I'm going to be using this a lot.
Masih77|6 months ago
skar01|6 months ago
metalliqaz|6 months ago
bangaladore|6 months ago
I agree. Rather than (what I assume is) E2E text -> video/audio output, it seems like training a model on how to utilize the community fork of manim which 3blue1brown uses for videos would produce a better result.
[1] https://github.com/ManimCommunity/manim/
delbronski|6 months ago
Congrats! Cool product.
Feedback: I tried making a product explainer video for a tree planting rover I’m working on. The rover looked different in every scene. I can imagine this kind of consistency may be more difficult to get right. Maybe if I had uploaded a photo of how the rover looks it may have helped. In one scene the rover looks like an actual rover, in the other it looks like a humanoid robot.
But still, super impressed!
skar01|6 months ago
torlok|6 months ago
grues-dinner|6 months ago
dtran|6 months ago
Signed up and waiting on a video :)
Edit: here's a 58s explainer video for the concept of body doubling: https://video.golpoai.com/share/448557cc-cf06-4cad-9fb2-f56b...
addandsubtract|6 months ago
albumen|6 months ago
Have you tried a "filled line" approach, rather than "outlined" strokes? Might feel more like individual marker strokes.
I made a demo video on the free tier and it did a great job explaining acoustic delay lines in an accessible fashion, after feeding it a catalog PDF with an overview of the historical artefact and photography of an example unit. Unfortunately the service invented its own idea of what the artefact looked like. Could you offer a storyboard view and let users erase the incorrect parts and sketch their own shapes? Or split the drawing up into logical elements and the user could redraw them as needed, which would then be reused where that element is used in other frames?
skar01|6 months ago
mclau157|6 months ago
skar01|6 months ago
giorgioz|6 months ago
Keep up refining the generated demo! Best of luck
fxwin|6 months ago
[1] https://www.youtube.com/@Aleph0
[2] https://www.youtube.com/@MinutePhysics
[3] https://www.youtube.com/@12tone
[4] https://www.youtube.com/@SimplilearnOfficial
raylad|6 months ago
Are there options to have the text appear differently?
dfee|6 months ago
> The Al needs to figure out not just what to draw, but precisely when to draw it
;)
adi4213|6 months ago
skar01|6 months ago
meistertigran|6 months ago
ceroxylon|6 months ago
skar01|6 months ago
tk90|6 months ago
I asked it about pointers in Rust. The transcript and images were great, very approachable!
"Do not let your computer sleep" -> is this using GPU on my machine or something?
skar01|6 months ago
UltraSane|6 months ago
drawnwren|6 months ago
Lienetic|6 months ago
reactordev|6 months ago
What I always wanted to do was to teach what I know but I lack the time commitment to get it out. This might be a way…
skar01|6 months ago
trenchpilgrim|6 months ago
Note: Your paywall for downloading the video is easily bypassed by Inspect Element :)
My main concern for you is that y'all will get Sherlocked by OpenAI/Anthropic/Google.
mkagenius|6 months ago
1. For example, I have built http://gitpodcast.com which can be run for free. Can also be self hosted using free tier of gemini and azure speech.
ishita159|6 months ago
I would love to add a link to my product docs, upload some images and have it generate an onboarding video of the platform.
skar02|6 months ago
skar01|6 months ago
cube2222|6 months ago
I’m mostly curious how it fairs with more complex topics and doing actually informative (rather than just “plain background”) illustrations.
Like a video explaining transformer attention in LLMs, to stay on the AI topic?
andhuman|6 months ago
metalliqaz|6 months ago
Wolf_Larsen|6 months ago
Lienetic|6 months ago
WasimBhai|6 months ago
skar02|6 months ago
sdotdev|6 months ago
OG_BME|6 months ago
The video UUID starts with "f5fbd6c7", hopefully that's sufficient to identify me!
skar02|6 months ago
futhey|6 months ago
But, white on black is really ugly. Even black on white or a simple inversion would be an improvement.
I think it could benefit from the ability to pause and see the transcript, and make edits before the video is generated.
Terretta|6 months ago
qwertytyyuu|6 months ago
achempion|6 months ago
p.s. the pricing section is unreadable under the 840px width
ActVen|6 months ago
skar01|6 months ago
mandeepj|6 months ago
If I may ask - how do you generate your audio?
poly2it|6 months ago
skar02|6 months ago
bangaladore|6 months ago
Seems like this is pretty useless unless you pay 200$ per month. Which may be a reasonable number for the clearly commercial / enterprise use case, but I'm just not certain what you can do wtih the lower tiers.
ing33k|6 months ago
android521|6 months ago
KaoruAoiShiho|6 months ago
empressplay|6 months ago
It created an 8 minute video explaining my Logo-based coding language using 50 sources and it was free.
https://www.youtube.com/watch?v=HZW75burwQc
BoorishBears|6 months ago
Straight vector paths?
nextworddev|6 months ago
skar02|6 months ago
CalRobert|6 months ago
One is reminded of smbc
https://www.seekpng.com/png/detail/213-2132749_gulpo-decal-f...
skar02|6 months ago
atleastoptimal|6 months ago
cool product though!
unknown|6 months ago
[deleted]
subhro|6 months ago
skar02|6 months ago
snowfield|6 months ago
ludicrousdispla|6 months ago
ayaros|6 months ago
In addition, the line-art style of the illustrations looks like that same cartoonish-AI-slop style I see everywhere now. I just can't take it seriously.
If this tool is widely deployed it's just going to get used to spread more misinformation. I'm sure it will be great for bad actors and spammers to have yet another tool in their toolbox to spread whatever weird content or messages they want. But for the rest of us, that means search engines and YouTube and other places will be filled with a million AI-generated half-baked inferior copies of Khan Academy. It's already hard enough to find good educational resources online if you don't know where to look, and this will only make the problem worse.
You'll just have to forgive me if I'm not really excited about this tool.
...also the name is a bit weird. It reminds me of "Gulpo, the fish who eats concepts" from that classic SMBC cartoon. (https://www.smbc-comics.com/comic/2010-12-15)
whitepaint|6 months ago
ks2048|6 months ago
I'm sure AI could help make good animations like this, but this looks like slop.
dangoboydango|6 months ago
[deleted]
orange-tourist|6 months ago
[deleted]
personjerry|6 months ago
wordpad|6 months ago