I asked it to create a story that described the modes of the major scale with a cartoon treble clef as the main character.
It created a 10 page story that stuck to the topic and was overall coherent. The main character changed color and style on every page, so no consistency there. The overall page layouts and animation style were reasonably consistent.
The metaphor it used was the character climbing a mountain and encountering other characters that represented each mode. Each supporting character was reasonably unique, although note motif was present on 3 or 4. The mountain also changed significantly and the character was frequently back at the bottom. However, in the end, he does reach the summit.
I can't say I am overly impressed but it does mostly do what they claim.
I tried this to and had the same experience on half the books I tried to create. A lot of products I've tried have this issue and I think it will get better. I've been using and will stick to KidsAIStory as it allows me to use the same characters across the books. Also my child would be sad that they can't read their favorite series when google kills off another product.
I just used ChatGPT (or Gemini, no idea) the other night to generate me a story.
I live abroad, so I don't have unlimited access to books in my native language and all the websites were crappy sites with dozens of ads on it, made it unusable.
I was fed up with searching, so I went to ChatGPT, told it to generate me a story in my native tongue about a boy named $MySonsName and his partner $FavoriteAnimalOfTheDay, who is doing $WhateverMySonDidThatDay. It was a good story, used phrases commonly used in children's books in my language, and all.
I think the aspect of being with my son, hugging him while reading something before going to sleep is much more important than who came up with the story. And as parents, after a day of full time work and constantly helping at the household, sleep deprivation, my stories would be two sentences before I run out of ideas.
I think it'd be amazing if I had the energy to make up improv bedtime stories every night. (We have a "King Dragon" improv series happening lately, which involves a lot of farts)
BUT, I don't always have that energy, and I already spend hours a day reading stories to my kids, so I am okay with them spending some fraction of time hearing stories from robots/screens/etc. (Lately, it's "Hey Google, tell a story" if mommy is too busy to read)
I hope we never stop paying amazing children's book illustrators though! I have so many books where I marvel at each page and the ingenuity of the illustrative style.
When I've seen parents amuse kids with AI slop, the kids ask for more slop. When I've seen parents amuse kids with improv, the kids participate. Kids love both, and like nutrition... kids love sugar.
> Is there something lost, when it's not the adult telling the child a bedtime improv story? (IME, kids love this.)
Kids use their imagination because they're encouraged to do so. It's somewhat of a challenge to find the cusp between what is plain and what is incomprehensible (think of the ZPD but for creativity).
gemini app is really funny because they ship ridiculously complicated features like this before fixing the basic ability to have a chat history with apps activity turned off
Imagine the meetings where they decide to add personal illustrated storybooks before fixing chat histories
Nobody gets promoted for fixing bugs. That is the sad state of big tech.
My theory is this misalignment of incentives is probably at the heart of most of our quality rot in software. Product managers are incentivized to create new features that boost the daily active users, while generally blind to the death by a thousand cuts caused by all the quality issues.
I think that’s on purpose (the chat history thing), because they actually keep the data (I’m the admin in a Workspace and even though we have Apps Activity turned off, everything still gets logged for compliance and I cannot disable it)
>Try it today in the Gemini app. Available globally on desktop and mobile
Not quite. Gemini isn't available in Hong Kong. Unfortunately instead of telling Pixel users that, they updated their phones to use Gemini instead of the functional assistant, and then whenever the assistant is accessed, it just spins forever with a "just a moment" prompt.
It's not even clear why it's disabled, since it works just fine if you pay them for workspace subscription.
"If only I could spend less time with my kids and instead more time on implementing a new feature until the end of the month" - an ideal employee through the eyes of the employer
Likewise, if you rent a book from the library, you're outsourcing your parenting to some random author.
Isn't the point of this you have a customized book in like 5 minutes, and you can spend time sharing it with your child? Presumably you aren't just throwing the book at them and telling them to read it. If you spent hours drawing a book, would that mean you can spend more time with your kids?
I asked it to create the kind of storybook my toddler would have asked for ("create a storybook about a music truck and an ice cream truck and a mailman and a carwash", inspired by his request for a story last night), and the results were certainly... interesting.
Obviously Gemini doesn't know that "music truck" is another name for "ice cream truck", but more concerningly, the illustrations it made for the trucks were this kind of eldritch amalgamation of Cars-movie style cars and people driving cars. The story was just OK, I don't think it would have kept my toddler's attention for the whole ten pages. Plus, the mailman is barely involved.
Not to sound hyperbolic and this is really just the beginning of significant AI, but will there be anything left for humans to do or create when all this is done?
I work on a product that uses AI to write interactive stories and I think it's a perfect description.
People hear my product "writes stories" and always ask why the site doesn't have any features to share a full story: because it wouldn't make sense.
It'd be like listening to a stream of every song a person has ever played for themselves. Maybe they didn't write the songs, but they chose them based on the moment. Sometimes they start a song and skip half way because they already got the emotion, sometimes they repeat the saddest part 10 times.
They weren't trying to build a playlist for others to consume, it was for them, and only they could have come up with it.
I can't seem to get it to work. It just summarizes whatever I plug in.
Edit: Even without giving it context, at best, just get a single picture and two paragraphs. Maybe they are slowly rolling the feature out. It doesn't seem to get it.
Try starting your prompt with "Create a storybook about..." - this specific phrasing seems to be the trigger phrase that activates the full storybook generation mode.
I am thinking about doing something similar as I learn Spanish. I know some - about at a B1 level.
Right now I’m using ChatGPT to create my own lessons and having it to draw pictures depicting sentences in Spanish and putting a caption in Spanish underneath.
It’s keeping me from having to go from Spanish -> English -> mental image directly to Spanish -> mental image
I made several attempts to try to get it to generate something more esoteric. Here is a story about a computer falling in love with a potato chip who becomes a sentient meth addict.
Damn... pretty good. Generated a 10 page booklet including high quality graphics and cohesive story right on point with my prompt. It would've taken me at least an hour mucking around with LLMs and image generators to get the same result that it spit out in ~30 seconds.
I gave it a properly developed story. Gemini altered it somewhat to fit it in 10 pages. It was more than acceptable. But the illustrations, asked for in the style of oil paining, could have been better. We will get there.
i made https://storyforu.com which generates stories for children, based on topics you select with vibrant graphics and an interactive and quiz mode.
it was fun to build it.
That is so cool! Thanks for the Gemini team for working on that, a great and innovative feature.
Just a heads up: as I tried to print several stories to PDF, most times one of the generated images did not appear on the PDF. It’s surely a bug of some sort, because regenerating stories eventually makes it go away. Hope these kinds of issues will be fixed soon.
[+] [-] stillpointlab|7 months ago|reply
It created a 10 page story that stuck to the topic and was overall coherent. The main character changed color and style on every page, so no consistency there. The overall page layouts and animation style were reasonably consistent.
The metaphor it used was the character climbing a mountain and encountering other characters that represented each mode. Each supporting character was reasonably unique, although note motif was present on 3 or 4. The mountain also changed significantly and the character was frequently back at the bottom. However, in the end, he does reach the summit.
I can't say I am overly impressed but it does mostly do what they claim.
[+] [-] niemyjski|7 months ago|reply
[+] [-] neilv|7 months ago|reply
Is there something lost, when it's not the adult telling the child a bedtime improv story? (IME, kids love this.)
Is something else gained by the generated storybook?
[+] [-] serial_dev|7 months ago|reply
I live abroad, so I don't have unlimited access to books in my native language and all the websites were crappy sites with dozens of ads on it, made it unusable.
I was fed up with searching, so I went to ChatGPT, told it to generate me a story in my native tongue about a boy named $MySonsName and his partner $FavoriteAnimalOfTheDay, who is doing $WhateverMySonDidThatDay. It was a good story, used phrases commonly used in children's books in my language, and all.
I think the aspect of being with my son, hugging him while reading something before going to sleep is much more important than who came up with the story. And as parents, after a day of full time work and constantly helping at the household, sleep deprivation, my stories would be two sentences before I run out of ideas.
[+] [-] pamelafox|7 months ago|reply
BUT, I don't always have that energy, and I already spend hours a day reading stories to my kids, so I am okay with them spending some fraction of time hearing stories from robots/screens/etc. (Lately, it's "Hey Google, tell a story" if mommy is too busy to read)
I hope we never stop paying amazing children's book illustrators though! I have so many books where I marvel at each page and the ingenuity of the illustrative style.
[+] [-] ants_everywhere|7 months ago|reply
Yeah, kids love creating stuff
[+] [-] boothby|7 months ago|reply
[+] [-] HKH2|7 months ago|reply
Kids use their imagination because they're encouraged to do so. It's somewhat of a challenge to find the cusp between what is plain and what is incomprehensible (think of the ZPD but for creativity).
[+] [-] arrosenberg|7 months ago|reply
The opportunity for low-effort, low-talent grifters to make a buck on Amazon?
[+] [-] bionhoward|7 months ago|reply
Imagine the meetings where they decide to add personal illustrated storybooks before fixing chat histories
[+] [-] XenophileJKO|7 months ago|reply
My theory is this misalignment of incentives is probably at the heart of most of our quality rot in software. Product managers are incentivized to create new features that boost the daily active users, while generally blind to the death by a thousand cuts caused by all the quality issues.
[+] [-] rmonvfer|7 months ago|reply
But yeah, it’s Google after all
[+] [-] Workaccount2|7 months ago|reply
[+] [-] qwertox|7 months ago|reply
[+] [-] baxtr|7 months ago|reply
[+] [-] PunchTornado|7 months ago|reply
[+] [-] addy34|7 months ago|reply
Not quite. Gemini isn't available in Hong Kong. Unfortunately instead of telling Pixel users that, they updated their phones to use Gemini instead of the functional assistant, and then whenever the assistant is accessed, it just spins forever with a "just a moment" prompt.
It's not even clear why it's disabled, since it works just fine if you pay them for workspace subscription.
[+] [-] nosioptar|7 months ago|reply
[+] [-] arnaudsm|7 months ago|reply
Just spend more time with your kids, they want connection!
[+] [-] missingdays|7 months ago|reply
[+] [-] IncreasePosts|7 months ago|reply
Isn't the point of this you have a customized book in like 5 minutes, and you can spend time sharing it with your child? Presumably you aren't just throwing the book at them and telling them to read it. If you spent hours drawing a book, would that mean you can spend more time with your kids?
[+] [-] dmonitor|7 months ago|reply
[+] [-] gherkinnn|7 months ago|reply
[+] [-] insane_dreamer|7 months ago|reply
[+] [-] slongfield|7 months ago|reply
Obviously Gemini doesn't know that "music truck" is another name for "ice cream truck", but more concerningly, the illustrations it made for the trucks were this kind of eldritch amalgamation of Cars-movie style cars and people driving cars. The story was just OK, I don't think it would have kept my toddler's attention for the whole ten pages. Plus, the mailman is barely involved.
[+] [-] hopelite|7 months ago|reply
[+] [-] disillusioned|7 months ago|reply
[+] [-] rahimnathwani|7 months ago|reply
[+] [-] a11r|7 months ago|reply
[+] [-] unknown|7 months ago|reply
[deleted]
[+] [-] aeontech|7 months ago|reply
That's... uh... a pretty bold description for a tool where you are in fact outsourcing the "imagination" part to the machine.
[+] [-] BoorishBears|7 months ago|reply
People hear my product "writes stories" and always ask why the site doesn't have any features to share a full story: because it wouldn't make sense.
It'd be like listening to a stream of every song a person has ever played for themselves. Maybe they didn't write the songs, but they chose them based on the moment. Sometimes they start a song and skip half way because they already got the emotion, sometimes they repeat the saddest part 10 times.
They weren't trying to build a playlist for others to consume, it was for them, and only they could have come up with it.
[+] [-] xnx|7 months ago|reply
[+] [-] bongodongobob|7 months ago|reply
Edit: Even without giving it context, at best, just get a single picture and two paragraphs. Maybe they are slowly rolling the feature out. It doesn't seem to get it.
[+] [-] xnx|7 months ago|reply
[+] [-] ethan_smith|7 months ago|reply
[+] [-] omegaworks|7 months ago|reply
[+] [-] scarface_74|7 months ago|reply
Right now I’m using ChatGPT to create my own lessons and having it to draw pictures depicting sentences in Spanish and putting a caption in Spanish underneath.
It’s keeping me from having to go from Spanish -> English -> mental image directly to Spanish -> mental image
[+] [-] lightyrs|7 months ago|reply
https://g.co/gemini/share/598cc68832a9
[+] [-] gigel82|7 months ago|reply
[+] [-] bongodongobob|7 months ago|reply
[+] [-] _giorgio_|7 months ago|reply
Finally it gives generated text and images some sort of coherence that makes everything immediately "usable".
It is easier to develop something from a lot of text and images than having to assemble everything from zero.
Hope that it's editable too?
[+] [-] mnewme|7 months ago|reply
So many startups in that space that now get killed. Oscar Stories is going to have a hard time
[+] [-] leopoldj|7 months ago|reply
[+] [-] jp1016|7 months ago|reply
[+] [-] thimabi|7 months ago|reply
Just a heads up: as I tried to print several stories to PDF, most times one of the generated images did not appear on the PDF. It’s surely a bug of some sort, because regenerating stories eventually makes it go away. Hope these kinds of issues will be fixed soon.