top | item 42412858

(no title)

smoores | 1 year ago

That’s a really interesting idea! The more I think about it, the more I like it.

A challenge I foresee is that the media overlays are only reusable if you have the exact same input EPUB file, and have processed it with Storyteller to mark up the sentence boundaries. EPUBs have unique identifiers, though, so maybe this would be fine! We’d need to add a new processing flow to Storyteller, but it should be doable.

Feel free to hit me up in the Storyteller chat if you want to discuss more! Thanks for sharing this idea!

discuss

smeej|1 year ago

It would be cool to do this with Project Gutenberg and LibriVox files, since they're all public domain works anyway.

The entire Great Books of Western Civilization are on both, and I know I'd make more progress on reading it if I could hand off between reading and listening more easily!

NoahKAndrews|1 year ago

You could require that the input files have the same sha256 hash, that would presumably be more robust than trusting an ID from the file itself

smoores|1 year ago

Yeah I was toying around with that, too… but folks often mess around with metadata in tools like Calibre and Audiobookshelf in ways that wouldn’t have an impact on Storyteller’s sync, but would change their hash. On the other hand, I don’t know how various publishers handle EPUB dc:identifiers and that may not be robust enough, either. We could try doing something like hashing only the contents of spine items (including their file names, since that’s how media overlays refer to content)