top | item 41978050

(no title)

highlanderNJ | 1 year ago

Could we replicate the NotebookLM's podcast feature as a customizable API?

Live demo: https://huggingface.co/spaces/thatupiso/Podcastfy.ai_demo Open Source Python package: https://github.com/souzatharsis/podcastfy

Apache-2.0 license

Key Features: - Generates conversational content from multiple sources (e.g. URLs, YouTube, and PDFs) and modalities (images+text) - Customizes transcript and audio generation (e.g., style, language, structure, length) - Provides multi-language support for global content creation

Technical Highlights: - Flexible LLM integration with LangChain, supporting both cloud-based and local models - Support for advanced text-to-speech models (OpenAI, ElevenLabs, and Microsoft Edge) - Seamless CLI and Python package integration for automated workflows

NotebookLM's AI-generated voices remain unparalleled in quality (SoundStorm is awesome!). We would love additional contributors to help build this open source alternative!

discuss

order

No comments yet.