top | item 33815898

Show HN: CastDop – Search Inside Podcasts

4 points| t0mkaka | 3 years ago |castdop.com

Hi fellow hackers,

I made a website to search full episodes in podcasts. You can search by terms/words, or list all episodes of podcast and view their transcripts. (To list available episodes, click on the podcast name in the search result)

- Search in transcripts - View Full transcripts of episodes - Highlight important section of podcast episode - Currently only for English podcasts

There is another feature in transcripts which highlights important sentences in the podcast.

The website has around 2500 episodes transcribed from around 80 podcasts. I am working on adding around 1000 episodes per day but the list is random.

If you provide your favorite podcast in comment / on the website form, then I can prioritize that.

Note that full transcript is not available for some episodes but search is working. For full episodes, you can look for 10 podcasts with full transcripts on the right side of the page. e.g (Lex Fridman doesn't have search result for latest 20 podcasts but before that we have around 245 episodes full transcript and searchable.)

Few questions : 1. I know that sooner or later big companies like Apple / Google /Spotify might add this feature on their podcasts. Is it worth pursuing it ?

2. How can I make it self-sustainable for getting server costs?

Thanks to @wenbin, hackernews - for showing one person can build a product.

Any help / comment / feedback is appreciated. Thanks

5 comments

order

mikece|3 years ago

Is this being done leveraging the podcast:namespace and features made available in Podcasting 2.0?

CrypticShift|3 years ago

  Podcast Index LLC is a software developer focused partnership that provides tools and data to anyone who aspires to create new and exciting Podcast experiences
isn’t' it better to use more open standards [1] ?

[1] https://www.w3.org/WAI/media/av/transcripts/

t0mkaka|3 years ago

No, I have just just used the RSS feed from podcasts. The the audio files are transcribed and processed and ingested into database.

I have not know podcasting 2.0 but will take a look.