This is a brilliant and useful application of LLM technology, I'm impressed. One...

aka_sh · on April 21, 2024

Thank you! I'm getting the transcript through an API and feeding it to the GPT. For now, the fallback function for no captions is just to make something out of the description of the video. I really appreciate the suggestion, i'll experiment around using Whisper. Regarding open source or business. I don't really know about that yet. Maybe, i'll lean towards the business side to cover the costs and see where this goes. And sorry for the downtime! API credits ran out. It should be fixed by now

metadat · on April 21, 2024

Eek, so many typos in my comment - but the most egregious was where I meant to convey the code itself is not a huge moat. Even still, no worries if you don't want to give it away, I totally understand.

Keep up the good execution.

cchance · on April 22, 2024

Definitly try out whisper after splitting out the audio as a fallback, and don't forget their are other models like WhisperFast that might be slightly less accurate but less resource intesnive, and since your not publishing the captions themselves you don't need it to literally get every word perfect.

ravenstine · on April 21, 2024

It's epic how well that works. Even with Whisper locally, most of what I throw at it becomes readable.

Yannael · on April 23, 2024

Here an example of implementation you may find interesting (that also includes snapshots, and links back to original video) - https://github.com/Yannael/video2blogpost

redbell · on April 22, 2024

Here is another resource on the same topic: https://news.ycombinator.com/item?id=39367264

j45 · on April 21, 2024

Comparing yt transcript to open whisper transcripts could be interesting if it could pick up on something extra.

There is limited need to reinvent the wheel to process audio when other things can be solved.

alvah · on April 22, 2024

The suggestion was to use Whisper as a fallback where no YT transcript exists.

cchance · on April 22, 2024

I mean if CC is missing you just run it through whisper/whisperfast and you've got CC.