Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I built a similar thing with the GPT-4 APIs a few weeks ago; thanks for the reminder that I must put it on GitHub at some point, as it's only about 30 lines of code.


I need this, but for Vimeo videos.


It should be doable. What I built only works for videos with transcripts, but I've been looking to improve it using OpenAI's Whisper for Speech-To-Text. I'm just lazy so I haven't gotten around to it (...which is why I spent an hour throwing together a 30-line script to summarize youtube videos for me)


For you or anyone else reading this I recently ran across this video documenting setting up and using whisper. It's probably a little overdetailed, but I found the github docs a little underdetailed so might be useful. Whisper is pretty powerful. One of the more useful open source ai tools available right now.

https://www.youtube.com/watch?v=XX-ET_-onYU

But as you implied in your comment, it should be possible to do it quite well with any video by transcripting with whisper and then sending the text to gpt or another LLM to summarize.


I’ve done something similar here https://github.com/mcdallas/summarize it feeds an audio file to whisper and then summarizes the transcript. You can easily wrap it with yt-dlp to download the audio portion of a video


I also did the same but its a web app, https://github.com/mkagenius/audioGPT (i also have it hosted but I am afraid if i post the link, it would eat through all my credits)


I’m currently working on this with the caveat that I want to do the work locally. Using whisper but the summarization portion if this task is not straightforward given the limited context size of models.

Does anyone have any additional insight into this problem?


I'll check it out (or maybe let my script check it out first), thanks.

From what I remember the Whisper API docs weren't too bad, but I didn't try actually implementing anything, so you could be right that they're underdetailed.


Maybe one for TikTok videos? Would be the scourge of influencers...


The GPT4 API is not generally available yet right? At least for the public?


You can apply for access, and you'll be added to a waiting list. I only have access to the GPT4 API with 8K context, not 32K.


There's a waiting list. I applied on the first day and only just got access.


No :(. I applied several weeks ago and I still don't have access


it’s accessible via the beta. they gave me access after a day or two of waiting, but I don’t know if that’s typical or not


Please do. Curious how you transform a YouTube video to text before sending to chatgpt?


Youtube offers transcripts / cc



Yep, it’s pretty much that except using GPT-4 and a wider context.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: