I built a similar thing with the GPT-4 APIs a few weeks ago; thanks for the remi...

drewrbaker · on April 29, 2023

I need this, but for Vimeo videos.

messe · on April 29, 2023

It should be doable. What I built only works for videos with transcripts, but I've been looking to improve it using OpenAI's Whisper for Speech-To-Text. I'm just lazy so I haven't gotten around to it (...which is why I spent an hour throwing together a 30-line script to summarize youtube videos for me)

f0e4c2f7 · on April 29, 2023

For you or anyone else reading this I recently ran across this video documenting setting up and using whisper. It's probably a little overdetailed, but I found the github docs a little underdetailed so might be useful. Whisper is pretty powerful. One of the more useful open source ai tools available right now.

https://www.youtube.com/watch?v=XX-ET_-onYU

But as you implied in your comment, it should be possible to do it quite well with any video by transcripting with whisper and then sending the text to gpt or another LLM to summarize.

sethgecko · on April 29, 2023

I’ve done something similar here https://github.com/mcdallas/summarize it feeds an audio file to whisper and then summarizes the transcript. You can easily wrap it with yt-dlp to download the audio portion of a video

mkagenius · on April 29, 2023

I also did the same but its a web app, https://github.com/mkagenius/audioGPT (i also have it hosted but I am afraid if i post the link, it would eat through all my credits)

cced · on April 29, 2023

I’m currently working on this with the caveat that I want to do the work locally. Using whisper but the summarization portion if this task is not straightforward given the limited context size of models.

Does anyone have any additional insight into this problem?

messe · on April 29, 2023

I'll check it out (or maybe let my script check it out first), thanks.

From what I remember the Whisper API docs weren't too bad, but I didn't try actually implementing anything, so you could be right that they're underdetailed.

belter · on April 29, 2023

Maybe one for TikTok videos? Would be the scourge of influencers...

wand3r · on April 29, 2023

The GPT4 API is not generally available yet right? At least for the public?

messe · on April 29, 2023

You can apply for access, and you'll be added to a waiting list. I only have access to the GPT4 API with 8K context, not 32K.

sebzim4500 · on April 29, 2023

There's a waiting list. I applied on the first day and only just got access.

rolisz · on April 29, 2023

No :(. I applied several weeks ago and I still don't have access

permo-w · on April 29, 2023

it’s accessible via the beta. they gave me access after a day or two of waiting, but I don’t know if that’s typical or not

gregw134 · on April 29, 2023

Please do. Curious how you transform a YouTube video to text before sending to chatgpt?

jy1 · on April 29, 2023

Youtube offers transcripts / cc

sunshadow · on April 29, 2023

https://github.com/musabgultekin/youtube-tldw/blob/main/main...

messe · on April 29, 2023

Yep, it’s pretty much that except using GPT-4 and a wider context.