Copilot is telling me that this is doable in 30 minutes:
1. Grab a Creative Commons video URL
2. Run it through an off-the-shelf transcription API (Whisper)
3. Then feed that transcript into GPT-style summarization prompts
4. Wrap the result in simple markdown, and spit it out.
5. A minimal script or no-code workflow can be live in half an hour—though polishing the UI, API keys, and error handling will take longer.
* I will need to get the API and help with most of the process. Hope to finally put my first AI project together with the help of my friend, the forum and the course. Thanks for any input.
** I would like that for the user it would be only:
1. Enter the cc video url
2. Hit Convert
3. Copy result