This thesis addresses the challenge of efficiently summarizing and analyzing video content using artificial intelligence. The solution lies in quick extraction of key information from lengthy videos, which is particularly crucial in educational and research contexts. The approach integrates advanced AI models such as Whisper for speech-to-text conversion and GPT for content analysis. A web application was developed that enables automatic generation of summaries, key points, and answers to questions about video content. Results from a selected video demonstrated that the application successfully produces clear and accurate information. The key contribution is the implementation of a solution for processing longer videos and improving subtitle quality, especially for Slovenian content, significantly expanding the application's usability across different language environments and video types.
|