This project is a quick video transcription pipeline for me (and you) to use in university. Input a folder of videos and get out transcriptions (plus subtitle files) with additional grammar checking done. Pipeline script file can be easily edited to grammar check subtitles too, output different formats or other adaptions. A focus was easy and working setup as opposed to some projects I got inspired by while developing this and struggled to set up.
The current second version of this tool utilizes the advanced speech recognition model called whisper built by OpenAI.
Clone the repo:
git clone https://github.com/DavidM42/Video-Transcription-Pipeline.git
virtualenv -p python3 venv
source venv/bin/activate
pip install -r ./requirements.txt
After running the setup and placing all your videos in the video folder run
source venv/bin/activate
python pipeline.py