Video Transcription Pipeline (Whisper, Languagetool, supports german)

This project is a quick video transcription pipeline for me (and you) to use in university. Input a folder of videos and get out transcriptions (plus subtitle files) with additional grammar checking done. Pipeline script file can be easily edited to grammar check subtitles too, output different formats or other adaptions. A focus was easy and working setup as opposed to some projects I got inspired by while developing this and struggled to set up.

The current second version of this tool utilizes the advanced speech recognition model called whisper built by OpenAI.

Setup (Linux)

Download

Clone the repo:

git clone https://github.com/DavidM42/Video-Transcription-Pipeline.git

Python environment setup

virtualenv -p python3 venv
source venv/bin/activate
pip install -r ./requirements.txt

Execute

After running the setup and placing all your videos in the video folder run

source venv/bin/activate
python pipeline.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
old		old
punctuatorModel		punctuatorModel
transcriptions		transcriptions
.gitignore		.gitignore
README.md		README.md
pipeline.py		pipeline.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Transcription Pipeline (Whisper, Languagetool, supports german)

Setup (Linux)

Download

Python environment setup

Execute

About

Releases

Packages

Languages

DavidM42/Video-Transcription-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Video Transcription Pipeline (Whisper, Languagetool, supports german)

Setup (Linux)

Download

Python environment setup

Execute

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages