GitHub - Bergbok/Jerma-Subtitle-Search: Webpage for searching through 2000+ Jerma videos

Subtitles

Video Count    : 2002
Word Count     : 25,273,515
Duration       : 5385:16:33
Oldest Video   : 2011-06-11
Latest Video   : 2025-01-19

Subtitles were obtained using this Python script. Audio gets downloaded with yt-dlp, which gets transcribed using WhisperX (large-v3 model) and converted to LRC format with ffmpeg.

Relevant information gets written to a JSON file, which gets indexed and compressed using this JS script.

The Python script also supports downloading YouTube's auto-generated subtitles, and optionally only transcribing videos which don't have auto-generated subtitles available.

Webpage

Uses Mithril, MiniSearch, lite-youtube-embed and fflate.

screenshot of webpage search results for the query: "GitHub"

Running Locally

# feel free to substitute bun with npm/yarn/whatever
git clone https://github.com/Bergbok/Jerma-Subtitle-Search.git
cd Jerma-Subtitle-Search
git lfs install
git lfs pull
bun install
bun run dev

jermaHeart Twitch Emote

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.github		.github
.vscode		.vscode
public		public
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
bun.lock		bun.lock
index.html		index.html
package.json		package.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitles

Webpage

Running Locally

About

Contributors 3

Languages

License

Bergbok/Jerma-Subtitle-Search

Folders and files

Latest commit

History

Repository files navigation

Subtitles

Webpage

Running Locally

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 3

Languages