-
Reloadware
- Brisbane, Australia
Lists (1)
Sort Name ascending (A-Z)
Stars
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Userspace tool to disable middle mouse button paste in Xorg
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python3 library for downloading YouTube Videos.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Official implementation of Conflict-Free Inverse Gradients Method
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
SGLang is a fast serving framework for large language models and vision language models.
Vocal Remover using Deep Neural Networks
A script that shows warning messages to the user when the battery is almost empty. For i3wm users.
🔊 Text-Prompted Generative Audio Model
An Open Source text-to-speech system built by inverting Whisper.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
A fast inference library for running LLMs locally on modern consumer-class GPUs
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
Faster Whisper transcription with CTranslate2
Inference and training library for high-quality TTS models.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Foundational model for human-like, expressive TTS
Instant voice cloning by MIT and MyShell. Audio foundation model.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production