Skip to content
View dkrystki's full-sized avatar
  • Reloadware
  • Brisbane, Australia

Sponsors

@toddkaufmann
@HyukdongKim

Block or report dkrystki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,247 465 Updated Jan 22, 2025

Userspace tool to disable middle mouse button paste in Xorg

C 404 21 Updated Feb 10, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,128 1,212 Updated Jan 22, 2025

Python3 library for downloading YouTube Videos.

Python 949 119 Updated Jan 20, 2025
Python 7,182 564 Updated Jan 14, 2025

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 527 26 Updated Dec 19, 2024

Official implementation of Conflict-Free Inverse Gradients Method

Python 27 1 Updated Nov 14, 2024

Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).

Python 190 3 Updated Jan 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,574 731 Updated Jan 22, 2025

Vocal Remover using Deep Neural Networks

Python 1,616 238 Updated Jul 23, 2024

A script that shows warning messages to the user when the battery is almost empty. For i3wm users.

Shell 295 32 Updated Feb 4, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,729 4,322 Updated Aug 19, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,091 227 Updated Dec 12, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,078 2,153 Updated Dec 13, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,856 292 Updated Jan 9, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,119 4,417 Updated Jan 18, 2025

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 1,362 145 Updated Jan 17, 2025

Faster Whisper transcription with CTranslate2

Python 13,614 1,147 Updated Jan 1, 2025

Inference and training library for high-quality TTS models.

Python 4,927 508 Updated Dec 10, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,482 154 Updated Oct 28, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,604 649 Updated Aug 13, 2024

Longformer: The Long-Document Transformer

Python 2,072 276 Updated Feb 8, 2023

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,261 86 Updated Jan 22, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,833 3,420 Updated Jan 22, 2025

Piper based VoiceDock TTS implementation

C++ 9 1 Updated Aug 12, 2023

Foundational model for human-like, expressive TTS

Python 3,993 670 Updated Jul 30, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,567 3,041 Updated Jan 7, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,318 466 Updated Aug 10, 2024

A fast, local neural text to speech system

C++ 7,507 548 Updated Oct 21, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,066 4,594 Updated Aug 16, 2024
Next
Showing results