Skip to content
View bratao's full-sized avatar
👻
Improving inefficiencies
👻
Improving inefficiencies
  • Escavador

Highlights

  • Pro

Organizations

@FORMAS

Block or report bratao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Python 1,466 71 Updated Feb 13, 2025

🔧 Repair JSON!Solution for JSON Anomalies from LLMs.

Go 214 10 Updated Jul 17, 2024

Simple GRPO scripts and configurations.

Python 54 4 Updated Feb 6, 2025

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python 1,766 120 Updated Feb 14, 2025

Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"

Python 15 1 Updated Feb 14, 2024

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 745 54 Updated Nov 29, 2024

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

Python 2,517 114 Updated Feb 14, 2025

Fast State-of-the-Art Static Embeddings

Python 1,052 49 Updated Feb 14, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 11,961 1,220 Updated Feb 2, 2025

Build fast and accurate GenAI apps with GraphRAG SDK at scale.

Python 250 27 Updated Feb 5, 2025

Automated Code Reviewer for GitLab merge requests

TypeScript 6 3 Updated Dec 27, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 67,230 9,835 Updated Feb 15, 2025

A playground to make it easy to try crazy things

Python 28 1 Updated Feb 14, 2025

Code for our paper accepted at EMNLP 2023 (Findings)

Python 13 Updated Jan 5, 2024

A p2p reverse proxy with NAT traversal. Inspired by frp, rathole and ngrok

Go 318 9 Updated Feb 1, 2025

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 10,559 1,003 Updated Feb 14, 2025

Dead Simple LLM Abliteration

Python 200 6 Updated Feb 14, 2025

Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and WordPiece tokenization in JavaScript, Python and Rust.

Rust 10 Updated Dec 28, 2024
Python 3 Updated Jan 5, 2025

Automatic code review using OpenAI API triggered by GitHub/GitLab webhooks

Python 3 Updated Dec 20, 2024

Python tool for converting files and office documents to Markdown.

HTML 37,071 1,681 Updated Feb 12, 2025

A tiny HTML5 parser

Python 5 Updated Oct 29, 2024

Get your documents ready for gen AI

Python 20,956 1,159 Updated Feb 14, 2025

🧟 The modern PHP app server

Go 7,470 281 Updated Feb 12, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,519 1,898 Updated Feb 15, 2025

Windows Dependencies

C# 510 13 Updated Feb 15, 2025

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Rust 933 35 Updated Dec 21, 2024

Create an issue on FireDucks

Jupyter Notebook 675 22 Updated Feb 7, 2025

RAG that intelligently adapts to your use case, data, and queries

Python 2,901 151 Updated Feb 13, 2025
Next
Showing results