Skip to content
View daphnei's full-sized avatar

Block or report daphnei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Python 150 13 Updated Oct 1, 2024

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

JavaScript 4,972 1,301 Updated Feb 7, 2024

Crawl BookCorpus

Python 818 110 Updated Jul 14, 2023

Easily fine tune GPT-2 to fill in missing text

Python 196 41 Updated Dec 8, 2022

Front-end for ChatEval platform. Written using React, Next.js and ES6 JavaScript.

JavaScript 2 2 Updated Jan 7, 2025

Python Implementation of HLL-tailcut with 4-bit buckets and MLE estimator

Python 1 Updated May 17, 2019

Public evaluation tool for non task driven neural open domain chatbots

Python 5 Updated May 4, 2022

A fast, efficient universal vector embedding utility package.

Python 1,641 119 Updated Aug 3, 2023

This repo contains code to process a once live-streamed video and annotate it with Tweets.

Jupyter Notebook 1 Updated May 22, 2018

Deep network that performs spectral clustering

Python 321 102 Updated Feb 23, 2023

Cluster paraphrases by word sense

Python 12 7 Updated Jan 3, 2019

LogLog space version of MinHash by combining ideas from HyperLogLog and b-bit MinHash

Python 52 8 Updated Feb 19, 2020

simple python3 module for scraping google images across many languages, based on input dictionaries

Python 9 8 Updated Nov 21, 2019
Python 1 Updated Sep 29, 2017
Showing results