Explore using RAG in ChatCraft #337

humphd · 2024-01-14T14:10:06Z

As we start to consider adding the ability to attach files to a chat (see #325), we're going to run into cases where the context window of the chat is not enough to fit everything we have. Consider a PDF of a paper, or a zip file with a bunch of files and code that you want to ask questions about.

RAG (Retrieval Augmentation Generation) is a way to use a large piece of context (e.g., a big document, a database, etc.) to "retrieve" relevant chunks of context, then include those along with your prompt. For example, if I had a zip file of a source code project, I might only need to include 5 or 6 chunks of code with my question vs. the whole thing. RAG techniques allow you to find chunks of text that are similar to what you are talking about within a larger document/database.

I was reminded of this reading Twitter today:

It's possible to generate embeddings in a browser (free), or have OpenAI ($$$) do it for you:

RAG probably isn't the main way we'd use ChatCraft; but given that we have a database of all chats, and the ability to include files, we should probably explore whether we can leverage this for our use cases.

tarasglek · 2024-01-20T23:43:28Z

I would actually start with RAG for code. Eg teach chatcraft to use code indexers ala https://tabby.tabbyml.com/docs/configuration/

The search/embedding makes more sense to me as an external web service for chatcraft to invoke.

humphd · 2024-01-21T14:22:05Z

Looking at the tabby code, it seems like they run a server on port 8080, and they have an agent JS lib at https://github.com/TabbyML/tabby/tree/main/clients/tabby-agent. Maybe we could use that?

We'd have to include a way to point ChatCraft at a server running somewhere (local docker container, remote...) via settings or something.

Someone should investigate if this is doable and worth doing.

rjwignar · 2024-01-31T16:35:00Z

I'd be willing to work on parts of this

Amnish04 · 2024-01-31T16:35:38Z

I would like to try contributing to this

WangGithub0 · 2024-03-13T15:59:48Z

try to use built in openAI

humphd · 2025-01-27T13:35:39Z

Closing in favour of #803

WangGithub0 assigned Amnish04 Mar 13, 2024

Amnish04 removed their assignment Apr 10, 2024

humphd closed this as completed Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore using RAG in ChatCraft #337

Explore using RAG in ChatCraft #337

humphd commented Jan 14, 2024

tarasglek commented Jan 20, 2024

humphd commented Jan 21, 2024

rjwignar commented Jan 31, 2024

Amnish04 commented Jan 31, 2024

WangGithub0 commented Mar 13, 2024

humphd commented Jan 27, 2025

Explore using RAG in ChatCraft #337

Explore using RAG in ChatCraft #337

Comments

humphd commented Jan 14, 2024

tarasglek commented Jan 20, 2024

humphd commented Jan 21, 2024

rjwignar commented Jan 31, 2024

Amnish04 commented Jan 31, 2024

WangGithub0 commented Mar 13, 2024

humphd commented Jan 27, 2025