llm-zero-to-mastery

About

This repository serves as a comprehensive guide to mastering large language models (LLMs) through hands-on projects. It includes implementations of various concepts and techniques related to LLMs, such as Retrieval-Augmented Generation (RAG) and fine-tuning GPT models. Each part is well-structured, providing clear steps and practical examples to help you learn and apply LLM techniques effectively.

Structure

1. Local-RAG [`Local-RAG/`]

In this part, I implemented a simple local-rag which will get the data from the ./data directory and then implements the following:

It sets the pdf_directory variable to data. You should replace this with the actual path to your PDF directory.
The extract_text_from_pdfs() function is called with the pdf_directory as an argument. This function extracts text from all the PDF files in the specified directory and returns a list of documents.
The create_vector_database() function is called with the documents list as an argument. This function creates a vector database using the extracted documents and returns the client, collection, and model objects.
A message is printed to indicate that the PDF documents have been processed and stored.
An infinite loop is started using the while True statement. Inside the loop:
- The user is prompted to enter a question. If the user types exit, the loop breaks, and the program terminates.
- The retrieve_relevant_documents() function is called with the user's query, collection, and model objects as arguments. This function retrieves relevant documents from the vector database based on the query.
- The retrieved documents are joined into a single string using the join() method with a newline character as the separator.
- The generate_response_with_ollama() function is called with the context (joined retrieved documents) and the user's query as arguments. This function generates a response using the OpenAI's Ollama model.
- The generated response is printed to the console.

2. Fine-Tuning GPT2 [`Fine-Tuning-GPT2/`]

Imports
Libraries required for fine-tuning GPT-2.
Check GPU Availability
Ensures GPU is used if available.
Define Model Name
Specifies the base model (GPT-2).
Load the Tokenizer and Model
Loads GPT-2 and its tokenizer from Hugging Face.
Add Special Tokens
Adds padding and resizes token embeddings.
Prepare the Dataset
- Dataset Loading: Loads text datasets for training and testing.
- Tokenization: Prepares and tokenizes data for training.
- Data Collator: Formats batches for training.
Define Training Arguments
Configures hyperparameters for training.
Define the Trainer
Initializes the Trainer for training and evaluation.
Fine-Tune the Model
Trains the model on the provided dataset.
Save the Fine-Tuned Model
Saves the trained model and tokenizer locally.
Test the Fine-Tuned Model
Generates text from the fine-tuned model using custom prompts.
Evaluate the Model
Tests the model with multiple prompts to assess its performance.

3. Instruction-Tuning [`instruction_tuning/`]

This part focuses on instruction-tuning, a method to refine large language models (LLMs) for specific tasks by leveraging task-specific instructions and examples. The notebook provides a step-by-step implementation of this process:

Introduction
Provides an overview of instruction-tuning and its relevance in aligning LLM outputs to user-defined tasks.
Setup and Imports
Ensures the required libraries are imported and the environment is configured for smooth execution.
Data Preparation
- Prepares a dataset of task-specific examples.
- Formats the data into instruction-response pairs suitable for fine-tuning.
Load the Pre-trained Model
Initializes a pre-trained transformer model and tokenizer as the base for instruction-tuning.
Preprocessing the Data
Tokenizes the instruction-response pairs, ensuring proper handling of special tokens and padding.
Define Training Arguments
Configures the fine-tuning process with key parameters such as:
- Learning rate
- Batch size
- Number of training epochs
Fine-Tuning the Model
Leverages Hugging Face's Trainer API to train the model on the prepared dataset, aligning it to follow task-specific instructions.
Evaluation
Evaluates the fine-tuned model using custom task examples to ensure the model performs as intended.
Save the Fine-Tuned Model
Stores the instruction-tuned model locally for future use in specific applications.
Usage Examples
Demonstrates how to use the fine-tuned model for instruction-based tasks, showcasing its performance improvements over the base model.

This part is a valuable addition to the repository, guiding users through the process of aligning LLM outputs to desired tasks using instruction-tuning.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Fine-Tuning-GPT2		Fine-Tuning-GPT2
Instruction-Tuning		Instruction-Tuning
Knowledge-Graph-with-LLMs		Knowledge-Graph-with-LLMs
Local-RAG		Local-RAG
__pycache__		__pycache__
data		data
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-zero-to-mastery

About

Structure

1. Local-RAG [`Local-RAG/`]

2. Fine-Tuning GPT2 [`Fine-Tuning-GPT2/`]

3. Instruction-Tuning [`instruction_tuning/`]

4. MORE WILL COME OUT SOON

About

Releases

Packages

Languages

MeshkatShB/llm-zero-to-mastery

Folders and files

Latest commit

History

Repository files navigation

llm-zero-to-mastery

About

Structure

1. Local-RAG [Local-RAG/]

2. Fine-Tuning GPT2 [Fine-Tuning-GPT2/]

3. Instruction-Tuning [instruction_tuning/]

4. MORE WILL COME OUT SOON

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Local-RAG [`Local-RAG/`]

2. Fine-Tuning GPT2 [`Fine-Tuning-GPT2/`]

3. Instruction-Tuning [`instruction_tuning/`]

Packages