llm-export

llm-export是一个llm模型导出工具，能够将llm模型导出到onnx模型。

🚀 均完成onnxruntime正确性测试
🚀 优化原始代码，支持动态形状
🚀 优化原始代码，减少常量部分

模型支持与下载

用法

将该项目clone到本地

git clnoe [email protected]:wangzhaode/llm-export.git

将需要导出的LLM项目clone到本地，如：chatglm2-6b

git clone https://huggingface.co/THUDM/chatglm2-6b
# 如果huggingface下载慢可以使用modelscope
git clone https://modelscope.cn/ZhipuAI/chatglm2-6b.git

执行LLMExporter导出模型

cd LLMExporter
python llm_export.py --path ../chatglm2-6b --export_path ./onnx --export

功能

支持将模型完整导出为一个onnx模型，使用--export
支持将模型分段导出为多个模型，使用--export_split
支持导出模型的词表到一个文本文件，每行代表一个token；其中token使用base64编码；使用--export_verbose
支持导出模型的Embedding层为一个onnx模型，使用--export_embed，同时支持bf16格式，使用--embed_bf16
支持分层导出模型的block，使用--export_blocks导出全部层；使用--export_block $id导出指定层
支持导出模型的lm_head层为一个onnx模型，使用--export_lm
支持对模型进行对话测试，使用--test $query会返回llm的回复内容
支持在导出onnx模型后使用onnxruntime对结果一致性进行校验，使用--export_test
支持将tokenizer导出为文本文件，使用--export_token

参数

usage: llm_export.py [-h] --path PATH [--type {chatglm-6b,chatglm2-6b,codegeex2-6b,Qwen-7B-Chat,Baichuan2-7B-Chat,Llama-2-7b-chat-ms}]
                     [--export_path EXPORT_PATH] [--export_verbose] [--export_test] [--test TEST] [--export] [--export_split] [--export_token]
                     [--export_embed] [--export_lm] [--export_block EXPORT_BLOCK] [--export_blocks] [--embed_bf16]

LLMExporter

optional arguments:
  -h, --help            show this help message and exit
  --path PATH           path(`str` or `os.PathLike`):
                        Can be either:
                        	- A string, the *model id* of a pretrained model like `THUDM/chatglm-6b`. [TODO]
                        	- A path to a *directory* clone from repo like `../chatglm-6b`.
  --type {chatglm-6b,chatglm2-6b,codegeex2-6b,Qwen-7B-Chat,Baichuan2-7B-Chat,Llama-2-7b-chat-ms}
                        type(`str`, *optional*):
                        	The pretrain llm model type.
  --export_path EXPORT_PATH
                        export onnx model path, defaut is `./onnx`.
  --export_verbose      Whether or not to export onnx with verbose.
  --export_test         Whether or not to export onnx with test using onnxruntime.
  --test TEST           test model inference with query `TEST`.
  --export              export model to an `onnx` model.
  --export_split        export model split to some `onnx` models:
                        	- embedding model.
                        	- block models.
                        	- lm_head model.
  --export_token        export llm tokenizer to a txt file.
  --export_embed        export llm embedding to an `onnx` model.
  --export_lm           export llm lm_head to an `onnx` model.
  --export_block EXPORT_BLOCK
                        export llm block [id] to an `onnx` model.
  --export_blocks       export llm all blocks to `onnx` models.
  --embed_bf16          using `bfloat16` replace `float32` in embedding.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
llm_models		llm_models
README.md		README.md
llm_export.py		llm_export.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-export

模型支持与下载

用法

功能

参数

About

Releases

Packages

Languages

MetaAlms/llm-export

Folders and files

Latest commit

History

Repository files navigation

llm-export

模型支持与下载

用法

功能

参数

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages