We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
比如我有一个数据集,使用自然语言描述数据结构,模型的任务是还原出数据结构。 是否需要从头构建tokenizer和预训练数据集呢,以及tokenizer和预训练数据集是否要完全基于我的数据集构建呢,望解惑。
The text was updated successfully, but these errors were encountered:
tokenizer 在任意数据集中都不需要重新构建
tokenizer
minimind2 将会给出新的数据集格式,可以用于构建自己的垂直任务
minimind2
未来几天很快发布
Sorry, something went wrong.
No branches or pull requests
比如我有一个数据集,使用自然语言描述数据结构,模型的任务是还原出数据结构。
是否需要从头构建tokenizer和预训练数据集呢,以及tokenizer和预训练数据集是否要完全基于我的数据集构建呢,望解惑。
The text was updated successfully, but these errors were encountered: