Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding alpaca data 52k data from stanford alpaca project #32

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

joecodecreations
Copy link

Adding in additional .jsonl data for 52k dataset from alpaca project

@orangetin
Copy link
Member

orangetin commented Mar 31, 2023

Hey, great PR! Unfortunately, the Alpaca data set is under a Creative Commons Non-Commercial license: https://github.com/tatsu-lab/stanford_alpaca/blob/aa65c492bb788e144712daab42bc5d11c2761591/DATA_LICENSE

This issue mentions them working on changing the license: tatsu-lab/stanford_alpaca#25 (comment)

As mentioned in this comment, open source licensed data sets are preferred.

It would be great if Alpaca changed the license, but as it stands right now, it's limited.

Maybe someone else can comment on this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants