Skip to content

Conversation

Muennighoff
Copy link

@Muennighoff Muennighoff commented Jul 21, 2022

@KhalidAlt Great work on the dataset & sorry for just opening a 2nd pr with it

If someone has bandwidth would be great to add the arabic prompts via PR into this branch from: https://github.com/bigscience-workshop/promptsource/pull/763/files (Maybe @haileyschoelkopf , @KhalidAlt ?)

cc cc @thomasw21 & @lintangsutawika to double-check my Indonesian 👻

@haileyschoelkopf
Copy link

Just saw this! Unfortunately, I’ll be away from my computer today and tomorrow.

Also just a note: we should not merge this into the
eval-hackathon branch,

because the eval WG is using this dataset with the prompts + lang-specific if stmts in their experiments.

@Muennighoff
Copy link
Author

Just saw this! Unfortunately, I’ll be away from my computer today and tomorrow.

Also just a note: we should not merge this into the eval-hackathon branch,

because the eval WG is using this dataset with the prompts + lang-specific if stmts in their experiments.

I think it's fine to merge this as it's a new dataset.
We can just leave the other tydiqa in the repo so there are both options.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants