-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Orb v3 #240
base: main
Are you sure you want to change the base?
Orb v3 #240
Conversation
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
@janosh / @CompRhys this is ready now - I am still seeing some error from the tests regarding the csv predictions - I can include them here, but I think it's not being fetched from the aws bucket - is it preferable to have them stored remotely? I'll send a follow up PR when the paper is out to update a couple of the yaml sections, shouldn't be too long (but the models are available as of now in |
Sounds good. |
Description
Orb-v3 - coming in the next few days! I've also included the script I used to do the geo_opt analysis (mostly based on the one in matbench, but isolated to a single script), because I know that was slightly in the air based on #230. To check, I ran a couple of other models using their predictions in figshare (showing the utility of getting people to to this!):
We are still in the process of adding the exact models to
orb-models
but we'll make a release over the weekend and then I will update this PR.Checklist
Please check the following items before submitting your PR:
models/<arch_name>/<model_variant>.yml
for my submission.arch_name
is the name of the architecture andmodel_variant.yml
includes things like author details, training set names and important hyperparameters.Model.<arch_name>
enum inenums.py
.<yyyy-mm-dd>-<model_variant>-preds.csv.gz
).<yyyy-mm-dd>-wbm-IS2RE-FIRE.jsonl.gz
). JSON Lines allows fast loading of small numbers of structures withpandas.read_json(lines=True, nrows=100)
for inspection.<yyyy-mm-dd>-kappa-103-FIRE-<values-of-dist|fmax|symprec>.gz
).models/<arch_name>/<model_variant>.yml
). If not using Figshare I have included the urls to the cloud storage service in the description of the PR.test_<arch_name>_<task>.py
fortask
indiscovery
,kappa
,diatomics
) that generated the prediction files.Additional Information (Optional)
train_<arch_name>.py
) if I trained a model specifically for this benchmark.readme.md
with additional details about my model.