Skip to content

Ontology loader #923

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 42 commits into
base: main
Choose a base branch
from
Open

Ontology loader #923

wants to merge 42 commits into from

Conversation

sierra-moxon
Copy link
Member

@sierra-moxon sierra-moxon commented Mar 5, 2025

This PR adds a dagster job for loading ontologies. For now, it uses the default ontology configured in the ontology-loader package (ENVO). Its implementation parallels development of the "alldocs" job runner (but has nothing to do with referential integrity or the alldocs collection specifically).
It echoes the implementation in that it:

The other changes are to README and in-line doc to document the places I needed to look elsewhere for information.
fixes microbiomedata/issues#1055

sierra-moxon and others added 16 commits March 11, 2025 13:20
      1. Refactored the existing  job to  with explicit ENVO
      source
      2. Added new job for loading the Uberon ontology:
      3. Added new job for loading the Plant Ontology:
      4. Created separate weekly schedules for each ontology loading job
      5. Updated log messages to show the correct ontology source
      6. Updated all references to use the new job names

      All three jobs use the same underlying  method but with different source_ontology
      parameters.
@sierra-moxon sierra-moxon marked this pull request as ready for review April 23, 2025 04:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Incorporate ontology loading code into nmdc-runtime as a "scheduled job" via dagster.
1 participant