Skip to content

Commit

Permalink
Merge pull request #271 from jnywong/nasa-veda-post
Browse files Browse the repository at this point in the history
VEDA blog post
  • Loading branch information
jnywong authored Jul 12, 2024
2 parents d52b16f + 84ae6bd commit 2cdb796
Show file tree
Hide file tree
Showing 5 changed files with 71 additions and 0 deletions.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
71 changes: 71 additions & 0 deletions content/blog/2024/veda-devseed-collab/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
---
title: Collaborating with Development Seed to deliver cyberinfrastructure for NASA VEDA
date: "2024-07-02"
authors: ["Jenny Wong"]
tags: [geoscience, open source]
categories: [impact]
featured: false
draft: false
---

![Landing page of the public-facing NASA VEDA dashboard](featured.png "The [VEDA dashboard](https://www.earthdata.nasa.gov/dashboard/)")

The 2i2c team are proud to continue our strong working collaboration with [Development Seed](https://developmentseed.org/), following our previous work on launching the [US GHG center](blog/2023/us-ghg-center-launches/index) (also see the [Development Seed blog post](https://developmentseed.org/blog/2023-12-14-ghg-center)). Together with scientists at NASA in our regular sync touchpoints, we have recently delivered a tranche of improvements to [the Visualization, Exploration and Data Analysis (VEDA) project](https://www.earthdata.nasa.gov/esds/veda).

This platform is designed to thread open-source components together to consolidate GIS delivery mechanisms, processing, analysis and visualization tools, and presented in a collaborative interactive computing environment. All code repositories and associated resources stemming from this work are available on the [VEDA GitHub page](https://github.com/NASA-IMPACT/VEDA/wiki).

In the spirit of fully open development, you can [see the objectives](https://github.com/NASA-IMPACT/veda-jupyterhub/issues?q=is%3Aissue+jh+is%3Aclosed+label%3A%22PI+24.3%22+)
the combined 2i2c and Development Seed team had for the last quarter. In this blog post, we will describe some of the significant ones!

## Better image management and testing

The [repo2docker-action](https://github.com/jupyterhub/repo2docker-action) is a GitHub action simplifying image building and testing for use with JupyterHub, using either a `Dockerfile` or various [configuration files](https://repo2docker.readthedocs.io/en/latest/config_files.html) (like `requirements.txt`, `environment.yml`, etc) supported by [repo2docker](https://github.com/jupyterhub/repo2docker). We migrated our image building pipeline from a somewhat homegrown solution to this upstream action, making image updates and testing *much* easier. In particular, we can [automatically run test notebooks](https://github.com/NASA-IMPACT/pangeo-notebook-veda-image/pull/4) on every change we make to the image! This way, we can easily catch any breaking changes in library versions or other package installs without disrupting users. We also debugged and [contributed upstream](https://github.com/jupyterhub/repo2docker-action/pull/124) fixes to the testing infrastructure so everyone could benefit from this, rather than just us.

## Automatically pulling example notebooks on startup

When a user logs into a JupyterHub, it is very helpful if we could have a bunch of example notebooks and other content pre-populated for them so they can get started right away. [nbgitpuller](https://nbgitpuller.readthedocs.io/) is heavily used for this particular use case. However, it requires that nbgitpuller is installed inside the image the user is using - and not all images have it installed. In particular, we wanted to continue using the (wonderful) [Rocker images](https://rocker-project.org/) maintained upstream for R users, however they do not have nbgitpuller installed. To solve this problem we built [jupyterhub-gitpuller-init](https://github.com/NASA-IMPACT/jupyterhub-gitpuller-init), which can be used as an [init container](https://kubernetes.io/docs/concepts/workloads/pods/init-containers/) to pre-populate user content on persistent home directories regardless of the image used. We also made sure to build this in a way that *anyone* can use it, and it is not tied into either 2i2c or VEDA infrastructure!

## Opening specific visualizations in QGIS via URL

[QGIS](https://www.qgis.org/) is the world's most used open source GIS software, and previously 2i2c had [worked with Openscapes and QGreenland](https://blog.jupyter.org/desktop-gis-software-in-the-cloud-with-jupyterhub-ddced297019a) to bring this *desktop* software to JupyterHub. We had previously worked on a [container image](https://github.com/2i2c-org/nasa-qgis-image) that allows users to access large datasets stored in the cloud directly through QGIS on the JupyterHub, allowing users to work with much larger datasets than they could on their desktops by bringing cloud compute adjacent to the data. As a continuation of this work, we developed [jupyter-remote-qgis-proxy](https://github.com/sunu/jupyter-remote-qgis-proxy), which builds QGIS specific features on top of [jupyter-remote-desktop-proxy](https://github.com/jupyterhub/jupyter-remote-desktop-proxy). In particular, it allows creation of shareable links that when clicked, opens specific datasets and layers in QGIS in a JupyterHub! You can see this in action:

<figure>
{{< video autoplay="true" loop="true" src="qgis.mp4" >}}
<figcaption>Launching QGIS on a Linux desktop served by the VEDA JupyterHub</figcaption>
</figure>

This opens up exciting future possibilities. Imagine this [exploration of the Camp Fire](https://www.earthdata.nasa.gov/dashboard/data-catalog/campfire_ndvi_difference_2015_2022) having an 'Open in QGIS' button that enables further exploration of the data without the user needing to download or install anything! Work will continue in the coming quarter towards achieving this vision.

We are also excited to see recent work in this space [from QuantStack and Simula Labs](https://blog.jupyter.org/jupytergis-d63b7adf9d0c), and will follow up to ensure an orderly transition to more web native workflows for existing users of QGIS in due time.

## Better Profile Selection

This is a continuation of our [GESIS collaboration](blog/2024/jupyterhub-binderhub-gesis/index). In the path to deploying dynamic image building to end users, we wanted to stabilize [jupyterhub-fancy-profiles](https://github.com/yuvipanda/jupyterhub-fancy-profiles) enough to deploy to users of VEDA (and eventually everyone else). This is the primary interface users see *after* they log in to JupyterHub, and was ripe for UX improvements. The default interface looks like this:

![Default profile list page](old-profile.png)

The revamped one is much more streamlined and looks like this:

<figure>
{{< video autoplay="true" loop="true" src="new-profile.mp4" >}}
<figcaption>Revamped Profile Screen</figcaption>
</figure>

This is currently deployed to a staging hub and has helped us shake out a lot of bugs! We expect the improved interface will be rolled out to all users in the near future. We are also planning further development to make the user experience even better and smoother for everyone.

## Supporting workshops

End users benefiting from our work is what ultimately gives meaning to our work. To that end, we were very happy to support running workshops during this collaboration – see our related blog post [US Greenhouse Gas Center supports summer school at CIRA](blog/2024/ghg-summer-school/index) for more information.

## Ongoing Collaboration

Delivering on these objectives in a timely way heavily depended on the success of the team collaboration. [Sanjay Bhangar](https://developmentseed.org/team/sanjay-bhangar) of Development Seed commented

> Working closely with the 2i2c team on growing features to support users on the VEDA and GHG Center hubs has been absolutely amazing. With 2i2c’s deep experience in the Jupyter ecosystem, we have been able to implement some fairly complex features quite easily, and their strong open-source roots have ensured that whatever we work on is broadly useful to the wider Jupyter and scientific computing communities.
This collaboration continues, and we have now [published our objectives for the coming quarter](https://github.com/NASA-IMPACT/veda-jupyterhub/issues?q=is%3Aissue+jh%3A+label%3A%22PI+24.4%22+). Watch this space!

## Acknowledgements

- [Development Seed](https://developmentseed.org/)
- [NASA IMPACT](https://impact.earthdata.nasa.gov/)
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added content/blog/2024/veda-devseed-collab/qgis.mp4
Binary file not shown.

0 comments on commit 2cdb796

Please sign in to comment.