Browse all notebooks in this collection, grouped by data origin (the OSDF origin a notebook streams from). Each entry shows a short description and the key tags so you can spot relevant ones at a glance.
For the tag taxonomy and conventions, see CONTRIBUTING.md. For background on OSDF and PelicanFS, see Project Pythia’s OSDF Cookbook.
Note on
platform:tags. Most notebooks default to a DaskLocalCluster(with PBS/Slurm options for HPC users), so they should run on a laptop with no changes other than a cluster-switch toggle. Theplatform:tag documents where the notebook was verified, not the only place it can run.
Find a notebook¶
Have NCAR HPC access (Casper/Derecho)? → Check the GDEX / NCAR Data Origin section — these stream from NCAR’s GDEX origin.
Want to run on your laptop or in the cloud? → Look for platform:laptop or platform:jetstream2 notebooks under Cross-platform / cloud.
Looking for ML examples? → See Machine learning.
Comparing OSDF performance? → See Benchmarks.
GDEX / NCAR Data Origin¶
Notebooks that stream data from NCAR’s OSDF origin — i.e. NCAR’s Geoscience Data Exchange (GDEX) — covering CESM2 LENS, ERA5, JRA-3Q, DART, CONUS404, NA-CORDEX, SAAG, HadISST, and more. All of these run on Casper.
AWS Open Data¶
Notebooks that stream from the OSDF AWS open-data origin.
| Notebook | Description | Tags |
|---|---|---|
cmip6_gmst_zarr.ipynb | Multi-model GMST from CMIP6 zarr (~27 GCMs), compared to HadCRUT5 | dataset:cmip6 task:visualization |
cmip6_ecs.ipynb | Equilibrium Climate Sensitivity from CMIP6 | dataset:cmip6 task:ecs |
cmip6_bias_correction.ipynb | Bias-correct CMIP6 output | dataset:cmip6 task:bias-correction |
cmip6_precipitation.ipynb | CMIP6 precipitation diagnostics | dataset:cmip6 task:visualization |
hrrr_aws.ipynb | Stream HRRR data from AWS | dataset:hrrr origin:aws |
simple_aws_example.ipynb | Minimal AWS-origin example — good first notebook | origin:aws level:beginner |
Cross-platform / cloud examples¶
Notebooks demonstrating execution outside Casper.
| Notebook | Description | Tags |
|---|---|---|
cesm_osdf_stampede3.ipynb | Bias-correction workflow on TACC Stampede3 | platform:stampede3 dataset:cesm task:bias-correction |
jetstream_intro.ipynb | Introduction to running on Jetstream2 | platform:jetstream2 level:beginner |
jetstream_cesm_oceanheat.ipynb | Ocean-heat workflow on Jetstream2 | platform:jetstream2 dataset:cesm |
jetstream_cmip6_gmst.ipynb | CMIP6 GMST on Jetstream2 | platform:jetstream2 dataset:cmip6 |
Benchmarks¶
Notebooks that measure data-access throughput from various OSDF origins.
| Notebook | Description | Tags |
|---|---|---|
ndc_workflows/aws_benchmark.ipynb | Benchmark CESM2 LENS access from AWS origin | task:benchmark origin:aws |
ndc_workflows/ncar_benchmark.ipynb | Benchmark NCAR-origin access (DART) | task:benchmark origin:ncar-data-origin |
ndc_workflows/ncar_benchmark_simple.ipynb | Simplified NCAR benchmark | task:benchmark level:beginner |
ndc_workflows/ncar_benchmark_ap40.ipynb | NCAR-origin benchmark via OSPool AP40 | task:benchmark platform:ospool |
Machine-learning workflows¶
| Notebook | Description | Tags |
|---|---|---|
ml_workflows/nino3.4_index.ipynb | Logistic-regression Niño 3.4 prediction from SST | task:ml dataset:hadisst |
NDC pathfinder workflows¶
Notebooks developed under the National Discovery Cloud Pathfinder initiative. Most run on a laptop without HPC access.
| Notebook | Description | Tags |
|---|---|---|
ndc_workflows/sonar_ai.ipynb | Plot echograms from NOAA SONAR data | dataset:sonar origin:aws |
ndc_workflows/pycogss_spectral_change.ipynb | Spectral change from Sentinel-2 imagery | dataset:sentinel2 origin:aws |
ndc_workflows/envistor_test_ap40.ipynb | Envistor test via OSPool AP40 | platform:ospool |
Scripts (non-notebook)¶
| Script | Description |
|---|---|
scripts/ospool_example/ | Submit OSDF benchmarks to OSPool via HTCondor |