Satellite-Driven Estimates of Global Chlorophyll Depth Profiles using PACE Hyperspectral Ocean Color Data
Fisheries
Fish live and eat in layers, not just at the surface.
Carbon cycle
ca 50% of global primary production is carried out by ocean phytoplankton
In this talk, I will introduce you to an experimental product of CHLA(z), depth profiles, developed using PACE remote-sensing reflectance (Rrs) data. You can access the cloud-optimized dataset as follows. zarr version 3+ required. Use zarr.__version__ to see what you have.
March 5, 2024. Red box is where we will make a time series.


Remote-sensing reflectance is the spectral signal of upwelling light emerging from the ocean, formed by depth-integrated absorption and scattering.
Sunlight enters the ocean and then some is absorbed and some is scattered:
The scattered light that escapes the ocean surface carries information from those layers and what is in them.
PACE provides hyperspectral water-leaving radiance reflectances, Rrs(λ), across the visible spectrum.

Even though Rrs is light emerging from the ocean surface:
PACE’s spectral richness gives us new power to infer what’s happening at depth.
Caveat: Mostly trained on data from open ocean (BGC-Argo). Most data was for surface CHLA < 1. Need more coastal data. Though it does do a decent job for the surface CHLA > 10 data that we do have.



Raw profiles come at irregular depths → ML models need consistent targets.
👉 This is why we train one BRT per depth bin instead of forcing a single parametric profile shape.
sklearn HistGradientBoostingRegressor() function. Standard. Easy.Each model predicts CHLA in a single depth range:
Benefits:
Overall we see a close match between our predictions and our test data for CHLA in the upper 10m — for non-coastal data.

Using PACE data for hyperspectral Rrs, we can make a prediction of surface CHLA. Let’s compare to PACE’s surface CHLA product as a first pass check, but note that these are different products trained on different in-situ data. Note surface CHLA is not our goal.

We do not expect these to be identical as the PACE chlor_a is based on the classic Rrs ratio algorithm while the BRT uses the whole spectrum but most importantly was trained on Argo and OOI florometer measurements.
PACE chlor_a to BRT with type = 1 (ooi) and solar_hour = 0 (midnight)
Notice that at high PACE chlor_a, the BRT model predicts lower CHLA_0_10. We might be able to correct this by using the whole CHLA depth profile (next section).

We can do this for the whole globe.


Not enough data yet. Need to correct for bathymetry.





Once the per-depth-bin BRTs are trained:
This moves us toward estimates of global, depth-resolved CHLA density from space.