Unveiling 3D ocean biogeochemical provinces in the North Atlantic: A systematic comparison and validation of clustering methods


Contact
bkoch [ at ] awi.de

Abstract

Defining ocean regions and water masses helps to understand marine processes and can serve downstream tasks such as defining marine protected areas. However, such definitions often result from subjective decisions potentially producing misleading, unreproducible outcomes. Here, the aim was to objectively define regions of the North Atlantic through systematic comparison of clustering methods within the Native Emergent Manifold Interrogation (NEMI) framework (Sonnewald, 2023). About 300 million measured salinity, temperature, and oxygen, nitrate, phosphate and silicate concentration values served as input for various clustering methods (k-Means, agglomerative Ward, and Density-Based Spatial Clustering of Applications with Noise (DBSCAN)). Uniform Manifold Approximation and Projection (UMAP) emphasised (dis-)similarities in the data while reducing dimensionality. Based on systematic validation of clustering methods and their hyperparameters using internal, external and relative validation techniques, results showed that UMAP-DBSCAN best represented the data. Strikingly, internal validation metrics proved systematically unreliable for comparing clustering methods. To address stochastic variability, 100 UMAP-DBSCAN clustering runs were conducted and aggregated following NEMI, yielding a final set of 321 clusters. Reproducibility was evaluated via ensemble overlap (88.81±1.8%) and mean grid cell-wise uncertainty (15.49±20%). Case studies of the Mediterranean Sea, deep Atlantic waters and Labrador Sea showed strong agreement with common water mass definitions. This study revealed a more detailed regionalisation compared to previous concepts such as the Longhurst provinces through systematic clustering method comparison. The applied method is objective, efficient and reproducible and will support future research on biogeochemical differences and changes in oceanic regions.



Item Type
Article
Authors
Divisions
Primary Division
Programs
Primary Topic
Publication Status
Published
Eprint ID
60579
DOI 10.1016/j.ecoinf.2025.103390

Cite as
Jenniges, Y. , Sonnewald, M. , Maneth, S. , Olsen, A. and Koch, B. P. (2025): Unveiling 3D ocean biogeochemical provinces in the North Atlantic: A systematic comparison and validation of clustering methods , Ecological Informatics, 91 , p. 103390 . doi: 10.1016/j.ecoinf.2025.103390


Download
[thumbnail of Jenniges et al 2025 - 3D provinces.pdf]
Preview
PDF
Jenniges et al 2025 - 3D provinces.pdf - Other

Download (6MB) | Preview

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email


Citation

Research Platforms
N/A


Actions
Edit Item Edit Item