
CLOINet: Ocean State Reconstructions A PREPRINT
have highlighted the importance of resolving submesoscale dynamics to account for the majority of vertical ocean
transport, which is vital for carbon export, fisheries, nutrient availability, and pollution displacement [Pascual et al.,
2017]. These challenges underscore the need for high-resolution, three-dimensional representations of the ocean state.
High-resolution numerical models and data assimilation techniques, which align model outputs with actual observations,
are currently the most common solutions [Carrassi et al., 2018, Mourre et al., 2004].
Operational simulations now assimilate near-real-time observations, including in-situ (ship-based observations, under-
water gliders, and floats) and remote sensing data. CIT Satellite observations provide frequent global snapshots of the
sea surface, for instance Sea Surface Temperature and Chlorophyll concentration images offer resolutions as fine as 1
km on a daily basis. In contrast, the current capabilities of remote altimeters are limited to a 200 km wavelength for the
global ocean at mid-latitudes and about 130 km for the Mediterranean Sea [Ballarotta et al., 2019], though significant
advancements are upcoming with the Surface Water and Ocean Topography (SWOT) mission successfully launched in
December 2022 [Morrow et al., 2019]. Notably, Sea Surface Height (SSH) data are unaffected by cloud cover. However,
the uncertainties regarding the ocean interior remain significant due to the sparse distribution of in-situ observations in
time and space [Siegelman et al., 2019]. As a result, while data-assimilating models adhere to physical balances, they
still lack accuracy [Arcucci et al., 2021].
The ocean twin strategy proposes data-driven approaches as a complementary method for revealing the ocean state.
In previous oceanographic studies, multivariate methods allowed to elaborate three-dimensional hydrographic fields
relying on their vast in-situ measurements collected during ocean campaigns [Cutolo et al., 2022, Gomis et al., 2001].
However, these methods are not easily scalable to a global observing system due to the sheer number of parameters
involved, such as correlation lengths. Machine learning techniques offer a solution to these scalability issues, as the
models are directly learned from the data. A key challenge for these techniques is the need for a substantial quantity of
realistic training data. General circulation and process study models play a new role here, providing a cost-effective way
to generate large datasets that adhere to ocean physics. Even datasets that only approximately match the true ocean state
can be valuable, provided they encompass a wide range of scenarios.This last point is especially crucial in preventing
the risk of deep networks memorizing the input climatology rather than capturing the actual ocean dynamics. Such a
focus ensures that the networks can understand and adapt to scenarios that significantly deviate from the average, rather
than being confined to repetitive patterns. To effectively generalize beyond their training data, neural networks require
careful design to preserve relevant input features across their layers. In this context, explainable AI aims to advance
beyond the black-box applications typical in ocean remote sensing studies, promoting a deeper understanding of the
model workings ([Zhu et al., 2017].
Despite these difficulties, recent studies have demonstrated the potential of deep-learning methods for various dynamical
system tasks. These range from idealized situations [Fablet et al., 2021] to realistic case studies, such as interpolating
missing data in satellite-derived observations of sea surface dynamics [Barth et al., 2020, Manucharyan et al., 2021,
Fablet et al., 2020]. With regard to reconstructing hydrographic profiles from satellite data, there’s a spectrum of
approaches: from proof-of-concept studies using self-organizing maps (SOMs) and neural networks (Charantonis et al.
[2015], Gueye et al. [2014]) and feed-forward or long short-term memory (LSTM) neural networks [Contractor and
Roughan, 2021, Sammartino et al., 2020, Jiang et al., 2021, Fablet et al., 2021] as well as [Pauthenet et al., 2022]
relying instead on multilayer perceptron. Even considering these past works the interpolation of temperature and salinity
profiles given some in-situ and sea surface information is an open challenge.
In this study, we introduce an innovative modular neural network designed to seamlessly integrate remote-sensing
images with in-situ observations for a complete 3D reconstruction of the ocean state. This integration is underpinned by
the Optimal Interpolation (OI) scheme’s mathematical principles [Gandin, 1966]. Unlike traditional applications of OI,
which typically use Euclidean distance to estimate the correlation between points, our approach involves computing
distances within a specially designed latent space. A specific module within our neural network transform all our input
information into this latent space made of ’clusters’. Within these clusters, non-local correlations become more easily
identifiable and can be effectively applied to enhance the correlation matrix. Like attention mechanisms in advanced
neural models [Vaswani et al., 2017], which focus on key aspects in large datasets for tasks such as language processing
or image recognition, our neural network module similarly identifies crucial correlational patterns through the latent
space of clusters.
We privileged a network structure composed of independent nested modules to facilitate the understanding and analysis
of its internal information flow from the input data to the covariance structure. To the best of our knowledge, this is the
first work in which neural networks achieve the most optimal combination of remote-sensing and in-situ observations
without previous knowledge of the study area’s climatology. This study is structured as follows: section 2 presents the
main synthetic dataset that we used for the training and testing and some real observations for some preliminar use case
scenario. All the details regarding the network architecture are in section 3 while the results are shown in section 4.
2