Universal hidden monotonic trend estimation with contrastive learning Edouard Pineau

2025-05-06 1 0 963.55KB 12 页 10玖币

侵权投诉

Universal hidden monotonic trend estimation

with contrastive learning

Edouard Pineau

EthiFinance

edouard.pineau@ethifinance.com

S´

ebastien Razakarivony

Safran

sebastien.razakarivony@safrangroup.com

Mauricio Gonzalez

EthiFinance

mauricio.gonzalez@ethifinance.com

Anthony Schrapﬀer

EthiFinance

anthony.schrapffer@ethifinance.com

Abstract

In this paper, we describe a universal method for extracting the underlying

monotonic trend factor from time series data. We propose an approach

related to the Mann-Kendall test, a standard monotonic trend detection

method and call it contrastive trend estimation (CTE). We show that the

CTE method identiﬁes any hidden trend underlying temporal data while

avoiding the standard assumptions used for monotonic trend identiﬁcation.

In particular, CTE can take any type of temporal data (vector, images,

graphs, time series, etc.) as input. We ﬁnally illustrate the interest of our

CTE method through several experiments on diﬀerent types of data and

problems.

1 Introduction

Our paper focuses on the estimation of a monotonic trend factor underlying temporal data.

Such estimation is interesting in many ﬁelds, e.g., health monitoring [38], survival analysis

[35] or climate change monitoring [23]. In all these ﬁelds and related trend estimation

problems, we observe samples generated by a monitored system (e.g., an ageing mechanical

system, a credit debtor, earth’s weather and climate conditions) at diﬀerent times in its life,

and we assume that the state of the system drifts monotonically. These observed samples

may be of any type (e.g., vectors, images, time series, graphs), depending on the monitored

system. Figure 1 illustrates the general context of trend estimation.

More generally, when studying temporal data, it is common to assume the existence of

structural latent factors, supposed meaningful, that generated the data [21]. These com-

ponents are generally allocated into four groups. The trend components are monotonic

long-term signals. The cycle components are factors exhibiting rises and falls that are not

of a ﬁxed frequency. The seasonality components are periodic patterns occurring at a ﬁxed

frequency. The irregularity factors represent the rest of the information (considered as a

noise). We assume independent structural factors. The challenging yet essential task is the

identiﬁcation of one or several of these factors, that is called blind source separation [8],

independent component analysis [25] or disentanglement [4]. In this paper, the objective

is to detect, isolate and identify only the trend component. [24] shows that if we know

one hidden component under time series data, we can ﬁnd the others conditionally. Hence,

ﬁnding the trend component is not only useful for many monitoring problems, it is relevant

for further analysis.

arXiv:2210.09817v2 [cs.LG] 23 Apr 2023

Figure 1: Illustration of the context of the paper’s contribution. We have a monitored

system Sthat generates data samples (colored curves) at random time. The hidden trend τ

underlying the system (colors from green to red) represents the hidden state of Sthat changes

monotonically until a state restoration is applied (tools in hexagons): samples between two

state restorations form a sequence with a monotonic hidden trend. The relation between trend

and observed data may be an arbitrary function yet assumed to preserve the information

about the trend.

Often, trend estimation methods seek monotonic variations in the values of the data or in

expert-based statistics computed from data [7, 36]. In practice, the trend can be deeply

hidden in the data or may be not well deﬁned because of a lack of information about the

monitored system. Hence, we may not know which variable or statistics to follow to ﬁnd

the trend.

In this paper, we learn to infer the trend factor from data (of any type) without labels or

expert supervision, using only samples’ time index. To do so, we develop a general method

based on Contrastive Learning (CL). CL recently received high interest in self-supervised

representation learning [33], in particular for time series data (see, e.,g., [11, 3, 45]). Our

CL approach uses a loss inspired by Mann-Kendall test [34], a standard trend detection

method.

The rest of the paper presents our universal trend inference method called Contrastive

Trend Estimation (CTE).Section 2 presents the method. Section 3 analyzes the theoretical

foundation of our method in terms of identiﬁability. Section 4 lists related works on trend

detection and estimation. Section 5 presents a set of experiments to illustrate the interest of

our approach for trend estimation and survival analysis. Concluding remarks are presented

in Section 6.

2 Contrastive trend detection

Notations. Let Xbe a sequence of NX∈Nobserved samples generated by a monitored

system denoted by S. We assume that a hidden state of Sdrifts monotonically. We note

Xthe dataset of all sequences Xin which there exist a hidden monotonic factor. We note

tithe time index of the ith observed sample, i∈J1, NXK. We assume that each sequence

X∈ X has been generated from structural factors through a function F, such that at least

the information about the trend is not annihilated (in blind source separation problems, F

would be assumed invertible). That is, for each Xthere exist ZX:= (τX, cX, sX, X) such

that Xti=F(ZX

ti) ,where τX,cX,sX, and Xrepresent respectively (resp.) the monotonic

trend, the cycle, the seasonality, and the irregularity that generated X. The paper’s goal is

to estimate the factor τXfrom X.

Our CTE approach. For each X∈ X , we select two sampling times tu, tvin

{t1, . . . , tNX}2, such that, without loss of generality (w.l.o.g.), tu< tv. The value of the

hidden trend at the sampling time tfor Xtis noted τX

t. Since we do not have access to the

true hidden trend, we need assumptions about τX. We use the natural Assumption 1 to

estimate τX.

Assumption 1. (Monotonicity). For each sequence X∈ X and all sample couples

(Xtu, Xtv), we have that tu≤tv⇐⇒ τX

tu≤τX

tv.

To extract the trend component, we use a neural network (NN) Fφwith parameters φ

that embeds each sample Xtinto a de-dimensional vector space, with which we deﬁne

gβ,φ :Rde×Rde→[0,1] a parametric logistic regressor deﬁned as follows:

gβ,φ (Xtu, Xtv) = σβ>Fφ(Xtv)−β>Fφ(Xtu),(1)

where σ(x):= (1 + e−x)−1is the sigmoid function. Let Cuv :=1{τX

tu≤τX

tv}be the indicator

function that describes the trend direction between tuand tvfor any sample X. Under

the Assumption 1, we have also Cuv =1{tu≤tv}. Then, we can build Cuv from sample’s

time indices. We then can learn the posterior distribution p(Cuv|Xtu, Xtv), i.e., learn the

identity:

p(Cuv = 1|Xtu, Xtv) = gβ,φ (Xtu, Xtv).(2)

As for common binary classiﬁcation problems, training is done by minimizing the binary

cross entropy (BCE) between Cuv and the regressor gβ,φ (Xtu, Xtv), for all pairs of time

indices (tu, tv), ∀XinX, i.e., by minimizing:

R(β, φ;X) = −EX∈X 



i,j=1

Cij log gβ,φ Xti, Xtj

.(3)

Remark 2. Eq. (3) is similar to the Mann-Kendall statistics of eq. (7) presented in Section

4 of related work. 

Once the parameters (φ, β) are ﬁtted, we build an estimator β>Fφ(Xt) of the trend factor

τX

t. In the next section, we show in what extent this estimator eﬀectively estimates the

hidden trend factor.

3 Identiﬁability study

We assume that Fφis a universal approximation function (e.g., a suﬃciently large NN) and

that the amount of data is large enough (equivalent to inﬁnite data) such that we achieve

the identity of eq. (2).

Deﬁnition 1. (Minimal suﬃciency). A suﬃcient statistic Tis minimal suﬃcient if

for any suﬃcient statistic U, there exists a function hsuch that T=h(U). If Uis also

minimal, then his a bijection. 

Proposition 1. β>(Fφ(Xtv)−Fφ(Xtu)) is a minimal suﬃcient statistic for trend label

Cuv.

Proof. First we remind that logistic regression learns likelihood ratios, i.e., Fβ,φ is a log-

likelihood diﬀerence. In fact, using the Bayes rule, we get

p(Cuv = 1|Xtu, Xtv) = p(Xtu, Xtv|Cuv = 1)p(Cuv = 1)

p(Xtu, Xtv).(4)

Moreover, using properties of sigmoid function σand eq. (2), we have

eβ>(Fφ(Xtv)−Fφ(Xtu)) =p(Cuv = 1|Xtu, Xtv)

p(Cuv = 0|Xtu, Xtv).(5)

文档加载中……请稍候！
如果长时间未打开，您也可以点击刷新试试。

下载文档到电脑，查找使用更方便

10 玖币 0人已下载

立即下载

摘要：

UniversalhiddenmonotonictrendestimationwithcontrastivelearningEdouardPineauEthiFinanceedouard.pineau@ethifinance.comSebastienRazakarivonySafransebastien.razakarivony@safrangroup.comMauricioGonzalezEthiFinancemauricio.gonzalez@ethifinance.comAnthonySchraperEthiFinanceanthony.schrapffer@ethifinance....

展开>> 收起<<

Universal hidden monotonic trend estimation with contrastive learning Edouard Pineau.pdf

共12页,预览3页

还剩页未读，继续阅读

声明：本站为文档C2C交易模式，即用户上传的文档直接被用户下载，本站只是中间服务平台，本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私，请立即通知玖贝云文库，我们立即给予删除！

Universal hidden monotonic trend estimation with contrastive learning Edouard Pineau

相关推荐

开通VIP享超值会员特权

作者详情

相关内容

热门标签

举报选择: