Two-parameter sums signatures and corresponding quasisymmetric functions Joscha Diehl Leonard Schmitz

2025-05-06 0 0 1.2MB 61 页 10玖币
侵权投诉
Two-parameter sums signatures and
corresponding quasisymmetric functions
Joscha Diehl Leonard Schmitz
University of Greifswald
Quasisymmetric functions have recently been used in time series analysis
as polynomial features that are invariant under, so-called, dynamic time
warping. We extend this notion to data indexed by two parameters and
thus provide warping invariants for images. We show that two-parameter
quasisymmetric functions are complete in a certain sense, and provide a
two-parameter quasi-shuffle identity. A compatible coproduct is based on
diagonal concatenation of the input data, leading to a (weak) form of Chen’s
identity.
Keywords: quasisymmetric functions, warping invariants, matrix compositions, Hopf
algebra, image analysis, signatures and data streams
Contents
1 Introduction 2
1.1 Warping invariants motivate the signature .................. 3
1.2 Illustrations of quasi-shuffle relations and of Chen’s identity ........ 6
2 Two-parameter sums signature 7
2.1 The Hopf algebra of matrix compositions .................. 11
2.2 Quasi-shuffle identity .............................. 13
2.3 Invariance to zero insertion .......................... 15
2.4 Invariance to warping ............................. 17
2.5 Chen’s identity ................................. 20
3 Two-parameter quasisymmetric functions 22
3.1 Closure property under multiplication .................... 23
3.2 Polynomial invariants ............................. 27
4 Algorithmic considerations 30
4.1 Iterated two-parameter sums ......................... 30
1
arXiv:2210.14247v3 [math.CO] 9 Oct 2024
4.2 Two-parameter quasi-shuffle .......................... 34
5 Technical details and proofs 35
5.1 Signature properties .............................. 35
5.2 Two-parameter quasi-shuffle .......................... 43
5.3 Bialgebra structure of matrix compositions ................. 47
6 Conclusions and outlook 55
1 Introduction
Certain forms of signatures have proven beneficial as features in time series analysis. The
iterated-integrals signature was introduced by Chen in the 1950s [Che54] for homological
considerations on loop space. After applications in control theory, starting in the 1970s,
[Fli76,Fli81] and rough path theory, starting in the 1990s, [Lyo98], it has in the last
decade been successfully applied in machine learning tasks on time series [KBPA+19,
KO19,DR19,KMFL20,TBO20]. As the name suggests, it applies to continuous objects,
namely (smooth enough) curves in Euclidean space. For discrete time series to fit in the
machinery, they have to undergo a (simple) interpolation step.
The iterated-sums signature, introduced in [DEFT20], forgoes this intermediate step
and immediately works on the discrete-time object. This discrete perspective brings
additional benefits: a broader class of features (even for one-dimensional time series,
whose integral signature is trivial), flexibility in the underlying ground field [DEFT22],
and a tight, new-found, connection to the theory of quasisymmetric functions [MR95]
and dynamic time warping [SC78,BC94,KR05].
In the present work we will take the latter perspective and apply it to data indexed by
two parameters, the canonical example being image data.
Related work
In data science, two recent works have very successfully applied iterated integrals to
images. In [IL22] the classical, one-parameter, iterated-integrals signature is used for
images (by working “row-by-row”), whereas certain multi-parameter iterated integrals
are used in [ZLT22]. A principled extension of Chen’s iterated integrals, based on their
original use in topology, is presented in [GLNO22].
More generally, the use of “signature-like” feature-maps has recently been extended to
graphs [TLHO22,CDEFV22] and trees [CFC+21].
Notation
Throughout, N={1,2, . . . }denotes the strictly positive integers and N0={0} ∪ Nde-
notes the non-negative integers. Let (N2,) denote the product poset (partially ordered
2
set), i.e. (here, and throughout, we denote tuples with bold letters)
iji1j1and i2j2.
For every matrix AMm×nwith entries from an arbitrary set Mlet size(A) :=
(rows(A),cols(A)) := (m, n)N2denote the number of rows and columns in Arespec-
tively. Let gf:MP, m 7→ g(f(m)) be the set-theoretic composition of functions
f:MNand g:NP. Let Cdenote the complex number field.
1.1 Warping invariants motivate the signature
We briefly recall the notion of (classical, one-parameter) time warping invariants, as
covered in [DEFT20]. For simplicity, we consider eventually-constant, C-valued time
series in discrete time,
evC(N,C) := x:NC| ∃nN:xi̸=xj=in.
Intuitively one might think of complex numbers as colored pixels, becoming especially
valuable for visualization in the two-parameter case. Later in this paper, the complex
numbers are actually replaced by the module Kdover some arbitrary integral domain,
covering the classical encoding of colors via R3. A single time warping operation is
formalized by the mapping
warpk:evC(N,C)evC(N,C),(warpkx)i:= (xiik
xi1i > k,
which leaves all entries until the k-th unchanged, copies this value once, attaches it at
position k+ 1, and shifts all remaining successors by one.
For example, consider
warp2´2 1 3 1 · · ·¯=21131· · ·
where the dots on the right hand side indicate that all relevant information is provided,
i.e., that the series has reached a constant and will not change again. A time warping
invariant is a function from the set of time series to the complex numbers which remains
unchanged under warping. An example of such an invariant is
φ:evC(N,C)C, x 7→ x1lim
t→∞ xt(1)
where the limit exists, since xwas assumed to be eventually constant. This invariant
does not “see”, whether certain entries of a time series are repeated over and over again.
Indeed, neither the first entry nor the limt can be changed by any warping. In the
3
numerical example
φ´231511· · ·¯=φ´23311151· · ·¯
=φ´2223151· · ·¯
=φ´23315551· · ·¯
=21
the initial time series is warped to three different representatives and yet still yields the
same value under φ.
Next, we move to the two-parameter case, which is the focus of this paper.
We denote by
evC(N2,C) := nX:N2C| ∃nN2:Xi̸=Xj=ino
the set of two-parameter functions which are eventually constant. A function from
this set is a two-parameter analog to a (classical, one-parameter) time series that is
eventually constant and can be thought of as an image of arbitrary size, with its pixels
being encoded by C.
We define a single warping operation warpa,k similar to the one-parameter case, except
that we add a second index a∈ {1,2}indicating on which axis the warping takes place.
For the axis a= 1 we obtain an operation on rows, i.e., at position kwe copy a row and
shift all remaining rows by one. Illustratively, for k= 2 we have
warp1,2¨
˚
˚
˚
˝
21311
32511
11111
...
˛
=
21311
32511
32511
11111
...
whereas for the axis a= 2 we get the warping of columns, illustrated by
warp2,2¨
˚
˚
˚
˝
2 1 3 2 2
3 0 5 2 2
2 2 2 2 2
...
˛
=
2 1 1 3 2 2
3 0 0 5 2 2
2 2 2 2 2 2
...
also at position k= 2. A formal definition of warpa,k is provided in Section 2.4.
We call a function ψ:evC(N2,C)Can invariant to warping (in both directions
independently), if it remains unchanged under warping, i.e.,
ψwarpa,k =ψ(a, k)∈ {1,2} × N.
4
(a) (b) (c) (d)
(e) (f) (g) (h)
Figure 1: Functions in evC(N2,C). The images (a), (d) and (e) are equivalent under warping,
so are (b) and (f), as well as (c), (g) and (h).
Figure 1illustrates eight two-parameter functions (i.e. images), obtained from three
initial functions by repeatedly applying warping.1Hence, the value of any warping
invariant will coincide on pairwise equivalent inputs.
As an example, consider the two-parameter analogon of Equation (1),
Ψr1s:evC(N2,C)C, X 7→ X1,1lim
s,t→∞ Xs,t (2)
which is a warping invariant. In fact it belongs to an entire class Ψ of invariants con-
structed as follows.
An N0-valued matrix is called a matrix composition, if it has no zero lines or zero
columns. For every eventually-zero ZevC(N2,C), that is lims,t→∞ Zs,t = 0, we define
the two-parameter sums signature coefficient of Zat the matrix composition avia
SS(Z),a:= X
ι1<···rows(a)
κ1<···cols(a)
rows(a)
Y
s=1
cols(a)
Y
t=1
Zas,t
ιstC.
Note that this sum over strictly increasing chains ιand κis always finite since Zis zero
almost everywhere. A numerical illustration is provided in Example 2.6.
We collect the second ingredient for warping invariants. For XevC(N2,C) let δX
evC(N2,C) be defined via the forward difference operator
(δX)i,j := Xi+1,j+1 Xi+1,j Xi,j+1 +Xi,j .
1In the sense of Definition 2.29 each of the eight images is equivalent to one of the initial three.
5
摘要:

Two-parametersumssignaturesandcorrespondingquasisymmetricfunctionsJoschaDiehlLeonardSchmitzUniversityofGreifswaldQuasisymmetricfunctionshaverecentlybeenusedintimeseriesanalysisaspolynomialfeaturesthatareinvariantunder,so-called,dynamictimewarping.Weextendthisnotiontodataindexedbytwoparametersandthus...

展开>> 收起<<
Two-parameter sums signatures and corresponding quasisymmetric functions Joscha Diehl Leonard Schmitz.pdf

共61页,预览5页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:61 页 大小:1.2MB 格式:PDF 时间:2025-05-06

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 61
客服
关注