Deformations of Boltzmann Distributions Bálint Máté University of Geneva_2

2025-05-06 0 0 870.5KB 8 页 10玖币
侵权投诉
Deformations of Boltzmann Distributions
Bálint Máté
University of Geneva
balint.mate@unige.ch
François Fleuret
University of Geneva
francois.fleuret@unige.ch
Abstract
Consider a one-parameter family of Boltzmann distributions
pt(x) = 1
ZteSt(x)
.
This work studies the problem of sampling from
pt0
by first sampling from
pt1
and then applying a transformation
Ψt0
t1
so that the transformed samples follow
pt0
.
We derive an equation relating
Ψ
and the corresponding family of unnormalized
log-likelihoods
St
. The utility of this idea is demonstrated on the
φ4
lattice field
theory by extending its defining action
S0
to a family of actions
St
and finding a
τ
such that normalizing flows perform better at learning the Boltzmann distribution
pτthan at learning p0.
1 Introduction
Sampling from unnormalized densities has been studied by many due to its relevance for the sciences
[
1
11
]. The problem can be summarized as follows. Given an unnormalized log-density
S:RnR
can we efficiently generate samples from the probability density
p(x) = 1
ZeS(x)
? In particular,
there are no samples given, all we have is the ability to evaluate
S
for any sample candidate. A
popular technique for attacking this problem is to use a normalizing flow to parametrise a distribution
qθand optimize the parameters θto minimize the reverse KL divergence
KL[qθ, p] = Exqθ(log qθ(x)log p(x)) = Exqθ(log qθ(x) + S(x)) + Z. (1)
As a motivating example for this paper, let
S
be the defining action of the lattice
φ4
theory (See §4
for details), and consider the family of distributions
pβ(x)eβS(x)
parametrized by
βR+
. In
terms of statistical physics
β
corresponds to the inverse temperature and controls how ordered the
given system is. As seen in Fig. 1 the performance of a normalizing flow is sensitive to the parameter
β
. Continuous normalizing flows converge faster at higher temperatures. Somewhat surprisingly,
the RealNVP architecture converges faster both at
β=.1
and
β= 10
than at
β= 1
. A possible
explanation is that both the smoother distribution (
β= 0.1
) and the more localized (
β= 10
) is easier
to learn than a combination of these characteristics at β= 1.
Motivated by this observation, this paper studies the following problem. Suppose we are given a
family of actions
St(x)
, parametrised by
t
, inducing a family of distributions,
pt(x)eSt(x)
.
We are interested in how these distributions are related for different values of
t
. From a practical
viewpoint, the goal is to sample from
pt2
by first sampling from
pt1
and making the samples flow
along some vector field Vt. We will refer to this Vtas the deformation field or transport field.
The contributions of this paper can be summarized as:
In §3 we derive a PDE, the deformation equation, that translates between infinitesimal
deformations of actions and the infinitesimal deformations of the Boltzmann distributions
they induce.
In §4 we put the theory into practice in the case of a simple deformation of the lattice
φ4
theory and show that it improves the performance of normalizing flows.
Machine Learning and the Physical Sciences workshop, NeurIPS 2022.
arXiv:2210.13772v3 [hep-lat] 14 Nov 2022
Figure 1: Sensitivity of the training to the inverse temperature
β
on a
12 ×12
lattice. We trained
the continous normalizing flow of Gerdes et al.
[7]
(left) and a RealNVP (right). In both cases we
used the action of the
φ4
theory with different values of
β∈ {0.1,1,10}
and with
m2
and
λ
values
same as in the work of Gerdes et al.
[7]
. The
x
-axis represents the number of training steps, while the
y-axis represents the performance metrics. Mean and standard deviation over 5 runs are shown.
Figure 2: Deformation of a two-dimensional standard Gaussian into a family of isotropic Gaussians
with different variances. Samples from a two-dimensional standard Gaussian (left). The deformation
field (center). The samples in the left plot flowing along the deformation field from
1
to
t
follow a
Gaussian centered at the origin with covariance 1/t (right).
2 Background
Change of variables
Let
p0(z)
be a probability density on
Rn
and
Ψ : RnRn
a diffeomorphism.
Pushing the the density forward along Ψinduces a new probability density pimplicitly defined by
log p0(z) = log pz) + log |det JΨ(z)|.(2)
The term log |det JΨ(z)|measures how much the function Ψexpands volume locally at z.
Continous change of variables
Suppose
Vt
is a time-dependent vector field. Let
Ψτ
denote the
diffeomorphism of following the trajectories of
Vt
from
0
to
τ
. This family of diffeomorphisms
generates a one-parameter family of densities
pτ
. The amount of volume expansion a particle
experiences along this trajectory of
Ψτ
is
Rτ
0div Vttz)dt
. The log-likelihoods are then related by
log p0(z) = log pττz) + Zτ
0
div Vttz)dt. (3)
2
摘要:

DeformationsofBoltzmannDistributionsBálintMátéUniversityofGenevabalint.mate@unige.chFrançoisFleuretUniversityofGenevafrancois.fleuret@unige.chAbstractConsideraone-parameterfamilyofBoltzmanndistributionspt(x)=1ZteSt(x).Thisworkstudiestheproblemofsamplingfrompt0byrstsamplingfrompt1andthenapplyingatra...

展开>> 收起<<
Deformations of Boltzmann Distributions Bálint Máté University of Geneva_2.pdf

共8页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!

相关推荐

分类:图书资源 价格:10玖币 属性:8 页 大小:870.5KB 格式:PDF 时间:2025-05-06

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 8
客服
关注