Ecient Solutions of Fermionic Systems using Articial Neural Networks Even M. Nordhagen Department of Physics and Njord Center University of Oslo N-0316 Oslo Norway

2025-05-03 0 0 2.39MB 16 页 10玖币
侵权投诉
Efficient Solutions of Fermionic Systems using Artificial Neural Networks
Even M. Nordhagen
Department of Physics and Njord Center, University of Oslo, N-0316 Oslo, Norway
Jane M. Kim
Department of Physics and Astronomy and Facility for Rare Isotope Beams,
Michigan State University, East Lansing, MI 48824, USA
Bryce Fore and Alessandro Lovato
Physics Division, Argonne National Laboratory, Argonne, IL 60439, USA
Morten Hjorth-Jensen
Department of Physics and Astronomy and Facility for Rare Isotope Beams,
Michigan State University, East Lansing, MI 48824, USA and
Department of Physics and Center for Computing in Science Education, University of Oslo, N-0316 Oslo, Norway
We discuss differences and similarities between variational Monte Carlo approaches that use con-
ventional and artificial neural network parameterizations of the ground-state wave function for
systems of fermions. We focus on a relatively shallow neural-network architectures, the so called
restricted Boltzmann machine, and discuss unsupervised learning algorithms that are suitable to
model complicated many-body correlations. We analyze the strengths and weaknesses of conven-
tional and neural-network wave functions by solving various circular quantum-dots systems. Results
for up to 90 electrons are presented and particular emphasis is placed on how to efficiently implement
these methods on homogeneous and heterogeneous high-performance computing facilities.
arXiv:2210.00365v1 [cond-mat.mes-hall] 1 Oct 2022
2
I. INTRODUCTION
Solving the Schr¨odinger equation for systems of many interacting bosons or fermions is classified as an NP-hard
problem due to the complexity of the required many-dimensional wave function, resulting in an exponential growth of
degrees of freedom. Reducing the dimensionalities of quantum mechanical many-body systems is an important aspect
of modern physics, ranging from the development of efficient algorithms for studying many-body systems to exploiting
the increase in computing power. To write software that can fully utilize the available resources has long been known
to be an important aspect of these endeavors. Despite tremendous progress has been made in this direction, traditional
many-particle methods, either quantum mechanical or classical ones, face huge dimensionality problems when applied
to studies of systems with many interacting particles.
Over the last two decades, quantum computing and machine learning have emerged as some of the most promising
approaches for studying complex physical systems where several length and energy scales are involved. Machine
learning techniques and in particular neural-network quantum states [1] have recently been applied to studies of
many-body systems, see for example Refs. [29], in various fields of physics and quantum chemistry, with very
promising results. In many of these studies, one has obtained results that align well with exact analytical solutions
or are in close agreement with state-of-the-art quantum Monte Carlo calculations.
The variational and diffusion Monte Carlo algorithms are among the most popular and successful methods available
for ground-state studies of quantum mechanical systems. They both rely on a suitable ansatz for the ground-state of
the system, often dubbed trial wave function, which is defined in terms of a set of variational parameters whose optimal
values are found by minimizing the total energy of the system. Devising flexible and accurate functional forms for the
trial wave functions requires prior knowledge and physical intuition about the system under investigation. However,
for many systems we do not have this intuition, and as a result it is often difficult to define a good ansatz for the
state function.
According to the universal approximation theorem, a deep neural network can represent any continuous function
within a certain error [10] — see also Refs. [1,1113] for further discussions of deep leaning methods. Since the
variational state wave function in principle can take any functional form, it is natural to replace the trial wave
function with a neural network and treat it as a machine learning problem. This approach has been successfully
implemented in recent works, see for example Refs. [2,4,8,9,14], and forms the motivation for the present study.
Here, the neural network of choice was derived from so-called restricted Boltzmann machines, much inspired by the
recent contributions by Carleo et al., see for example Refs. [2,6]. Note that neural-networks representations of
variational states are more general, as they do not in principle require prior knowledge on the ground-state wave
function, thereby opening the door to systems that have yet to be solved. Particular attention however has to be
devoted to the symmetries of the problem, whose inclusion is critical to achieve accurate results [].
In this work, we will focus on systems of electrons confined to move in two-dimensional harmonic oscillator sys-
tems, so-called quantum dots. These are strongly confined electrons and offer a wide variety of complex and subtle
phenomena which pose severe challenges to existing many-body methods. Due to their small size, quantum dots are
characterized by discrete quantum levels. For instance, the ground states of circular dots show similar shell structures
and magic numbers as seen for atoms and nuclei. These structures are particularly evident in measurements of the
change in electrochemical potential due to the addition of one extra electron. Here these systems will serve as our
test of the applicability of artificial neural network variational states, including restricted Boltzmann Machines.
The theoretical foundation and the methodology are explained in section II. The subsequent sections present our
results with an analysis of computational methods and resources. In the last section we present our conclusions and
perspectives for future work.
II. METHOD
For any Hamiltonian ˆ
Hand trial wave function ψT, the variational principle guarantees that the expectation value
of the energy ETis greater than or equal to the true ground state energy E0,
E0ET=hψT|ˆ
H|ψTi
hψT|ψTi.(1)
Thus approximate solutions to the time-independent Schr¨odinger equation can be obtained by choosing a careful
parameterization of the wave function and minimizing the energy ETwith respect to the parameters. Since the
integrals representing ETare normally high dimensional, it is most efficient to evaluate them by means of Monte
3
Carlo methods
ET≈ hELi=1
n
n
X
i=1
EL(Ri),Ri∼ |ψT(R)|2.(2)
This involves collecting nsamples of configurations and averaging over the so-called local energies
EL(R) = 1
ψT(R)ˆ
HψT(R).(3)
We apply the variational Monte Carlo (VMC) method to various circular quantum dots systems. These are systems
of interacting electrons confined to move in a two-dimensional harmonic oscillator well. The (scaled)[15] Hamiltonian
is given by
ˆ
H=1
2X
i
−∇2
i+ω2r2
i+X
j6=i
1
rij
,(4)
where ωis the oscillator frequency, riis the distance between electron iand the origin, and rij is the distance between
electrons iand j. We will henceforth assume the total number of electrons Nto be even and the total spin of the
system to be zero.
A simple ansatz can be built starting from the analytical solutions to the non-interacting case. The harmonic
oscillator eigenfunctions are given by
φm,n(x, y)eω(x2+y2)Hm(ωx)Hn(ωy),(5)
where Hnare the Hermite polynomials of degree n. To constrain the antisymmetry of the many-body wave function,
products of the lowest N/2 spatial states and the two spin states ξ±(σ) are used as a basis for a Slater determinant
ψSD(R) = det hnφm,n(xi, yi)ξk(σi)oi,
where m, n, k label the single-particle state, ilabels the particle, and Rcontains all coordinates of the Nparticles.
As an aside, we do not include the spin projections σias explicit inputs to the wave function as we will describe how
to treat them separately in Section II.B. We then define a reference state by pulling the common exponential term
out of the determinant and inserting a single variational parameter α
ψRef(R;α) = eαω Pi(x2
i+y2
i)det hnHm(ωxi)Hn(ωyi)ξk(σi)oi.(6)
Correlations among electrons can be handled by a Pad´e-Jastrow factor [16],
g(R;β) = exp
N
X
i=1
N
X
j>i
aij rij
1 + βrij
,(7)
where βis a variational parameter and
aij =(1/3 if σi=σj
1 if σi6=σj
,
in order for the Kato cusp condition to be satisfied [17]. The product of the Slater determinant and the Pad´e-Jastrow
factor is commonly named the Slater-Jastrow ansatz,
ψSlater-Jastrow(R;α, β) = ψRef(R;α)×g(R;β).(8)
A. Gaussian-binary restricted Boltzmann machine
There are many possible choices for a machine learning inspired wave function, but using an artificial neural network
is natural. Inspired by Ref. [2], our choice is to start from a restricted Boltzmann machine (RBM) configured for
4
x2
x1
x3
h1
h2
h3
1 1
w11
b3
a3
visible hidden
Figure 1. Architecture of a restricted Boltzmann machine. Inter-layer connections between the visible and the hidden layer
are represented by the black lines, where, for instance, the line connecting x1to h1represents the weight w11. The red lines
represent the visible biases, where the line going from the bias unit to the visible unit x3represents the bias weight a3. The
purple lines represent the hidden biases, where the line going from the bias unit to the hidden unit h3represents the bias weight
b3.
continuous inputs, illustrated in Fig. 1. The inputs xR2Nare the flattened particle positions and interactions
between the particles are mediated by Hhidden binary nodes. After summing over all the possible values of the
hidden nodes, the marginal distribution of the inputs to the Gaussian-binary RBM takes the form
P(R;a,b,w) = exp
2N
X
i=1
(xiai)2
2σ2
i!H
Y
j=1 "1 + exp bj+
2N
X
i=1
xiwij
σ2
i!#.(9)
Here, aR2Nand bRHare the bias parameters of the input and hidden nodes, respectively. The weights between
the input and hidden nodes are wR2N×H, while σR2Nare the widths of the Gaussian input nodes (not to be
confused with the spin projections). It is possible to train these widths by reparameterizing them as σi= exp(si),
but in this work all of the widths were fixed to σ= 1/ωand only the biases and weights are treated as variational
parameters. See Appendix A 1 for the derivation of the marginal probability.
Notice how the marginal distribution in Eq. (9) mimics the Gaussian parts of our aforementioned ans¨atze in Eqs. (6)
and (8). Based on such observations, our next step to is construct two corresponding ans¨atze
ψRBM(R;a,b,w) = P(R;a,b,w)×det hnHm(ωxi)Hn(ωyi)ξk(σi)oi,(10)
and
ψRBM+PJ(R;a,b,w, β) = P(R;a,b,w)×g(R;β)×det hnHm(ωxi)Hn(ωyi)ξk(σi)oi.(11)
The two trial wave functions above apply different levels of physical intuition. While ψRBM does not contain specific
information about the electron-electron interactions, ψRBM+PJ contains a correlation factor that explicitly upholds the
cusp condition. Both ans¨atze contain knowledge about the required antisymmetry and the Gaussians in the marginal
distribution help localize the wave functions to satisfy the boundary conditions far from the oscillator well. Also,
as the marginal distribution is positive definite, these ans¨atze will never collapse into the bosonic state even if the
marginal distribution is not symmetric.
B. Code optimization
Parallel computing is an important part of our efforts for developing an efficient VMC solver. However, increasing
the available computational resources alone is often not sufficient. One should also consider developing sophisticated
摘要:

EcientSolutionsofFermionicSystemsusingArti cialNeuralNetworksEvenM.NordhagenDepartmentofPhysicsandNjordCenter,UniversityofOslo,N-0316Oslo,NorwayJaneM.KimDepartmentofPhysicsandAstronomyandFacilityforRareIsotopeBeams,MichiganStateUniversity,EastLansing,MI48824,USABryceForeandAlessandroLovatoPhysicsDi...

展开>> 收起<<
Ecient Solutions of Fermionic Systems using Articial Neural Networks Even M. Nordhagen Department of Physics and Njord Center University of Oslo N-0316 Oslo Norway.pdf

共16页,预览4页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:16 页 大小:2.39MB 格式:PDF 时间:2025-05-03

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 16
客服
关注