Maximum Entropy Approach for the Prediction of Urban Mobility Patterns Simone DaniottiBernardo Monechi and Enrico Ubaldi Dated October 5 2022

2025-05-02 0 0 8.29MB 14 页 10玖币
侵权投诉
Maximum Entropy Approach for the Prediction of Urban Mobility Patterns
Simone Daniotti,Bernardo Monechi, and Enrico Ubaldi
(Dated: October 5, 2022)
The science of cities is a relatively new and interdisciplinary topic. It borrows techniques from
agent-based modeling, stochastic processes, and partial differential equations. However, how the
cities rise and fall, how they evolve, and the mechanisms responsible for these phenomena are still
open questions. Scientists have only recently started to develop forecasting tools, despite their
importance in urban planning, transportation planning, and epidemic spreading modeling. Here,
we build a fully interpretable statistical model that, incorporating only the minimum number of
constraints, can predict different phenomena arising in the city. Using data on the movements
of car-sharing vehicles in different Italian cities, we infer a model using the Maximum Entropy
(MaxEnt) principle. With it, we describe the activity in different city zones and apply it to activity
forecasting and anomaly detection (e.g., strikes, and bad weather conditions). We compare our
method with different models explicitly made for forecasting: SARIMA models and Deep Learning
Models. We find that MaxEnt models are highly predictive, outperforming SARIMAs and having
similar results as a Neural Network. These results show how relevant statistical inference can be in
building a robust and general model describing urban systems phenomena. This article identifies
the significant observables for processes happening in the city, with the perspective of a deeper
understanding of the fundamental forces driving its dynamics.
INTRODUCTION
In recent years, pressing societal and environmental problems such as population growth, migrations, and climate
change, boosted the research on the Science of Cities and the related study of mobility. The availability of large
and detailed datasets covering the mobility of individuals at different granularity contributed largely to enhance the
interest of researchers in this field[1, 2]. Being multi-disciplinary, Science of Cities studies embrace diverse areas
of research. For example, statistical methods have been applied to city growth [3], multi-layer networks to urban
resilience [4] and spatial networks to describe and characterize the structure and the evolution of phenomena arising
on it [5]. On the modeling side, co-evolution models [6] and agent-based simulations [7] have been used to model
stylized facts and take into account policy making. Moreover, other frameworks such as ranking dynamics [8] have
been used to study urban environments and universal laws arising in them.
The mobility patterns of individuals diffusing in an urban environment are determined by the interplay of different
mechanisms, such as the daily routines of the individuals and environmental constraints, both again regulated by even
more fundamental and interrelated phenomena, like wealth, trends, economic relations, socio-political disparities, and
cultural movements [9–11]. Urban environments are complex systems [12], and describing their growth and relation
with the surrounding cities is not unanimously understood by the scientific community [13]. Being that the commuting
phenomenon inside and between cities may be driven by different events and may be related to different causes, different
tools and understandings must be developed [14, 15].
In this work, we study urban mobility patterns, building a model based on only a few constraints driven by
data observations. The model can predict different events in the urban environment with high precision and can
be generalized to other dynamical processes unfolding in urban spaces. Here, we propose a Statistical Inference
(Maximum Entropy) approach to study and predict urban mobility patterns. Phenomena and relative observables
that happen inside the city are complex, they entangle with various indicators and are difficult to predict. To build a
powerful, general, and robust statistical model, in principle we need to understand what are the important variables
that play a central role in the phenomenon we want to model. To solve it, we need to analyze the dataset we want to
study, identify the important dynamical properties and then model it solving the problem of optimizing the resulting
entropy.
The data represents the 30 minute binned multi-variate time-series of the activity of different zones inside the
city.
Being that urban systems are notoriously complex and that the fundamental causes of the observed mobility patterns
are various and interrelated, our methodology is novel in this field in the sense that builds the most general model
constrained to reproduce the correlations observed.
Complexity Science Hub Vienna, Vienna, 1080, Austria; daniotti@csh.ac.at
arXiv:2210.01491v1 [physics.soc-ph] 4 Oct 2022
2
In section Results we present the results for the Metropolitan City of Milan: multi-variate forecasting of the zones
activity and forecasting outliers events, discussed in Discussion. We also settle the basis for the formal derivation of
the model. We give an introduction to Maximum Entropy models, the formalism used in the following sessions and
the optimization algorithm used. Afterward,we introduce the formalism to compute an approximated Log-Likelihood.
In Additional Informations can be found similar results for the cities of Rome, Florence, and Turin.
In section Methods we describe the data and the variables in use. To find out which are the important variables to
represent, we do a historical analysis and observe a dynamics of contraction and dilation of the time-shifted correlations
between different zones. With this, we understand the important effects and and finally we define and derive the
formulae for the ME model describing our data.
We use MaxEnt inference to obtain the parameters (using gradient ascent algorithm) and reproduce lag-correlations
of definite positive time series. We derive a model that is highly predictive at least as more sophisticated models that
take into account nonlinear correlations and have more parameters. We also use the obtained statistical model to find
extrema events (outliers, such as strikes and bad weather days). As a result, we find the dynamics of cars’ presence
in urban areas to be extremely rich and complex. We infer the couplings parameters between the activity profiles of
different areas and use them to project the cars’ locations in time efficiently.
RESULTS
In this section, given a brief introduction to the method and the data studied, we present the results. For a more
in depth description, we refer to sec. Methods.
Data
The data we use is a normalised multivariate time-series representing the parking activity of the different munici-
palities of the city. The procedure to obtain the final form of the time-series in described in sec. Methods. In Fig. 1
we show an example of activity for two zones of our dataset for the Metropolitan City of Milan.
FIG. 1: Time series activity data for the city center (a) and for one of the suburbs (b).
3
Maximum Entropy Principle
The principle of maximum entropy states that the probability distribution which best represents the current state
of knowledge is the one with the largest entropy, in the context of precisely stated prior data. According to this
principle, the distribution with maximal information entropy is the best choice. The principle was first shown by E.
T. Jaynes in two papers in the late fifties where he emphasized a natural correspondence between statistical mechanics
and information theory [16, 17].
In particular, Jaynes offered a new and very general rationale on why the Gibbsian method of statistical mechanics
works. He showed that statistical mechanics, and in particular Ensemble Theory, can be seen just as a particular
application of information theory, hence there is a strict correspondence between the entropy of statistical mechanics
and the Shannon information entropy.
Maximum Entropy models unveiled interesting results throughout the years for a large variety of systems, like
flocking birds [18], proteins [19], brain [20] and social systems [21].
We will then implement this approach to define the model of our real-world system in the following sections. A more
general introduction to the maximum entropy formalism is out of scope here and we refer to the existing literature
for details [22–25].
The probability distribution with Maximum Entropy PME results from the extreme condition of the so-called
Generalized Entropy:
SP=SP+
K
X
k=1
θk(hOkiP(X)− hOkiobs),(1)
where
SP=X
X
P(X) log(P(X)) (2)
is the Shannon Entropy of the probability distribution P(X). The maximum of the Generalized Entropy is the
maximum of the Entropy of the model when it is subject to constraints. Computing the functional-derivative (1) with
respect to P(X) and equating to zero results in:
Pme(X) = 1
Z(θ)exp h
K
X
k=1
θkOk(X)i,(3)
where
Z(θ) = Z
dX exp h
K
X
k=1
θkOk(X)i(4)
is the normalization (making a parallel with statistical physics, can be called Partition Function). Z(θ) is written
as a sum if Ω is discrete. Hence, the maximum entropy probability distribution is a Boltzmann distribution in the
canonical ensemble at temperature T= 1K, with effective Hamiltonian H(X) = PK
k=1 θkOk(X).
It must be noticed that the minimization of the generalized entropy is equivalent to the maximization of the
experimental average of the likelihood:
SP= log Z(θ)
K
X
k=1
θkhOkie=−hlog Pmeie=1
M
M
X
m=1
log P(X(m)).(5)
In other words, the θkare chosen by imposing the experimental constraints on Entropy or, equivalently, by max-
imizing the global, experimental likelihood according to a model with the constraints cited above. Focusing on this
last sentence, we can say that the optimal parameters of θ(called effective couplings) can be obtained through Max-
imum Likelihood, but only once one has assumed (by the principle of Maximum Entropy) that the most probable
distribution has the form of PM E .
Given the generative model probability distribution of configurations P(X|θ) and its corresponding partition
function by log Z(θ), the estimator of θcan be found by maximizing the log-likelihood:
摘要:

MaximumEntropyApproachforthePredictionofUrbanMobilityPatternsSimoneDaniotti,BernardoMonechi,andEnricoUbaldi(Dated:October5,2022)Thescienceofcitiesisarelativelynewandinterdisciplinarytopic.Itborrowstechniquesfromagent-basedmodeling,stochasticprocesses,andpartialdi erentialequations.However,howthecit...

展开>> 收起<<
Maximum Entropy Approach for the Prediction of Urban Mobility Patterns Simone DaniottiBernardo Monechi and Enrico Ubaldi Dated October 5 2022.pdf

共14页,预览3页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:14 页 大小:8.29MB 格式:PDF 时间:2025-05-02

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 14
客服
关注