Dynamic Inventory Management with Mean-Field Competition

2025-08-18 0 0 1.18MB 37 页 10玖币
侵权投诉
Dynamic Inventory Management with Mean-Field Competition
Ryan Donnellya, Zi Lia
aDepartment of Mathematics, King’s College London,
Strand, London, WC2R 2LS, United Kingdom
Abstract
Agents attempt to maximize expected profits earned by selling multiple units of a perishable
product where their revenue streams are affected by the prices they quote as well as the
distribution of other prices quoted in the market by other agents. We propose a model which
captures this competitive effect and directly analyze the model in the mean-field limit as
the number of agents is very large. We classify mean-field Nash equilibrium in terms of the
solution to a Hamilton-Jacobi-Bellman equation and a consistency condition and use this to
motivate an iterative numerical algorithm. Convergence of this numerical algorithm yields
the pricing strategy of a mean-field Nash equilibrium. Properties of the equilibrium pricing
strategies and overall market dynamics are then investigated, in particular how they depend
on the strength of the competitive interaction and the ability to oversell the product.
Keywords: mean-field game, dynamic pricing, optimal control
1. Introduction
This paper considers an optimal price setting model in which agents attempt to liquidate a
given product inventory over a finite time horizon. Agents quote prices to potential buyers
which affects both the revenue made on individual sales and the intensity at which sales
are made. In addition, the intensity of sales is affected by the distribution of prices quoted
across all agents, meaning the revenue rate of an individual agent is impacted by the effects
of competition. When the finite time horizon is reached, each agent that has unsold units of
the perishable good recovers a salvage cost for their remaining inventory, essentially selling it
with an imposed penalty. We employ a mean-field game approach to the model which lowers
The authors would like to thank Matt Loring and Ronnie Sircar for their helpful comments on an earlier
draft.
Email addresses: ryan.f.donnelly@kcl.ac.uk (Ryan Donnelly), zi.2.li@kcl.ac.uk (Zi Li)
1
arXiv:2210.17208v2 [q-fin.TR] 15 Apr 2025
the complexity of numerical computation of optimal pricing strategies compared to the finite
agent setting.
Models of dynamic inventory pricing have been used in the past to dictate optimal pricing
strategies for products which have a finite shelf life, notably fashion garments, seasonal leisure
spaces, and airline tickets (see for example Gallego and Van Ryzin (1994), Anjos et al. (2004),
Anjos et al. (2005), and Gallego and Hu (2014)). Some early work in dynamic inventory
pricing restricts the underlying dynamics to be deterministic (see Jørgensen (1986), Dock-
ner and Jørgensen (1988), and Eliashberg and Steinberg (1991)) which may allow for more
tractability and further analysis of optimal policies, with or without considering competition.
Models with stochastic demand have also been studied (see Gallego and Van Ryzin (1994)
and Zhao and Zheng (2000)), but most results pertain to the case of a monopolistic agent
without competition. The paper Gallego and Hu (2014) considers an oligopolistic market, but
equilibrium with stochastic revenue streams is only classified in terms of a system of coupled
differential equations. Instead of trying to analyse the solution to these equations, which is
very computationally intense even for only two agents, the authors show that the stochastic
game is well approximated by a deterministic differential game under suitable scaling of model
parameters. The mean-field setting we consider can be thought of as allowing the number of
agents to grow very large rather than the parameters which control the underlying dynamics.
This retains a level of computational complexity at a level similar to the single agent case
which allows us to investigate the effects of competition.
Our model is a generalization of the single agent model considered in Gallego and Van Ryzin
(1994), which we briefly summarize and refer to as the reference model when discussing the
mean-field case. Specifically, agents hold a positive integer number of units of an asset and
quote a selling price for each unit continuously through time. Sales occur at random times
with an intensity that depends on the price quoted by the agent such that higher prices result
in less frequent trades, creating a trade-off between large but infrequent revenue of quoting
high prices, versus small but frequent revenue of low prices. Additionally, competition between
agents is modelled by specifying that the sell intensity also depends on the distribution of
prices quoted by all agents. Thus, an agent’s selling rate increases if other agents begin to
quote higher prices. Our model setting is quite similar to the one in Yang and Xia (2013),
especially in regards to the dynamics of agents’ inventory levels. The main difference is that
our model allows prices to be continuous rather than being selected from a finite set of either
high price or low price. Additionally, as a mean-field extension of one of the models presented
in Dockner and Jørgensen (1988), Chenavaz et al. (2021) has a similar sell intensity to our
work that depends on the distribution of prices quoted by all agents. Our work mainly differs
from Chenavaz et al. (2021) in that their model considers agents with states determined by
2
a continuous quantity which changes deterministically, whereas in our work a representative
agent has a discrete inventory level which changes stochastically.
By working in a mean-field setting, our conditions for equilibrium do not require the solution
to a coupled system of differential equations. Instead, equilibrium is classified by a single
differential equation and a consistency condition. This lends itself to an efficient iterative
algorithm which upon convergence yields a mean-field Nash equilibrium. We are unable
to prove the existence of a mean-field Nash equilibrium, but we have conducted extensive
numerical experiments with the iterative algorithm which always converge to the same result
within a small numerical tolerance. The low dimension of the system of equations which
classify equilibrium allows us to easily demonstrate the resulting pricing strategies, and hence
investigate how prices under the effects of competition compare to those of the reference
model.
The tractability offered by a mean-field game framework over finite agent models has also led
to their use in studying competition in other types of markets. In Chan and Sircar (2017)
and Ludkovski and Yang (2017), the effects of competition through mean-field interaction are
incorporated into models of energy production and commodity extraction. In Donnelly and
Leung (2019), agents compete for a reward in an R&D setting within a mean-field framework,
in which earlier success yields greater rewards for the expended effort. In Li et al. (2024)
agents expend effort to mine cryptocurrency, where an agent’s rate of mining depends on
their hash rate as well as that of the entire population of miners. Our modeling framework
has some similarities to these papers which allows us to employ a nearly direct adaptation of
relevant numerical methods to compute equilibrium in our model.
A novel focus of our work is regarding how overall market behaviour is affected by features
describing individual agents. In particular, we investigate how the magnitude of competitive
interaction, the ability to oversell the asset (with penalty), and price caps affect the total
wealth transferred from consumers in the market, the average price paid per unit asset, and
the probability that a particular consumer will end up empty handed due to overselling of
the asset. The dependence of market behaviour on these phenomena could be used to guide
regulatory framework with the goal of achieving desired levels of various measurements of
economic welfare.
The rest of the paper is organized as follows: in Section 2 we give an overview of the reference
model, which is the single-agent equivalent to the mean-field setting we cover in more detail.
In Section 3 we specify our model that incorporates the effects of competition, including
the definition of equilibrium which we consider. In Section 4 we show several examples of
numerically solving for equilibrium and investigate the effects of competition and other market
3
phenomena. Section 5 concludes, and longer proofs are contained in the Appendix.
1.1. General Notation
Here we introduce the general notation which is used throughout the remainder of the work.
(Ω,F,{Ft}0tT,P) A filtered probability space.
XAIndicator function of the event A.
Et,s,x,q[Y] Conditional expectation of Ygiven St=s,Xt=xand Qt=q.
Q, Q Finite upper and lower bounds of inventory.
TFinite terminal time horizon.
S= (St)t[0,T ]Reference price process.
AThe set of admissible controls.
δi, δfSpread process of agent i, or induced by feedback control f.
δ, δfMean spread posted by all agents, or induced by the feedback control f.
Nλ(δ), Nλ(δi)
iCounting process of the number of items sold.
Qδ, Qδi
iRemaining inventory held by an agent.
Xδ, Xδi
iThe cash generated by an agent selling inventory.
λ(δ), λ(δ, δ) The intensity function which determines the rate of inventory sales.
f(t, q) Feedback form of a Markov strategy.
Pf,M
q, P f
qProportion process in the M-player setting and in the mean-field limit.
2. Single Agent Reference Model
In this section, we introduce a reference model where we only consider one agent. Many
aspects of the dynamics we consider are equivalent to those found in Gallego and Van Ryzin
(1994) with some modifications made to the price process and agent’s performance criterion.
This style of model which relates intensity to price has also been used frequently in the
literature on algorithmic trading. See for example Gu´eant et al. (2012), Gu´eant and Lehalle
(2015), and Cartea and Jaimungal (2015).
We work on a probability space (Ω,F,P) which we assume supports all random variables
and stochastic processes defined below. We consider an agent who has to liquidate a finite
quantity Qof a given product within a finite time horizon of length T. The reference price
process of the product is denoted by S= (St)t[0,T ]with dynamics
dSt=σ dWt,(1)
4
where σ > 0 is a constant and W= (Wt)t[0,T ]is a Brownian motion.
We denote the agent’s spread process above the reference price by δ= (δt)t[0,T ], so that she
continuously quotes her selling price at St+δt1at every time point, and is committed to sell
one item2with the quoted price. Once the agent clears her inventory, she will stop trading.
Given a non-negative bounded process λ= (λt)t[0,T ], her number of items sold follows a
counting process denoted by Nλ= (Nλ
t)t[0,T ]defined as follows: let {un}
n=1 be a sequence
of independent standard uniform random variables. Define
τ0= 0 ,
τn= inf{tτn1:eRt
τn1λudu un},
Nλ
t= sup{n0 : tτn}.
Then Nλis a doubly stochastic Poisson process with intensity process λ(see Lando (1998)),
and the sequence of times at which Nλjumps is {τn}
n=1. Subsequently, we will let the
intensity process depend on her spread through the relation λt=λ(δt) for a function to be
specified later, and we will write Nλ(δ)to denote the number of items sold when the agent
quotes a spread according to δ= (δt)t[0,T ]. We denote the indicator function of any event A
by XA. Thus, the agent’s inventory Qδ= (Qδ
t)t[0,T ]satisfies
dQδ
t=−XQδ
t>0dNλ(δ)
t,
with initial value Qδ
0=Q. As a consequence of her trades, the agent accumulates cash denoted
by Xδ= (Xδ
t)t[0,T ]with dynamics given by
dXδ
t=XQδ
t>0(St+δt)dNλ(δ)
t,
with given initial cash Xδ
0=x0.
The agent’s goal is to maximize her expected P&L at time Twith an inventory penalty.
Specifically, her value functional is
J(δ) = EXδ
T+Qδ
TSTα Qδ
TϕZT
0Qδ
u2du,
where αand ϕare positive constants. The objective functional consists of three parts.
The first term in the expectation Xδ
Tis the amount of cash at time T. The second term
Qδ
TSTα Qδ
Tcorresponds to the salvage value of unsold items remaining at time T, and
1Here by t, we mean the left limit to time t.
2Note that selling one item may be understood as selling a block of units of the product, each block being
of the same size.
5
摘要:

DynamicInventoryManagementwithMean-FieldCompetition⋆RyanDonnellya,ZiLiaaDepartmentofMathematics,King’sCollegeLondon,Strand,London,WC2R2LS,UnitedKingdomAbstractAgentsattempttomaximizeexpectedprofitsearnedbysellingmultipleunitsofaperishableproductwheretheirrevenuestreamsareaffectedbythepricestheyquote...

展开>> 收起<<
Dynamic Inventory Management with Mean-Field Competition.pdf

共37页,预览5页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:37 页 大小:1.18MB 格式:PDF 时间:2025-08-18

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 37
客服
关注