D.G. Gioia et al. ArXiv Preprint - Rolling horizon policies for multi-stage ATO November 23, 2023
stochastic programming, as they place emphasis on the replenishment of components from suppliers, under random
lead times. van Jaarsveld and Scheller-Wolf (2015), too, deal with component replenishment and allocation, rather than
finite-capacity production. See also Huang and de Kok (2015). Nonås (2009) adopts stochastic programming, providing
analytical insights for small-scale component replenishment problems, limited to just a few (three) end items.
Unlike, e.g., DeValve, Pekeˇ
c, and Wei (2020) and Thevenin, Adulyasak, and Cordeau (2022), we deliberately steer
away from ad-hoc solution approaches, as they may be fragile when dealing with general models. Since we aim at
providing support to two-level master production scheduling, we prefer to use standard solvers. Furthermore, we insist
on complex demand patterns. We allow for seasonality and deal with realistic issues related to ATO system, where some
end items may be variations on a basic family. Hence, we shall cope with intra-family demand, which needs a better
approach than just pairwise correlations. Differently from DeValve, Pekeˇ
c, and Wei (2020) and Borodin et al. (2016),
we do not consider component replenishment from suppliers, but finite capacity production. This results in challenging
issues, especially when there are demand peaks, possibly due to seasonality, that must be suitably addressed.
Our problem setting is more related with, e.g., Englberger, Herrmann, and Manitz (2016) and Thevenin, Adulyasak, and
Cordeau (2021). However, we consider a different information structure, related to two-level, rather than single-level
master production scheduling. On the other hand, we disregard lot sizing issues associated with setup costs and times.
We do not use sophisticated scenario generation strategies as in Thevenin, Adulyasak, and Cordeau (2022), like
quasi-Monte Carlo sampling, as we rely on a data-driven approach whereby probability distributions are unknown
and only a few data are available. Furthermore, we do not evaluate performance in terms of in-sample computational
efficiency, but rather in terms of actual expected profit estimated by realistic out-of-sample, rolling horizon simulations
(where the true demand distributions are used).
2.2 Approaches based on deterministic models and safety stock buffers
The practical strategy adopted in typical MRP systems relies on deterministic planning, integrated with the use of
safety stocks as a hedging tool, within a rolling horizon process. This is reflected in academic research based on the
application of simulation models and optimization heuristics to set safety stocks.
The problem of sizing safety stocks has been considered in several papers and a recent literature review is provided by
Gonçalves, Sameiro Carvalho, and Cortez (2020), who analyze a set of 95 articles published from 1977 to 2019 and
show that 65% of them apply analytical techniques, often within an inventory control framework, and do not contain
references to realistic applications. On the contrary, simulation seems to play a key role in practice, often with reference
to MRP systems. Simulation-based optimization is adopted by Gansterer, Almeder, and Hartl (2014) and Seiringer
et al. (2021), as well as by Zhao, Lai, and Lee (2001) to evaluate the performance of MRP systems. A deterministic
optimization model is proposed by Ziarnetzky, Mönch, and Uzsoy (2020), using safety stocks to buffer against demand
uncertainty. They use simulation to assess actual performance within a rolling horizon framework, which is what we
also do in our computational experiments.
2.3 Paper positioning and relationships with approximate dynamic programming
The distinguishing feature of our work is the integration of stochastic programming models for ATO with tools to ease
end-of-horizon effects (Grinold 1983; Fisher, Ramdas, and Zheng 2001). We should note that this kind of issue has been
addressed in the financial domain Myers, Ziemba, and Cariño (1998); Konicz et al. (2015) too. However, in finance we
essentially have to deal with one commodity, namely, wealth. In an ATO environment, the matter is more complicated,
due to the relationships among components used for the same end item. Finding an exact expression of the terminal
value function is impractical. Nevertheless, experience with rollout algorithms (Bertsekas 2020) shows that even a
rough approximation may be remarkably effective in improving performance. A useful strategy for this aim is to resort
to separable approximations as proposed, e.g., by Simão et al. (2009), which is what we pursue in this paper.
The comparison of solutions based on different scenario tree structures has been tackled for financial portfolio choice
problems by, e.g., Birge, Blomvall, and Ekblom (2022); Blomvall and Shapiro (2006). However, financial and
manufacturing domains are deeply different. While in the financial domain care must be taken to avoid building a
tree allowing arbitrage opportunities, which places additional requirements on the shape of the scenario tree, in the
production field there are no such problems. Moreover, in finance, there is an abundance of data, which is a much scarcer
commodity in manufacturing, requiring a data-driven approach. These differences motivate a specific investigation
within a manufacturing setting.
In principle, the problem that we address in the paper could be solved exactly by stochastic dynamic programming,
which is the reference approach to cope with sequential decisions under uncertainty. It is well-known, however, that
the curse of dimensionality precludes its application to most practical size problems. Nevertheless, concepts from
4