
Record loan
application
Verify
employment
Review credit
report
Request credit
report
Review title
report
Perform title
search
Review loan
application
Send rejection
Send approval
Figure 1: Example of a mortgage loan application process (Source: Chakraborti et al. [2020b])
Similar to natural language, images, or code snippets, business processes are yet another information
representation paradigm. However, the unique and particular nature of process features and modalities
can render existing foundation models inadequate to accurately understand and reason over them.
Hence, developing successful foundation models for business process decision making requires
research efforts to treat process data in a holistic manner instead of separate, independent modalities.
In this paper, we propose an approach to creating foundation models that factor in the complexity
of process data. We also discuss some of the challenges of creating foundation models for business
processes and the risks and opportunities of foundation models’ emergent behavior. First, however,
we provide an overview of business processes, their unique properties and the tasks
4
that may be best
suited for foundation models.
2 Background
2.1 Business Process Management
Abusiness process is a collection of ordered tasks, followed by a business to produce a product
or a service Weske [2012]. Figure 1 shows the example of a mortgage application process where
every application must go through the same steps before a decision is made. This allows mortgage
lenders to structure their process, improve consistency across loan officers and track the execution of
the process for accountability, auditing and improving the provided service. A graphical notation,
known as business process model and notation (BPMN) Grosskopf et al. [2009], is generally used to
represent such processes, capturing the relationship between tasks (rectangular boxes with rounded
edges) that must be completed by employee roles within an organization, events (circles) that can
trigger processes or specific tasks within them, and decision points (diamonds) that allow paths within
the process flow to merge or diverge. Swim lanes are usually defined to place specific tasks within
the scope of an employee role or department. A trace is an execution of a process; each process can
produce many distinct traces when executed depending on input events and other factors.
Business process management consists of many problems related to the modeling or design, execution
and governance of processes. Process mining or discovery analyzes event data to identify and derive
processes from raw, unstructured data Van Der Aalst [2012]. Ideally, process mining should produce
a BPMN or similar representation for the discovered process. Process optimization or re-engineering
looks to improve existing processes Arlbjørn and Haug [2010]. This requires making changes
to the process representation while maintaining the properties that characterize a valid process.
Conformance checking verifies that the “as-is” process (i.e., how the process is being executed in
reality) does not deviate from the “to-be” process (i.e., how the process was theoretically designed
to be executed) Dunzer et al. [2019]. Task automation through robotic process automation looks to
create automation scripts that can programmatically execute tasks instead of humans Van der Aalst
et al. [2018], whereas automation mining programmatically identifies the best tasks to automate
Geyer-Klingeberg et al. [2018].
2.2 Foundation Models
Foundation models, coined in Bommasani et al. [2021], refer to deep neural network models trained
on massive data and can be reused (with minimal modifications) for multiple downstream tasks.
4
Since processes also call individual nodes within a process a task, we will use “downstream tasks” to refer
to foundation model specific prediction tasks and “process tasks” to refer to tasks within a process.
2