
Zero-Shot Prompting for Implicit Intent Prediction and Recommendation
with Commonsense Reasoning
Hui-Chi Kuo Yun-Nung Chen
National Taiwan University, Taipei, Taiwan
r09922a21@csie.ntu.edu.tw y.v.chen@ieee.org
Abstract
The current generation of intelligent assistants
require explicit user requests to perform tasks
or services, often leading to lengthy and com-
plex conversations. In contrast, human assis-
tants can infer multiple implicit intents from
utterances via their commonsense knowledge,
thereby simplifying interactions. To bridge this
gap, this paper proposes a framework for multi-
domain dialogue systems. This framework au-
tomatically infers implicit intents from user ut-
terances, and prompts a large pre-trained lan-
guage model to suggest suitable task-oriented
bots. By leveraging commonsense knowledge,
our framework recommends associated bots in
a zero-shot manner, enhancing interaction ef-
ficiency and effectiveness. This approach sub-
stantially reduces interaction complexity, seam-
lessly integrates various domains and tasks, and
represents a significant step towards creating
more human-like intelligent assistants that can
reason about implicit intents, offering a supe-
rior user experience.1
1 Introduction
Intelligent assistants have become increasingly pop-
ular in recent years, but they require users to explic-
itly describe their tasks within a single domain. Yet,
the exploration of gradually guiding users through
individual task-oriented dialogues has been rela-
tively limited (Chiu et al.,2022). This limitation
is amplified when tasks extend across multiple do-
mains, compelling users to interact with numerous
bots to accomplish their goals (Sun et al.,2016).
For instance, planning a trip might involve inter-
acting with one bot for flight booking and another
for hotel reservation, each requiring distinct, task-
specific intentions like “Book a flight ticket” to
activate the corresponding bot, such as an airline
bot. In contrast, human assistants can manage high-
level intentions spanning multiple domains, utiliz-
1Code: http://github.com/MiuLab/ImplicitBot.
ing commonsense knowledge. This approach ren-
ders conversations more pragmatic and efficient, re-
ducing the user’s need to deliberate over each task
separately. To overcome this limitation of current
intelligent assistants, we present a flexible frame-
work capable of recommending task-oriented bots
within a multi-domain dialogue system, leveraging
commonsense-inferred implicit intents as depicted
in Figure 1.
Multi-Domain Realization Sun et al. (2016) pin-
pointed the challenges associated with a multi-
domain dialogue system, such as 1) comprehend-
ing single-app and multi-app language descriptions,
and 2) conveying task-level functionality to users.
They also gathered multi-app data to encourage
research in these directions. The HELPR frame-
work (Sun et al.,2017) was the pioneering attempt
to grasp users’ multi-app intentions and conse-
quently suggest appropriate individual apps. Nev-
ertheless, previous work focused on understanding
individual apps based on high-level descriptions
exclusively through user behaviors, necessitating a
massive accumulation of personalized data. Due to
the lack of paired data for training, our work lever-
ages external commonsense knowledge to bridge
the gap between high-level utterances and their
task-specific bots. This approach enables us to
consider a broad range of intents for improved gen-
eralizability and scalability.
Commonsense Reasoning Commonsense rea-
soning involves making assumptions about the
nature and essence of typical situations humans
encounter daily. These assumptions encompass
judgments about the attributes of physical objects,
taxonomic properties, and individuals’ intentions.
Existing commonsense knowledge graphs such as
ConceptNet (Bosselut et al.,2019), ATOMIC (Sap
et al.,2019), and TransOMCS (Zhang et al.,2021)
facilitate models to reason over human-annotated
commonsense knowledge. This paper utilizes a
arXiv:2210.05901v2 [cs.CL] 6 Jun 2023