experience and reduce the risk of customer churn
(Liu et al.,2021a,b). As shown in Fig.1, when a
chatbot try to address the user’s needs by giving
an inappropriate response, the user will feel disap-
pointed, and give a low global satisfaction score
for current dialogue, which means a service failure
and may lead to customer loss. If deploying with
MHCH mechanism, a human can take over the dia-
logue and give a satisfactory response to meet the
user’s needs, thus ensuring the user experience and
service quality (Radziwill and Benton,2017).
In fact, a high-quality MHCH service should
consider multiple factors, such as dialogue context,
local sentiments, global satisfaction, user state, and
labor cost, etc. However, most existing MHCH
methods mainly concerned on the dialogue context
(Liu et al.,2021a) or assisting with global satisfac-
tion prediction under the multi-task learning setting
(Liu et al.,2021b), ignoring the grounded relation-
ships among the other causal variables of MHCH,
like the user state and human cost.
To address above issues and improve the perfor-
mance of MHCH, we propose a general Causal-
Enhance Module (CEM), which can be plugged
into existing MHCH networks (Liu et al.,2021a,b),
to incorporate the considerations of other potential
causal variables of MHCH. Specifically, we first
analyzes MHCH task based on causal graph by
mining all potential causal variables and deduce
that user states and labor cost are the other two
causal variables that should be considered for high-
quality customer service. Then, to incorporate the
consideration of user state, we train a user state
network mainly driven by local sentiments to main-
tain the changes of user state during the dialogue
and adjust the handoff predictions by correcting the
prediction bias according to the causal relationship
between user states and handoff decisions. To con-
sider the labor cost of customer service and reduce
it as much as possible while maintaining the same
service quality, we construct a counterfactual-based
cost simulator to regress the cost of a dialogue as
an auxiliary task which can make the MHCH back-
bone become cost-aware and minimize the labor
cost as much as possible.
The contributions of our CEM can be summa-
rized as follows:
•
We conduct causal analysis based on causal
graph for MHCH and identify the other two
causal variables: user state and human cost,
which should be considered to build high-
quality MHCH service.
•
To consider the impact of user state, the user
state is applied to correct the handoff predic-
tion bias according to the causal relationship
between user states and handoff decisions.
•
To minimize the labor cost of customer ser-
vice while maintaining the same service qual-
ity, we construct a counterfactual-based cost
simulator to regress the cost of a dialogue as
an auxiliary task, which can make the MHCH
backbone become cost-aware.
2 Related Work
Machine-Human Chatting Handoff.
The re-
search on MHCH is originated in 2018. Using
the idea of reinforcement learning, Huang et al.
(2018) proposed a dialogue robot to choose an as-
sistant. Rajendran et al. (2019) utilize a reinforce-
ment learning framework to maximize success rate
and minimize human workload. Liu et al. (2021a,b)
regraded the MHCH as a classificagtion problem
and focused on identifying which sentence should
be transferred to the human service.
Causal inference and counterfactual learning.
For structural causal models (Halpern et al.,2005),
related studies (Heskes,2013;Claassen et al.,2014;
Xia et al.,2021) utilize graph neural networks for
directed acyclic graph structure learning. For Ru-
bin causal models, Rubin (2006) and Bengio et al.
(2019) use neural networks to approximate the
propensity scores, matching weights, etc., which
can satisfy the covariate balancing (Kallus,2020;
Kuang et al.,2017); The representation learning
(Huang et al.,2020b;Liang et al.,2020) can also
be used to matched the covariate balance between
the test group and the reference group (Shalit et al.,
2017;Louizos et al.,2017;Lu et al.,2020). Several
studies (Yoon et al.,2018;Yuan et al.,2019;Liu
et al.,2020) uses counterfactual methods based on
the generative models over the observed distribu-
tions to causal inference.
Multi-task learning in dialogue systems.
Xu
et al. (2020) uses multi-task learning for auxiliary
pre-training tasks of dialogue data. Qin et al. (2020)
combines dialogue behavior recognition and sen-
timent classification. Ide and Kawahara (2021)
proposes a model which includes generation and
classification tasks.