
Choose
Research
Question
Scientific Process
Generate
Hypothesis
Experimental
Setup /
Data Collection
Analysis Interpretation Report to
Community
NP Lifecycle
Data Curation
(Sec 3.1)
Encoding Domain
Knowledge
(Sec 3.2)
NP Model
Training
(Sec 3.3; 3.4)
Evaluation &
Interpretability
(Sec 3.5; Sec 3.6)
Deployment
(Sec 3.7)
Iterative
Iterative
Domain
Knowledge
World
Figure 1: Synergy between the scientific and neurosymbolic programming workflow.
Collectively, these characteristics enable a transparent and interactive process where an AI system
and a human expert collaborate on evidence-based reasoning and the discovery of new scientific facts.
Here, we use behavior analysis as a concrete, illustrative example. We start with an introduction to
NP (Section 2), then outline challenges and opportunities for future research (Section 3).
Behavior analysis as running example.
We chose behavior analysis as an example use case for
several reasons. Behavioral data is spatiotemporal, which is a common data type across the sciences.
Correspondingly, underlying challenges are shared in other domains, from monitoring vital signs to
modeling physical systems, to studying the dynamics of chemical reactions. Additionally, behavioral
data illustrate common challenges with scientific data. These datasets often contain rare behaviors
with noisy and imperfect data and can vary significantly in relevant time scales (e.g., milliseconds vs
hours). Datasets also vary across labs, organisms/systems, and experimental setups. Finally, automatic
behavior quantification is becoming increasingly crucial in many fields, such as neuroscience, ecology,
biology, and healthcare. As computational behavior analysis and neurosymbolic learning are both
developing research areas, there are many exciting opportunities to explore at their intersection.
Background on behavior analysis.
An important objective of behavior analysis is to quantify
behavior from video using continuous or discrete representations. We focus on the case of animal
behavior analysis in science [Anderson and Perona, 2014, Datta et al., 2019], where there are diverse
organisms and naturalistic behaviors. A common approach is first to perform animal pose tracking
from video [Mathis et al., 2018, Pereira et al., 2022], then categorize behaviors of interest from animal
pose [Segalin et al., 2021] (as discussed later in Figure 4). From an NP perspective, this approach can
be viewed as learning a symbolically interpretable intermediate representation (tracked keypoints).
Existing challenges in behavior analysis.
Similar to other scientific fields, data collection and
annotation are expensive for behavioral experiments. Analyzing data is also time-consuming and
expensive since specialized domain expertise is required for identifying behaviors of interest and
extracting knowledge. Models need to interface efficiently with scientists and data at both the inputs
and outputs from the scientific process (Figure 1). For NP models, leveraging domain expertise in the
form of behavioral attributes has been demonstrated to improve data efficiency [Sun et al., 2021] and
interpretability [Tjandrasuwita et al., 2021].
There is a variety of domain expertise that requires new algorithmic designs to integrate into the NP
workflow, such as experimental context, existing ethograms, and scientific spatiotemporal constraints.
Incorporating such domain knowledge has the potential to enable NP models to be more robust
to noisy and imperfect data, and enable new scientific inquiries that were too expensive to study
previously. Furthermore, when black-box models are used for studying behavior, it is difficult to
diagnose errors and explain model outputs [Rudin, 2019]. NP models have the potential to produce
symbolic descriptions of behavior (Figure 2), which enables experts to connect model interpretations
with other parts of the behavior analysis workflow, e.g., describing behavioral differences across
different strains of mice. Finally, to enable the use of NP models in real-world science workflows,
these models must be scalable and produce robustly reproducible interpretations.
2