
Course-Prerequisite Networks for Analyzing and Understanding Academic Curricula
Pavlos Stavrinides1and Konstantin Zuev1, ∗
1Department of Computing and Mathematical Sciences,
California Institute of Technology, Pasadena, CA 91125, USA
Understanding a complex system of relationships between courses is of great importance for the
university’s educational mission. This paper is dedicated to the study of course-prerequisite net-
works (CPNs), where nodes represent courses and directed links represent the formal prerequisite
relationships between them. The main goal of CPNs is to model interactions between courses, rep-
resent the flow of knowledge in academic curricula, and serve as a key tool for visualizing, analyzing,
and optimizing complex curricula. First, we consider several classical centrality measures, discuss
their meaning in the context of CPNs, and use them for the identification of important courses.
Next, we describe the hierarchical structure of a CPN using the topological stratification of the
network. Finally, we perform the interdependence analysis, which allows to quantify the strength
of knowledge flow between university divisions and helps to identify the most intradependent, in-
fluential, and interdisciplinary areas of study. We discuss how course-prerequisite networks can be
used by students, faculty, and administrators for detecting important courses, improving existing
and creating new courses, navigating complex curricula, allocating teaching resources, increasing
interdisciplinary interactions between departments, revamping curricula, and enhancing the overall
students’ learning experience. The proposed methodology can be used for the analysis of any CPN,
and it is illustrated with a network of courses taught at the California Institute of Technology. The
network data analyzed in this paper is publicly available in the GitHub repository.
I. INTRODUCTION
An academic curriculum is a complex system of courses
and interactions between them that lies at the heart of
a university and underlies its educational mission. Un-
derstanding a university curriculum as a whole is an im-
portant prerequisite for providing students with a high
quality education and meaningful learning experiences.
Moreover, designing an appropriate curriculum is of great
importance not only from an academic point of view, but
also for organizational and financial management.
A full list of courses together with their descriptions
given in the university catalog allows, at least in princi-
ple, to know everything about the curriculum and answer
any question about it. However, it is hard to comprehend
this raw data, extract actionable knowledge, and make
data-driven decisions.
A network, where nodes represent courses and links
represent certain relationships between them, is a nat-
ural model for conceptualizing, representing, and ana-
lyzing a curriculum. For example, links can represent
the temporal relationships between courses based on how
many students move from one course to another through
out their studies [1]. These temporal networks can be
used for forecasting course enrollments, predicting stu-
dent performance [2], and estimating the relative contri-
bution of courses [3]. Alternatively, links between courses
can reflect the influence that some subjects have on oth-
ers based on the expert knowledge of the professors. Such
influence network models can be used for the curriculum
design and recommendations [4].
∗Corresponding author, email: kostia@caltech.edu
This paper focuses of the study of course-prerequisite
networks (CPNs), where nodes represent courses and di-
rected links represent the formal prerequisite require-
ments between them listed in the university catalog. Un-
like temporal and influence networks, CPNs are objec-
tively defined and, when in a steady state, don’t change
substantially from year to year. Over the last decade,
CPNs have attracted a lot of interest from researchers
due to their key role in the understanding of the complex
structure of academic curricula. For example, Slim et al.
used CPNs for detecting crucial courses that have a high
impact on students progress and graduation rates [5],
Aldrich discussed applications of CPNs to advising and
curriculum reform [6], and Molontay et al. introduced a
data-driven probabilistic approach for studying the dis-
tribution of graduation time based on the CPN topol-
ogy [7].
In this paper, we propose a general network-science-
based framework for analysis of CPNs and illustrate it
with a CPN based on the courses offered at the Califor-
nia Institute of Technology in the 2021-2022 academic
year. We show that a CPN is an indispensable tool for
visualizing, understanding, and optimizing an academic
curriculum. It can be used not only for identification of
important courses, but also for improving existing and
creating new courses. We discuss how students can use
a CPN to navigate their complex curriculum, and how a
CPN can help faculty and administrators to meaningfully
allocate teaching resources, increase interactions between
divisions and departments, revamp the curriculum, and
enhance the overall students’ learning experience.
The proposed framework is based on network sci-
ence [8–11], which is an interdisciplinary field that
emerged at the intersection of graph theory, compu-
tational statistics, computer science, and statistical
arXiv:2210.01269v3 [physics.soc-ph] 28 Apr 2023