
1
Granular-Ball Fuzzy Set and Its Implementation in
SVM
Shuyin Xia, Xiaoyu Lian, Guoyin Wang*, Xinbo Gao, Yabin Shao
Abstract—Most existing fuzzy set methods use points as their
input, which is the finest granularity from the perspective of
granular computing. Consequently, these methods are neither
efficient nor robust to label noise. Therefore, we propose a frame-
work called granular-ball fuzzy set by introducing granular-ball
computing into fuzzy set. The computational framework is based
on the granular-balls input rather than points; therefore, it is
more efficient and robust than traditional fuzzy methods, and
can be used in various fields of fuzzy data processing according
to its extensibility. Furthermore, the framework is extended to
the classifier fuzzy support vector machine (FSVM), to derive the
granular ball fuzzy SVM (GBFSVM). The experimental results
demonstrate the effectiveness and efficiency of GBFSVM. The
source codes and data sets are available on the public link:
http://www.cquptshuyinxia.com/GBFSVM.html.
Index Terms—Fuzzy set, granular-ball, SVM, granular com-
puting, label noise.
I. INTRODUCTION
IN the practical world, there are numerous fuzzy phenomena
or concepts in the objective world, such as big and small,
light and heavy, fast and slow, dynamic and static, deep and
shallow, beauty and ugliness, etc., which cannot be clearly
and completely distinguished. In fact, fuzzy information is
also reliable information. In order to quantitatively describe
the objective laws of fuzzy concepts and fuzzy phenomena,
Professor L.A. Zadeh, an American computer and cybernetics
expert, put forward the important concept of fuzzy set [1] in
1965. He used membership functions to represent fuzzy sets,
which are functions of [0,1] closed intervals, to describe the
degree to which elements belong to fuzzy sets. The greater the
function value, the greater the degree of membership. Since
Zadeh introduced fuzzy sets [2], it has been applied to various
fields such as control systems, pattern recognition, machine
learning, etc, and its another branch, fuzzy rough set, has also
been developed rapidly. Several scholars have conducted in-
depth research in the direction of feature selection [3], [4],
[5], [6], [7], [8], clustering [9], decision making [10], [11] ,
classification [12] and so on.
Considering the classification problem of fuzzy data sets,
Lin et al. [1] proposed a fuzzy support vector machine (FSVM)
model by applying fuzzy membership to each input point.
The model can make full use of the sample information,
however, the complexity of the training stage is still high
for a large number of data classification problems. For the
research on fuzzy set classification tasks in the field of machine
S. Xia, X. Lian, G. Wang, X. Gao and Y. Shao are with the Chongqing
Key Laboratory of Computational Intelligence, Chongqing University of
Telecommunications and Posts, 400065, Chongqing, China. E-mail: xi-
asy@cqupt.edu.cn, 1258852995@qq.com, shaoyb@cqupt.edu.cn.
Fig. 1. Human cognition the coarse-grained large range is preferred.
learning, Aydogan et al. [13] proposed a hybrid heuristic
method based on the genetic algorithm (GA) and integer
programming formula (IPF) to solve the high-dimensional
classification problem in the classification system of linguistic
fuzzy rules. The method can find accurate and concise classi-
fication rules, but can not flexibly consider the number of rule
sets generated in the classification. Sanz et al. [14] directly
learned interval-valued fuzzy rules by defining a packaging
method to obtain a classification system based on the interval
valued fuzzy principle. Compared with the existing algorithm
at that time, the accuracy of this method has been significantly
improved, but the unbalanced classification problem can not
be well tested. The algorithm is inefficient owing to its two
evolutionary processes. Li et al. [15] proposed an interval
extreme learning machine for interval fuzzy set classification
of continuous-valued attributes, in which the discretization
of conditional attributes and fuzzification of class labels are
considered. Recently, an associative fuzzy classifier called
CFM-BD [16] was been developed, which has shown ro-
bust predictive performance against more complex algorithms
such as fuzzy decision trees [17]. To simplify the rule set,
Aghaeipoor et al. [18] proposed a new scalable fuzzy classifier
for big data, namely Chi-BD-DRF, which added the method of
"dynamic rule filtering (DRF)" to supplement fuzzy big data
learning.
The aforementioned mentioned processing methods are
based on the finest granularity from the perspective of granular
computing [19], [20], as shown in Fig. 2(a), therefore, it is not
efficient and robust. Human cognition has the rule of "large
scope first," and the visual system is particularly sensitive
to the global topological characteristics, from large to small,
from coarse-grained to fine-grained as shown in Fig. 1 [21]. In
granular computing, the larger the granularity size, the higher
the efficiency and the better the robustness to noise. However,
arXiv:2210.11675v2 [cs.LG] 26 Nov 2022