A Systematic Review of Machine Learning Techniques for Cattle Identification Datasets Methods and Future Directions Md Ekramul Hossainae Muhammad Ashad Kabirabe Lihong Zhengae Dave L. Swainbce Shawn McGrathbde

2025-04-30 0 0 1.19MB 34 页 10玖币
侵权投诉
A Systematic Review of Machine Learning Techniques for Cattle Identification:
Datasets, Methods and Future Directions
Md Ekramul Hossaina,e, Muhammad Ashad Kabira,b,e,
, Lihong Zhenga,e, Dave L. Swainb,c,e, Shawn McGrathb,d,e,
Jonathan Medwayb,e
aSchool of Computing, Mathematics and Engineering, Charles Sturt University, Bathurst, NSW 2795, Australia
bGulbali Institute for Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, NSW, 2678, Australia
cTerraCipher Pty. Ltd., Alton Downs, QLD 4702, Australia
dFred Morley Centre, School of Animal and Veterinary Sciences, Charles Sturt University, Wagga Wagga, NSW 2678, Australia
eFood Agility CRC Ltd, Sydney, NSW 2000, Australia
Abstract
Increased biosecurity and food safety requirements may increase demand for ecient traceability and identifica-
tion systems of livestock in the supply chain. The advanced technologies of machine learning and computer vision
have been applied in precision livestock management, including critical disease detection, vaccination, production
management, tracking, and health monitoring. This paper oers a systematic literature review (SLR) of vision-based
cattle identification. More specifically, this SLR is to identify and analyse the research related to cattle identification
using Machine Learning (ML) and Deep Learning (DL). This study retrieved 731 studies from four online scholarly
databases. Fifty-five articles were subsequently selected and investigated in depth. For the two main applications of
cattle detection and cattle identification, all the ML based papers only solve cattle identification problems. However,
both detection and identification problems were studied in the DL based papers. Based on our survey report, the most
used ML models for cattle identification were support vector machine (SVM), k-nearest neighbour (KNN), and arti-
ficial neural network (ANN). Convolutional neural network (CNN), residual network (ResNet), Inception, You Only
Look Once (YOLO), and Faster R-CNN were popular DL models in the selected papers. Among these papers, the
most distinguishing features were the muzzle prints and coat patterns of cattle. Local binary pattern (LBP), speeded
up robust features (SURF), scale-invariant feature transform (SIFT), and Inception or CNN were identified as the
most used feature extraction methods. This paper details important factors to consider when choosing a technique or
method. We also identified major challenges in cattle identification. There are few publicly available datasets, and
the quality of those datasets are aected by the wild environment and movement while collecting data. The process-
ing time is a critical factor for a real-time cattle identification system. Finally, a recommendation is given that more
publicly available benchmark datasets will improve research progress in the future.
Keywords: Cattle identification, cattle detection, machine learning, deep learning, cattle farming.
Corresponding author: School of Computing, Mathematics and Engineering, Charles Sturt University, Panorama Ave, Bathurst, NSW 2795.
Ph.+61263386259, Email: akabir@csu.edu.au
Email addresses: mdhossain@csu.edu.au (Md Ekramul Hossain), akabir@csu.edu.au (Muhammad Ashad Kabir),
Published in Artificial Intelligence in Agriculture, vol 6, pp. 138-155, 2022, https: // doi. org/ 10. 1016/ j. aiia. 2022. 09. 002 October 18, 2022
arXiv:2210.09215v1 [cs.CV] 13 Oct 2022
1. Introduction
The demand for ecient traceability and identification systems for livestock is growing due to biosecurity and
food safety requirements in the supply chain. The advanced technologies of machine learning and computer vision
have been applied in precision livestock management, including critical disease detection, vaccination, production
management, tracking, health monitoring, and animal well-being monitoring (Awad,2016). ‘Cattle identification’
refers to ‘cattle detection’ and ‘cattle recognition’ (Mahmud et al.,2021). Cattle identification systems start from
manual identification to automatic identification with the help of image processing. Traditional cattle identification
systems such as ear tagging (Awad,2016), ear notching (Neary and Yager,2002), and electronic devices (Ruiz-Garcia
and Lunadei,2011) have been used for individual identification in cattle farming. Disadvantages of these individual
identification methods include the possibility of losses, duplication, electronic device malfunctions, and fraud of the
tag number (Rossing,1999;Roberts,2006). These are the issues and challenges for cattle identification in livestock
farm management.
With the advent of computer-vision technology, cattle visual features have gained popularity for cattle identifi-
cation (Kusakunniran and Chaiviroonjaroen,2018;Andrew et al.,2016,2017;de Lima Weber et al.,2020). Visual
feature based cattle identification systems are used to detect and classify dierent breeds or individuals based on a set
of unique features. In recent years, machine learning (ML) and deep learning (DL) approaches have been widely used
for automatic cattle identification using visual features (Andrew et al.,2016;Tharwat et al.,2014b;Andrew et al.,
2019;Qiao et al.,2019;Li et al.,2021b). ML and DL are subfields of artificial intelligence that can solve complex
problems for automatic decision-making. ML is mainly divided into two approaches, such as supervised learning
and unsupervised learning. The supervised ML approach is defined by its use of labelled datasets, whereas the unsu-
pervised learning uses ML algorithms to analyse and cluster unlabeled datasets. An unsupervised ML approach can
detect hidden patterns in data without human supervision (Janiesch et al.,2021). DL approaches are useful in areas
with large and high-dimensional datasets. Thus, DL models are usually outperformed over traditional ML models in
the area of text, speech, image, video, and audio data processing (LeCun et al.,2015). There are two main steps in the
development of ML and DL models. In the first step, a training dataset is used to train the model, and in the second,
the model is validated using a separate validation dataset. Thus, a trained model is created that is later used on the test
dataset to determine its performance based on the test dataset. The dataset used for ML models includes the features
and their corresponding outcomes or labels. The features are extracted from the input data using a feature extraction
method. DL algorithms can automatically extract high-level features from the dataset and learn from these features.
Although the implementation of the ML and DL models is straightforward, there are some challenges with selecting
algorithms, tuning parameters, and features for better prediction accuracy (Janiesch et al.,2021).
Several important review studies have been completed in livestock farm management. Some recent literature re-
lzheng@csu.edu.au (Lihong Zheng), dave.swain@terracipher.com (Dave L. Swain), shmcgrath@csu.edu.au (Shawn McGrath),
jmedway@csu.edu.au (Jonathan Medway)
ii
views have addressed various research challenges in livestock farming, such as identification, tracking, and health
monitoring, using tag-based, ML, and DL approaches. Recently, Awad (2016) and Kumar and Singh (2020) reviewed
the literature on using dierent classical and visual biometrics methods for cattle identification and tracking. Li et al.
(2021a) reviewed the deep learning-based approaches for classification, object detection and segmentation, pose es-
timation, and tracking for dierent kinds of animals such as cattle, pigs, sheep, and poultry. A systematic literature
review based on applying ML and DL approaches in precision livestock farming by Garcia et al. (2020) focused on
grazing and animal health. Qiao et al. (2021) summarised the ML and DL approaches in precision cattle farming
for cattle identification, body condition score evaluation, and live weight estimation. They reviewed a small number
of articles (n=13) related to cattle identification using ML and DL approaches. Mahmud et al. (2021) conducted a
systematic literature review showing the recent progress of DL applications for cattle identification and health mon-
itoring. Their review included only a few articles related to cattle identification. Moreover, these review articles
focused on the combination of dierent types of challenges (e.g., tracking, pose estimation, weight estimation, identi-
fication, and detection) solved by tag-based, ML, and DL methods in precision livestock farming. Thus, they lack in
providing a comprehensive review on cattle identification. Also, the existing review articles lack information on ML
and DL applications combined for cattle identification as they cover partly either ML or DL for cattle identification.
Moreover, the details of the cattle dataset for identification are not discussed. In this context, an extensive systematic
literature review is needed, particularly for the challenge of cattle identification addressed by ML and DL approaches.
Also, the details of the dataset used in the relevant articles need to be discussed, and the current trend of using ML
and DL techniques in cattle identification and future research opportunities with challenges need to be identified.
This systematic literature review (SLR) aims to summarise and analyse the ML and DL applications used exten-
sively in cattle identification. A total of 55 articles for cattle identification and detection have been selected for this
SLR. The reviewed articles are first summarised, and then the datasets used in the selected articles are discussed. We
then analyse the reviewed articles for trends in using ML and DL approaches for cattle identification in recent years
before presenting the feature extraction methods and performance evaluation metrics extracted from the reviewed
articles. Finally, the challenges and future research directions in this field are discussed.
2. Methodology
2.1. Review process
The review process of an SLR is divided into three phases – planning, conducting, and reporting the review
(Kitchenham and Charters,2007). In the first phase, the research questions for the SLR are identified. Based on the
research questions, the electronic databases and search terms or keywords were determined. The search keywords
are used to create a search string that is applied to the dierent electronic databases to extract the related articles for
the SLR. This study used the IEEE Xplore, Science Direct, Scopus, and Web of Science databases. These databases
were selected to cover a wide range of studies in our targeted sector as they index most of the journals from various
iii
publishers such as Springer, ACM, Inderscience, Elsevier, Sage, Taylor & Francis, IOS, Wiley, and so on. In the
second phase, the relevant research studies are identified by searching the databases. After that, the selection criteria
are determined for the quality assessment of the primary studies. The eligible studies are selected by applying the
selection criteria, and then the relevant data are extracted from the selected articles based on the research questions.
In the final phase, the extracted data are analysed and used to address the research questions. Then, the results are
reported in the form of tables and figures followed by a brief discussion of research challenges and future research
opportunities.
2.2. Research questions
This SLR focuses on published research studies into cattle identification using ML and DL approaches. The search
process identifies potential primary studies that address the research questions. The answers to the research questions
are discussed based on the data extracted from the selected studies. This study defined the following seven research
questions (RQs) for the SLR.
RQ1: What ML models are used in cattle identification?
RQ2: What DL models are used in cattle identification?
RQ3: What datasets are used in cattle identification?
RQ4: What feature extraction methods are used in cattle identification?
RQ5: What performance evaluation metrics are used for ML and DL models in cattle identification?
RQ6: What are the best ML and DL models used in a specific cattle identification problem?
RQ7: What are the challenges in solving cattle identification problems?
2.3. Search strategy
A search strategy is applied to keep the search results within the scope of the SLR. In this study, the initial
search was performed using a string with four keywords. The search string was (“cattle” AND “identification”)
AND (“machine learning” OR “deep learning”). Some articles were extracted from the search results, and the title,
abstract, and author-specified keywords were read to find the synonyms for the basic search keywords. For “cattle”,
synonyms considered were “cow” and “livestock”. For “identification”, synonyms considered were “recognition” and
“detection”. The keywords “neural network”, “image processing” and “vision” were added with “machine learning”
and “deep learning” as similar terms. Thus, the general search string was (“cattle” OR “cow*” OR “livestock”)
AND (“identification” OR “recognition” OR “detection”) AND (“machine learning” OR “deep learning” OR “neural
network” OR “image processing” OR “vision”). The search keywords were used for articles in four databases (August
2021). The search strings for the databases are shown in Table 1.
iv
Table 1: Search strings for the selected databases.
Database name Search string
IEEE Xplore ((cattle OR cow* OR livestock) AND (identification OR recognition OR de-
tection) AND (“deep learning” OR “machine learning” OR “neural network”
OR “image processing” OR vision)) (anywhere).
Science Direct (cattle OR cow) AND (identification OR recognition OR detection) AND
(“deep learning” OR “machine learning” OR “neural network” OR “image pro-
cessing”). It was used to search in the title, abstract and keywords.
Scopus TITLE-ABS-KEY ((“cattle identification” OR “cow* identification” OR “live-
stock identification” OR “cattle recognition” OR “cow* recognition” OR “live-
stock recognition” OR “cattle detection” OR “cow* detection” OR “livestock
detection”) AND (“deep learning” OR “machine learning” OR “neural net-
work” OR “image processing” OR vision)). It was used to search in the title
(TITLE), abstract (ABS) and keywords (KEY).
Web of Science AB=((cattle OR cow* OR livestock) AND (identification OR recognition OR
detection) AND (“deep learning” OR “machine learning” OR “neural net-
work”)) OR AK=((cattle OR cow* OR livestock) AND (identification OR
recognition OR detection) AND “deep learning” OR“machine learning” OR
“neural network”)) OR TI=((cattle OR cow* OR livestock) AND (identifica-
tion OR recognition OR detection) AND (“deep learning” OR “machine learn-
ing” OR “neural network”)). It was used to search in the title (TI), abstract
(AB) and author keywords (AK).
This study reduced some keywords from the search string for the Science Direct database as the maximum Boolean
connectors (AND/OR) for this database is eight. Since the Scopus database yielded many articles with the general
search string, the search results were reduced by putting two dierent keywords together. In this SLR, we did not
limit the publication year during the search. After performing the above search strings, a total of 731 articles were
retrieved.
2.4. Study selection criteria
The selection criteria are used to identify the studies that can answer the research questions. In this study, inclusion
and exclusion criteria were defined based on the research questions. The search results from all databases were
recorded on a spreadsheet for scrutiny using the inclusion and exclusion criteria. A study was selected for the SLR
when the inclusion criteria were true but the exclusion criteria were false. The exclusion criteria were: (i) publication
is not related to ML or DL for cattle identification, (ii) publication is a survey or review paper, (iii) publication is
v
摘要:

ASystematicReviewofMachineLearningTechniquesforCattleIdentication:Datasets,MethodsandFutureDirectionsMdEkramulHossaina,e,MuhammadAshadKabira,b,e,,LihongZhenga,e,DaveL.Swainb,c,e,ShawnMcGrathb,d,e,JonathanMedwayb,eaSchoolofComputing,MathematicsandEngineering,CharlesSturtUniversity,Bathurst,NSW2795,...

展开>> 收起<<
A Systematic Review of Machine Learning Techniques for Cattle Identification Datasets Methods and Future Directions Md Ekramul Hossainae Muhammad Ashad Kabirabe Lihong Zhengae Dave L. Swainbce Shawn McGrathbde.pdf

共34页,预览5页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:34 页 大小:1.19MB 格式:PDF 时间:2025-04-30

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 34
客服
关注