Transformer-based Flood Scene Segmentation for Developing Countries Ahan M R

2025-05-06 0 0 819.75KB 7 页 10玖币
侵权投诉
Transformer-based Flood Scene Segmentation for
Developing Countries
Ahan M R
BITS Pilani Goa Campus
ahanmr98@gmail.com
Roshan Roy*
BITS Pilani
rroshanroy@gmail.com
Shreyas Sunil Kulkarni
BITS Pilani Hyderabad Campus
sskshreyas@gmail.com
Vaibhav Soni
MANIT Bhopal
vaibsoni@gmail.com
Ashish Chittora
BITS Pilani Goa Campus
ashishc@goa.bits-pilani.ac.in
Abstract
Floods are large-scale natural disasters that often induce a massive number of
deaths, extensive material damage, and economic turmoil. The effects are more
extensive and longer-lasting in high-population and low-resource developing coun-
tries. Early Warning Systems (EWS) constantly assess water levels and other
factors to forecast floods, to help minimize damage. Post-disaster, disaster response
teams undertake a Post Disaster Needs Assessment (PDSA) to assess structural
damage and determine optimal strategies to respond to highly affected neighbor-
hoods. However, even today in developing countries, EWS and PDSA analysis of
large volumes of image and video data is largely a manual process undertaken by
first responders and volunteers. We propose FloodTransformer, which to the best
of our knowledge, is the first visual transformer-based model to detect and segment
flooded areas from aerial images at disaster sites. We also propose a custom metric,
Flood Capacity (FC) to measure the spatial extent of water coverage and quantify
the segmented flooded area for EWS and PDSA analyses. We use the SWOC Flood
segmentation dataset and achieve 0.93 mIoU, outperforming all other methods.
We further show the robustness of this approach by validating across unseen flood
images from other flood data sources.
1 Introduction and Context
The Center for Research on the Epidemiology of Disasters, in affiliation with the World Health
Organization (WHO), reported that natural disasters accounted for 1.3 million deaths and over USD
2 trillion in economic damage — all between 1998 and 2017 [
19
]. Flooding related damage is
a factor in most of them [
4
] and frequent the list of most expensive disasters [
17
]. Developing
economies of Asia are disproportionately affected and are the worst-hit by floods, accounting for
44% of all flood disasters from 1987-1997 [
18
]. India alone registers 1/5th of global deaths from
floods [
11
]. Rapid urbanization, global climate change, and rising sea water levels will expose 1.47
billion more people to flood risk, with 89% of them living in low-middle income countries [
5
]. Flood
Segmentation technology is instrumental for Disaster Prediction and Response is critical to save lives
and livelihoods.
Flood Response
: Typically, disaster management teams complete a Post Disaster Needs Assessment
(PDSA) and rapidly develop infrastructure based on this report on the collected data [
6
]. Unmanned
Aerial Vehicles (UAVs) are deployed to collect large volumes of image and video data in affected
Equal Contribution
35th Conference on Neural Information Processing Systems (NeurIPS 2021), virtual.
arXiv:2210.04218v1 [cs.CV] 9 Oct 2022
regions. PDSA is essential for identification of submerged regions, sanity check of large building
structures, debris identification, and search-and-rescue (S&R) operations.
Flood Forecasting
: Flood segmentation techniques can be critically important for flooding-related
Early Warning Systems (EWS). According to research in [
12
], Indians given a flood warning are
twice as likely to evacuate safely than Indians without any notice. which require constant monitoring
of river or sea water levels. Comparison of current levels with historical evidence of flood-prone
water levels can help understand when to trigger warnings appropriately.
Constraints
: Developing countries are plagued by resource and economic constraints. Failure of
macro- and micro- infrastructure planning in Nicaragua led to re-construction on top of an earthquake
faultline [
7
]. Weak social safety and insurance policies inflate recovery time [
27
]. Economic
vulnerability renders countries like Haiti, Ethiopia, Nepal, El Salvador in a near-permanent state of
emergency alert [
7
]. In these countries, processing and analysis of large-scale visual data from UAVs
for PDSA in Flood response is a manual process that requires multi-team intervention, which poses a
serious bottleneck in search-and-response speed. Deployment of EWSs is infeasible because human
monitoring of video feeds is too cumbersome and expensive.
AI Technology
: To reduce the burden of manual analysis on crisis responders, Deep learning is
well-suited to scale, automate and expedite these operations. The last few years have witnessed
a tremendous rise in CNN-based image classification and segmentation research [
21
]. However,
CNNs suffer from a well-known problem — large inductive biases. Conceretely, CNNs assume
locality and translation equivariance, which hurt the interpretability of pure CNN-based algorithms.
Recently, visual transformers have garnered attention for image classification, segmentation and
object detection tasks [2, 20, 26, 3] for challenging these assumptions with comparable accuracy.
Contributions
: In this work, we propose a hybrid fused CNN-Transformer: FloodTransformer to
tackle flood water segmentation on the Water Segmentation Open Collection (WSOC) dataset [
22
].
First, we achieve state-of-the-art results and are the first work (to the best of our knowledge) to apply
new transformer-driven research to the flood data domain. Second, our approach is extendable —
we demonstrate the ability of our model to generalize well on unseen data sources. Further local
calibration, if required at all, simply requires weight fine-tuning with previous, region-specific, flood
scene data. Third, our model does not suffer from data scarcity - it only requires image data input and
not complex sensor data which is hard to collect [
13
]. Last, the transformer-based encoder applies
recent DL innovations to the flood data domain. Although the hybrid method still uses CNNs in
the decoder network, the aforementioned spatial inductive biases no longer occur throughout the
entire network. Dependencies between patch embeddings are learnt from scratch. This improves the
robustness of our approach.
2 Methodology
To achieve the Flood Scene Understanding, we introduce a Deep Learning model for Flood image
segmentation and quantify the impact of flooding with a custom metric called Flooding Capacity.
2.1 Method
Inspired by Zhang et al. [
23
], we propose FloodTransformer to solve segmentation for the flood data
domain. It is a fusion architecture of Visual Transformer [
25
] and Convolution Neural Networks
(CNNs) and its model architecture is displayed in Figure 1.
Complex flooding imagery may contain heterogeneous objects, flooding patterns and backgrounds.
Using the self-attention module of the visual Transformer module from [
25
] and global vector
representation learned from the CNN network, FloodTransformer fuses the trained embeddings to
learn long-term spatial relationships between the aforementioned entities in images of flood affected
areas. Using Hadamard bilinear product [
23
], the fusion module fuses information via embeddings
from both parallel streams into a dense representation. The combination of multi-level fusion maps
generates the segmentation output of the model. We summarize each component below, per Zhang et
al. [23].
Transformer Module
: We use the encoder-decoder network using Visual Transformer [
25
]. The
input image
xRH×W×3
is sliced into N patches, where
N=H
F×W
F
and F is usually set to 16 or
2
摘要:

Transformer-basedFloodSceneSegmentationforDevelopingCountriesAhanMRBITSPilaniGoaCampusahanmr98@gmail.comRoshanRoy*BITSPilanirroshanroy@gmail.comShreyasSunilKulkarniBITSPilaniHyderabadCampussskshreyas@gmail.comVaibhavSoniMANITBhopalvaibsoni@gmail.comAshishChittoraBITSPilaniGoaCampusashishc@goa.bits-...

展开>> 收起<<
Transformer-based Flood Scene Segmentation for Developing Countries Ahan M R.pdf

共7页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:7 页 大小:819.75KB 格式:PDF 时间:2025-05-06

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 7
客服
关注