Search and rescue operations often face difficulties due to unpredictable weather, rugged terrain, and limited resources. Traditional methods rely heavily on experienced personnel, making the process time-consuming and labor-intensive. While unmanned aerial vehicles(UAVs) offer a promising alternative, their effectiveness has been constrained by difficulties in detecting small or partially visible individuals from aerial imagery. These limitations highlight the need for improved APD technologies to enhance the accuracy and efficiency of rescue efforts.
Published (DOI: 10.34133/remotesensing.0474) on March 13, 2025, in the journal Journal of Remote Sensing, researchers from Northwestern Polytechnical University and Yan’an University have developed an AI-powered APD system designed to improve the detection of individuals in aerial images captured by drones. The system addresses critical challenges such as occlusion, scale variation, and changing lighting conditions, aiming to increase the precision and reliability of SaR operations, particularly in remote and inaccessible areas.
The research team compiled the VTSaR dataset, which includes diverse environments, human behaviors, and multiple capture angles. The dataset integrates visible and infrared images alongside synthetic data to establish a comprehensive benchmark for APD. Testing of several detection algorithms demonstrated notable improvements in detection accuracy and efficiency. The proposed system performed well under challenging conditions, offering better results compared to existing technologies in handling occlusions and variations in scale and lighting.
"Our research contributes to the development of more effective Aerial Person Detection for search and rescue missions," said Dr. Xiangqing Zhang, the study’s lead researcher. "By integrating AI with multimodal data fusion, we have designed a system that improves detection capabilities in complex environments, making SaR operations more efficient and reliable."
The study employed a specially designed unmanned helicopter with a dual-camera gimbal system to capture aerial images from various environments, including urban, suburban, maritime, and wilderness areas. The VTSaR dataset consists of three versions: Unaligned VTSaR (UA-VTSaR), Aligned VTSaR (A-VTSaR), and Aligned Synthetic VTSaR (AS-VTSaR), containing a total of 19,956 real-world instances and 54,749 synthetic instances. Researchers tested models such as YOLOv8-s and EfficientViT, achieving a precision of 95.03% and a mean average precision (mAP) of 94.91%. The study emphasized the importance of combining visible and infrared imagery to enhance detection performance across different environmental conditions.
Beyond SaR, the APD system has potential applications in disaster response, security monitoring, and law enforcement. Improved accuracy in APD could support efforts in locating missing persons, patrolling high-risk areas, and responding to emergencies more effectively. As AI and UAV technologies advance, the system may also be adapted for applications such as wildlife monitoring and border surveillance, further contributing to safety and security efforts.
This study presents an AI-powered APD system that improves SaR efficiency by addressing key technical challenges. By leveraging AI-driven analysis and multimodal data fusion, the system offers a more precise and adaptable approach to locating individuals in complex environments. These advancements contribute to making rescue operations more effective and increasing the likelihood of timely intervention in critical situations.
###
Refereneces
DOI
10.34133/remotesensing.0474
Original Source URL
https://doi.org/10.34133/remotesensing.0474
Funding information
This work was supported in part by the National Natural Science Foundation of China under Grants 62171381.
About Journal of Remote Sensing
The Journal of Remote Sensing, an online-only Open Access journal published in association with AIR-CAS, promotes the theory, science, and technology of remote sensing, as well as interdisciplinary research within earth and information science.