Abstract
Highway accidents remain a critical public safety challenge, where delayed response times intensify injury severity and fatalities. Traditional systems relying on manual reporting suffer from inefficiencies and critical delays during emergencies. This paper presents an autonomous drone based highway incident management system that integrates advanced aerial technology with Vision-Language Models (VLMs) and Large Language Models (LLMs) to enable rapid detection, analysis, and response. The framework employs a DJI Matrice 30T drone equipped with a dock station for autonomous deployment for incident analysis by yielding incident coordinates provided by authorities. Upon arrival, real-time video is analyzed using an optimized fine-tuned YOLOv12n model selected after rigorous comparison of nine YOLO variants to detect incidents such as collisions and fires with high precision. To generate actionable intelligence, high-confidence video frames are processed using a hybrid vision-language pipeline comprising of LLaVA-OneVision-Qwen2 and GPT-4o API. This combination yields detailed natural language descriptions and structured summaries of incident scenes, offering enhanced contextual awareness of the incident site. By combining autonomous drone navigation, real-time object detection, VLM-based scene interpretation, and LLM-driven summarization, this end-to-end solution enhances situational awareness and decision-making for emergency responders. These results demonstrate the practical value of integrating multi-modal AI with UAVs in critical safety domains, laying the groundwork for scalable, intelligent transportation infrastructure and next-generation emergency response systems.
| Original language | English |
|---|---|
| Pages (from-to) | 183314-183329 |
| Number of pages | 16 |
| Journal | IEEE Access |
| Volume | 13 |
| DOIs | |
| State | Published - 2025 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
-
SDG 11 Sustainable Cities and Communities
Keywords
- GPT-4o mini
- Highway incident detection
- LLMs
- LLaVA-OneVision-Qwen2
- VLMs
- YOLOv12n
- autonomous drone deployment
- multimodal AI pipeline
- real-world validation
Fingerprint
Dive into the research topics of 'Multi-Modal Autonomous Drone System for Real-Time Highway Incident Detection and Analysis'. Together they form a unique fingerprint.Press/Media
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver