Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Edge AI-Enhanced Traffic Monitoring and Anomaly Detection Using Multimodal Large Language Models

Conference ·

This paper addresses the challenge of traffic monitoring and incident detection in remote areas, utilizing multimodal large language models (LLMs) deployed on edge AI devices. The key novelty of the LLM is to convert real-time video streams into descriptive texts, enabling low-bandwidth transmissions and reliable detection of anomalies and incidents in environments of intermittent connectivity. The model is developed based on fine-tuning open-source LLMs and extending it with multi-modal capabilities to analyze video frames. Our work also involves deploying this model on edge devices such as Nvidia IGX Orin and is planned to be tested in realistic environments in future work. The methodology includes data set curation, iterative model fine-tuning and compression, and hardware-based optimization. This approach aims to enhance traffic safety and response speed in remote areas, marking a significant advancement in the application of AI for traffic monitoring and safety management.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE); USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2573307
Country of Publication:
United States
Language:
English

Similar Records

SCITUNE: Aligning Large Language Models with Human-Curated Scientific Multimodal Instructions
Conference · Mon Nov 11 23:00:00 EST 2024 · OSTI ID:2477906

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
Conference · Tue Dec 31 23:00:00 EST 2024 · OSTI ID:2563712

Related Subjects