KYA: Vision–Language Assistant for Emotional Reactions to Risky Driving

Abstract

This study introduces a vision–language pipeline that detects risky driving behaviors and generates emotionally expressive responses to support driver awareness and comfort. Although vision–language models have advanced perception and reasoning in autonomous driving, existing systems rarely consider the emotional dimension or real-world user experience. Keep Yelling Assistant (KYA) detects high-risk driving maneuvers in real time, such as sudden cut-ins. It then produces emotional responses through a large language model tailored to driver preferences. The framework comprises two core modules. The vision module uses You Only Look Once (YOLO) v8 variants to detect nearby vehicles and identify risky behaviors such as sudden cut-ins. Key driving metrics, including relative distance, speed, and projected reach time, are extracted and normalized to produce a structured behavior log. The language module processes this log with user-defined emotional tone settings (e.g., neutral, humorous, analytical) and generates verbal reactions using state-of-the-art large language models (LLMs) (ChatGPT-4o, Claude 3, Gemini 2.5, and Copilot). We evaluated the proposed system using dashcam videos containing risky driving behaviors and a user study involving 108 participants. Participants selected preferred response styles, and LLMs were evaluated based on emotional alignment. All models received favorable ratings, though preferences varied across personas. Notably, the combination of YOLOv8s and ChatGPT-4o achieved the highest score, 4.29 out of 5.00. By integrating real-world perception with emotionally adaptive dialogue, KYA advances emotionally intelligent in-vehicle artificial intelligence. It highlights new opportunities to improve safety, trust, and driver comfort in conventional and autonomous vehicles.

Keywords

advanced driver assistance systems in-vehicle warning systems vision–language models driver behavior risky driving

Get full access to this article

View all access options for this article.

References

Lee

E. H.

Yun

Cho

S. H.

Lee

Aggressive Driving in Ride-Hailing: Work Hours and Road Conditions. Journal of Transportation Safety & Security, Vol. 18, No.18, 2026, pp. 693–716.

Jafarpour

Rahimi-Movaghar

Determinants of Risky Driving Behavior: A Narrative Review. Medical Journal of the Islamic Republic of Iran, Vol. 28, 2014, pp. 142.

U.S. Department of Transportation, and National Highway Traffic Safety Administration. NHTSA Releases Early Estimates Showing Nationwide Decreases in Traffic Fatalities in Priority Safety Areas During First Half Of 2024, Including Pedestrian and Speeding-Related Crashes. Washington, D.C., 2024, https://www.nhtsa.gov/press-releases/nhtsa-releases-2024-early-estimates-decrease-traffic-fatalities

Jiang

Z. H.

Yang

X. G.

Sun

Wang

Yang

Investigating the Relationship Between Traffic Violations and Crashes at Signalized Intersections: An Empirical Study in China. Journal of Advanced Transportation, Vol. 2021, No. 1, 2021, pp. 4317214. https://doi.org/10.1155/2021/4317214

Statistics Korea. Road Traffic Accident Fatality Rate. KOSIS Index (JipyoNuri). 2024. https://www.index.go.kr/unify/idx-info.do?idxCd=4261. Accessed May 24, 2026.

Morgado

S. M.

Xavier

Pereira

Prevention and Road Safety Campaigns Through Emotion in Portugal: A Comprehensive Empirical Analysis. In Proc., International Conference on Marketing and Technologies, 2023, November, Springer Nature Singapore, Singapore. Vol. 393, pp. 139-155. https://doi.org/10.1007/978-981-97-3698-0_10

Chand

H. V.

Karthikeyan

CNN Based Driver Drowsiness Detection System Using Emotion Analysis. Intelligent Automation & Soft Computing, Vol. 31, 2022, No. 2. pp.717–728. https://doi.org/10.32604/iasc.2022.020008

Liu

Koch

Zhou

Föll

Menke

Fleisch

Wortmann

The Empathetic Car: Exploring Emotion Inference Via Driver Behaviour and Traffic Context. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 5, No.3, 2021, pp. 1-34. https://doi.org/10.1145/3478078

Wang

Xue

Guo

Cao

Intelligent Cockpit for Intelligent Vehicle in Metaverse: A Case Study Of Empathetic Auditory Regulation of Human Emotion. IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol. 53, No.4, 2022, pp. 2173-2187. https://doi.org/10.1109/TSMC.2022.3229021

10.

Jadhav

S. S.

Advancing Safety in Vehicles with AI-Driven Emotion Recognition. Doctoral dissertation, Dublin, National College of Ireland, 2024.

11.

Pandya

Thakkar

Sentiment Analysis of Self Driving Car Dataset: A Comparative Study of Deep Learning Approaches. Procedia Computer Science, Vol. 235, 2024, pp. 12-21. https://doi.org/10.1016/j.procs.2024.04.002.

12.

Huang

Qiang

Influencing Driving Safety by Matching Ai Assistant’s Verbal Emotions to Driver: A Randomized Controlled Trial On Performance, Attention, And Emotion. Computers in Human Behavior, Vol. 169, 2025, pp. 108667. https://doi.org/10.1016/j.chb.2025.108667

13.

Wang

Quan

Yang

Dong

Ren

ViE-Take: A Vision-Driven Multi-Modal Dataset for Exploring the Emotional Landscape in Takeover Safety of Autonomous Driving. Research, Vol. 8, 2025, pp. 0603. https://doi.org/10.34133/research.0603

14.

Stappen

Baird

Rizos

Tzirakis

Hafner

Schumann

, et al. Muse 2020 Challenge and Workshop: Multimodal Sentiment Analysis, Emotion-Target Engagement and Trustworthiness Detection in Real-Life Media: Emotional Car Reviews in-The-Wild. In Proc., 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, October, 2020, pp. 35-44. https://doi.org/10.1145/3423327.342367

15.

Zou

Khan

Lwin

Alnajjar

Mubin

Investigating The Impacts of Auditory and Visual Feedback in Advanced Driver Assistance Systems: A Pilot Study on Driver Behavior and Emotional Response. Frontiers in Computer Science, Vol. 6, 2025, pp. 1499165. https://doi.org/10.3389/fcomp.2024.1499165

16.

Wachter

Emotional driving in AI-powered Cars; Driver and traffic safety. Emotion tracking and route prediction, alongside rewarding and education for good driving behaviour. 2019.

17.

Giri

Bansal

Ramesh

Satvik

Enhancing Safety in Vehicles Using Emotion Recognition with Artificial Intelligence. In 2023 IEEE 8th International Conference for Convergence in Technology (I2CT)April 2023, IEEE, Lonavla, India, pp. 1-10.

18.

Zhang

Sun

Lee

C. H.

Feng

How Do We Team Up? Human-Machine Co-driving Style Assessment Through Visual Dynamic Analysis and Vision-Language Model. In International Conference on Human-Computer Interaction, May 2025, Springer Nature Switzerland, Cham, pp. 287-304. https://doi.org/10.1007/978-3-031-93733-0_19

19.

Zheng

Zhao

Gong

Zhu

Simplellm4ad: An End-To-End Vision-Language Model With Graph Visual Question Answering for Autonomous Driving. arXiv preprint arXiv:2407.21293, 2024.

20.

Zhao

Wang

Zhu

Chen

Huang

Bao

Wang

Drivedreamer-2: Llm-enhanced world models for diverse driving video generation. In Proc., AAAI Conference on Artificial Intelligence, Vol. 39, April 2025, pp. 10412-10420.

21.

Jin

Yang

Shen

Peng

Liu

Gong

Surrealdriver: Designing Llm-Powered Generative Driver Agent Framework Based On Human Drivers’ Driving-Thinking Data. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2024, IEEE, United Arab Emirates, pp. 966-971. https://doi.org/10.1109/IROS58592.2024.10802229

22.

Liu

Wang

Yang

Chen

CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving. arXiv preprint arXiv:2503.08683, 2025.

23.

Chahe

Zhou

ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models. In Proc., Computer Vision and Pattern Recognition Conference2025, pp. 3870-3879. https://doi.org/10.48550/arXiv.2504.10757

24.

Nie

Peng

Wang

Cai

Han

Zhang

Reason2drive: Towards interpretable and chain-based reasoning for autonomous driving. In European Conference on Computer Vision, September 2024, Springer Nature Switzerland, Cham, pp. 292-308. https://doi.org/10.1007/978-3-031-73347-5_17

25.

Sima

Renz

Chitta

Chen

Zhang

Xie

…Li

Drivelm: Driving with Graph Visual Question Answering. In European Conference on Computer Vision, September 2024, Springer Nature Switzerland, Cham, pp. 256-274. https://doi.org/10.1007/978-3-031-72943-0_15

26.

Marcu

A. M.

Chen

Hünermann

Karnsund

Hanotte

Chidananda

Sinavski

Lingoqa: Visual Question Answering for Autonomous Driving. In European Conference on Computer Vision, September 2024, Cham, Springer Nature Switzerland, pp. 252-269. https://doi.org/10.1007/978-3-031-72980-5_15

27.

Qian

Chen

Zhuo

Jiao

Jiang

Y. G.

Nuscenes-Qa: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario. In Proc., AAAI Conference on Artificial Intelligence, Vol. 38, March 2024, pp. 4542-4550. https://doi.org/10.1609/aaai.v38i5.28253

28.

Park

S. Y.

Cui

Moradipari

Gupta

Han

Wang

, Z. Nuplanqa: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding In Multi-Modal Large Language Models. arXiv preprint arXiv:2503.12772, 2025.

29.

Wang

Xing

Can

Hua

Tian

, et al. Generative Ai for Autonomous Driving: Frontiers and Opportunities. arXiv preprint arXiv:2505.08854, 2025.

30.

Jiang

Ergu

Liu

Cai

A Review of Yolo Algorithm Developments. Procedia Computer Science, 199, 2022, pp. 1066-1073.

31.

Terven

Córdova-Esparza

D. M.

Romero-González

J. A.

A Comprehensive Review of Yolo Architectures In Computer Vision: From Yolov1 To Yolov8 and Yolo-Nas. Machine Learning and Knowledge Extraction, Vol. 5, No. 4, 2023, pp. 1680-1716.

32.

Hsu

H. H.

Huang

N. F.

Han

C. H.

Collision Analysis To Motor Dashcam Videos with Yolo and Mask R-CNN For Auto Insurance. In 2020 International Conference on Intelligent Engineering and Management (ICIEM), June 2020, IEEE, London, pp. 311-315.

33.

OpenAI. ChatGPT-4o, 2024. https://openai.com/chatgpt

34.

Anthropic Claude 3, 2024. https://www.anthropic.com/index/introducing-claude

35.

Google. Gemini 2.5 by Google DeepMind, 2024. https://deepmind.google/technologies/gemini/

36.

Microsoft. Microsoft Copilot, 2024. https://copilot.microsoft.com/

37.

National Highway Traffic Safety Administration. Traffic safety facts 2011: A Compilation of Motor Vehicle Crash Data From The Fatality Analysis Reporting System and The General Estimates System. Report No. DOT HS 811 754. U.S. Department of Transportation, 2013. http://www-nrd.nhtsa.dot.gov/Pubs/811754AR.pdf

38.

Yun

Lee

E. H.

Party Politics in Transport Policy with A Large Language Model. Transport Policy, Vol. 171, 2025, pp. 487-496.

39.

Lee

E. H.

Lee

Electric Vehicle Charging Station Location Selection Using Generative Artificial Intelligence. Transportation Research Part E: Logistics and Transportation Review, Vol. 213, 2026, p. 104930. https://doi.org/10.1016/j.tre.2026.104930

40.

Lee

E. H.

Moon

Cho

S. H.

Lee

Passenger to Train Assignment Using Only Smart Card Data. Transportation Research Record: Journal of Transportation Research Board, 2026: 03611981251407915.

41.

Lee

E. H.

Understanding Gender Gap in Bike-Sharing Services via eXplainable Artificial Intelligence. Transportation Research Record: Journal of Transportation Research Board, Vol. 2679, No. 8, 2025. https://doi.org/10.1177/03611981251335900

42.

Lee

E. H.

eXplainable DEA Approach for Evaluating Performance of Public Transport Origin-Destination Pairs. Research in Transportation Economics, Vol. 108, 2024, p. 101491.

43.

Lee

E. H.

Traffic Speed Prediction of Urban Road Network Based on High Importance Links Using XGB and SHAP. IEEE Access, Vol. 11, 2023, pp. 113217-113226.

44.

Min

J. H.

Ham

S. W.

Kim

D. K.

Lee

E. H.

Deep Multimodal Learning for Traffic Speed Estimation Combining Dedicated Short-Range Communication and Vehicle Detection System Data. Transportation Research Record: Journal of Transportation Research Board, 2023: 2677: 247–259.