Improving Driver Drowsiness Detection via Personalized EAR/MAR Thresholds and CNN-Based Classification

arXiv cs.CV / 4/27/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The paper addresses the common limitation of driver drowsiness monitoring that uses fixed EAR/MAR thresholds, which often do not generalize well across different individuals and driving conditions.
  • It proposes a personalized real-time detection approach that calibrates driver-specific EAR and MAR thresholds before driving and monitors eyelid movement, head position, and yawning to issue fatigue warnings.
  • To improve performance in difficult scenarios, the system integrates CNN-based classification alongside classical metric-based EAR/MAR detection.
  • Experiments on public datasets and a custom dataset collected under varied lighting, head poses, and user characteristics show that personalized thresholding boosts accuracy by about 2–3% over fixed thresholds.
  • The CNN components report high performance, achieving 99.1% accuracy for eye-state detection and 98.8% for yawning detection, supporting the effectiveness of combining classical signals with deep learning for robust monitoring.

Abstract

Driver drowsiness is a major cause of traffic accidents worldwide, posing a serious threat to public safety. Vision-based driver monitoring systems often rely on fixed Eye Aspect Ratio (EAR) and Mouth Aspect Ratio (MAR) thresholds; however, such fixed values frequently fail to generalize across individuals due to variations in facial structure, illumination, and driving conditions. This paper proposes a personalized driver drowsiness detection system that monitors eyelid movements, head position, and yawning behavior in real time and provides warnings when signs of fatigue are detected. The system employs driver-specific EAR and MAR thresholds, calibrated before driving, to improve classical metric-based detection. In addition, deep learning-based Convolutional Neural Network (CNN) models are integrated to enhance accuracy in challenging scenarios. The system is evaluated using publicly available datasets as well as a custom dataset collected under diverse lighting conditions, head poses, and user characteristics. Experimental results show that personalized thresholding improves detection accuracy by 2-3% compared to fixed thresholds, while CNN-based classification achieves 99.1% accuracy for eye state detection and 98.8% for yawning detection, demonstrating the effectiveness of combining classical metrics with deep learning for robust real-time driver monitoring.