Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep Learning-Based Driver Intention Recognition: Evaluating performance, complexity and uncertainty estimations
University of Skövde, School of Informatics. University of Skövde, Informatics Research Environment. Volvo Car Corporation, Sweden. (Skövde Artificial Intelligence Lab (SAIL))ORCID iD: 0000-0003-2135-6615
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep learning (DL) methods have advanced rapidly and are commonly applied in high-risk, resource-constrained environments such as advanced driver assistance systems (ADAS), where misclassifications can have serious consequences. With upcoming artificial intelligence (AI) legislation, it is essential to extensively evaluate and minimize the undesirable behavior of DL-based systems in such settings. An example is an ADAS that continuously evaluates whether a driver’s intended maneuvers are safe to execute given the current traffic context. Driver intention recognition (DIR), which predicts the maneuver a driver intends to perform in the near future, is a central DL-based component of such systems. Since deep neural networks (DNNs) do not inherently provide uncertainty estimates for their predictions, probabilistic deep learning (PDL) methods can be applied to improve the identification of scenarios where model outputs may be unreliable. In this thesis, we first review the current state of DIR research, focusing on the recent shift toward DL methods. We then examine how both established and novel PDL methods influence DIR performance. We evaluate the uncertainty estimations by analyzing their ability to distinguish between correct and incorrect predictions and by measuring their effectiveness in out-of-distribution (OOD) detection. Furthermore, we employ neural architecture search with multiple objectives and search strategies to explore how architectural complexity impacts DIR and OOD detection performance. Finally, we conduct a comparative experiment to evaluate human performance against that of DL-based models in video-based recognition of road user intentions.

Place, publisher, year, edition, pages
Skövde: University of Skövde , 2025. , p. xiii, 294
Series
Dissertation Series ; 66
National Category
Computer Sciences Computer graphics and computer vision Artificial Intelligence
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
URN: urn:nbn:se:his:diva-25867ISBN: 978-91-989080-5-3 (print)ISBN: 978-91-989080-6-0 (electronic)OAI: oai:DiVA.org:his-25867DiVA, id: diva2:2001935
Public defence
2025-11-12, University of Skövde, Building D, Room D107, Skövde, 13:15 (English)
Opponent
Supervisors
Note

Tre av nio delarbeten ("under submission"; övriga se rubriken Delarbeten/List of papers):

6. Koen Vellenga, H. Joe Steinhauer, Göran Falkman, Jonas Andersson, and Anders Sjögren (2025b). “Last Layer Hamiltonian Monte Carlo”

7. Koen Vellenga, H. Joe Steinhauer, Jonas Andersson, and Anders Sjögren (2025). “Latent Uncertainty Representations for Video-based Driver Action and Intention Recognition”

8. Koen Vellenga (2025). “Multi-Objective Architecture Search for Driver Action and Intention Recognition using Probabilistic Deep Neural Networks”

Available from: 2025-09-30 Created: 2025-09-29 Last updated: 2026-01-08Bibliographically approved
List of papers
1. Driver intention recognition: state-of-the-art review
Open this publication in new window or tab >>Driver intention recognition: state-of-the-art review
Show others...
2022 (English)In: IEEE Open Journal of Intelligent Transportation Systems, E-ISSN 2687-7813, Vol. 3, p. 602-616Article, review/survey (Refereed) Published
Abstract [en]

Every year worldwide more than one million people die and a further 50 million people are injured in traffic accidents. Therefore, the development of car safety features that actively support the driver in preventing accidents, is of utmost importance to reduce the number of injuries and fatalities. However, to establish this support it is necessary that the advanced driver assistance system (ADAS) understands the driver’s intended behavior in advance. The growing variety of sensors available for vehicles together with improved computer vision techniques, hence led to increased research directed towards inferring the driver’s intentions. This article reviews 64 driver intention recognition studies with regard to the maneuvers considered, the driving features included, the AI methods utilized, the achieved performance within the presented experiments, and the open challenges identified by the respected researchers. The article provides a high level analysis of the current technology readiness level of driver intention recognition technology to address the challenges to enable reliable driver intention recognition, such as the system architecture, implementation, and the purpose of the technology.

Place, publisher, year, edition, pages
IEEE, 2022
Keywords
Driver intentions, intention recognition, driver behavior, driving maneuvers
National Category
Computer Systems
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-21812 (URN)10.1109/ojits.2022.3197296 (DOI)000853832800001 ()2-s2.0-85147393634 (Scopus ID)
Projects
Intention recognition for real-time automotive 3D situation awareness
Funder
Vinnova, 2018-05012
Note

CC BY-NC-ND 4.0

CORRESPONDING AUTHOR: K. VELLENGA (e-mail: koen.vellenga@volvocars.com)

This work was supported by the Intention Recognition in Real Time for Automotive 3D Situation Awareness (IRRA) Project (https://www.vinnova.se/p/intention-recognition-i-realtid-for-automotive-3d-situation-awareness-irra/).

Available from: 2022-09-12 Created: 2022-09-12 Last updated: 2025-09-29Bibliographically approved
2. Surrogate Deep Learning to Estimate Uncertainties for Driver Intention Recognition
Open this publication in new window or tab >>Surrogate Deep Learning to Estimate Uncertainties for Driver Intention Recognition
Show others...
2023 (English)In: ICMLC 2023: Proceedings of 2023 15th International Conference on Machine Learning and Computing, Zhuhai, China, February 17-20, 2023, New York, NY, USA: Association for Computing Machinery (ACM), 2023, p. 252-258Conference paper, Published paper (Refereed)
Abstract [en]

Real-world applications of artificial intelligence that can potentially harm human beings should be able to express uncertainty about the made predictions. Probabilistic deep learning (DL) methods (e.g., variational inference [VI], VI last layer [VI-LL], Monte-Carlo [MC] dropout, stochastic weight averaging - Gaussian [SWA-G], and deep ensembles) can produce a predictive uncertainty but require expensive MC sampling techniques. Therefore, we evaluated if the probabilistic DL methods are uncertain when making incorrect predictions for an open-source driver intention recognition dataset and if a surrogate DL model can reproduce the uncertainty estimates. We found that all probabilistic DL methods are significantly more uncertain when making incorrect predictions at test time, but there are still instances where the models are very certain but completely incorrect. The surrogate DL models trained on the MC dropout and VI uncertainty estimates were capable of reproducing a significantly higher uncertainty estimate when making incorrect predictions.

Place, publisher, year, edition, pages
New York, NY, USA: Association for Computing Machinery (ACM), 2023
Keywords
Driver intention recognition, probabilistic deep learning, surrogate modeling, uncertainty quantification
National Category
Information Systems Other Computer and Information Science
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-22851 (URN)10.1145/3587716.3587758 (DOI)2-s2.0-85173817744 (Scopus ID)978-1-4503-9841-1 (ISBN)
Conference
15th International Conference on Machine Learning and Computing, Zhuhai, China, February 17-20, 2023
Projects
Intention recognition for real-time automotive 3D situation awareness
Funder
Vinnova, 2018-05012
Note

CC BY-NC-SA 4.0

CORRESPONDING AUTHOR: K. VELLENGA (e-mail: koen.vellenga@volvocars.com)

This work was supported by the Intention Recognition in Real Time for Automotive 3D Situation Awareness (IRRA) Project (https://www.vinnova.se/p/intention-recognition-i-realtid-for-automotive-3d-situation-awareness-irra/).

Available from: 2023-06-27 Created: 2023-06-27 Last updated: 2025-09-29Bibliographically approved
3. Evaluation of Video Masked Autoencoders' Performance and Uncertainty Estimations for Driver Action and Intention Recognition
Open this publication in new window or tab >>Evaluation of Video Masked Autoencoders' Performance and Uncertainty Estimations for Driver Action and Intention Recognition
2024 (English)In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), IEEE, 2024, p. 7429-7437Conference paper, Published paper (Refereed)
Abstract [en]

Traffic fatalities remain among the leading death causes worldwide. To reduce this figure, car safety is listed as one of the most important factors. To actively support human drivers, it is essential for advanced driving assistance systems to be able to recognize the driver's actions and intentions. Prior studies have demonstrated various approaches to recognize driving actions and intentions based on in-cabin and external video footage. Given the performance of self-supervised video pre-trained (SSVP) Video Masked Autoencoders (VMAEs) on multiple action recognition datasets, we evaluate the performance of SSVP VMAEs on the Honda Research Institute Driving Dataset for driver action recognition (DAR) and on the Brain4Cars dataset for driver intention recognition (DIR). Besides the performance, the application of an artificial intelligence system in a safety-critical environment must be capable to express when it is uncertain about the produced results. Therefore, we also analyze uncertainty estimations produced by a Bayes-by-Backprop last-layer (BBB-LL) and Monte-Carlo (MC) dropout variants of an VMAE. Our experiments show that an VMAE achieves a higher overall performance for both offline DAR and end-to-end DIR compared to the state-of-the-art. The analysis of the BBB-LL and MC dropout models show higher uncertainty estimates for incorrectly classified test instances compared to correctly predicted test instances.

Place, publisher, year, edition, pages
IEEE, 2024
Series
Proceedings IEEE Workshop on Applications of Computer Vision, ISSN 2472-6737, E-ISSN 2642-9381
National Category
Computer graphics and computer vision
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-23540 (URN)10.1109/WACV57701.2024.00726 (DOI)2-s2.0-85191986920 (Scopus ID)979-8-3503-1893-7 (ISBN)979-8-3503-1892-0 (ISBN)
Conference
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), January 4-8, 2024, Waikoloha, Hawaii, USA
Funder
Vinnova, 2018-05012
Available from: 2024-01-16 Created: 2024-01-16 Last updated: 2025-09-29Bibliographically approved
4. PT-HMC: Optimization-based Pre-Training with Hamiltonian Monte-Carlo Sampling for Driver Intention Recognition
Open this publication in new window or tab >>PT-HMC: Optimization-based Pre-Training with Hamiltonian Monte-Carlo Sampling for Driver Intention Recognition
Show others...
2024 (English)In: ACM Transactions on Probabilistic Machine Learning, E-ISSN 2836-8924, Vol. 1, no 1, article id 4Article in journal (Refereed) Published
Abstract [en]

Driver intention recognition (DIR) methods mostly rely on deep neural networks (DNNs). To use DNNs in asafety-critical real-world environment it is essential to quantify how confident the model is about the producedpredictions. Therefore, this study evaluates the performance and calibration of a temporal convolutionalnetwork (TCN) for multiple probabilistic deep learning (PDL) methods (Bayes-by-Backprop, Monte-Carlodropout, Deep ensembles, Stochastic Weight averaging - Gaussian, Multi SWA-G, cyclic Stochastic GradientHamiltonian Monte Carlo). Notably, we formalize an approach that combines optimization-based pre-trainingwith Hamiltonian Monte-Carlo (PT-HMC) sampling, aiming to leverage the strengths of both techniques. Ouranalysis, conducted on two pre-processed open-source DIR datasets, reveals that PT-HMC not only matchesbut occasionally surpasses the performance of existing PDL methods. One of the remaining challenges thatprohibits the integration of a PDL-based DIR system into an actual car is the computational requirements toperform inference. Therefore, future work could focus on optimizing PDL methods to be more computationallyefficient without sacrificing performance or the ability to estimate uncertainties.

Place, publisher, year, edition, pages
ACM Digital Library, 2024
Keywords
Driver Intention Recognition, Probabilistic Deep Learning, Bayesian Deep Learning, Uncertainty quantification, Hamiltonian Monte Carlo
National Category
Computer graphics and computer vision
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-24425 (URN)10.1145/3688573 (DOI)
Note

CC BY-SA 4.0

Koen Vellenga (corresponding author), University of Skövde, Skövde, Sweden and Volvo Car Corporation, Göteborg, Sweden

Available from: 2024-08-12 Created: 2024-08-12 Last updated: 2025-10-06Bibliographically approved
5. Designing deep neural networks for driver intention recognition
Open this publication in new window or tab >>Designing deep neural networks for driver intention recognition
Show others...
2025 (English)In: Engineering applications of artificial intelligence, ISSN 0952-1976, E-ISSN 1873-6769, Vol. 139, no Part B, article id 109574Article in journal (Refereed) Published
Abstract [en]

Driver intention recognition (DIR) studies increasingly rely on deep neural networks. Deep neural networks have achieved top performance for many different tasks. However, apart from image classifications and semantic segmentation for mobile phones, it is not a common practice for components of advanced driver assistance systems to explicitly analyze the complexity and performance of the network’s architecture. Therefore, this paper applies neural architecture search to investigate the effects of the deep neural network architecture on a real-world safety critical application with limited computational capabilities. We explore a pre-defined search space for three deep neural network layer types that are capable to handle sequential data (a long-short term memory, temporal convolution, and a time-series transformer layer), and the influence of different data fusion strategies on the driver intention recognition performance. A set of eight search strategies are evaluated for two driver intention recognition datasets. For the two datasets, we observed that there is no search strategy clearly sampling better deep neural network architectures. However, performing an architecture search improves the model performance compared to the original manually designed networks. Furthermore, we observe no relation between increased model complexity and better driver intention recognition performance. The result indicate that multiple architectures can yield similar performance, regardless of the deep neural network layer type or fusion strategy. However, the optimal complexity, layer type and fusion remain unknown upfront.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Driver intention recognition, Neural architecture search, Deep learning, Information fusion
National Category
Information Systems Computer Systems
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-24573 (URN)10.1016/j.engappai.2024.109574 (DOI)001356766800001 ()2-s2.0-85208661926 (Scopus ID)
Funder
University of SkövdeVinnova, 2018-05012
Note

CC BY 4.0

Corresponding author: Koen Vellenga

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Koen Vellenga reports financial support was provided by Volvo Car Corporation. Koen Vellenga reports article publishing charges was provided by University of Skövde. We got funding from Vinnova (Swedish innovation agency, reference number: 2018-05012).

Available from: 2024-09-25 Created: 2024-09-25 Last updated: 2025-09-29Bibliographically approved
6. AI vs. Humans: Comparing road user intention recognition performance
Open this publication in new window or tab >>AI vs. Humans: Comparing road user intention recognition performance
Show others...
2026 (English)In: Transportation Research Part F: Traffic Psychology and Behaviour, ISSN 1369-8478, E-ISSN 1873-5517, Vol. 118, article id 103491Article in journal (Refereed) Published
Abstract [en]

Anticipating the behavior of other road users is critical for safe driving. To anticipate the behavior of other road users in a timely manner, it is essential to recognize their intentions. Although artificial intelligence (AI)-based intention recognition systems for traffic scenarios have advanced significantly, their performance relative to human road user intention recognition (RUIR) remains largely unexplored. To address this gap, we conducted an experiment comparing the RUIR performance of human participants and a state-of-the-art end-to-end video recognition AI model on a set of 25 video scenarios. The selected scenarios offered a balanced representation of various road user types and a range of intention maneuvers. Among human participants (N=161), we found no statistically significant differences in RUIR performance with respect to age, self-perceived driving skill, annual driven kilometers, or years of driving experience. However, the average human participant exhibited slightly lower RUIR performance than the AI models.

Place, publisher, year, edition, pages
Elsevier, 2026
Keywords
Intention recognition, Road user, Artificial intelligence
National Category
Computer Sciences Computer graphics and computer vision Artificial Intelligence
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
urn:nbn:se:his:diva-26103 (URN)10.1016/j.trf.2025.103491 (DOI)001658037200001 ()2-s2.0-105026119576 (Scopus ID)
Note

CC BY 4.0

Corresponding author: koen.vellenga@his.se (K. Vellenga)

Available from: 2026-01-07 Created: 2026-01-07 Last updated: 2026-05-22Bibliographically approved

Open Access in DiVA

fulltext(30315 kB)2429 downloads
File information
File name FULLTEXT02.pdfFile size 30315 kBChecksum SHA-512
eb0ddaf9115f4815f00ee5c1acb6294ef9f4ad52c5fae800f0335feacdcc3c500031e52ea212823b0a99c962c06a2246c715971603dad18eaa108741254477e7
Type fulltextMimetype application/pdf

Authority records

Vellenga, Koen

Search in DiVA

By author/editor
Vellenga, Koen
By organisation
School of InformaticsInformatics Research Environment
Computer SciencesComputer graphics and computer visionArtificial Intelligence

Search outside of DiVA

GoogleGoogle Scholar
Total: 2431 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 4663 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf