Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Video-Audio Multimodal Fall Detection Method
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). Malmö University, Internet of Things and People (IOTAP).ORCID iD: 0000-0002-9464-7010
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). Malmö University, Internet of Things and People (IOTAP).ORCID iD: 0000-0003-0998-6585
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). Malmö University, Internet of Things and People (IOTAP).ORCID iD: 0000-0002-3797-4605
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). Malmö University, Internet of Things and People (IOTAP).
Show others and affiliations
2025 (English)In: PRICAI 2024: Trends in Artificial Intelligence: 21st Pacific Rim International Conference on Artificial Intelligence, PRICAI 2024, Kyoto, Japan, November 18–24, 2024, Proceedings, Part IV / [ed] Rafik Hadfi; Patricia Anthony; Alok Sharma; Takayuki Ito; Quan Bai, Springer, 2025, p. 62-75Conference paper, Published paper (Refereed)
Abstract [en]

Falls frequently present substantial safety hazards to those who are alone, particularly the elderly. Deploying a rapid and proficient method for detecting falls is a highly effective approach to tackle this concealed peril. The majority of existing fall detection methods rely on either visual data or wearable devices, both of which have drawbacks. This research presents a multimodal approach that integrates video and audio modalities to address the issue of fall detection systems and enhances the accuracy of fall detection in challenging environmental conditions. This multimodal approach, which leverages the benefits of attention mechanism in both video and audio streams, utilizes features from both modalities through feature-level fusion to detect falls in unfavorable conditions where visual systems alone are unable to do so. We assessed the performance of our multimodal fall detection model using Le2i and UP-Fall datasets. Additionally, we compared our findings with other fall detection methods. The outstanding results of our multimodal model indicate its superior performance compared to single fall detection models.

Place, publisher, year, edition, pages
Springer, 2025. p. 62-75
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 15284
Keywords [en]
Audio classification, Fall detection, Multimodal, Video classification, Video analysis, Detection methods, Detection models, Effective approaches, Multi-modal, Multi-modal approach, Performance, Safety hazards
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:mau:diva-72628DOI: 10.1007/978-981-96-0125-7_6ISI: 001540369300006Scopus ID: 2-s2.0-85210317498ISBN: 978-981-96-0124-0 (print)ISBN: 978-981-96-0125-7 (electronic)OAI: oai:DiVA.org:mau-72628DiVA, id: diva2:1919887
Conference
21st Pacific Rim International Conference on Artificial Intelligence, PRICAI 2024, Kyoto, Japan, November 18–24, 2024
Available from: 2024-12-10 Created: 2024-12-10 Last updated: 2025-09-18Bibliographically approved
In thesis
1. Context-aware learning for adaptive vision-based systems
Open this publication in new window or tab >>Context-aware learning for adaptive vision-based systems
2025 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

This thesis shows our investigation on scene understanding and object detection for surveillance applications, emphasizing context-aware computer vision models that enhance detection accuracy in complex environments while respecting privacy considerations. The research advances object detection by addressing key aspects such as variability across environments, contextual information, and multimodal data fusion. Through a comprehensive literature review, we examines the role of contextual information, such as spatial, scale, and temporal context, in improving detection performance. Furthermore, we introduce specialized object detection models designed for indoor and outdoor environments, demonstrating howscene-specific training enhances detection accuracy. We also explore hierarchical scene classification, analyzing how different levels contribute to scene recognition. Lastly, a multimodal fall detection method integrating video and audio is proposed, overcoming limitations of purely visual systems in obstructed or low-visibility conditions. The findings of all papers highlight the effectiveness of scene context, hierarchical classification, and multimodal fusion in developing robust, high-accuracy surveillance models suitable for real-world environments. 

Place, publisher, year, edition, pages
Malmö: Malmö University Press, 2025. p. 35
Series
Studies in Computer Science ; 34
Keywords
Object detection, Scene classification, Vision based systems, Multimodal learning, Context-aware learning
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:mau:diva-75404 (URN)10.24834/isbn.9789178776238 (DOI)978-91-7877-622-1 (ISBN)978-91-7877-623-8 (ISBN)
Presentation
2025-04-24, B1, Niagara, Malmö University, Malmö, 10:00 (English)
Opponent
Supervisors
Available from: 2025-04-16 Created: 2025-04-14 Last updated: 2025-12-23Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Jamali, MahtabDavidsson, PaulKhoshkangini, RezaMihailescu, Radu-Casian

Search in DiVA

By author/editor
Jamali, MahtabDavidsson, PaulKhoshkangini, RezaMihailescu, Radu-Casian
By organisation
Department of Computer Science and Media Technology (DVMT)Internet of Things and People (IOTAP)
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 540 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf