Malmö University Publications
Change search
Link to record
Permanent link

Direct link
Sarkheyli-Hägele, ArezooORCID iD iconorcid.org/0000-0001-6925-0444
Publications (10 of 22) Show all publications
Chen, W.-L., Zhou, K.-Q., Sarkheyli-Hägele, A., Qin, F., Hasikin, K., Kang, D.-W. & Zain, A. M. (2026). Cross-lingual transfer learning for knowledge graph acquisition: Paradigms, resources and challenges. Expert systems with applications, 303, Article ID 130434.
Open this publication in new window or tab >>Cross-lingual transfer learning for knowledge graph acquisition: Paradigms, resources and challenges
Show others...
2026 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 303, article id 130434Article in journal (Refereed) Published
Abstract [en]

Knowledge graphs play a pivotal role in structuring human knowledge within artificial intelligence systems. Nonetheless, knowledge distribution is markedly uneven across languages, and linguistic community activity can hinder the performance and scale. Cross-lingual transfer learning emerges as a predominant effective strategy to surmount linguistic barriers, facilitating knowledge transfer across natural languages. This paper reviews cross-lingual knowledge acquisition for knowledge graphs, offering the first systematic integration of cross-lingual transfer paradigms and resources in this field. It critically examines the state of research across subtasks (including named entity recognition, relation extraction, coreference resolution and entity linking). Despite the advancements facilitated by multilingual word embeddings, pre-trained language models and large language models, persistent challenges such as language bias-induced alignment difficulties and low transfer efficiency continue to impede progress. Enhancing model effectiveness through both paradigms and resources will benefit the future construction of multilingual or minor-language knowledge graphs.

Place, publisher, year, edition, pages
Elsevier, 2026
Keywords
Cross-lingual transfer learning, Information extraction, Knowledge acquisition, Knowledge graph, Low-resource languages
National Category
Natural Language Processing
Identifiers
urn:nbn:se:mau:diva-81481 (URN)10.1016/j.eswa.2025.130434 (DOI)001636556000001 ()2-s2.0-105024889704 (Scopus ID)
Available from: 2026-01-07 Created: 2026-01-07 Last updated: 2026-01-08Bibliographically approved
Sanogo, M., Gao, F., Littlefield, N., Siddiqui, I. A., Carlson, L. A., Rezaei, A., . . . Tafti, A. P. (2026). KneeXNet-2.5D: a clinically-oriented and explainable deep learning framework for MRI-based knee cartilage and meniscus segmentation. npj Health Systems, 3(1), 1-16, Article ID 18.
Open this publication in new window or tab >>KneeXNet-2.5D: a clinically-oriented and explainable deep learning framework for MRI-based knee cartilage and meniscus segmentation
Show others...
2026 (English)In: npj Health Systems, E-ISSN 3005-1959, Vol. 3, no 1, p. 1-16, article id 18Article in journal (Refereed) Published
Abstract [en]

Accurate segmentation of knee cartilage and meniscus in magnetic resonance imaging (MRI) is essential for the early detection and monitoring of complications such as cartilage erosion and osteoarthritis. Yet, manual annotation remains time-consuming, subjective, and inefficient for routine clinical use. In this study, we introduced KneeXNet-2.5D, a clinically oriented and explainable deep learning framework for accurate and efficient knee cartilage and meniscus segmentation in sagittal MRIs. Unlike traditional 3D segmentation methods, the proposed model employs a 2.5D architecture to capture inter-slice spatial context, achieving high segmentation accuracy while maintaining computational efficiency and optimal resource utilization. We further incorporated targeted image augmentation, including synthetic noise injection, to enhance the AI model robustness against medical imaging variability. The efficient design of the 2.5D model allows for reduced resource consumption, making it suitable for deployment in healthcare settings with limited computational infrastructure, particularly in low-resource hospitals and rural care environments. To enable open scientific research and ensure reproducibility, we constructed a gold-standard, manually segmented knee MRI dataset and publicly released it alongside the annotation guideline, source code, trained AI models, and a lightweight software application. An entropy-based AI explainability strategy was developed to highlight high-uncertainty regions that are most influential to model predictions, advancing transparency and interpretability. Clinical relevance and anatomical validity were further assessed through expert review by board-certified orthopedic surgeons. Together, these contributions demonstrate the AI model’s anatomical fidelity, interpretability, and readiness for integration into musculoskeletal imaging workflows.

Place, publisher, year, edition, pages
Springer Nature, 2026
National Category
Radiology and Medical Imaging
Identifiers
urn:nbn:se:mau:diva-83036 (URN)10.1038/s44401-026-00072-5 (DOI)2-s2.0-105030823523 (Scopus ID)
Available from: 2026-03-09 Created: 2026-03-09 Last updated: 2026-03-10Bibliographically approved
Sarkheyli-Hägele, A., Sarkheyli, A., Sarkheyli, E., Shahbazi, Z. & Johnsson, M. (2025). An Interactive and Adaptive AI-based Decision Support System for Dynamic Risk Management in Megaprojects. In: Feras M. Awaysheh; Sadi Alawadi (Ed.), 2025 3rd International Conference on Federated Learning Technologies and Applications (FLTA): . Paper presented at 3rd International Conference on Federated Learning Technologies and Applications (FLTA), Dubrovnik, Croatia, 14-17 October, 2025 (pp. 479-483). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>An Interactive and Adaptive AI-based Decision Support System for Dynamic Risk Management in Megaprojects
Show others...
2025 (English)In: 2025 3rd International Conference on Federated Learning Technologies and Applications (FLTA) / [ed] Feras M. Awaysheh; Sadi Alawadi, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 479-483Conference paper, Published paper (Refereed)
Abstract [en]

Megaprojects, as vital instruments of societal and infrastructural transformation, are often challenged by cost overruns, delays, and evolving risks—issues that can compromise both their immediate success and long-term sustainability. Traditional risk management approaches, grounded in static and reactive frameworks, fall short in addressing the complexity and dynamism of such large-scale initiatives. This position paperproposes an Interactive and Adaptive AI-based Decision SupportSystem (DSS) that integrates Interactive Machine Learning(IML) and Adaptive Deep Learning with domain expert input. Central to this approach is the concept of hybrid intelligence, where AI-driven analytics and human judgment are combined synergistically to enhance decision-making. The system embedsexpert-in-the-loop mechanisms to enable continuous, contextaware learning from both historical data and real-time feedback. This produces interpretable, real-time insights for proactive risk identification, assessment, and forecasting. The hybrid intelligence framework promotes greater resilience, adaptability, andtransparency—contributing to more sustainable, informed, andethically responsible governance of complex megaprojects.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Megaproject Risk Management, hybrid intelligence, Adaptive AI, Interactive Machine Learning, Decision Support System, Sustainable Infrastructure
National Category
Information Systems Artificial Intelligence
Research subject
Smart Cities and Communities, REBEL
Identifiers
urn:nbn:se:mau:diva-83063 (URN)10.1109/FLTA67013.2025.11336661 (DOI)2-s2.0-105033529293 (Scopus ID)979-8-3315-5670-9 (ISBN)
Conference
3rd International Conference on Federated Learning Technologies and Applications (FLTA), Dubrovnik, Croatia, 14-17 October, 2025
Available from: 2026-03-10 Created: 2026-03-10 Last updated: 2026-04-20Bibliographically approved
Biscalchin, A., Sarkheyli-Hägele, A., Eriksson, J. & Vogel, B. (2025). Designing Education for the AI Era: Principles for Integrating LLMs in Pedagogy. In: 2025 33rd International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2025: . Paper presented at 33rd International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2025, 18-20 Sep 2025, Split, Croatia. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Designing Education for the AI Era: Principles for Integrating LLMs in Pedagogy
2025 (English)In: 2025 33rd International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Large language models (LLMs) are reshaping education by altering how knowledge is accessed, interpreted, and constructed. This paper reviews 98 academic sources-including peer-reviewed articles, conference papers, and expert reports-to examine the pedagogical and institutional implications of LLM integration. We argue for a cautious but proactive redesign of education grounded in critical AI literacy, ethical safeguards, and task authenticity. LLMs are framed as intelligent interfaces that require new approaches to assessment, instruction, and learning design. The paper proposes actionable principles for AI-integrated education and calls for longitudinal research on the cognitive and developmental impact of scaffolded LLM use in adolescence.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Epistemic Agency, Generative AI in Education, Intelligent Interfaces, Large Language Models, Learning Task Design, Secondary Education Reform
National Category
Educational Sciences
Identifiers
urn:nbn:se:mau:diva-80841 (URN)10.23919/SoftCOM66362.2025.11197458 (DOI)2-s2.0-105021964982 (Scopus ID)9789532901436 (ISBN)
Conference
33rd International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2025, 18-20 Sep 2025, Split, Croatia
Available from: 2025-11-25 Created: 2025-11-25 Last updated: 2025-11-26Bibliographically approved
Ostermann, M., Akin, E. & Sarkheyli-Hägele, A. (2025). Estimation Time of Escape (ETE) by Monitoring the Building Evacuation Flow. In: Stefan Nastic; Florian Michahelles; Sashko Ristov; Patrizio Dazzi; Florian Wolling (Ed.), IOT '25: Proceedings of the 15th International Conference on the Internet of Things: . Paper presented at IOT 2025: The 15th International Conference on the Internet of Things, Vienna Austria, November 18 - 21, 2025 (pp. 131-139). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Estimation Time of Escape (ETE) by Monitoring the Building Evacuation Flow
2025 (Undetermined)In: IOT '25: Proceedings of the 15th International Conference on the Internet of Things / [ed] Stefan Nastic; Florian Michahelles; Sashko Ristov; Patrizio Dazzi; Florian Wolling, Association for Computing Machinery (ACM) , 2025, p. 131-139Conference paper, Published paper (Refereed)
Abstract [en]

In emergency situations, accurately estimating an individual’s time of escape can greatly improve safety and support more informed decision-making. This paper presents a real-time framework for the Estimation Time of Escape (ETE) in indoor environments using computer vision and Internet of Things (IoT) technologies. Unlike prior approaches, this work emphasizes real-time applicability. By also including unobservable zones between observed areas, the complexity of the framework increases while also making it more robust to different scenarios and building structures. The ETE framework utilizes IP cameras in conjunction with a YOLOv11 object detection model and the ByteTrack tracking algorithm to detect and track individuals in multiple rooms. A homography-based perspective transformation is applied to map the tracked positions from the video to real-world positions on the floor. This enables accurate estimation of speed and direction. The framework operates in real-time and dynamically adapts to changing conditions, such as variations in walking speed or direction. It can also predict potential congestion near exits by analyzing comparative escape times. The framework is evaluated using multi-person scenarios, achieving a mean absolute error of up to 0.83s for two-person scenarios and an average of under 1.5s overall. The results show the framework’s robustness and real-world applicability for real-time evacuation monitoring. This framework offers a scalable and adaptable solution with potential applications in public safety, building evacuation systems, and smart city infrastructure.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2025
National Category
Control Engineering
Identifiers
urn:nbn:se:mau:diva-81265 (URN)10.1145/3770501.3770517 (DOI)2-s2.0-105025589333 (Scopus ID)9798400715952 (ISBN)
Conference
IOT 2025: The 15th International Conference on the Internet of Things, Vienna Austria, November 18 - 21, 2025
Available from: 2025-12-18 Created: 2025-12-18 Last updated: 2026-01-07Bibliographically approved
Biscalchin, A., Sarkheyli, E. & Sarkheyli-Hägele, A. (2025). Multi-Agent Foundation Models for Urban Mobility: The Malmö Elderly Case. In: 2025 3rd International Conference on Foundation and Large Language Models, FLLM 2025: . Paper presented at 2025 3rd International Conference on Foundation and Large Language Models, FLLM 2025, 25-28 Nov 2025, Vienna, Austria (pp. 870-875). Institute of Electrical and Electronics Engineers Inc.
Open this publication in new window or tab >>Multi-Agent Foundation Models for Urban Mobility: The Malmö Elderly Case
2025 (English)In: 2025 3rd International Conference on Foundation and Large Language Models, FLLM 2025, Institute of Electrical and Electronics Engineers Inc. , 2025, p. 870-875Conference paper, Published paper (Refereed)
Abstract [en]

Understanding travel behaviour is crucial for developing inclusive, adaptable transportation systems in ageing cities. Yet many public authorities lack the expertise and budget for advanced analytics. We present a modular, multi-agent framework that employs Large Language Models (LLMs) to lower the entry barrier to data-driven mobility analysis. The pipeline extracts interpretable, policy-oriented insights from unstructured survey data and outputs structured reports without manual coding. We demonstrate the approach using Malmö's 2023 travel survey for residents aged 65+, and benchmark its recommendations against those of (i) a single-agent LLM service (Single Agent), representative of current commercial offerings, and (ii) a human expert baseline (Human), both evaluated in a blinded expert review. The system achieves superior reasoning quality compared to the single-agent baseline, while performing slightly lower on interpretability and focus, and approaches expert-level quality overall - at a fraction of the cost and effort. Key constraints are the current lack of real-time data ingestion and dependence on proprietary commercial APIs. The study provides an open-source proof of concept showing how multi-agent LLMs can make urban mobility analytics more accessible, transparent, and timely. All code and evaluation materials are publicly available.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2025
Keywords
Aging Population, Decision Support, Large Language Models, Multi-Agent Systems, Travel Behavior, Urban Mobility
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-83962 (URN)10.1109/FLLM67465.2025.11391006 (DOI)2-s2.0-105035828644 (Scopus ID)9798331594091 (ISBN)
Conference
2025 3rd International Conference on Foundation and Large Language Models, FLLM 2025, 25-28 Nov 2025, Vienna, Austria
Available from: 2026-05-04 Created: 2026-05-04 Last updated: 2026-05-06Bibliographically approved
Shokrollahi, A., Karlsson, F., Malekian, R., Persson, J. A. & Sarkheyli-Hägele, A. (2025). Non-invasive occupancy estimation and space utilization in smart buildings: Leveraging machine learning with PIR sensors and booking data. Internet of Things: Engineering Cyber Physical Human Systems, 34, Article ID 101777.
Open this publication in new window or tab >>Non-invasive occupancy estimation and space utilization in smart buildings: Leveraging machine learning with PIR sensors and booking data
Show others...
2025 (English)In: Internet of Things: Engineering Cyber Physical Human Systems, E-ISSN 2542-6605, Vol. 34, article id 101777Article in journal (Refereed) Published
Abstract [en]

Occupancy estimation in smart buildings is essential for optimizing resource usage and enhancing operational efficiency. Existing estimation methods predominantly rely on cameras or advanced sensor fusion techniques, which, while accurate, are often expensive, invasive, and raise privacy concerns. Additionally, these approaches frequently require extra hardware, increasing installation complexity and operational costs. A significant gap in the literature lies in the limited use of existing smart building infrastructure, such as detection systems and booking data, for people counting. This study addresses these limitations by exclusively utilizing two binary PIR sensors (in-door and in-room) and booking data. Since PIR sensors and booking systems are already integrated into most smart building infrastructures, leveraging these existing resources helps reduce costs and simplifies implementation. The primary goal is to estimate the number of people between each in-door sensor trigger using machine learning models by incorporating people counting levels and time thresholds. Among the evaluated machine learning algorithms, the Extra Trees Classifier delivered strong performance, achieving 68.5% accuracy when the estimated occupancy differed from the actual count by at most one person, and 81.56% with a tolerance of two. These results are based on periods when the room was occupied. When both occupied and unoccupied periods were included, the accuracy was 96.10% for ±1 tolerance. Moreover, incorporating booking data enhanced people counting accuracy by 4%. The study also explores the method's ability to identify underutilization and overutilization by comparing estimated occupancy with booking records and seating capacity, thereby supporting enhanced space management in smart buildings.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Booking information, Machine learning, Occupancy estimation, People counting, PIR sensors, Smart buildings, Space utilization
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:mau:diva-80013 (URN)10.1016/j.iot.2025.101777 (DOI)001587516700001 ()2-s2.0-105017464561 (Scopus ID)
Available from: 2025-10-14 Created: 2025-10-14 Last updated: 2025-10-15Bibliographically approved
Shokrollahi, A., Karlsson, F., Malekian, R., Persson, J. A. & Sarkheyli-Hägele, A. (2025). Non-Invasive People Counting in Smart Buildings: Employing Machine Learning with Binary PIR Sensors. In: Ana Paula Rocha; Luc Steels; H. Jaap van den Herik (Ed.), Proceedings of the 14th International Conference on Agents and Artificial Intelligence: Volume 3: ICAART. Paper presented at 17th International Conference on Agents and Artificial Intelligence, Porto, Portugal, February 23-25, 2025 (pp. 394-405). INSTICC
Open this publication in new window or tab >>Non-Invasive People Counting in Smart Buildings: Employing Machine Learning with Binary PIR Sensors
Show others...
2025 (English)In: Proceedings of the 14th International Conference on Agents and Artificial Intelligence: Volume 3: ICAART / [ed] Ana Paula Rocha; Luc Steels; H. Jaap van den Herik, INSTICC , 2025, p. 394-405Conference paper, Published paper (Refereed)
Abstract [en]

People counting in smart buildings is crucial for the efficient management of building systems such as energy, space allocation, efficiency, and occupant comfort. This study investigates the use of two non-invasive binary Passive Infrared (PIR) sensors for estimating the number of people in seven office rooms with different people counting intervals. Previous studies often relied on sensor fusion or more complex signal-based PIR sensors, which increased hardware costs, raised privacy concerns, and added installation complexity. Our approach addresses these limitations by utilizing fewer sensors, reducing hardware costs, and simplifying installation, making it scalable and flexible for different room configurations, while also ensuring high consideration of privacy. Additionally, binary PIR sensors are typically part of smart building systems, eliminating the need for additional sensors. We employed several machine learning methods to analyze motion detected by binary PIR sensors, imp roving the accuracy of people counting estimates. We analyzed important features by extracting event count, duration, and density from sensor data, along with features from the room’s shape, to estimate the number of people. We used different machine learning models for estimating the number of people. Models like Gradient Boosting, XGBoost, MLP, and LGBM demonstrated superior performance for their strong ability to handle complex, non-linear relationships in sensor data, high-dimensional datasets, and imbalanced data, which are common challenges in people counting tasks using PIR sensors. These models were evaluated using performance metrics such as accuracy and F1-score. Additionally, the results show that features such as passage events and the number of detected events, combined with machine learning algorithms, can achieve good accuracy and reliability in people counting.

Place, publisher, year, edition, pages
INSTICC, 2025
Series
ICAART, ISSN 2184-3589, E-ISSN 2184-433X
Keywords
Smart Buildings, Occupancy Information, People Counting, Binary PIR Sensors, Machine Learning, Non-Invasive Sensors
National Category
Signal Processing
Identifiers
urn:nbn:se:mau:diva-75263 (URN)10.5220/0013141800003890 (DOI)2-s2.0-105001977209 (Scopus ID)978-989-758-737-5 (ISBN)
Conference
17th International Conference on Agents and Artificial Intelligence, Porto, Portugal, February 23-25, 2025
Available from: 2025-04-08 Created: 2025-04-08 Last updated: 2025-10-15Bibliographically approved
Xiang, Y., Zhou, K., Sarkheyli-Hägele, A., Yusoff, Y., Kang, D. & Zain, A. M. (2025). Parallel fault diagnosis using hierarchical fuzzy Petri net by reversible and dynamic decomposition mechanism. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 26(1), 93-108
Open this publication in new window or tab >>Parallel fault diagnosis using hierarchical fuzzy Petri net by reversible and dynamic decomposition mechanism
Show others...
2025 (English)In: FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, ISSN 2095-9184, Vol. 26, no 1, p. 93-108Article in journal (Refereed) Published
Abstract [en]

The state space explosion, a challenge analogous to that encountered in a Petri net (PN), has constrained the extensive study of fuzzy Petri nets (FPNs). Current reasoning algorithms employing FPNs, which operate through forward, backward, and bidirectional mechanisms, are examined. These algorithms streamline the inference process by eliminating irrelevant components of the FPN. However, as the scale of the FPN grows, the complexity of these algorithms escalates sharply, posing a significant challenge for practical applications. To address the state explosion issue, this work introduces a parallel bidirectional reasoning algorithm for an FPN that utilizes reverse and decomposition strategies to optimize the implementation process. The algorithm involves hierarchically dividing a large-scale FPN into two sub-FPNs, followed by a converse operation to generate the reversal sub-FPN for the right-sub-FPN. The detailed mapping between the original and reversed FPNs is thoroughly discussed. Parallel reasoning operations are then conducted on the left-sub-FPN and the resulting reversal right-sub-FPN, with the final result derived by computing the Euclidean distance between the outcomes from the output places of the two sub-FPNs. A case study is presented to illustrate the implementation process, demonstrating the algorithm's significant enhancement of inference efficiency and substantial reduction in execution time.

Abstract [zh]

基于可逆和动态分解机制的层次化FPN并行故障诊断

与Petri网类似,模糊Petri网(fuzzy Petri net, FPN)的研究同样受到状态空间爆炸问题的限制。目前,基于FPN的推理算法主要依赖于正向、反向和双向等机制。这些算法通过消除FPN中不相关的部分来简化推理过程。然而,随着规模的扩大,基于FPN的相关应用算法的复杂度迅速增加,这给基于FPN的推理算法的实际应用带来重大挑战。为解决状态爆炸问题,本文提出一种基于可逆和动态分解机制的FPN双向推理算法,以优化推理过程。该算法将层次化后的FPN分解为左右两个子网;然后,深入分析FPN原网与其逆网元素之间的对应关系,提出FPN逆网生成算法,用于生成右子网的逆网;最后,在左子网与右子网的逆网上同时执行推理算法,通过计算两子网输出位置之间的欧式距离得到最终结果。案例表明,本文提出的推理算法显著提高了推理效率,大幅缩短了执行时间。

Place, publisher, year, edition, pages
Springer, 2025
Keywords
Fuzzy Petri net (FPN), State explosion, Decomposition, Parallel, Bidirectional reasoning, TP301
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-72901 (URN)10.1631/FITEE.2400184 (DOI)001383481700001 ()2-s2.0-86000429977 (Scopus ID)
Available from: 2025-01-07 Created: 2025-01-07 Last updated: 2025-04-10Bibliographically approved
Sandelius, C., Pappas, A., Sarkheyli-Hägele, A., Heuer, A. & Johnsson, M. (2024). Leveraging Deep Learning for Approaching Automated Pre-Clinical Rodent Models. In: Francesco Marcelloni; Kurosh Madani; Niki van Stein; Joaquim Filipe (Ed.), Proceedings of the 16th International Joint Conference on Computational Intelligence - Volume 1: NCTA: . Paper presented at 16th International Joint Conference on Computational Intelligence, Porto, Portugal, November 20-22, 2024 (pp. 613-620). SciTePress
Open this publication in new window or tab >>Leveraging Deep Learning for Approaching Automated Pre-Clinical Rodent Models
Show others...
2024 (English)In: Proceedings of the 16th International Joint Conference on Computational Intelligence - Volume 1: NCTA / [ed] Francesco Marcelloni; Kurosh Madani; Niki van Stein; Joaquim Filipe, SciTePress, 2024, p. 613-620Conference paper, Published paper (Refereed)
Abstract [en]

We evaluate deep learning architectures for rat pose estimation using a six-camera system, focusing on ResNet and EfficientNet across various depths and augmentation techniques. Among the configurations tested, ResNet 152 with default augmentation provided the best performance when employing a multi-perspective network approach in the controlled experimental setup. It reached a Root Mean Squared Error (RMSE) of 8.74, 8.78, and 9.72 pixels for the different angles. The utilization of data augmentation revealed that less altering yields better performance. We propose potential areas for future research, including further refinement of model configurations, more in-depth investigation of inference speeds, and the possibility of transferring network weights to study other species, such as mice. The findings underscore the potential for deep learning solutions to advance preclinical research in behavioral neuroscience. We suggest building on this research to introduce behavioral recogniti on based on a 3D movement reconstruction, particularly emphasizing the motoric aspects of neurodegenerative diseases. This will allow for the correlation of observable behaviors with neuronal activity, contributing to a better understanding of the brain and aiding in developing new therapeutic strategies.

Place, publisher, year, edition, pages
SciTePress, 2024
Series
IJCCI, ISSN 2184-3236
Keywords
Deep Learning, Machine Learning, Computer Vision, Behavioral Neuroscience, Pre-Clinical Rodent Models
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-72778 (URN)10.5220/0013065600003837 (DOI)2-s2.0-85211432771 (Scopus ID)978-989-758-721-4 (ISBN)
Conference
16th International Joint Conference on Computational Intelligence, Porto, Portugal, November 20-22, 2024
Available from: 2024-12-16 Created: 2024-12-16 Last updated: 2024-12-16Bibliographically approved
Projects
Dynamic Intelligent Sensor Intensive Systems; Malmö University; Publications
Persson, J. A., Bugeja, J., Davidsson, P., Holmberg, J., Kebande, V. R., Mihailescu, R.-C., . . . Tegen, A. (2023). The Concept of Interactive Dynamic Intelligent Virtual Sensors (IDIVS): Bridging the Gap between Sensors, Services, and Users through Machine Learning. Applied Sciences, 13(11), Article ID 6516.
Internet of Things Master's Program; Malmö UniversitySituation Awareness-based Attention Guidance; Malmö University, Internet of Things and People (IOTAP) (Closed down 2024-12-31)Evacuation Assistance System; Malmö University
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-6925-0444

Search in DiVA

Show all publications