Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
AI on the Edge: Architectural Alternatives
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0003-3972-2265
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0002-7700-1816
Chalmers University of Technology. (Software Center)
2020 (English)In: Proceedings 46th Euromicro Conferenceon Software Engineering and Advanced Applications SEAA 2020 / [ed] Antonio Martini, Manuel Wimmer, Amund Skavhaug, IEEE, 2020, p. 21-28Conference paper, Published paper (Refereed)
Abstract [en]

Since the advent of mobile computing and IoT, a large amount of data is distributed around the world. Companies are increasingly experimenting with innovative ways of implementing edge/cloud (re)training of AI systems to exploit large quantities of data to optimize their business value. Despite the obvious benefits, companies face challenges as the decision on how to implement edge/cloud (re)training depends on factors such as the task intent, the amount of data needed for (re)training, edge-to-cloud data transfer, the available computing and memory resources. Based on action research in a software-intensive embedded systems company where we study multiple use cases as well as insights from our previous collaborations with industry, we develop a generic framework consisting of five architectural alternatives to deploy AI on the edge utilizing transfer learning. We validate the framework in four additional case companies and present the challenges they face in selecting the optimal architecture. The contribution of the paper is threefold. First, we develop a generic framework consisting of five architectural alternatives ranging from a centralized architecture where cloud (re)training is given priority to a decentralized architecture where edge (re)training is instead given priority. Second, we validate the framework in a qualitative interview study with four additional case companies. As an outcome of validation study, we present two variants to the architectural alternatives identified as part of the framework. Finally, we identify the key challenges that experts face in selecting an ideal architectural alternative.

Place, publisher, year, edition, pages
IEEE, 2020. p. 21-28
Series
Proceedings (EUROMICRO Conference on Software Engineering and Advanced Applications), ISSN 2640-592X, E-ISSN 2376-9521
Keywords [en]
Artificial Intelligence, Machine Learning, Deep Learning, Edge, Cloud, Transfer Learning, Action Research, Architectural alternatives
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:mau:diva-17930DOI: 10.1109/SEAA51224.2020.00015ISI: 000702094100004Scopus ID: 2-s2.0-85096567097ISBN: 978-1-7281-9532-2 (print)OAI: oai:DiVA.org:mau-17930DiVA, id: diva2:1458239
Conference
46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020, 26-28 August 2020, Portorož, Slovenia
Available from: 2020-08-14 Created: 2020-08-14 Last updated: 2024-12-17Bibliographically approved
In thesis
1. Design Methods and Processes for ML/DL models
Open this publication in new window or tab >>Design Methods and Processes for ML/DL models
2021 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Context: With the advent of Machine Learning (ML) and especially Deep Learning (DL) technology, companies are increasingly using Artificial Intelligence (AI) in systems, along with electronics and software. Nevertheless, the end-to-end process of developing, deploying and evolving ML and DL models in companies brings some challenges related to the design and scaling of these models. For example, access to and availability of data is often challenging, and activities such as collecting, cleaning, preprocessing, and storing data, as well as training, deploying and monitoring the model(s) are complex. Regardless of the level of expertise and/or access to data scientists, companies in all embedded systems domain struggle to build high-performing models due to a lack of established and systematic design methods and processes.

Objective: The overall objective is to establish systematic and structured design methods and processes for the end-to-end process of developing, deploying and successfully evolving ML/DL models.

Method: To achieve the objective, we conducted our research in close collaboration with companies in the embedded systems domain using different empirical research methods such as case study, action research and literature review.

Results and Conclusions: This research provides six main results: First, it identifies the activities that companies undertake in parallel to develop, deploy and evolve ML/DL models, and the challenges associated with them. Second, it presents a conceptual framework for the continuous delivery of ML/DL models to accelerate AI-driven business in companies. Third, it presents a framework based on current literature to accelerate the end-to-end deployment process and advance knowledge on how to integrate, deploy and operationalize ML/DL models. Fourth, it develops a generic framework with five architectural alternatives for deploying ML/DL models at the edge. These architectural alternatives range from a centralized architecture that prioritizes (re)training in the cloud to a decentralized architecture that prioritizes (re)training at the edge. Fifth, it identifies key factors to help companies decide which architecture to choose for deploying ML/DL models. Finally, it explores how MLOps, as a practice that brings together data scientist teams and operations, ensures the continuous delivery and evolution of models. 

Place, publisher, year, edition, pages
Malmö: Malmö universitet, 2021. p. 204
Series
Studies in Computer Science ; 17
Keywords
Machine Learning, Deep Learning, Development, Deployment, Evolution
National Category
Computer Systems
Identifiers
urn:nbn:se:mau:diva-45026 (URN)10.24834/isbn.9789178771998 (DOI)978-91-7877-198-1 (ISBN)978-91-7877-199-8 (ISBN)
Presentation
(English)
Opponent
Supervisors
Note

Due to copyright reasons, the articles are not included in the fulltext online

Available from: 2021-08-20 Created: 2021-08-20 Last updated: 2024-03-26Bibliographically approved
2. Towards continuous development of MLOps practices
Open this publication in new window or tab >>Towards continuous development of MLOps practices
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: Digitalisation is transforming software-intensive embedded systems companies by focussing on business models that utilise software, data and AI (especially Machine learning and Deep Learning (DL)). However, despite these advancements, the majority of companies still struggle to transition their models from prototypes to fully functional operational systems. This highlights the need to optimise the end-to-end process of developing, deploying and evolving ML/DL models to ensure continuous value delivery.

Objective: This thesis is structured around three primary objectives. The first objective is to identify the need of MLOps (Machine Learning Operations). Building on this understanding, the second objective is to develop frameworks for the adoption of MLOps, aiming to standardise and streamline the processes of developing, deploying and evolving ML/DL models. Finally, the third objective is to adopt MLOps practices and assess the maturity of their adoption.

Method: To achieve these objectives, we conducted research in close collaboration with various companies and used a combination of different empirical research methods, such as case studies, action research, and literature reviews.

Results and Conclusions: First, the thesis identifies the activities carried out by practitioners in companies and the challenges they face when developing, deploying and evolving models. Second, it presents a conceptual framework with three parallel and concurrent activities that companies utilise in the process of developing, deploying and evolving models. Third, it introduces a framework based on current literature to accelerate and advance knowledge on the end-to-end deployment process. Fourth, it develops a generic framework with five architectural alternatives ranging from a centralised architecture to a decentralised architecture for deploying ML/DL models at the edge. It also identifies key factors that help companies overcome their dilemma to decide which architecture to choose for deploying ML/DL models. Five, it explores how MLOps, as a practice, brings together data scientist teams and operations to ensure the continuous delivery and evolution of models. Sixth, it presents the MLOps framework, maps companies to the MLOps maturity model, and validates the MLOps framework and maturity model with other companies. It also presents critical trade-offs that practitioners made when adopting MLOps. Seventh, it presents an MLOps taxonomy that helps companies determine their maturity stage and provide tailored MLOps practices to advance.

Place, publisher, year, edition, pages
Malmö: Malmö University Press, 2025. p. 248
Series
Studies in Computer Science ; 30
Keywords
MLOps, Development, Deployment, Evolution, ML/DL models, Frameworks
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-72797 (URN)10.24834/isbn.9789178775637 (DOI)978-91-7877-562-0 (ISBN)978-91-7877-563-7 (ISBN)
Public defence
2025-01-10, Auditorium C, Niagara, Malmö, 13:15 (English)
Opponent
Supervisors
Note

Paper H in dissertation as manuscript.

Available from: 2024-12-17 Created: 2024-12-17 Last updated: 2025-01-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

John, Meenu MaryOlsson, Helena Holmström

Search in DiVA

By author/editor
John, Meenu MaryOlsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 584 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf