Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Architecting AI Deployment: A Systematic Review of State-of-the-art and State-of-practice Literature
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0003-3972-2265
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0002-7700-1816
Chalmers University of Technology. (Software Center)
2020 (English)In: Software Business: 11th International Conference, ICSOB 2020, Karlskrona, Sweden, November 16–18, 2020, Proceedings / [ed] Eriks Klotins; Krzysztof Wnuk, Springer, 2020, p. 14-29Conference paper, Published paper (Refereed)
Abstract [en]

Companies across domains are rapidly engaged in shifting computational power and intelligence from centralized cloud to fully decentralized edges to maximize value delivery, strengthen security and reduce latency. However, most companies have only recently started pursuing this opportunity and are therefore at the early stage of the cloud-to-edge transition. To provide an overview of AI deployment in the context of edge/cloud/hybrid architectures, we conduct a systematic literature review and a grey literature review. To advance understanding of how to integrate, deploy, operationalize and evolve AI models, we derive a framework from existing literature to accelerate the end-to-end deployment process. The framework is organized into five phases: Design, Integration, Deployment, Operation and Evolution. We make an attempt to analyze the extracted results by comparing and contrasting them to derive insights. The contribution of the paper is threefold. First, we conduct a systematic literature review in which we review the contemporary scientific literature and provide a detailed overview of the state-of-the-art of AI deployment. Second, we review the grey literature and present the state-of-practice and experience of practitioners while deploying AI models. Third, we present a framework derived from existing literature for the end-to-end deployment process and attempt to compare and contrast SLR and GLR results.

Place, publisher, year, edition, pages
Springer, 2020. p. 14-29
Series
Lecture Notes in Business Information Processing, ISSN 1865-1348, E-ISSN 1865-1356 ; 407
Keywords [en]
Machine Learning, Deep Learning, Deployment, Systematic Literature Review, Grey Literature Review, Practices, Challenges
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:mau:diva-17122DOI: 10.1007/978-3-030-67292-8_2Scopus ID: 2-s2.0-85101368139ISBN: 978-3-030-67291-1 (print)ISBN: 978-3-030-67292-8 (electronic)OAI: oai:DiVA.org:mau-17122DiVA, id: diva2:1553907
Conference
11th International Conference on Software Business, ICSOB, Nov 17-18, 2020, Karlskrona, Sweden
Available from: 2021-05-11 Created: 2021-05-11 Last updated: 2024-12-17Bibliographically approved
In thesis
1. Design Methods and Processes for ML/DL models
Open this publication in new window or tab >>Design Methods and Processes for ML/DL models
2021 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Context: With the advent of Machine Learning (ML) and especially Deep Learning (DL) technology, companies are increasingly using Artificial Intelligence (AI) in systems, along with electronics and software. Nevertheless, the end-to-end process of developing, deploying and evolving ML and DL models in companies brings some challenges related to the design and scaling of these models. For example, access to and availability of data is often challenging, and activities such as collecting, cleaning, preprocessing, and storing data, as well as training, deploying and monitoring the model(s) are complex. Regardless of the level of expertise and/or access to data scientists, companies in all embedded systems domain struggle to build high-performing models due to a lack of established and systematic design methods and processes.

Objective: The overall objective is to establish systematic and structured design methods and processes for the end-to-end process of developing, deploying and successfully evolving ML/DL models.

Method: To achieve the objective, we conducted our research in close collaboration with companies in the embedded systems domain using different empirical research methods such as case study, action research and literature review.

Results and Conclusions: This research provides six main results: First, it identifies the activities that companies undertake in parallel to develop, deploy and evolve ML/DL models, and the challenges associated with them. Second, it presents a conceptual framework for the continuous delivery of ML/DL models to accelerate AI-driven business in companies. Third, it presents a framework based on current literature to accelerate the end-to-end deployment process and advance knowledge on how to integrate, deploy and operationalize ML/DL models. Fourth, it develops a generic framework with five architectural alternatives for deploying ML/DL models at the edge. These architectural alternatives range from a centralized architecture that prioritizes (re)training in the cloud to a decentralized architecture that prioritizes (re)training at the edge. Fifth, it identifies key factors to help companies decide which architecture to choose for deploying ML/DL models. Finally, it explores how MLOps, as a practice that brings together data scientist teams and operations, ensures the continuous delivery and evolution of models. 

Place, publisher, year, edition, pages
Malmö: Malmö universitet, 2021. p. 204
Series
Studies in Computer Science ; 17
Keywords
Machine Learning, Deep Learning, Development, Deployment, Evolution
National Category
Computer Systems
Identifiers
urn:nbn:se:mau:diva-45026 (URN)10.24834/isbn.9789178771998 (DOI)978-91-7877-198-1 (ISBN)978-91-7877-199-8 (ISBN)
Presentation
(English)
Opponent
Supervisors
Note

Due to copyright reasons, the articles are not included in the fulltext online

Available from: 2021-08-20 Created: 2021-08-20 Last updated: 2024-03-26Bibliographically approved
2. Towards continuous development of MLOps practices
Open this publication in new window or tab >>Towards continuous development of MLOps practices
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: Digitalisation is transforming software-intensive embedded systems companies by focussing on business models that utilise software, data and AI (especially Machine learning and Deep Learning (DL)). However, despite these advancements, the majority of companies still struggle to transition their models from prototypes to fully functional operational systems. This highlights the need to optimise the end-to-end process of developing, deploying and evolving ML/DL models to ensure continuous value delivery.

Objective: This thesis is structured around three primary objectives. The first objective is to identify the need of MLOps (Machine Learning Operations). Building on this understanding, the second objective is to develop frameworks for the adoption of MLOps, aiming to standardise and streamline the processes of developing, deploying and evolving ML/DL models. Finally, the third objective is to adopt MLOps practices and assess the maturity of their adoption.

Method: To achieve these objectives, we conducted research in close collaboration with various companies and used a combination of different empirical research methods, such as case studies, action research, and literature reviews.

Results and Conclusions: First, the thesis identifies the activities carried out by practitioners in companies and the challenges they face when developing, deploying and evolving models. Second, it presents a conceptual framework with three parallel and concurrent activities that companies utilise in the process of developing, deploying and evolving models. Third, it introduces a framework based on current literature to accelerate and advance knowledge on the end-to-end deployment process. Fourth, it develops a generic framework with five architectural alternatives ranging from a centralised architecture to a decentralised architecture for deploying ML/DL models at the edge. It also identifies key factors that help companies overcome their dilemma to decide which architecture to choose for deploying ML/DL models. Five, it explores how MLOps, as a practice, brings together data scientist teams and operations to ensure the continuous delivery and evolution of models. Sixth, it presents the MLOps framework, maps companies to the MLOps maturity model, and validates the MLOps framework and maturity model with other companies. It also presents critical trade-offs that practitioners made when adopting MLOps. Seventh, it presents an MLOps taxonomy that helps companies determine their maturity stage and provide tailored MLOps practices to advance.

Place, publisher, year, edition, pages
Malmö: Malmö University Press, 2025. p. 248
Series
Studies in Computer Science ; 30
Keywords
MLOps, Development, Deployment, Evolution, ML/DL models, Frameworks
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-72797 (URN)10.24834/isbn.9789178775637 (DOI)978-91-7877-562-0 (ISBN)978-91-7877-563-7 (ISBN)
Public defence
2025-01-10, Auditorium C, Niagara, Malmö, 13:15 (English)
Opponent
Supervisors
Note

Paper H in dissertation as manuscript.

Available from: 2024-12-17 Created: 2024-12-17 Last updated: 2025-01-07Bibliographically approved

Open Access in DiVA

fulltext(546 kB)1379 downloads
File information
File name FULLTEXT01.pdfFile size 546 kBChecksum SHA-512
4e159e6c992e135192b65e45bfe923caca0b138334bc1f3071c2b452e456ce108ac9668c83821b33c0e570ae6f9201fd811e2eab971f0c5977132068ee970f8a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

John, Meenu MaryOlsson, Helena Holmström

Search in DiVA

By author/editor
John, Meenu MaryOlsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 1382 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 186 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf