Malmö University Publications
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards continuous development of MLOps practices
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT).ORCID iD: 0000-0003-3972-2265
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: Digitalisation is transforming software-intensive embedded systems companies by focussing on business models that utilise software, data and AI (especially Machine learning and Deep Learning (DL)). However, despite these advancements, the majority of companies still struggle to transition their models from prototypes to fully functional operational systems. This highlights the need to optimise the end-to-end process of developing, deploying and evolving ML/DL models to ensure continuous value delivery.

Objective: This thesis is structured around three primary objectives. The first objective is to identify the need of MLOps (Machine Learning Operations). Building on this understanding, the second objective is to develop frameworks for the adoption of MLOps, aiming to standardise and streamline the processes of developing, deploying and evolving ML/DL models. Finally, the third objective is to adopt MLOps practices and assess the maturity of their adoption.

Method: To achieve these objectives, we conducted research in close collaboration with various companies and used a combination of different empirical research methods, such as case studies, action research, and literature reviews.

Results and Conclusions: First, the thesis identifies the activities carried out by practitioners in companies and the challenges they face when developing, deploying and evolving models. Second, it presents a conceptual framework with three parallel and concurrent activities that companies utilise in the process of developing, deploying and evolving models. Third, it introduces a framework based on current literature to accelerate and advance knowledge on the end-to-end deployment process. Fourth, it develops a generic framework with five architectural alternatives ranging from a centralised architecture to a decentralised architecture for deploying ML/DL models at the edge. It also identifies key factors that help companies overcome their dilemma to decide which architecture to choose for deploying ML/DL models. Five, it explores how MLOps, as a practice, brings together data scientist teams and operations to ensure the continuous delivery and evolution of models. Sixth, it presents the MLOps framework, maps companies to the MLOps maturity model, and validates the MLOps framework and maturity model with other companies. It also presents critical trade-offs that practitioners made when adopting MLOps. Seventh, it presents an MLOps taxonomy that helps companies determine their maturity stage and provide tailored MLOps practices to advance.

Place, publisher, year, edition, pages
Malmö: Malmö University Press, 2025. , p. 248
Series
Studies in Computer Science ; 30
Keywords [en]
MLOps, Development, Deployment, Evolution, ML/DL models, Frameworks
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:mau:diva-72797DOI: 10.24834/isbn.9789178775637ISBN: 978-91-7877-562-0 (print)ISBN: 978-91-7877-563-7 (electronic)OAI: oai:DiVA.org:mau-72797DiVA, id: diva2:1922011
Public defence
2025-01-10, Auditorium C, Niagara, Malmö, 13:15 (English)
Opponent
Supervisors
Note

Paper H in dissertation as manuscript.

Available from: 2024-12-17 Created: 2024-12-17 Last updated: 2025-01-07Bibliographically approved
List of papers
1. Developing ML/DL Models: A Design Framework
Open this publication in new window or tab >>Developing ML/DL Models: A Design Framework
2020 (English)In: Proceedings 2020 IEEE/ACM International Conferenceon Software and System Processes ICSSP 2020, ACM Digital Library, 2020, p. 1-10Conference paper, Published paper (Refereed)
Abstract [en]

Artificial Intelligence is becoming increasingly popular with organizations due to the success of Machine Learning and Deep Learning techniques. Using these techniques, data scientists learn from vast amounts of data to enhance behaviour in software-intensive systems. Despite the attractiveness of these techniques, however, there is a lack of systematic and structured design process for developing ML/DL models. The study uses a multiple-case study approach to explore the different activities and challenges data scientists face when developing ML/DL models in software-intensive embedded systems. In addition, we have identified seven different phases in the proposed design process leading to effective model development based on the case study. Iterations identified between phases and events which trigger these iterations optimize the design process for ML/DL models. Lessons learned from this study allow data scientists and engineers to develop high-performance ML/DL models and also bridge the gap between high demand and low supply of data scientists.

Place, publisher, year, edition, pages
ACM Digital Library, 2020
Keywords
Machine Learning, Deep Learning, Artificial Intelligence, Design, Software Engineering
National Category
Engineering and Technology
Identifiers
urn:nbn:se:mau:diva-17137 (URN)10.1145/3379177.3388892 (DOI)001039139300001 ()2-s2.0-85092522299 (Scopus ID)978-1-4503-7512-2 (ISBN)
Conference
ICSSP '20: International Conference on Software and System Processes, June 26-28, 2020, Seoul, Republic of Korea
Available from: 2020-04-28 Created: 2020-04-28 Last updated: 2024-12-17Bibliographically approved
2. Towards an AI-driven business development framework: A multi-case study
Open this publication in new window or tab >>Towards an AI-driven business development framework: A multi-case study
2023 (English)In: Journal of Software: Evolution and Process, ISSN 2047-7473, E-ISSN 2047-7481, Vol. 35, no 6, article id e2432Article in journal (Refereed) Published
Abstract [en]

Artificial intelligence (AI) and the use of machine learning (ML) and deep learning (DL) technologies are becoming increasingly popular in companies. These technologies enable companies to leverage big quantities of data to improve system performance and accelerate business development. However, despite the appeal of ML/DL, there is a lack of systematic and structured methods and processes to help data scientists and other company roles and functions to develop, deploy and evolve models. In this paper, based on multi-case study research in six companies, we explore practices and challenges practitioners experience in developing ML/DL models as part of large software-intensive embedded systems. Based on our empirical findings, we derive a conceptual framework in which we identify three high-level activities that companies perform in parallel with the development, deployment and evolution of models. Within this framework, we outline activities, iterations and triggers that optimize model design as well as roles and company functions. In this way, we provide practitioners with a blueprint for effectively integrating ML/DL model development into the business to achieve better results than other (algorithmic) approaches. In addition, we show how this framework helps companies solve the challenges we have identified and discuss checkpoints for terminating the business case.

Place, publisher, year, edition, pages
John Wiley & Sons, 2023
Keywords
AI-driven business development framework, artificial intelligence, challenges, deep learning, iterations and triggers, machine learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-50450 (URN)10.1002/smr.2432 (DOI)000760593100001 ()2-s2.0-85125909057 (Scopus ID)
Available from: 2022-03-07 Created: 2022-03-07 Last updated: 2024-12-17Bibliographically approved
3. Architecting AI Deployment: A Systematic Review of State-of-the-art and State-of-practice Literature
Open this publication in new window or tab >>Architecting AI Deployment: A Systematic Review of State-of-the-art and State-of-practice Literature
2020 (English)In: Software Business: 11th International Conference, ICSOB 2020, Karlskrona, Sweden, November 16–18, 2020, Proceedings / [ed] Eriks Klotins; Krzysztof Wnuk, Springer, 2020, p. 14-29Conference paper, Published paper (Refereed)
Abstract [en]

Companies across domains are rapidly engaged in shifting computational power and intelligence from centralized cloud to fully decentralized edges to maximize value delivery, strengthen security and reduce latency. However, most companies have only recently started pursuing this opportunity and are therefore at the early stage of the cloud-to-edge transition. To provide an overview of AI deployment in the context of edge/cloud/hybrid architectures, we conduct a systematic literature review and a grey literature review. To advance understanding of how to integrate, deploy, operationalize and evolve AI models, we derive a framework from existing literature to accelerate the end-to-end deployment process. The framework is organized into five phases: Design, Integration, Deployment, Operation and Evolution. We make an attempt to analyze the extracted results by comparing and contrasting them to derive insights. The contribution of the paper is threefold. First, we conduct a systematic literature review in which we review the contemporary scientific literature and provide a detailed overview of the state-of-the-art of AI deployment. Second, we review the grey literature and present the state-of-practice and experience of practitioners while deploying AI models. Third, we present a framework derived from existing literature for the end-to-end deployment process and attempt to compare and contrast SLR and GLR results.

Place, publisher, year, edition, pages
Springer, 2020
Series
Lecture Notes in Business Information Processing, ISSN 1865-1348, E-ISSN 1865-1356 ; 407
Keywords
Machine Learning, Deep Learning, Deployment, Systematic Literature Review, Grey Literature Review, Practices, Challenges
National Category
Computer Systems
Identifiers
urn:nbn:se:mau:diva-17122 (URN)10.1007/978-3-030-67292-8_2 (DOI)2-s2.0-85101368139 (Scopus ID)978-3-030-67291-1 (ISBN)978-3-030-67292-8 (ISBN)
Conference
11th International Conference on Software Business, ICSOB, Nov 17-18, 2020, Karlskrona, Sweden
Available from: 2021-05-11 Created: 2021-05-11 Last updated: 2024-12-17Bibliographically approved
4. AI on the Edge: Architectural Alternatives
Open this publication in new window or tab >>AI on the Edge: Architectural Alternatives
2020 (English)In: Proceedings 46th Euromicro Conferenceon Software Engineering and Advanced Applications SEAA 2020 / [ed] Antonio Martini, Manuel Wimmer, Amund Skavhaug, IEEE, 2020, p. 21-28Conference paper, Published paper (Refereed)
Abstract [en]

Since the advent of mobile computing and IoT, a large amount of data is distributed around the world. Companies are increasingly experimenting with innovative ways of implementing edge/cloud (re)training of AI systems to exploit large quantities of data to optimize their business value. Despite the obvious benefits, companies face challenges as the decision on how to implement edge/cloud (re)training depends on factors such as the task intent, the amount of data needed for (re)training, edge-to-cloud data transfer, the available computing and memory resources. Based on action research in a software-intensive embedded systems company where we study multiple use cases as well as insights from our previous collaborations with industry, we develop a generic framework consisting of five architectural alternatives to deploy AI on the edge utilizing transfer learning. We validate the framework in four additional case companies and present the challenges they face in selecting the optimal architecture. The contribution of the paper is threefold. First, we develop a generic framework consisting of five architectural alternatives ranging from a centralized architecture where cloud (re)training is given priority to a decentralized architecture where edge (re)training is instead given priority. Second, we validate the framework in a qualitative interview study with four additional case companies. As an outcome of validation study, we present two variants to the architectural alternatives identified as part of the framework. Finally, we identify the key challenges that experts face in selecting an ideal architectural alternative.

Place, publisher, year, edition, pages
IEEE, 2020
Series
Proceedings (EUROMICRO Conference on Software Engineering and Advanced Applications), ISSN 2640-592X, E-ISSN 2376-9521
Keywords
Artificial Intelligence, Machine Learning, Deep Learning, Edge, Cloud, Transfer Learning, Action Research, Architectural alternatives
National Category
Engineering and Technology
Identifiers
urn:nbn:se:mau:diva-17930 (URN)10.1109/SEAA51224.2020.00015 (DOI)000702094100004 ()2-s2.0-85096567097 (Scopus ID)978-1-7281-9532-2 (ISBN)
Conference
46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020, 26-28 August 2020, Portorož, Slovenia
Available from: 2020-08-14 Created: 2020-08-14 Last updated: 2024-12-17Bibliographically approved
5. AI Deployment Architecture: Multi-Case Study for Key Factor Identification
Open this publication in new window or tab >>AI Deployment Architecture: Multi-Case Study for Key Factor Identification
2020 (English)In: 2020 27th Asia-Pacific Software Engineering Conference (APSEC), IEEE, 2020, Vol. 1, p. 395-404Conference paper, Published paper (Refereed)
Abstract [en]

Machine learning and deep learning techniques are becoming increasingly popular and critical for companies as part of their systems. However, although the development and prototyping of ML/DL systems are common across companies, the transition from prototype to production-quality deployment models are challenging. One of the key challenges is how to determine the selection of an optimal architecture for AI deployment. Based on our previous research, and to offer support and guidance to practitioners, we developed a framework in which we present five architectural alternatives for AI deployment ranging from centralized to fully decentralized edge architectures. As part of our research, we validated the framework in software-intensive embedded system companies and identified key challenges they face when deploying ML/DL models. In this paper, and to further advance our research on this topic, we identify factors that help practitioners determine what architecture to select for the ML/D L model deployment. For this, we conducted a follow-up study involving interviews and workshops in seven case companies in the embedded systems domain. Based on our findings, we identify three key factors and develop a framework in which we outline how prioritization and trade-offs between these result in certain architecture. The contribution of the paper is threefold. First, we identify key factors critical for AI system deployment. Second, we present the architecture selection framework that explains how prioritization and trade-offs between key factors result in the selection of a certain architecture. Third, we discuss additional factors that may or may not influence the selection of an optimal architecture.

Place, publisher, year, edition, pages
IEEE, 2020
Series
Proceedings - Asia Pacific Software Engineering Conference, ISSN 1530-1362, E-ISSN 2640-0715
Keywords
Artificial Intelligence, Machine Learning, Deep Learning, Edge, Cloud, Architecture, Deployment
National Category
Computer Engineering
Identifiers
urn:nbn:se:mau:diva-42167 (URN)10.1109/APSEC51365.2020.00048 (DOI)000662668700041 ()2-s2.0-85102359323 (Scopus ID)978-1-7281-9553-7 (ISBN)978-1-7281-9554-4 (ISBN)
Conference
27TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, 1 - 4 December 2020 - Singapore
Available from: 2021-05-11 Created: 2021-05-11 Last updated: 2024-12-17Bibliographically approved
6. Towards MLOps: A Framework and Maturity Model
Open this publication in new window or tab >>Towards MLOps: A Framework and Maturity Model
2021 (English)In: Proceedings - 2021 47th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2021, IEEE, 2021, p. 334-341Conference paper, Published paper (Refereed)
Abstract [en]

The adoption of continuous software engineering practices such as DevOps (Development and Operations) in business operations has contributed to significantly shorter software development and deployment cycles. Recently, the term MLOps (Machine Learning Operations) has gained increasing interest as a practice that brings together data scientists and operations teams. However, the adoption of MLOps in practice is still in its infancy and there are few common guidelines on how to effectively integrate it into existing software development practices. In this paper, we conduct a systematic literature review and a grey literature review to derive a framework that identifies the activities involved in the adoption of MLOps and the stages in which companies evolve as they become more mature and advanced. We validate this framework in three case companies and show how they have managed to adopt and integrate MLOps in their large-scale software development companies. The contribution of this paper is threefold. First, we review contemporary literature to provide an overview of the state-of-the-art in MLOps. Based on this review, we derive an MLOps framework that details the activities involved in the continuous development of machine learning models. Second, we present a maturity model in which we outline the different stages that companies go through in evolving their MLOps practices. Third, we validate our framework in three embedded systems case companies and map the companies to the stages in the maturity model. 

Place, publisher, year, edition, pages
IEEE, 2021
Keywords
Framework, GLR, Maturity Model, MLOps, SLR, Validation Study, Embedded systems, Machine learning, Continuous software engineerings, Framework models, Machine learning operation, Machine-learning, Software engineering practices, Software design
National Category
Software Engineering
Identifiers
urn:nbn:se:mau:diva-48501 (URN)10.1109/SEAA53835.2021.00050 (DOI)000766051900042 ()2-s2.0-85119201202 (Scopus ID)9781665427050 (ISBN)
Conference
2021 47th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2021, 1-3 Sept. 2021, Palermo, Italy
Available from: 2021-12-29 Created: 2021-12-29 Last updated: 2024-12-17Bibliographically approved
7. Advancing MLOps from Ad hoc to Kaizen
Open this publication in new window or tab >>Advancing MLOps from Ad hoc to Kaizen
2023 (English)In: 2023 49th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Institute of Electrical and Electronics Engineers (IEEE), 2023Conference paper, Published paper (Refereed)
Abstract [en]

Companies across various domains increasingly adopt Machine Learning Operations (MLOps) as they recognise the significance of operationalising ML models. Despite growing interest from practitioners and ongoing research, MLOps adoption in practice is still in its initial stages. To explore the adoption of MLOps, we employ a multi-case study in seven companies. Based on empirical findings, we propose a maturity model outlining the typical stages companies undergo when adopting MLOps, ranging from Ad hoc to Kaizen. We identify five dimensions associated with each stage of the maturity model as part of our MLOps framework. We also map these seven companies to the identified stages in the maturity model. Our study serves as a roadmap for companies to assess their current state of MLOps, identify gaps and overcome obstacles to successfully adopting MLOps.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Series
Proceedings (EUROMICRO Conference on Software Engineering and Advanced Applications), ISSN 2640-592X, E-ISSN 2376-9521
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-64891 (URN)10.1109/seaa60479.2023.00023 (DOI)2-s2.0-85183329155 (Scopus ID)979-8-3503-4235-2 (ISBN)979-8-3503-4236-9 (ISBN)
Conference
2023 49th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Durres, Albania, 06-08 September 2023
Available from: 2024-01-09 Created: 2024-01-09 Last updated: 2024-12-17Bibliographically approved
8. An empirical guide to mlops adoption: Framework,maturity model, and taxonomy
Open this publication in new window or tab >>An empirical guide to mlops adoption: Framework,maturity model, and taxonomy
(English)Manuscript (preprint) (Other academic)
National Category
Computer Sciences
Identifiers
urn:nbn:se:mau:diva-72799 (URN)
Available from: 2024-12-17 Created: 2024-12-17 Last updated: 2024-12-17Bibliographically approved
9. Exploring Trade-offs in MLOps Adoption
Open this publication in new window or tab >>Exploring Trade-offs in MLOps Adoption
2023 (English)In: Proceedings of the 2023 30th asia-pacific software engineering conference , ASPEC 2023, IEEE Computer Society Digital Library, 2023, p. 369-375Conference paper, Published paper (Refereed)
Abstract [en]

Machine Learning Operations (MLOps) play a crucial role in the success of data science projects in companies. However, despite its obvious benefits, several companies struggle to adopt MLOps practices and face difficulty in deciding how to deploy and evolve ML models. To gain a deeper understanding of these challenges, we conduct a multi-case study involving nine practitioners from seven companies. Based on our empirical results, we identify the key trade-offs we see companies make when adopting MLOps. We categorise these trade-offs into four concerns of the BAPO model: Business, Architecture, Process, and Organisation. Finally, we provide suggestions to mitigate the identified trade-offs. By identifying and detailing these trade-offs and the implications of these, this research helps companies to ensure the successful adoption of MLOps.

Place, publisher, year, edition, pages
IEEE Computer Society Digital Library, 2023
Series
Asia-Pacific Software Engineering Conference, ISSN 1530-1362
Keywords
MLOps, Trade-offs, BAPO model, Multi-case study
National Category
Software Engineering
Identifiers
urn:nbn:se:mau:diva-69983 (URN)10.1109/APSEC60848.2023.00047 (DOI)001207000500038 ()2-s2.0-85190507464 (Scopus ID)979-8-3503-4417-2 (ISBN)
Conference
30th Asia-Pacific Software Engineering Conference (APSEC), DEC 04-07, 2023, Seoul, SOUTH KOREA
Available from: 2024-07-30 Created: 2024-07-30 Last updated: 2024-12-17Bibliographically approved

Open Access in DiVA

fulltext(5196 kB)38 downloads
File information
File name FULLTEXT02.pdfFile size 5196 kBChecksum SHA-512
ff43999e5c48dfd044ac92c96da9e980c427e006abe44e0cf217cb3cd6c5e6c516add1f0dfb6492d769fa67b5323286d0125cfa77636b44294361fcfdd22aef8
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

John, Meenu Mary

Search in DiVA

By author/editor
John, Meenu Mary
By organisation
Department of Computer Science and Media Technology (DVMT)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 38 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 521 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf