Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
AI Deployment Architecture: Multi-Case Study for Key Factor Identification
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0003-3972-2265
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0002-7700-1816
Chalmers University. (Software Center)
2020 (English)In: 2020 27th Asia-Pacific Software Engineering Conference (APSEC), IEEE, 2020, Vol. 1, p. 395-404Conference paper, Published paper (Refereed)
Abstract [en]

Machine learning and deep learning techniques are becoming increasingly popular and critical for companies as part of their systems. However, although the development and prototyping of ML/DL systems are common across companies, the transition from prototype to production-quality deployment models are challenging. One of the key challenges is how to determine the selection of an optimal architecture for AI deployment. Based on our previous research, and to offer support and guidance to practitioners, we developed a framework in which we present five architectural alternatives for AI deployment ranging from centralized to fully decentralized edge architectures. As part of our research, we validated the framework in software-intensive embedded system companies and identified key challenges they face when deploying ML/DL models. In this paper, and to further advance our research on this topic, we identify factors that help practitioners determine what architecture to select for the ML/D L model deployment. For this, we conducted a follow-up study involving interviews and workshops in seven case companies in the embedded systems domain. Based on our findings, we identify three key factors and develop a framework in which we outline how prioritization and trade-offs between these result in certain architecture. The contribution of the paper is threefold. First, we identify key factors critical for AI system deployment. Second, we present the architecture selection framework that explains how prioritization and trade-offs between key factors result in the selection of a certain architecture. Third, we discuss additional factors that may or may not influence the selection of an optimal architecture.

Place, publisher, year, edition, pages
IEEE, 2020. Vol. 1, p. 395-404
Series
Proceedings - Asia Pacific Software Engineering Conference, ISSN 1530-1362, E-ISSN 2640-0715
Keywords [en]
Artificial Intelligence, Machine Learning, Deep Learning, Edge, Cloud, Architecture, Deployment
National Category
Computer Engineering
Identifiers
URN: urn:nbn:se:mau:diva-42167DOI: 10.1109/APSEC51365.2020.00048ISI: 000662668700041Scopus ID: 2-s2.0-85102359323ISBN: 978-1-7281-9553-7 (electronic)ISBN: 978-1-7281-9554-4 (print)OAI: oai:DiVA.org:mau-42167DiVA, id: diva2:1553912
Conference
27TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, 1 - 4 December 2020 - Singapore
Available from: 2021-05-11 Created: 2021-05-11 Last updated: 2024-02-05Bibliographically approved
In thesis
1. Design Methods and Processes for ML/DL models
Open this publication in new window or tab >>Design Methods and Processes for ML/DL models
2021 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Context: With the advent of Machine Learning (ML) and especially Deep Learning (DL) technology, companies are increasingly using Artificial Intelligence (AI) in systems, along with electronics and software. Nevertheless, the end-to-end process of developing, deploying and evolving ML and DL models in companies brings some challenges related to the design and scaling of these models. For example, access to and availability of data is often challenging, and activities such as collecting, cleaning, preprocessing, and storing data, as well as training, deploying and monitoring the model(s) are complex. Regardless of the level of expertise and/or access to data scientists, companies in all embedded systems domain struggle to build high-performing models due to a lack of established and systematic design methods and processes.

Objective: The overall objective is to establish systematic and structured design methods and processes for the end-to-end process of developing, deploying and successfully evolving ML/DL models.

Method: To achieve the objective, we conducted our research in close collaboration with companies in the embedded systems domain using different empirical research methods such as case study, action research and literature review.

Results and Conclusions: This research provides six main results: First, it identifies the activities that companies undertake in parallel to develop, deploy and evolve ML/DL models, and the challenges associated with them. Second, it presents a conceptual framework for the continuous delivery of ML/DL models to accelerate AI-driven business in companies. Third, it presents a framework based on current literature to accelerate the end-to-end deployment process and advance knowledge on how to integrate, deploy and operationalize ML/DL models. Fourth, it develops a generic framework with five architectural alternatives for deploying ML/DL models at the edge. These architectural alternatives range from a centralized architecture that prioritizes (re)training in the cloud to a decentralized architecture that prioritizes (re)training at the edge. Fifth, it identifies key factors to help companies decide which architecture to choose for deploying ML/DL models. Finally, it explores how MLOps, as a practice that brings together data scientist teams and operations, ensures the continuous delivery and evolution of models. 

Place, publisher, year, edition, pages
Malmö: Malmö universitet, 2021. p. 204
Series
Studies in Computer Science ; 17
Keywords
Machine Learning, Deep Learning, Development, Deployment, Evolution
National Category
Computer Systems
Identifiers
urn:nbn:se:mau:diva-45026 (URN)10.24834/isbn.9789178771998 (DOI)978-91-7877-198-1 (ISBN)978-91-7877-199-8 (ISBN)
Presentation
(English)
Opponent
Supervisors
Note

Due to copyright reasons, the articles are not included in the fulltext online

Available from: 2021-08-20 Created: 2021-08-20 Last updated: 2024-03-26Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

John, Meenu MaryOlsson, Helena Holmström

Search in DiVA

By author/editor
John, Meenu MaryOlsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Computer Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 282 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf