Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
AI on the Edge: Architectural Alternatives
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0003-3972-2265
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). (Software Center)ORCID iD: 0000-0002-7700-1816
Chalmers University of Technology. (Software Center)
2020 (English)In: Proceedings 46th Euromicro Conferenceon Software Engineering and Advanced Applications SEAA 2020 / [ed] Antonio Martini, Manuel Wimmer, Amund Skavhaug, IEEE, 2020, p. 21-28Conference paper, Published paper (Refereed)
Abstract [en]

Since the advent of mobile computing and IoT, a large amount of data is distributed around the world. Companies are increasingly experimenting with innovative ways of implementing edge/cloud (re)training of AI systems to exploit large quantities of data to optimize their business value. Despite the obvious benefits, companies face challenges as the decision on how to implement edge/cloud (re)training depends on factors such as the task intent, the amount of data needed for (re)training, edge-to-cloud data transfer, the available computing and memory resources. Based on action research in a software-intensive embedded systems company where we study multiple use cases as well as insights from our previous collaborations with industry, we develop a generic framework consisting of five architectural alternatives to deploy AI on the edge utilizing transfer learning. We validate the framework in four additional case companies and present the challenges they face in selecting the optimal architecture. The contribution of the paper is threefold. First, we develop a generic framework consisting of five architectural alternatives ranging from a centralized architecture where cloud (re)training is given priority to a decentralized architecture where edge (re)training is instead given priority. Second, we validate the framework in a qualitative interview study with four additional case companies. As an outcome of validation study, we present two variants to the architectural alternatives identified as part of the framework. Finally, we identify the key challenges that experts face in selecting an ideal architectural alternative.

Place, publisher, year, edition, pages
IEEE, 2020. p. 21-28
Series
Proceedings (EUROMICRO Conference on Software Engineering and Advanced Applications), ISSN 2640-592X, E-ISSN 2376-9521
Keywords [en]
Artificial Intelligence, Machine Learning, Deep Learning, Edge, Cloud, Transfer Learning, Action Research, Architectural alternatives
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:mau:diva-17930DOI: 10.1109/SEAA51224.2020.00015ISI: 000702094100004Scopus ID: 2-s2.0-85096567097ISBN: 978-1-7281-9532-2 (print)OAI: oai:DiVA.org:mau-17930DiVA, id: diva2:1458239
Conference
46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020, 26-28 August 2020, Portorož, Slovenia
Available from: 2020-08-14 Created: 2020-08-14 Last updated: 2023-07-06Bibliographically approved
In thesis
1. Design Methods and Processes for ML/DL models
Open this publication in new window or tab >>Design Methods and Processes for ML/DL models
2021 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Context: With the advent of Machine Learning (ML) and especially Deep Learning (DL) technology, companies are increasingly using Artificial Intelligence (AI) in systems, along with electronics and software. Nevertheless, the end-to-end process of developing, deploying and evolving ML and DL models in companies brings some challenges related to the design and scaling of these models. For example, access to and availability of data is often challenging, and activities such as collecting, cleaning, preprocessing, and storing data, as well as training, deploying and monitoring the model(s) are complex. Regardless of the level of expertise and/or access to data scientists, companies in all embedded systems domain struggle to build high-performing models due to a lack of established and systematic design methods and processes.

Objective: The overall objective is to establish systematic and structured design methods and processes for the end-to-end process of developing, deploying and successfully evolving ML/DL models.

Method: To achieve the objective, we conducted our research in close collaboration with companies in the embedded systems domain using different empirical research methods such as case study, action research and literature review.

Results and Conclusions: This research provides six main results: First, it identifies the activities that companies undertake in parallel to develop, deploy and evolve ML/DL models, and the challenges associated with them. Second, it presents a conceptual framework for the continuous delivery of ML/DL models to accelerate AI-driven business in companies. Third, it presents a framework based on current literature to accelerate the end-to-end deployment process and advance knowledge on how to integrate, deploy and operationalize ML/DL models. Fourth, it develops a generic framework with five architectural alternatives for deploying ML/DL models at the edge. These architectural alternatives range from a centralized architecture that prioritizes (re)training in the cloud to a decentralized architecture that prioritizes (re)training at the edge. Fifth, it identifies key factors to help companies decide which architecture to choose for deploying ML/DL models. Finally, it explores how MLOps, as a practice that brings together data scientist teams and operations, ensures the continuous delivery and evolution of models. 

Place, publisher, year, edition, pages
Malmö: Malmö universitet, 2021. p. 204
Series
Studies in Computer Science ; 17
Keywords
Machine Learning, Deep Learning, Development, Deployment, Evolution
National Category
Computer Systems
Identifiers
urn:nbn:se:mau:diva-45026 (URN)10.24834/isbn.9789178771998 (DOI)978-91-7877-198-1 (ISBN)978-91-7877-199-8 (ISBN)
Presentation
(English)
Opponent
Supervisors
Note

Due to copyright reasons, the articles are not included in the fulltext online

Available from: 2021-08-20 Created: 2021-08-20 Last updated: 2024-03-26Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

John, Meenu MaryOlsson, Helena Holmström

Search in DiVA

By author/editor
John, Meenu MaryOlsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 531 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf