Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
On the Impact of ML use cases on Industrial Data Pipelines
Chalmers Univ Technol, Gothenburg, Sweden..
Chalmers Univ Technol, Gothenburg, Sweden..
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT).ORCID iD: 0000-0002-7700-1816
CEVT, Gothenburg, Sweden..
2021 (English)In: 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), IEEE, 2021, p. 463-472Conference paper, Published paper (Refereed)
Abstract [en]

The impact of the Artificial Intelligence revolution is undoubtedly substantial in our society, life, firms, and employment. With data being a critical element, organizations are working towards obtaining high-quality data to train their AI models. Although data, data management, and data pipelines are part of industrial practice even before the introduction of ML models, the significance of data increased further with the advent of ML models, which force data pipeline developers to go beyond the traditional focus on data quality. The objective of this study is to analyze the impact of ML use cases on data pipelines. We assume that the data pipelines that serve ML models are given more importance compared to the conventional data pipelines. We report on a study that we conducted by observing software teams at three companies as they develop both conventional(Non-ML) data pipelines and data pipelines that serve ML-based applications. We study six data pipelines from three companies and categorize them based on their criticality and purpose. Further, we identify the determinants that can be used to compare the development and maintenance of these data pipelines. Finally, we map these factors in a two-dimensional space to illustrate their importance on a scale of low, moderate, and high.

Place, publisher, year, edition, pages
IEEE, 2021. p. 463-472
Series
Asia-Pacific Software Engineering Conference, ISSN 1530-1362
Keywords [en]
Data Pipelines, criticality, ML-influenced, conventional, ML characteristics, determinants
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:mau:diva-53120DOI: 10.1109/APSEC53868.2021.00053ISI: 000802192700046ISBN: 978-1-6654-3784-4 OAI: oai:DiVA.org:mau-53120DiVA, id: diva2:1672526
Conference
28th Asia-Pacific Software Engineering Conference (APSEC), DEC 06-09, 2021, ELECTR NETWORK
Available from: 2022-06-20 Created: 2022-06-20 Last updated: 2022-06-20Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Olsson, Helena Holmström

Search in DiVA

By author/editor
Olsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 9 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf