Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep Reinforcement Learning for Multiple Agents in a Decentralized Architecture: A Case Study in the Telecommunication Domain
Chalmers Univ Technol, Gothenburg, Sweden..
Ericsson, Ericsson Res, Gothenburg, Sweden..
Ericsson, Ericsson Res, Gothenburg, Sweden..
Ericsson, Ericsson Res, Gothenburg, Sweden..
Show others and affiliations
2023 (English)In: 2023 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE COMPANION, ICSA-C, IEEE COMPUTER SOC , 2023, p. 183-186Conference paper, Published paper (Refereed)
Abstract [en]

Deep reinforcement learning has made significant development in recent years, and it is currently applied not only in simulators and games but also in embedded systems. However, when implemented in a real-world context, reinforcement learning is frequently shown to be unstable and incapable of adapting to realistic situations, particularly when directing a large number of agents. In this paper, we develop a decentralized architecture for reinforcement learning to allow multiple agents to learn optimal control policies on their own devices of the same kind but in varied environments. For such multiple agents, the traditional centralized learning algorithm usually requires a costly or time-consuming effort to develop the best-regulating policy and is incapable of scaling to a large-scale system. To address this issue, we propose a decentralized reinforcement learning algorithm (DecRL) and information exchange scheme for each individual device, in which each agent shares the individual learning experience and information with other agents based on local model training. We incorporate the algorithm into each agent in the proposed collaborative architecture and validate it in the telecommunication domain under emergency conditions, in which a macro base station (BS) is broken due to a natural disaster, and three unmanned aerial vehicles carrying BSs (UAV-BSs) are deployed to provide temporary coverage for missioncritical (MC) users in the disaster area. Based on the findings, we show that the proposed decentralized reinforcement learning algorithm can successfully support multi-agent learning, while the learning speed and service quality can be further enhanced.

Place, publisher, year, edition, pages
IEEE COMPUTER SOC , 2023. p. 183-186
Series
IEEE International Conference on Software Architecture Workshops, ISSN 2768-427X
Keywords [en]
Reinforcement Learning, Machine Learning, Software Engineering, Emergency Communication Network, Multi-Agent, Decentralized Architecture
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:mau:diva-61865DOI: 10.1109/ICSA-C57050.2023.00048ISI: 000990534100032Scopus ID: 2-s2.0-85159142701ISBN: 978-1-6654-6459-8 (electronic)OAI: oai:DiVA.org:mau-61865DiVA, id: diva2:1788081
Conference
IEEE 20th International Conference on Software Architecture (ICSA), MAR 13-17, 2023, Aquila, ITALY
Available from: 2023-08-15 Created: 2023-08-15 Last updated: 2023-08-15Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Olsson, Helena Holmström

Search in DiVA

By author/editor
Olsson, Helena Holmström
By organisation
Department of Computer Science and Media Technology (DVMT)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf