Malmö University Publications
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep Learning Sensor-fusion-based odometry for autonomous robot navigation
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT).
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Odometry estimation plays a key role in facilitating autonomous navigation systems. While significant consideration has been devoted to research on monocular odometry estimation, sensor fusion techniques for Stereo Visual Odometry (SVO) have been relatively neglected due to their demanding computational requirements, posing practical challenges. However, recent advancements in hardware, particularly the integration of CPUs with dedicated artificial intelligence units, have alleviated these concerns. This thesis explores the enhancement of autonomous robot navigation through the integration of attention mechanisms with stereo images, particularly in environments where GPS signals are unreliable or absent. The core of this study is the development of a novel sensor fusion model that utilizes one image as a means of calculating attention weights for another image, and combine the result with inertial data to improve odometry estimates. A set of ablation experiments was conducted with different architectures and sensor fusion to find the best setup, using the KITTI dataset. The results demonstrate the effectiveness of our proposed methods, particularly the use of early fusion techniques and attention mechanisms, which significantly enhance the accuracy of navigation paths relative to the ground truth. Furthermore, we compared our Stereo Attention-based Visual Inertial Odometry model (SATVIO) to state-of-the-art to demonstrate its performance. Despite limitations that restricted extensive training, our findings suggest that, with further optimization and extended training, SATVIO could match or surpass current state-of-the-art approaches in visual inertial odometry.

Place, publisher, year, edition, pages
2024. , p. 43
Keywords [en]
Stereo Visual Inertial Odometry, Sensor Fusion, Deep Learning, Attention Mechanism
National Category
Computer Sciences Computer and Information Sciences
Identifiers
URN: urn:nbn:se:mau:diva-69714OAI: oai:DiVA.org:mau-69714DiVA, id: diva2:1880957
Educational program
TS Computer Science: Applied Data Science
Supervisors
Examiners
Available from: 2024-07-25 Created: 2024-07-02 Last updated: 2024-07-25Bibliographically approved

Open Access in DiVA

fulltext(11571 kB)32 downloads
File information
File name FULLTEXT01.pdfFile size 11571 kBChecksum SHA-512
6c33a16d8480519aad9a6cb105ca0fd2e53ca5a99464ab892779b03342d284e98eb84173e3ef53cfda979bb16cfd9da7ac6f36d452f9eb35be88f4d86c33f1bb
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Doorshi, Raoof
By organisation
Department of Computer Science and Media Technology (DVMT)
Computer SciencesComputer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 32 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 83 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf