Högskolan i Skövde

his.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Utvärdering av autonom lager-navigering genom generella förstärknings- och imitationsinlärningsalgoritmer: En jämförande studie av PPO, BC och GAIL metoder för autonom lager-navigering
Högskolan i Skövde, Institutionen för informationsteknologi.
2025 (Svenska)Självständigt arbete på grundnivå (kandidatexamen), 20 poäng / 30 hpStudentuppsats (Examensarbete)Alternativ titel
Evaluation of Autonomous Warehouse Navigation through General Reinforcement and Imitation Learning Algorithms : A Comparative Study of PPO, BC and GAIL Methods for Autonomous Warehouse Navigation (Engelska)
Abstract [en]

As reinforcement learning (RL) algorithms advance and warehouse automation becomes increasingly important for efficient logistics operations, developing autonomous navigation for robots is a key interest. This study evaluates two machine-learning paradigms within a simulated warehouse environment. First, an RL algorithm called Proximal Policy Optimization (PPO) is evaluated against combined methods that are pre-trained via imitation learning (IL) algorithms and subsequently fine-tuned with PPO (IL + RL). Second, two IL algorithms called Behavioral Cloning (BC), and Generative Adversarial Imitation Learning (GAIL) are evaluated against each other to assess their stand alone and combined navigation performance. Together, these experiments show both the benefit of combining IL with RL fine-tuning versus standalone RL, and the comparative value of IL algorithms when used in dependently (BC versus GAIL) and combined (BC + GAIL) for robot navigation.The autonomous agent is controlled by a neural network, specifically a multi layer perceptron (MLP). Performance metrics, namely mean reward and sample efficiency are tracked at multiple training milestones. The results show that one method combining BC + PPO (IL +RL) consistently outperforms the PPO (RL) method, even with a low amount of demonstration data used. Also, for the standalone IL evaluations, it shows that BC performs overall better than GAIL for this given game-engine-based environment and MLP complexity. These findings give insight into the generalizability of PPO, GAIL, and BC algorithms outside of domain-specific simulators and the advantages and limitations of both standalone and sequential training methods in autonomous warehouse navigation.

Ort, förlag, år, upplaga, sidor
2025. , s. 44
Nyckelord [en]
Artificial Intelligence, Godot Agents, Reinforcement learning, Imitation learning, Autonomous navigation, Proximal Policy Optimization, Behavioral Cloning
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:his:diva-25569OAI: oai:DiVA.org:his-25569DiVA, id: diva2:1985389
Ämne / kurs
Informationsteknologi
Utbildningsprogram
Datavetenskap - inriktning systemutveckling, 180 hp
Handledare
Examinatorer
Tillgänglig från: 2025-07-24 Skapad: 2025-07-24 Senast uppdaterad: 2025-09-29Bibliografiskt granskad

Open Access i DiVA

fulltext(2218 kB)133 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 2218 kBChecksumma SHA-512
107cc267102339198c8953d5157609c57fb448f4d54fcc863408906c8006ad47d6eadd99a300b3cf9d3926b3b6ed5f838a13fbd035e170263c416ae95ac1c48c
Typ fulltextMimetyp application/pdf

Av organisationen
Institutionen för informationsteknologi
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 133 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 158 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf