Högskolan i Skövde

his.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Utvärdering av autonom lager-navigering genom generella förstärknings- och imitationsinlärningsalgoritmer: En jämförande studie av PPO, BC och GAIL metoder för autonom lager-navigering
Högskolan i Skövde, Institutionen för informationsteknologi.
2025 (svensk)Independent thesis Basic level (degree of Bachelor), 20 poäng / 30 hpOppgaveAlternativ tittel
Evaluation of Autonomous Warehouse Navigation through General Reinforcement and Imitation Learning Algorithms : A Comparative Study of PPO, BC and GAIL Methods for Autonomous Warehouse Navigation (engelsk)
Abstract [en]

As reinforcement learning (RL) algorithms advance and warehouse automation becomes increasingly important for efficient logistics operations, developing autonomous navigation for robots is a key interest. This study evaluates two machine-learning paradigms within a simulated warehouse environment. First, an RL algorithm called Proximal Policy Optimization (PPO) is evaluated against combined methods that are pre-trained via imitation learning (IL) algorithms and subsequently fine-tuned with PPO (IL + RL). Second, two IL algorithms called Behavioral Cloning (BC), and Generative Adversarial Imitation Learning (GAIL) are evaluated against each other to assess their stand alone and combined navigation performance. Together, these experiments show both the benefit of combining IL with RL fine-tuning versus standalone RL, and the comparative value of IL algorithms when used in dependently (BC versus GAIL) and combined (BC + GAIL) for robot navigation.The autonomous agent is controlled by a neural network, specifically a multi layer perceptron (MLP). Performance metrics, namely mean reward and sample efficiency are tracked at multiple training milestones. The results show that one method combining BC + PPO (IL +RL) consistently outperforms the PPO (RL) method, even with a low amount of demonstration data used. Also, for the standalone IL evaluations, it shows that BC performs overall better than GAIL for this given game-engine-based environment and MLP complexity. These findings give insight into the generalizability of PPO, GAIL, and BC algorithms outside of domain-specific simulators and the advantages and limitations of both standalone and sequential training methods in autonomous warehouse navigation.

sted, utgiver, år, opplag, sider
2025. , s. 44
Emneord [en]
Artificial Intelligence, Godot Agents, Reinforcement learning, Imitation learning, Autonomous navigation, Proximal Policy Optimization, Behavioral Cloning
HSV kategori
Identifikatorer
URN: urn:nbn:se:his:diva-25569OAI: oai:DiVA.org:his-25569DiVA, id: diva2:1985389
Fag / kurs
Informationsteknologi
Utdanningsprogram
Computer Science - Specialization in Systems Development
Veileder
Examiner
Tilgjengelig fra: 2025-07-24 Laget: 2025-07-24 Sist oppdatert: 2025-09-29bibliografisk kontrollert

Open Access i DiVA

fulltext(2218 kB)134 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 2218 kBChecksum SHA-512
107cc267102339198c8953d5157609c57fb448f4d54fcc863408906c8006ad47d6eadd99a300b3cf9d3926b3b6ed5f838a13fbd035e170263c416ae95ac1c48c
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 134 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 159 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf