Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning a DFT-based sequence with reinforcement learning: A NAO implementation
University of Skövde, School of Humanities and Informatics. University of Skövde, The Informatics Research Centre. (Interaction Lab)
University of Skövde, School of Humanities and Informatics. University of Skövde, The Informatics Research Centre. (Interaction Lab)ORCID iD: 0000-0002-7236-997X
University of Skövde, School of Humanities and Informatics. University of Skövde, The Informatics Research Centre. (Interaction Lab)ORCID iD: 0000-0002-1525-0745
2012 (English)In: Paladyn - Journal of Behavioral Robotics, ISSN 2080-9778, E-ISSN 2081-4836, Vol. 3, no 4, p. 181-187Article in journal (Refereed) Published
Abstract [en]

The implementation of sequence learning in robotic platforms offers several challenges. Deciding when to stop one action and continue to the next requires a balance between stability of sensory information and, of course, the knowledge about what action is required next. The work presented here proposes a starting point for the successful execution and learning of dynamic sequences. Making use of the NAO humanoid platform we propose a mathematical model based on dynamic field theory and reinforcement learning methods for obtaining and performing a sequence of elementary motor behaviors. Results from the comparison of two reinforcement learning methods applied to sequence generation, for both simulation and implementation, are provided.

Place, publisher, year, edition, pages
Springer, 2012. Vol. 3, no 4, p. 181-187
Keywords [en]
sequences, neural dynamics, reinforcement learning, humanoid
National Category
Computer graphics and computer vision
Research subject
Interaction Lab (ILAB)
Identifiers
URN: urn:nbn:se:his:diva-10147DOI: 10.2478/s13230-013-0109-5Scopus ID: 2-s2.0-84920931610OAI: oai:DiVA.org:his-10147DiVA, id: diva2:759148
Note

CC BY-NC-ND 3.0

Available from: 2014-10-29 Created: 2014-10-29 Last updated: 2025-09-29Bibliographically approved

Open Access in DiVA

fulltext(1008 kB)565 downloads
File information
File name FULLTEXT01.pdfFile size 1008 kBChecksum SHA-512
c6204bf6f472f9f4053ac22fab2963af7598a0da9f0a0643955b4a092fd88024f86ce9c99204f418d198dec9baeac45a7b6131f2ba53bc725afd0324629bce2e
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Durán, BorisLee, GaussLowe, Robert

Search in DiVA

By author/editor
Durán, BorisLee, GaussLowe, Robert
By organisation
School of Humanities and InformaticsThe Informatics Research Centre
In the same journal
Paladyn - Journal of Behavioral Robotics
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
Total: 565 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 1831 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf