OPUS 4 | Search

59 search hits

1 to 10

Sort by

Double Deep Reinforcement Learning (2023)

In many application areas, Deep Reinforcement Learning (DRL) has led to breakthroughs. In Curriculum Learning, the Machine Learning algorithm is not randomly presented with examples, but in a meaningful order of increasing difficulty. This has been used in many application areas to further improve the results of learning systems or to reduce their learning time. Such approaches range from learning plans created manually by domain experts to those created automatically. The automated creation of learning plans is one of the biggest challenges.In this work, we investigate an approach in which a trainer learns in parallel and analogously to the student to automatically create a learning plan for the student during this Double Deep Reinforcement Learning (DDRL). Three Reward functions, Friendly, Adversarial, and Dynamic based on the learner’s reward are compared. The domain for evaluation is kicking with variable distance, direction and relative ball position in the SimSpark simulated soccer environment.As a result, Statistic Curriculum Learning (SCL) performs better than a random curriculum with respect to training time and result quality. DDRL reaches a comparable quality as the baseline and outperforms it significantly in shorter trainings in the distance-direction subdomain reducing the number of required training cycles by almost 50%.

The magmaOffenburg 2023 RoboCup 3D Simulation Team (2023)

Biehl, Tobias ; Bohlinger, Nico ; Braun, Hannes ; Dorer, Klaus ; Glaser, Stefan ; Grommelt, Patrick ; Portugall, Markus ; Scholz, Jannes ; Weiss, Louis ; Wolffram, Maren

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we focus on the architecture of the software. It is a main factor for being able to keep the code maintainable even after 15 years of development. We also describe how we make sure that the code follows this architecture.

RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup (2023)

Bohlinger, Nico ; Dorer, Klaus

This paper presents the new Deep Reinforcement Learning (DRL) library RL-X and its application to the RoboCup Soccer Simulation 3D League and classic DRL benchmarks. RL-X provides a flexible and easy-to-extend codebase with self-contained single directory algorithms. Through the fast JAX-based implementations, RL-X can reach up to 4.5x speedups compared to well-known frameworks like Stable-Baselines3.

The Sweaty 2023 RoboCup Humanoid Extended Abstract (2023)

Hensel, Stefan ; Waltersberger, Bernd ; Dorer, Klaus

Sweaty has already participated several times in RoboCup soccer competitions (Adult Size). Now the work is focused coordinating the play of two robots. Moreover, we are working on stabilizing the gait by adding additional sensor information. An ongoing work is the optimization of the control strategy by balancing between impedance and position control. By minimizing the jerk, gait and overall gameplay should improve significantly.

RL-X: A Deep Reinforcement Learning Libray (not only) for RoboCup (2023)

Bohlinger, Nico ; Dorer, Klaus

Learning Backswing Kicks with Deep Reinforcement Learning (2022)

Bohlinger, Nico ; Dorer, Klaus

The magmaOffenburg 2022 RoboCup 3D Simulation Team (2022)

Bohlinger, Nico ; Braun, Hannes ; Dorer, Klaus ; Ehlers, Lukas ; Huber, Danny ; Huber, Hannes ; Glaser, Stefan ; Schillings, Rico ; Scholz, Jannes ; Wolffram, Maren

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we address our approach to learn a model free kick with Nao toe using deep reinforcement learning.

The Sweaty 2022 RoboCup Humanoid Extended Abstract (2022)

Dorer, Klaus ; Hochberg, Ulrich

Sweaty has already participated several times in RoboCup soccer competitions (Adult Size). Now the work is focused on stabilizing the gait. Moreover, we would like to overcome the constraints of a ZMP-algorithm that has a horizontal footplate as precondition for the simplification of the equations. In addition we would like to switch between impedance and position control with a fuzzy-like algorithm that might help to minimize jerks when Sweaty’s feet touch the ground.

Humanoid Adult Size Champion 2021 Sweaty (2022)

Dorer, Klaus ; Gießler, Maximilian ; Hochberg, Ulrich ; Scharffenberg, Manuel ; Schillings, Rico ; Schnekenburger, Fabian

Due to the Covid-19 pandemic, the RoboCup WorldCup 2021 was held completely remotely. For this competition the Webots simulator (https://cyberbotics.com/) was used, so all teams needed to transfer their robot to the simulation. This paper describes our experiences during this process as well as a genetic learning approach to improve our walk engine to allow a more stable and faster movement in the simulation. Therefore we used a docker setup to scale easily. The resulting movement was one of the outstanding features that finally led to the championship title.

Deep Reinforcement Multi-Directional Kick-Learning of a Simulated Robot with Toes (2021)

Weiler, David ; Dorer, Klaus

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Conference Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

Open Access

59 search hits