OPUS 4 | Search

Double Deep Reinforcement Learning (2023)

In many application areas, Deep Reinforcement Learning (DRL) has led to breakthroughs. In Curriculum Learning, the Machine Learning algorithm is not randomly presented with examples, but in a meaningful order of increasing difficulty. This has been used in many application areas to further improve the results of learning systems or to reduce their learning time. Such approaches range from learning plans created manually by domain experts to those created automatically. The automated creation of learning plans is one of the biggest challenges.In this work, we investigate an approach in which a trainer learns in parallel and analogously to the student to automatically create a learning plan for the student during this Double Deep Reinforcement Learning (DDRL). Three Reward functions, Friendly, Adversarial, and Dynamic based on the learner’s reward are compared. The domain for evaluation is kicking with variable distance, direction and relative ball position in the SimSpark simulated soccer environment.As a result, Statistic Curriculum Learning (SCL) performs better than a random curriculum with respect to training time and result quality. DDRL reaches a comparable quality as the baseline and outperforms it significantly in shorter trainings in the distance-direction subdomain reducing the number of required training cycles by almost 50%.

Learning Backswing Kicks with Deep Reinforcement Learning (2022)

Bohlinger, Nico ; Dorer, Klaus

The magmaOffenburg 2011 RoboCup 3D Simulation Team (2011)

Dorer, Klaus ; Glaser, Stefan ; Raffeiner, Simon ; Shahi, Rajit ; Schindler, Ingo

This paper describes the magmaOffenburg 3D simulation team trying to qualify for RoboCup 2011. While last year’s TDP focused on the tool set created for 3D simulation in this year we describe the further improvement in this tools as well as some new features we implemented focusing on heterogeneous robot models which seem to be used in RoboCup 2012. An additional tool was written to simply generate situation-dependent strategies. Furthermore some tools, described last year, are now integrated in one single GUI to easy things up.

The magmaOffenburg 2016 RoboCup 3D Simulation Team (2016)

Dorer, Klaus ; Fischer, Jens ; Glaser, Stefan ; Nguyen, Duy ; Obrecht, Michael ; Weiler, David

After having described many different aspects of our team software in previous years, in this paper we take the freedom to describe the magmaChallenge framework provided by the magmaOffenburg team. The framework is used as a benchmark tool to run different challenges like the running challenge in 2014 or the kick accuracy challenge in 2015. This description should serve as a documentation to simplify the maintenance by the community and to add new benchmarks in the future.

The magmaOffenburg 2023 RoboCup 3D Simulation Team (2023)

Biehl, Tobias ; Bohlinger, Nico ; Braun, Hannes ; Dorer, Klaus ; Glaser, Stefan ; Grommelt, Patrick ; Portugall, Markus ; Scholz, Jannes ; Weiss, Louis ; Wolffram, Maren

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we focus on the architecture of the software. It is a main factor for being able to keep the code maintainable even after 15 years of development. We also describe how we make sure that the code follows this architecture.

The magmaOffenburg 2012 RoboCup 3D Simulation Team (2012)

Dorer, Klaus ; Glaser, Stefan

This paper describes the magmaOffenburg 3D simulation team trying to qualify for RoboCup 2012. While last year’s TDP focused on the tool set created for 3D simulation and the support for heterogeneous robot models, this year we focus on the different ways how robot behavior can be defined in the magmaOffenburg framework and how those behaviors can be improved by learning.

An efficient orientation and center of mass based bipedal balancing engine (2012)

Dorer, Klaus ; Glaser, Stefan

The Sweaty 2014 RoboCup Humanoid Adult Size Team Description (2014)

Dietsche, Armin ; Dorer, Klaus ; Fehrenbach, Michael ; Frei, Waldemar ; Glaser, Stefan ; Hirtes, Sabine ; Hochberg, Ulrich ; Ismail, Ibrahim ; Jahn, Nils-Malte ; Kirn, Rudi ; Koger, Raphael ; Niederhofer, Matthias ; Sadeghi, Mehdi ; Scharffenberg, Manuel ; Tropmann, Igor ; Tziallas, Efstratios ; Waltersberger, Bernd ; Wülker, Michael ; Venkataramana, Sneha

This paper describes the new Sweaty humanoid adult size robot trying to qualify for the RoboCup 2014 adult size humanoid competition. The robot is built from scratch to eventually allow it to run. One characteristic is that to prevent the motors from overheating, water evaporation is used for cooling. The robot is literally sweating which has given it its name. Another characteristic is, that the motors are not directly connected to the frame but by means of beams. This allows a variable transmission ratio depending on the angle.

The Sweaty 2020 RoboCup Humanoid Extended Abstract (2020)

Schnekenburger, Fabian ; Glaser, Stefan ; Wülker, Michael ; Hochberg, Ulrich ; Dorer, Klaus

Vermeidung der Abhängigkeitsdivergenz zwischen Design und Implementierung in Java (2010)

Cornelis, Jens ; Dorer, Klaus

Die Einhaltung der innerhalb der Designphase festgelegten Architektur eines Softwareprojektes muss w ̈ahrend der Entwicklungsphase sichergestellt werden. Dieses Papier beschreibt eine Erweiterung des Eclipse-Plugins JDepend4Eclipse, die die Verwaltung von Regels ̈atzen erlaubt und die Pr ̈ufung auf in einem Projekt vorhandene, unerlaubte Abh ̈angigkeiten auf Knopfdruck innerhalb der Entwicklungsumgebung vornimmt. Die Erweiterung des Plugins wird bereits erfolgreich in internen Projekten der Hochschule Offenburg eingesetzt und soll demn ̈achst ̈offentlich verf ̈ugbar sein.

Human Walk with mixed Step- and Pathplanning (2014)

Dorer, Klaus ; Grossmann, Stefan

RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup (2023)

Bohlinger, Nico ; Dorer, Klaus

This paper presents the new Deep Reinforcement Learning (DRL) library RL-X and its application to the RoboCup Soccer Simulation 3D League and classic DRL benchmarks. RL-X provides a flexible and easy-to-extend codebase with self-contained single directory algorithms. Through the fast JAX-based implementations, RL-X can reach up to 4.5x speedups compared to well-known frameworks like Stable-Baselines3.

The magmaOffenburg 2013 RoboCup 3D Simulation Team (2013)

Dorer, Klaus ; Glaser, Stefan

This paper describes the magmaOffenburg 3D simulation team trying to qualify for RoboCup 2013. While last year’s TDP focused on different ways how robot behavior can be defined in the magmaOffenburg framework this year we focus on how we statistically evaluate new features on distributed systems. We also show some results gained through such analysis.

Teaching Practical Machine Learning Concepts to Professionals and Students: An Integrated and Interdisciplinary Qualification Project (2020)

Hagen, Tobias ; Lauer, Tobias ; Sänger, Volker ; Dorer, Klaus ; Trahasch, Stephan

Machine learning (ML) has become highly relevant in applications across all industries, and specialists in the field are sought urgently. As it is a highly interdisciplinary field, requiring knowledge in computer science, statistics and the relevant application domain, experts are hard to find. Large corporations can sweep the job market by offering high salaries, which makes the situation for small and medium enterprises (SME) even worse, as they usually lack the capacities both for attracting specialists and for qualifying their own personnel. In order to meet the enormous demand in ML specialists, universities now teach ML in specifically designed degree programs as well as within established programs in science and engineering. While the teaching almost always uses practical examples, these are somewhat artificial or outdated, as real data from real companies is usually not available. The approach reported in this contribution aims to tackle the above challenges in an integrated course, combining three independent aspects: first, teaching key ML concepts to graduate students from a variety of existing degree programs; second, qualifying working professionals from SME for ML; and third, applying ML to real-world problems faced by those SME. The course was carried out in two trial periods within a government-funded project at a university of applied sciences in south-west Germany. The region is dominated by SME many of which are world leaders in their industries. Participants were students from different graduate programs as well as working professionals from several SME based in the region. The first phase of the course (one semester) consists of the fundamental concepts of ML, such as exploratory data analysis, regression, classification, clustering, and deep learning. In this phase, student participants and working professionals were taught in separate tracks. Students attended regular classes and lab sessions (but were also given access to e-learning materials), whereas the professionals learned exclusively in a flipped classroom scenario: they were given access to e-learning units (video lectures and accompanying quizzes) for preparation, while face-to-face sessions were dominated by lab experiments applying the concepts. Prior to the start of the second phase, participating companies were invited to submit real-world problems that they wanted to solve with the help of ML. The second phase consisted of practical ML projects, each tackling one of the problems and worked on by a mixed team of both students and professionals for the period of one semester. The teams were self-organized in the ways they preferred to work (e.g. remote vs. face-to-face collaboration), but also coached by one of the teaching staff. In several plenary meetings, the teams reported on their status as well as challenges and solutions. In both periods, the course was monitored and extensive surveys were carried out. We report on the findings as well as the lessons learned. For instance, while the program was very well-received, professional participants wished for more detailed coverage of theoretical concepts. A challenge faced by several teams during the second phase was a dropout of student members due to upcoming exams in other subjects.

The magmaOffenburg 2017 RoboCup 3D Simulation Team (2017)

Baur, Martin ; Dorer, Klaus ; Fischer, Jens ; Nguyen, Duy ; Schmider, Carmen ; Weiler, David

In this TDP we describe a new tool created for testing the strategy layer of our soccer playing agents. It is a complete 2D simulator that simulates the games based on the decisions of 22 agents. With this tool, debugging the decision and strategy layer of our agents is much more efficient than before due to various interaction methods and complete control over the simulation. In the future, the tool could also serve as a measure to run simulations of game series much faster than with the 3D simulator. This way, the impact of different play strategies could be evaluated much faster than before.

The Sweaty 2019 RoboCup Humanoid Adult Size Team Description (2019)

Dorer, Klaus ; Hochberg, Ulrich ; Wülker, Michael

Sweaty has already participated four times in RoboCup soccer competitions (Adult Size) and came second three times. While 2016 Sweaty needed a lot of luck to be finalist, 2017 Sweaty was a serious adversary in the preliminary rounds. In 2018 Sweaty showed up in the final with some lack of experience and room for improvements, but not without any chance. This paper describes the intended improvements of the humanoid adult size robot Sweaty in order to qualify for the RoboCup 2019 adult size competition.

The magmaOffenburg 2019 RoboCup 3D Simulation Team (2019)

Dorer, Klaus ; Fischer, Jens ; Schmider, Carmen ; Weiler, David

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we address our team formation strategy including set plays and how it is tested using our simulator.

Learning a Walk Behavior Utilizing Toes From Scratch (2019)

Fischer, Jens ; Dorer, Klaus

The magmaOffenburg 2022 RoboCup 3D Simulation Team (2022)

Bohlinger, Nico ; Braun, Hannes ; Dorer, Klaus ; Ehlers, Lukas ; Huber, Danny ; Huber, Hannes ; Glaser, Stefan ; Schillings, Rico ; Scholz, Jannes ; Wolffram, Maren

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we address our approach to learn a model free kick with Nao toe using deep reinforcement learning.

The magmaOffenburg 2020 RoboCup 3D Simulation Team (2020)

Bohlinger, Nico ; Braun, Hannes ; Dorer, Klaus ; Fischer, Jens ; Schmider, Carmen ; Seiler, Jannik ; Weiler, David

Team description papers of magmaOffenburg are incremental in the sense that each year we address a different topic of our team and the tools around our team. In this year’s team description paper we address our approach to learn a model free walk with Nao toe using genetic algorithms.

Author(s)
Title
Additional Person(s)
Referee(s)
Abstract
Fulltext

Open Access

Refine

Author

Year of publication

Document Type

Conference Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

Open Access

59 search hits