Cazzoli, Lorenzo
(2017)
Generalize policy on supporting user scenario.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Informatica [LM-DM270]
Documenti full-text disponibili:
Abstract
In this thesis we present a way of combining previously learned robot be- havior policies of different users. The main idea is to combine a set of policies, in tabular representation, into a final sub-optimal solution for the problem all users have contributed to. We assume that the features/differences of users are unknown and need to be extracted from the different policies generated from same user. This information is used to weight the importance of a set of actions to sum up two policies.
The proposed approach has been tested on a virtual environment finding out that the combined policy works as a general policy suitable for all users, as it always selects actions that are satisfying the users at the border of the defined sensorial possibilities.
All the assumptions has been finally verified on a real environment finding out all the limitations of the proposed model.
Abstract
In this thesis we present a way of combining previously learned robot be- havior policies of different users. The main idea is to combine a set of policies, in tabular representation, into a final sub-optimal solution for the problem all users have contributed to. We assume that the features/differences of users are unknown and need to be extracted from the different policies generated from same user. This information is used to weight the importance of a set of actions to sum up two policies.
The proposed approach has been tested on a virtual environment finding out that the combined policy works as a general policy suitable for all users, as it always selects actions that are satisfying the users at the border of the defined sensorial possibilities.
All the assumptions has been finally verified on a real environment finding out all the limitations of the proposed model.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Cazzoli, Lorenzo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
Curriculum C: Sistemi e reti
Ordinamento Cds
DM270
Parole chiave
Multimodal Interaction,Human-Robot-Interaction,Reanforcement Learning,Assistive robotics,Generalize Policy
Data di discussione della Tesi
20 Dicembre 2017
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Cazzoli, Lorenzo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
Curriculum C: Sistemi e reti
Ordinamento Cds
DM270
Parole chiave
Multimodal Interaction,Human-Robot-Interaction,Reanforcement Learning,Assistive robotics,Generalize Policy
Data di discussione della Tesi
20 Dicembre 2017
URI
Statistica sui download
Gestione del documento: