dc.contributor.advisor | Vouros, George | |
dc.contributor.advisor | Βούρος, Γεώργιος | |
dc.contributor.author | Koliou, Natalia | |
dc.contributor.author | Κολιού, Ναταλία | |
dc.date.accessioned | 2025-01-13T09:19:08Z | |
dc.date.available | 2025-01-13T09:19:08Z | |
dc.date.issued | 2024-12 | |
dc.identifier.uri | https://dione.lib.unipi.gr/xmlui/handle/unipi/17293 | |
dc.description | Not available until 09/01/2026 | |
dc.format.extent | 53 | el |
dc.language.iso | en | el |
dc.publisher | Πανεπιστήμιο Πειραιώς | el |
dc.rights | Αναφορά Δημιουργού-Μη Εμπορική Χρήση-Όχι Παράγωγα Έργα 3.0 Ελλάδα | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/gr/ | * |
dc.title | Ranking joint policies in dynamic games using evolutionary dynamics | el |
dc.type | Master Thesis | el |
dc.contributor.department | Σχολή Τεχνολογιών Πληροφορικής και Επικοινωνιών. Τμήμα Ψηφιακών Συστημάτων | el |
dc.description.abstractEN | Game-theoretic solution concepts, such as the Nash equilibrium, have been key to finding stable joint actions in multi-player games. However, it has been shown that the dynamics of agents’ interactions, even in simple two-player games with few strategies, are incapable of reaching Nash equilibria, exhibiting complex and unpredictable behavior. Instead, evolutionary approaches can describe the long-term persistence of strategies and filter out transient ones, accounting for the long-term dynamics of agents’ interactions. Our goal is to identify agents’ joint strategies that result in stable behavior, being resistant to changes, while also accounting for agents’ payoffs, in dynamic games. Towards this goal, we propose transforming dynamic games into their empirical forms by considering agents’ strategies instead of agents’ actions, and applying the evolutionary methodology α-Rank to evaluate and rank strategy profiles according to their long-term dynamics. This methodology not only allows us to identify joint strategies that are strong through agents’ long-term interactions, but also provides a descriptive, transparent framework regarding the high ranking of these strategies. Experiments report on agents that aim to collaboratively solve a stochastic version of the graph coloring problem. We consider different styles of play as strategies to define the empirical game, and train policies realizing these strategies, using the DQN algorithm. Then we run simulations to generate the payoff matrix required by α-Rank to rank joint strategies. | el |
dc.corporate.name | National Center of Scientific Research "Demokritos" | el |
dc.contributor.master | Τεχνητή Νοημοσύνη - Artificial Intelligence | el |
dc.subject.keyword | Evolutionary dynamics | el |
dc.subject.keyword | Stochastic games | el |
dc.subject.keyword | Deep reinforcement learning | el |
dc.subject.keyword | Ranking joint policies | el |
dc.date.defense | 2024-12-23 | |