| dc.contributor.advisor | Vouros, George | |
| dc.contributor.advisor | Βούρος, Γεώργιος | |
| dc.contributor.author | Kyriazopoulos, Christos | |
| dc.contributor.author | Κυριαζόπουλος, Χρήστος | |
| dc.date.accessioned | 2025-12-18T10:25:11Z | |
| dc.date.available | 2025-12-18T10:25:11Z | |
| dc.date.issued | 2025-12-17 | |
| dc.identifier.uri | https://dione.lib.unipi.gr/xmlui/handle/unipi/18721 | |
| dc.format.extent | 59 | el |
| dc.language.iso | en | el |
| dc.publisher | Πανεπιστήμιο Πειραιώς | el |
| dc.rights | Αναφορά Δημιουργού-Μη Εμπορική Χρήση-Όχι Παράγωγα Έργα 3.0 Ελλάδα | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/gr/ | * |
| dc.title | Diffusion models in offline reinforcement learning | el |
| dc.type | Master Thesis | el |
| dc.contributor.department | Σχολή Τεχνολογιών Πληροφορικής και Επικοινωνιών. Τμήμα Ψηφιακών Συστημάτων | el |
| dc.description.abstractEN | Offline reinforcement learning (RL) trains decision-making agents from fixed datasets, without interacting with the environment during training. This thesis investigates how diffusion models can be integrated into offline RL, leveraging their ability to capture complex, multimodal distributions and to generate action or trajectory sequences via iterative denoising. We study diffusion-based methods in a real-world airplane trajectory dataset, focusing on goal-reaching constraints and generalization across varying dataset complexity. Overall, the findings indicate that diffusion models can generate feasible trajectories while accommodating domain-specific constraints, supporting their role as a flexible and robust approach for offline RL and constrained planning. | el |
| dc.corporate.name | National Center of Scientific Research "Demokritos" | el |
| dc.contributor.master | Τεχνητή Νοημοσύνη - Artificial Intelligence | el |
| dc.subject.keyword | Offline Reinforcement Learning (Offline RL) | el |
| dc.subject.keyword | Diffusion Models | el |
| dc.subject.keyword | Trajectory generation | el |
| dc.subject.keyword | Diffusion policies | el |
| dc.subject.keyword | Behavior cloning | el |
| dc.date.defense | 2025-12-15 | |