Αυτόματη απόδειξη μαθηματικών θεωρημάτων στη Lean 4 με χρήση μεγάλων γλωσσικών μοντέλων

Τζίμας, Βύρων

dc.contributor.advisor	Σωτηρόπουλος, Διονύσιος
dc.contributor.author	Τζίμας, Βύρων
dc.date.accessioned	2025-10-03T11:47:36Z
dc.date.available	2025-10-03T11:47:36Z
dc.date.issued	2025-08
dc.identifier.uri	https://dione.lib.unipi.gr/xmlui/handle/unipi/18162
dc.description.abstract	Η παρούσα διπλωματική εργασία εστιάζει στην ανάπτυξη, υλοποίηση και αξιολόγηση ενός συστήματος αυτόματης απόδειξης μαθηματικών θεωρημάτων στη γλώσσα Lean 4, αξιοποιώντας Μεγάλα Γλωσσικά Μοντέλα (Large Language Models – LLMs) σε συνδυασμό με εξειδικευμένες τεχνικές ανάκτησης (retriever) και ανακατάταξης (reranker) μαθηματικών προτάσεων. Η ερευνητική προσέγγιση που ακολουθήθηκε επιδιώκει να συνδυάσει τη γλωσσική ισχύ των LLMs με την αυστηρότητα και την ακρίβεια των τυπικών αποδεικτικών συστημάτων, ώστε να παραχθεί ένα πλαίσιο ικανό να επιλύει θεωρήματα με υψηλό βαθμό αυτοματοποίησης. Στο θεωρητικό μέρος της εργασίας παρουσιάζεται το επιστημονικό υπόβαθρο της αυτοματοποιημένης απόδειξης θεωρημάτων, αναλύονται οι κύριες κατηγορίες συστημάτων και εξετάζονται οι σύγχρονες ερευνητικές κατευθύνσεις που ενσωματώνουν βαθιά μάθηση στην αποδεικτική διαδικασία. Ιδιαίτερη έμφαση δίνεται στο σύστημα του Lean 4 και της βιβλιοθήκης Mathlib, καθώς και σε εργαλεία όπως το LeanDojo, τα οποία επιτρέπουν την αλληλεπίδραση των LLMs με το περιβάλλον απόδειξης και την επαλήθευση των παραγόμενων βημάτων. Στο πρακτικό μέρος περιγράφεται η κατασκευή ενός εξειδικευμένου corpus θεωρημάτων γραμμικής άλγεβρας, η εκπαίδευση του retriever για την ανάκτηση των πιο συναφών premises και η ανάπτυξη του reranker για την ανακατάταξή τους με βάση τη σημασιολογική ομοιότητα. Στη συνέχεια, ενσωματώνεται το LLM DeepSeek, το οποίο παράγει δυναμικά Lean τακτικές ακολουθώντας προσεκτικά διαμορφωμένα prompts. Η εκτέλεση και επαλήθευση των παραγόμενων τακτικών πραγματοποιείται μέσω του LeanDojo API, διασφαλίζοντας την εγκυρότητα και την αλληλουχία των αποδεικτικών βημάτων. Η αξιολόγηση του συστήματος πραγματοποιήθηκε μετρώντας το ποσοστό επιτυχίας της ολοκλήρωσης των μαθηματικών αποδείξεων και αναλύοντας την ποιότητα και αποτελεσματικότητα των παραγόμενων τακτικών. Τα αποτελέσματα καταδεικνύουν ότι η συνδυαστική προσέγγιση retriever–reranker–LLM βελτιώνει ουσιαστικά την απόδοση σε σχέση με μεθόδους που βασίζονται αποκλειστικά σε εκπαιδευμένα LLMs. Τέλος η εργασία αυτή συμβάλλει στην κατανόηση και στην πρακτική υλοποίηση μεθόδων που συνδυάζουν LLMs και συστήματα τυπικής επαλήθευσης, ανοίγοντας τον δρόμο για την ανάπτυξη πιο αποδοτικών, επεκτάσιμων και πλήρως αυτοματοποιημένων συστημάτων απόδειξης μαθηματικών θεωρημάτων στο μέλλον.	el
dc.format.extent	83	el
dc.language.iso	el	el
dc.publisher	Πανεπιστήμιο Πειραιώς	el
dc.rights	Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 3.0 Ελλάδα	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/gr/	*
dc.title	Αυτόματη απόδειξη μαθηματικών θεωρημάτων στη Lean 4 με χρήση μεγάλων γλωσσικών μοντέλων	el
dc.title.alternative	Automated mathematical theorem proving in Lean 4 with large language models	el
dc.type	Master Thesis	el
dc.contributor.department	Σχολή Τεχνολογιών Πληροφορικής και Επικοινωνιών. Τμήμα Πληροφορικής	el
dc.description.abstractEN	This thesis focuses on the development, implementation, and evaluation of an automated mathematical theorem proving system in the Lean 4 language, leveraging Large Language Models (LLMs) in combination with specialized techniques for premise retrieval (retriever) and re-ranking (reranker). The proposed research approach aims to combine the linguistic capabilities of LLMs with the rigor and precision of formal proof systems, resulting in a framework capable of solving theorems with a high degree of automation. The theoretical part of the thesis presents the scientific background of automated theorem proving, analyzes the main categories of proof systems, and examines modern research directions that integrate deep learning into the proving process. Special emphasis is placed on the Lean 4 system and the Mathlib library, as well as on tools such as LeanDojo, which enable interaction between LLMs and the proof environment, facilitating the execution and verification of generated proof steps. In the practical part, the construction of a specialized corpus of linear algebra theorems is described, followed by the training of the retriever to identify the most relevant premises and the development of the reranker to reorder them based on semantic similarity. Subsequently, the DeepSeek LLM is integrated to dynamically generate Lean tactics using carefully crafted prompts. The execution and verification of the generated tactics are performed through the LeanDojo API, ensuring both the validity and coherence of the proof steps. The evaluation of the system involved measuring the success rate of theorem completions and analyzing the quality and efficiency of the generated tactics. The results demonstrate that the combined retriever–reranker–LLM approach significantly outperforms methods relying solely on pre-trained LLMs. Finally, this work contributes to the understanding and practical implementation of methods that integrate LLMs with formal verification systems, paving the way for more efficient, scalable, and fully automated mathematical theorem proving systems in the future.	el
dc.contributor.master	Προηγμένα Συστήματα Πληροφορικής - Ανάπτυξη Λογισμικού και Τεχνητής Νοημοσύνης	el
dc.subject.keyword	Lean 4	el
dc.subject.keyword	Mathlib	el
dc.subject.keyword	Αυτόματη απόδειξη θεωρημάτων	el
dc.subject.keyword	Μεγάλα γλωσσικά μοντέλα	el
dc.subject.keyword	LLMs	el
dc.subject.keyword	Retriever	el
dc.subject.keyword	Reranker	el
dc.subject.keyword	LeanDojo	el
dc.subject.keyword	Τεχνητή νοημοσύνη	el
dc.date.defense	2025-08

Files in this item

Name:: Thesis_Vyron_Tzimas_mpsp2351.pdf
Size:: 2.229Mb
Format:: PDF
Description:: Μεταπτυχιακή διατριβή

View/Open

This item appears in the following Collection(s)

Τμήμα Πληροφορικής
Department of Informatics

Show simple item record

Except where otherwise noted, this item's license is described as
Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 3.0 Ελλάδα