Συλλογή εργαστηριακών ασκήσεων στο Hadoop
Collection of Hadoop lab exercises
View/ Open
Keywords
Διαχείριση δεδομένων ; LinuxAbstract
This dissertation is actually a collecion of laboratory exercises to help students comprehend the various functions of Hadoop. Hadoop is a framework that offers faster and more efficient data management, something that is visible in cases of big data. This framework can be used easily on Linux Operating System, but also on Windows, either via command prompt, or by using I.D.E. such as Eclipse. In this work we viewed the necessary theoretical aspects of Hadoop, and then the in practice implementation. We installed Hadoop in a single – node cluster, in a multi – node cluster, and finally we solved famous problems, such as WordCount. This collection of exercises derives from a pool of foreign institutions, but also from a domestic lab.