Distributed Programming using Hadoop


Speaker


Abstract

In this workshop, Evert Lammers, consultant at SARA, the national high performance computing and e-Science Support Center, will give a short introduction to Hadoop, a software framework for distributed programming. The framework and computing cluster are freely available for researchers and students at universities, as their projects often require a lot of computational power. After a brief introduction to Hadoop, Evert Lammers will continue with a hands-on session for distributed programming and computing in Hadoop by analyzing a large data set. Some Java knowledge is preferred, as Hadoop is based on Java. Note that also MATLAB code can be run in a distributed way using Hadoop. Don't forget to bring your laptop!