Apache Hadoop Ecosystem Cheat Sheet

Reading Time: 6 minutes

Hadoop is a framework for running applications on large clusters built of commodity hardware. Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. Apache Hadoop has been in development for nearly 15 years. The term “Hadoop” refers to the Hadoop ecosystem or collection of… Continue reading Apache Hadoop Ecosystem Cheat Sheet

Published
Categorized as learn Tagged

Choose the right Hadoop solution

Reading Time: 8 minutes

Hadoop ecosystem is open-source with plenty of add-on packages. This takes away the infrastructure and software management aspect of the implementation. Though this adds dependency on the Hadoop host. Commercial distributions enable businesses to enjoy the power of Hadoop minus all the headaches. The commercial element generally means you have to pay to get in… Continue reading Choose the right Hadoop solution

Published
Categorized as learn Tagged