HADOOP & CLOUD COMPUTING PRACTICAL QA
1. What is Hadoop ? Ans: Hadoop is the framework to process and analyze Big Data. Hadoop’s popularity speaks for itself. A huge volume of data is considered as Big Data. Apache Hadoop was born out as a solution to Big Data. Hadoop was created by Doug Cutting and Mike Cafarella. Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming model. 2. What is Big data ? Ans: Big data is a collection of data sets so large and complex that your legacy IT systems cannot handle them. Now, the question arises what is considered as huge? Many terabytes, petabytes, exabytes of data. Now, the other question is how we can decide the data is Big Data or not? How can we say the problem needs a Big Data solution or not? If the problem satisfies the three factors. The four factors are...