Today's Question:  What does your personal desk look like?        GIVE A SHOUT

 ALL


  Hadoop or Spark: Which One is Better?

What is Hadoop?Hadoop is one of the widely used Apache-based frameworks for big data analysis. It allows distributed processing of large data set over the computer clusters. Its scalable feature leverages the power of one to thousands of system for computing and storage purpose. A complete Hadoop framework comprised of various modules such as:Hadoop Yet Another Resource Negotiator (YARNMapReduce (Distributed processing engine)Hadoop Distributed File System (HDFS)Hadoop CommonThese four modules lie in the heart of the core Hadoop framework. There are many more modules available over the interne...

2,446 0       COMPARISON HADOOP SPARK


  Top 5 Reasons Not to Use Hadoop for Analytics

As a former diehard fan of Hadoop, I LOVED the fact that you can work on up to Petabytes of data.  I loved the ability to scale to thousands of nodes to process a large computation job.  I loved the ability to store and load data in a very flexible format.  In many ways, I loved Hadoop, until I tried to deploy it for analytics.   That’s when I became disillusioned with Hadoop (it just "ain't all that").At Quantivo, we’ve explored many ways to deploy Hadoop to answer analytical queries (trust me – I made every attempt to include it in my day job).&n...

5,149 0       CLOUD COMPUTING HADOOP ANALYTICS