Tag Archives: Hadoop

Hadoop Ecosystem – A Quick Glance

What do Pig, Kangaroo, Eagle, and Phoenix have in common? Hadoop! We got some interesting technologies with curious names in Hadoop ecosystem. Azkaban is bloody wicked. H20 and Sparkling Water compete in the same space. Rethink, Couch, Dynamo, and Gemfire … Continue reading

Posted in Hadoop | Tagged | Leave a comment

Decision Matrix for Big Data Tools and Technologies

Image | Posted on by | Tagged , , , | Leave a comment

Introducing Pivotal HD 2.0

Pivotal HD 2.0 is a commercial distribution of Apache Hadoop 2.2. Along with that it brings an in-memory, SQL database to Hadoop through seamless integration, Pivotal GemFire XD – a SQL compliant, in-memory database designed for real-time analytics for Big … Continue reading

Posted in Hadoop Distribution | Tagged , , | Leave a comment

EMC Isilon and RainStor for Big Data Management

Big Data creates petabytes of data that organizations can readily mine to discover patterns and trends. Although Hadoop provides a comparatively inexpensive way to manage massive amounts of data, it is difficult to manage as the Hadoop cluster grows big. … Continue reading

Posted in Hadoop | Tagged , , | Leave a comment

Enterprise Infrastructure for Hadoop

Hadoop sandboxes rely on commodity hardware with direct attached storage (DAS). These implementations make it difficult to scale out on storage separately as Hadoop requires three or more copies of data residing within the internal drive of a server unit. … Continue reading

Posted in Hadoop | Tagged , | Leave a comment

Hosting Big Data

Rackspace recently introduced its new Big Data hosting options – customize your configuration for managing big data platform, run Hadoop on the public cloud, or configure your own private cloud. Rackspace eliminates the complex process of building and maintaining a … Continue reading

Posted in Hadoop | Tagged , | Leave a comment

Difference between MapReduce 1.0 and MapReduce 2.0

Apache Hadoop, introduced in 2005 has a core MapReduce processing engine to support distributed processing of large-scale data workloads. Several years later, there are major changes to the core MapReduce so that Hadoop framework not just supports MapReduce but other … Continue reading

Posted in Hadoop | Tagged | 2 Comments

Self-Service Data Access – Pivotal DD

Enterprise data resides in heterogeneous systems and of different data types. IT has its challenges to consolidate data in the right time. Also, many times it is difficult to know what data sources are required to access data. Pivotal DD … Continue reading

Posted in Big Data, Hadoop | Tagged , , | Leave a comment

Virtualizing Hadoop

HDFS, the “storage” and MapReduce, the “compute” are combined in traditional Hadoop model. If this Hadoop model is directly translated into a VM, it will affect the ability to scale up and down as the lifecycle of VM is tightly … Continue reading

Posted in Hadoop | Tagged , | Leave a comment


Recently I heard “moving content into Hadoop” – although I did not further question their motive, I was wondering seriously about “effective solutions” on Hadoop for the day-to-day business problems. Hadoop is not a magic wand to wipe away all … Continue reading

Posted in Conceptual, Hadoop | Tagged | Leave a comment