Category Archives: Hadoop

Hadoop Ecosystem – A Quick Glance

Posted on April 11, 2017 by Mercy Beckham

What do Pig, Kangaroo, Eagle, and Phoenix have in common? Hadoop! We got some interesting technologies with curious names in Hadoop ecosystem. Azkaban is bloody wicked. H20 and Sparkling Water compete in the same space. Rethink, Couch, Dynamo, and Gemfire … Continue reading →

Posted in Hadoop | Tagged Hadoop | Leave a comment

Decision Matrix for Big Data Tools and Technologies

Posted on February 11, 2015 by Mercy Beckham

Posted in Big Data, Hadoop | Tagged Analytics, Big Data, Decision Matrix, Hadoop | Leave a comment

EHC Use Case – Hadoop as a Service

Posted on September 29, 2014 by Mercy Beckham

Hadoop can handle extremely large, unstructured data sets efficiently and at affordable cost, makes it a valuable technology for enterprises across a number of applications and fields. Market Analysis predicts that the market for Hadoop MapReduce is forecast to grow … Continue reading →

Posted in Big Data, Cloud, Hadoop, Hybrid Cloud | Tagged Big Data Extensions, EMC Federation, EMC Hybrid Cloud, Hadoop as a Service, Pivotal, Pivotal HD, VMware | Leave a comment

Cruising Data Lakes at Supersonic Speeds

Posted on July 16, 2014 by Mercy Beckham

Traditional workloads or second platform workloads for organizations go into File Shares on NAS, HPC on SAN, or Backup/Archive workloads to tape. They typically work with SMB, NFS, or FTP protocols. Emerging workloads like Hadoop referred as third platform pushes … Continue reading →

Posted in Big Data, Hadoop | Tagged 3rd Platform, Big Data, data lake, Isilon, OneFS | Leave a comment

EMC Isilon and RainStor for Big Data Management

Posted on March 4, 2014 by Mercy Beckham

Big Data creates petabytes of data that organizations can readily mine to discover patterns and trends. Although Hadoop provides a comparatively inexpensive way to manage massive amounts of data, it is difficult to manage as the Hadoop cluster grows big. … Continue reading →

Posted in Hadoop | Tagged Hadoop, Isilon, RainStor | Leave a comment

Enterprise Infrastructure for Hadoop

Posted on February 5, 2014 by Mercy Beckham

Hadoop sandboxes rely on commodity hardware with direct attached storage (DAS). These implementations make it difficult to scale out on storage separately as Hadoop requires three or more copies of data residing within the internal drive of a server unit. … Continue reading →

Posted in Hadoop | Tagged Hadoop, VCE | Leave a comment

Hosting Big Data

Posted on December 11, 2013 by Mercy Beckham

Rackspace recently introduced its new Big Data hosting options – customize your configuration for managing big data platform, run Hadoop on the public cloud, or configure your own private cloud. Rackspace eliminates the complex process of building and maintaining a … Continue reading →

Posted in Hadoop | Tagged Hadoop, Isilon | Leave a comment

Difference between MapReduce 1.0 and MapReduce 2.0

Posted on November 11, 2013 by Mercy Beckham

Apache Hadoop, introduced in 2005 has a core MapReduce processing engine to support distributed processing of large-scale data workloads. Several years later, there are major changes to the core MapReduce so that Hadoop framework not just supports MapReduce but other … Continue reading →

Posted in Hadoop | Tagged Hadoop | 2 Comments

Self-Service Data Access – Pivotal DD

Posted on November 1, 2013 by Mercy Beckham

Enterprise data resides in heterogeneous systems and of different data types. IT has its challenges to consolidate data in the right time. Also, many times it is difficult to know what data sources are required to access data. Pivotal DD … Continue reading →

Posted in Big Data, Hadoop | Tagged Big Data, Hadoop, Pivotal | Leave a comment

Virtualizing Hadoop

Posted on October 23, 2013 by Mercy Beckham

HDFS, the “storage” and MapReduce, the “compute” are combined in traditional Hadoop model. If this Hadoop model is directly translated into a VM, it will affect the ability to scale up and down as the lifecycle of VM is tightly … Continue reading →

Posted in Hadoop | Tagged Hadoop, Isilon | Leave a comment

Category Archives: Hadoop

Hadoop Ecosystem – A Quick Glance

Decision Matrix for Big Data Tools and Technologies

EHC Use Case – Hadoop as a Service

Cruising Data Lakes at Supersonic Speeds

EMC Isilon and RainStor for Big Data Management

Enterprise Infrastructure for Hadoop

Hosting Big Data

Difference between MapReduce 1.0 and MapReduce 2.0

Self-Service Data Access – Pivotal DD

Virtualizing Hadoop

BLOG RSS

Follow Blog via Email

Disclaimer

Recent Posts

Categories

Links

Archives

Meta