- Follow My Data Experiments on WordPress.com
DisclaimerThe postings on this site are my own and do not necessarily represent the opinions of CapTech Ventures, Inc.
Mercy’s TweetsMy Tweets
Category Archives: Hadoop
Recently I heard “moving content into Hadoop” – although I did not further question their motive, I was wondering seriously about “effective solutions” on Hadoop for the day-to-day business problems. Hadoop is not a magic wand to wipe away all … Continue reading
I gave Hadoop elephant an Indian makeover with red and gold – acrylics on paper.
Hadoop has a poor out of the box programming model. Applications often become spaghetti code in the form of scripts calling Hadoop command line applications. Spring aims to simplify Hadoop applications by leveraging several Spring eco-system projects. Spring for Apache … Continue reading
HAWQ is a modern distributed and parallel query processor on top of HDFS that gives enterprises the best of both worlds: high-performance query processing with SQL, and scalable open storage. When the data is directly stored on HDFS, it provides … Continue reading
Greenplum introduced first Hadoop distribution GPHD (Greenplum Hadoop Distribution) in 2011 removes the need in building out a Hadoop cluster from scratch. In February this year, Pivotal – Greenplum announced the first product Pivotal HD to expand the capabilities of … Continue reading
Pivotal HD is a full Apache Hadoop distribution with Pivotal add-ons and a native integration with the Greenplum database. Hence bringing together both NoSQL and SQL access layers to multi-structured data stored within the Pivotal HDFS. This distribution is the … Continue reading