BLOG RSS
- Follow My Data Experiments on WordPress.com
-
Disclaimer
The postings on this site are my own and do not necessarily represent the opinions of CapTech Ventures, Inc.-
Recent Posts
Categories
Links
Archives
Category Archives: Hadoop
Hadoopable?
Recently I heard “moving content into Hadoop” – although I did not further question their motive, I was wondering seriously about “effective solutions” on Hadoop for the day-to-day business problems. Hadoop is not a magic wand to wipe away all … Continue reading
Hadoop Invades My Desk
I gave Hadoop elephant an Indian makeover with red and gold – acrylics on paper.
Spring for Apache Hadoop
Hadoop has a poor out of the box programming model. Applications often become spaghetti code in the form of scripts calling Hadoop command line applications. Spring aims to simplify Hadoop applications by leveraging several Spring eco-system projects. Spring for Apache … Continue reading
HAWQ Soars Higher
HAWQ is a modern distributed and parallel query processor on top of HDFS that gives enterprises the best of both worlds: high-performance query processing with SQL, and scalable open storage. When the data is directly stored on HDFS, it provides … Continue reading
Delta of Hadoop Distributions
Greenplum introduced first Hadoop distribution GPHD (Greenplum Hadoop Distribution) in 2011 removes the need in building out a Hadoop cluster from scratch. In February this year, Pivotal – Greenplum announced the first product Pivotal HD to expand the capabilities of … Continue reading
Introduction to Pivotal HD
Pivotal HD is a full Apache Hadoop distribution with Pivotal add-ons and a native integration with the Greenplum database. Hence bringing together both NoSQL and SQL access layers to multi-structured data stored within the Pivotal HDFS. This distribution is the … Continue reading
Hadoop Install on Windows Server 2012
My installation notes for Cygwin and Hadoop on Windows Server 2012- https://github.com/mercyp/Hadoop
