BLOG RSS
- Follow My Data Experiments on WordPress.com
-
Disclaimer
The postings on this site are my own and do not necessarily represent the opinions of CapTech Ventures, Inc.-
Recent Posts
Categories
Links
Mercy’s Tweets
My TweetsArchives
Category Archives: Big Data
Ethics in Big Data Analytics
Here are the slides from my session yesterday on Ethics in Big Data Analytics for DAMA Philadelphia.
Posted in Big Data, Conceptual, ethics, Philosophy
Tagged Analytics, Big Data, ethics
Leave a comment
Thou Shall Not Covet
Both my husband and my sister are huge fans of crime stories. In those rare family vacations, they like to binge watch “ID”. Last week while I was ironing his shirts, there was this crime story on TV in which … Continue reading
Primum Non Nocere
I want to discuss one of the lesser known study in the world. The study is named “Mushroom Trial” that was spearheaded by my loving mother and the subjects were my immediate family. As much as she loves to try … Continue reading
Posted in Big Data, Conceptual, ethics
Tagged bias, Big Data, discrimination, ethics
Leave a comment
Presentations
Trying to pick out all my data science presentation and consolidate here.
Posted in Big Data
Leave a comment
EHC Use Case – Hadoop as a Service
Hadoop can handle extremely large, unstructured data sets efficiently and at affordable cost, makes it a valuable technology for enterprises across a number of applications and fields. Market Analysis predicts that the market for Hadoop MapReduce is forecast to grow … Continue reading
Posted in Big Data, Cloud, Hadoop, Hybrid Cloud
Tagged Big Data Extensions, EMC Federation, EMC Hybrid Cloud, Hadoop as a Service, Pivotal, Pivotal HD, VMware
Leave a comment
Cruising Data Lakes at Supersonic Speeds
Traditional workloads or second platform workloads for organizations go into File Shares on NAS, HPC on SAN, or Backup/Archive workloads to tape. They typically work with SMB, NFS, or FTP protocols. Emerging workloads like Hadoop referred as third platform pushes … Continue reading
Self-Service Data Access – Pivotal DD
Enterprise data resides in heterogeneous systems and of different data types. IT has its challenges to consolidate data in the right time. Also, many times it is difficult to know what data sources are required to access data. Pivotal DD … Continue reading
Preping for Data Scientist Associate
I come from a content management background handling terabytes of content. Content lifecycle starts with capture/create, versioning, managing, publishing, to end with archival and retention. Content falls thru information rights, compliance, governance, and retention either at the organization level or … Continue reading
EMC Kazeon – Dark Data Explorer
Dark matter in astronomy and cosmology is a type of matter that hypothetically accounts for the large part of total mass in the universe. It neither emits nor absorbs light or other electromagnetic radiations so that it cannot be observed … Continue reading