Hadoop and Open Source

Business process management is not at the center of an IoT strategy

October 21, 2014
An Internet of Things (IoT) strategy requires a whole lot more than a business process management (BPM) engine to succeed. Read More

Critical success factors in deploying Hadoop across industries

October 20, 2014
Big data (data from many sources, of varying formats, both structured and unstructured) means different things in different industries. But as different as their needs and usage of big data may be, there is one commonality among all industries: the opportunity to plumb big data for better, more informed perspectives on their customers, products, partners, competitors and strategies.  Read More

Collaborations and correlations in the common cause

October 16, 2014
I'm impressed with initiatives in the U.S. data scientist community to volunteer their time to worthy causes at home and abroad. Clearly, most of the data scientists who participate in communities such as New York-based DataKind have day jobs to pay the bills. But they see larger humanitarian causes (reuniting refugees, curing infectious diseases, feeding hungry populations and guaranteeing civil rights to the disenfranchised for example) that can benefit from the smartest data scientists applying their best efforts and most sophisticated tools to the task. To sustain the engagement of the data science community in these common causes, what's needed is for people and institutions to open source all of their decision-support assets: data, analytics, tools, platforms and, of course, expertise. Read More

UDF: Tiny acronym, big deal in big data

October 16, 2014
It’s easy to understand why big data is intimidating as the whole gang of JSON, SQL, YARN and UDFs (user-defined functions) invoke glazed eyes and thoughts of extraterrestrial technologies. So, even to the uninitiated, what is the big deal about UDFs? Read More

10 reasons to love IBM InfoSphere BigInsights for Hadoop

October 15, 2014
IBM InfoSphere BigInsights is an industry-standard Hadoop offering that combines the best of open source software with enterprise-grade features. Let’s talk about some key features of the distribution and the top ten reasons to love it— Read More

Big data for social good

October 15, 2014
Agile development requires agile fingers, and big problems require smart minds working together to solve them. Big data has opened the door for everyone to collaborate, think bigger and drive change in the world. Accessible and open services make harnessing the power of big data for these changes a reality. Read More

Discrimination drives the need for ethics in big data

October 13, 2014
Big data and analytics are profoundly affecting the world around us. One of the focal points of my postings has been how big data and analytics affects, specifically, our personal privacy. An old and perhaps far too familiar twist on this has risen to the forefront of discussion and that is the issue of whether big data and analytics will be used to discriminate against the less fortunate (or perhaps even “the one percent”). Read More

Legos for grown-ups: Building with IBM Bluemix

October 8, 2014
In the world of big data and analytics, cloud capabilities offer a great opportunity and shorten the path between ideas and action. IBM’s new Bluemix platform for developers continues to grow, offering users new capabilities every day that they can attach, build and deploy for web and mobile app success. Read More

Why is analyzing text so hard?

September 30, 2014
How can we take documents, email or social media (and beyond) to show what they represent? Read More

Why is big data talked about so much?

September 29, 2014
Big data presents a tremendous opportunity to alter the ways we think and do business. Hadoop, analytics and other technologies will be front and center at this year’s Strata/Hadoop World event in NYC on October 15 through 17 and we will be there to discuss text analytics, Hadoop use cases, SQL-on-Hadoop, machine learning and much more. Will you join us? Read More

Pages