Data Scientists

Collaborations and correlations in the common cause

October 16, 2014
I'm impressed with initiatives in the U.S. data scientist community to volunteer their time to worthy causes at home and abroad. Clearly, most of the data scientists who participate in communities such as New York-based DataKind have day jobs to pay the bills. But they see larger humanitarian causes (reuniting refugees, curing infectious diseases, feeding hungry populations and guaranteeing civil rights to the disenfranchised for example) that can benefit from the smartest data scientists applying their best efforts and most sophisticated tools to the task. To sustain the engagement of the data science community in these common causes, what's needed is for people and institutions to open source all of their decision-support assets: data, analytics, tools, platforms and, of course, expertise. Read More

Big data for social good

October 15, 2014
Agile development requires agile fingers, and big problems require smart minds working together to solve them. Big data has opened the door for everyone to collaborate, think bigger and drive change in the world. Accessible and open services make harnessing the power of big data for these changes a reality. Read More

Discrimination drives the need for ethics in big data

October 13, 2014
Big data and analytics are profoundly affecting the world around us. One of the focal points of my postings has been how big data and analytics affects, specifically, our personal privacy. An old and perhaps far too familiar twist on this has risen to the forefront of discussion and that is the issue of whether big data and analytics will be used to discriminate against the less fortunate (or perhaps even “the one percent”). Read More

Distributing data science brainpower more equitably among the haves and have-nots

October 9, 2014
Data scientists, like anybody else, tend to gravitate to where the jobs are, especially those that fetch higher salaries, offer the resources needed to achieve their dreams and promise more rewarding career paths. For that reason, larger employers with well-established, amply funded big data initiatives tend to have an advantage over smaller organizations when it comes to recruiting the best and brightest data scientists. In order to more equitably distribute data scientist expertise among the haves and have-nots, the requisite skills, tools and platforms need to become more widely available at low or no cost. Read More

Legos for grown-ups: Building with IBM Bluemix

October 8, 2014
In the world of big data and analytics, cloud capabilities offer a great opportunity and shorten the path between ideas and action. IBM’s new Bluemix platform for developers continues to grow, offering users new capabilities every day that they can attach, build and deploy for web and mobile app success. Read More

Who is the chief data officer?

Getting to know today’s hero of data and analytics

October 3, 2014
Chief data officers are leading the charge to transform their organizations to be data-driven and capitalize on the tremendous opportunities data and analytics are creating for data leaders. Hailing from a variety of technical and analytical backgrounds, CDOs have the unique opportunity to shape their new emerging role. Read More

Observations, transformations and conversations from the IBM Chief Data Officer Summit

October 1, 2014
Chief data officers and data professionals shared best practices and new insights about transforming the way their companies view data and how to maximize competitive edge in the marketplace today at the Summit last week. Read More

Why is analyzing text so hard?

September 30, 2014
How can we take documents, email or social media (and beyond) to show what they represent? Read More

Why is big data talked about so much?

September 29, 2014
Big data presents a tremendous opportunity to alter the ways we think and do business. Hadoop, analytics and other technologies will be front and center at this year’s Strata/Hadoop World event in NYC on October 15 through 17 and we will be there to discuss text analytics, Hadoop use cases, SQL-on-Hadoop, machine learning and much more. Will you join us? Read More

Data science's limitations in addressing global warming

September 25, 2014
Global climate data is massive, diverse and often internally inconsistent. Researchers who attempt to use data science to understand, predict and control global warming find themselves challenged by methodological limitations that frustrate their attempts to fathom this sprawling mosaic. Chief issues include historically thin sources, rampant auto-correlations and heterogeneous data provenance. Tackling global warming requires a harmonious balance between theory-driven domain science and data-driven statistical analysis. Read More

Pages