Hadoop

Developing real-time applications through open source

Length: 17:44
July 13, 2014

IBM has a long and successful history with open source, from running Linux on IBM PCs to contributing initial codebase for Eclipse. We believe a mix of open source and closed source is the best way to drive adoption in the marketplace. Having the full support of a vendor like IBM can lower risk while open source can help achieve customer requirements. In this podcast, Mike Spicer, lead architect for InfoSphere Streams, talk about IBM’s latest contribute to open source—the new GitHub project from InfoSphere Streams.

Learn more about this project at github.com/ibmstreams. Also visit Streams Dev site for developers at developer.ibm.com/streamsdev

Managing big data with SQL-on-Hadoop

Length: 13:23
June 1, 2014

The relational database world has decades of research and experience optimizing SQL access and providing the needed capabilities for mission critical environments. Now, new SQL-on-Hadoop offerings are popping up, which is helping IT departments leverage their existing expertise to move into the big data world.

Jennifer McGinn, product marketing manager for IBM big data, described the benefits of SQL-on-Hadoop and talked about what IBM BigInsights with its integrated Big SQL have to offer.

Learn more about InfoSphere BigInsights and try out the product with a free download at ibm.co/quickstart

To learn more about the technical side of Big SQL, download the Technical White Paper – “SQL-on-Hadoop Without Compromise” from bit.ly/bigsqlhadoop

Follow Jennifer on Twitter @JenMcGinnChi, and follow podcast host David Pittman @TheSocialPitt

For more information about the IBM big data platform and products, visit www.ibm.com/bigdata. For more podcasts, blogs, videos, infographics and other resources, visit www.IBMBigDataHub.com.

Real-time analytics in the cloud

Length: 17:31
May 12, 2014

Cloud computing is gaining momentum, and for good reason. Some people consider it for overall IT cost savings, others are drawn to the promise of reduced capital expense and other people are looking to solve pressing IT issues. Kimberly Madia, product marketing manager for IBM big data, joined us to explain how cloud computing is enabling real-time analytics as a service, and what she sees developing in the area of “analytics everywhere.”

You can test out a free version of IBM InfoSphere Streams from ibm.co/streamqs and experiment with real-time analytics.

Follow Kimberly on Twitter @madiakc and follow our host, David Pittman, @TheSocialPitt.

For more information about the IBM big data platform and products, visit www.ibm.com/bigdataFor more podcasts, blogs, videos, infographics and other resources, visit www.IBMBigDataHub.com.

How in-Hadoop analytics are changing the game

Length: 7:28
February 24, 2014

Big Data without analytics is just data, but how do you perform the analytics? Christy Maver, product marketing manager for InfoSphere BigInsights, answers that question and gives examples of how in-Hadoop analytics are changing the game.

For more information about the IBM big data platform and products, visit www.ibm.com/bigdata.

For more podcasts, blogs, videos, infographics and other resources, visit www.IBMBigDataHub.com.

Data scientists: Hire an individual or team?

Length: 14:05
February 13, 2014

Most data science positions require a combination of technical and business skils. Gregory Piatetsky-Shapiro, the editor and publisher of KDnuggets.com, and a well-known expert in business analytics, data mining and data science, joined David Pittman to talk about that mix and the elusive “data scientist unicorn.” He also shared interesting findings from a poll he conducted on KDnuggets about what people look for when hiring data scientists. As his photo illustrates, he also demonstrated the evolution from data miners to data scientists.

For more information about the IBM big data platform and products, visit www.ibm.com/bigdata.

For more podcasts, blogs, videos, infographics and other resources, visit www.IBMBigDataHub.com.

The Important Difference Between Real-Time and Customer-Time

Length: 15:08
October 22, 2013

Technology vendors often tout their "real-time" products - but what does "real time" really mean? And is it what you need? Tom Deutsch compares real time to "customer time," which he says is the more meaningful measure: delivering the performance in the amount of time that it's needed and can be used. He also talks about in-memory solutions such as SAP Hana and describes the positives and negatives of in-memory systems.

Tom wrote about this topic in IBM Data Magazine; read that post here.

For more information about the IBM big data platform and products, visit www.ibm.com/bigdata.

For more podcasts, blogs, videos, infographics and other resources, visit www.IBMBigDataHub.com.

Debunking Big Data Myths

Length: 32:01
August 1, 2012

With so many conversations about big data, it is inevitable that there will be some mis-information and misunderstandings, some of which take on mythical proportions. James Kobielus, IBM big data evangelist, sets the record straight.