Seth Dobrin is vice president and CDO, IBM Analytics, platform development, at IBM. In this podcast, Dobrin shares experiences using Apache Spark for data science transformation and some thoughts on a larger vision for data science transformation at scale.
Steven Astorino is Vice President, Development, IBM Private Cloud Analytics Platform. In this podcast, he discusses how machine learning is driving the evolution of data science in strategic business initiatives.
In this white paper, discover how programmers and data scientists can use SparkR to transform R into a tool for big data analytics, taking advantage of parallel processing and near-linear scaling to tackle much larger challenges than would normally be possible with other methods.
Holden Karau is a software engineer at IBM, an active open source contributor and coauthor of Learning Spark (O'Reilly Media, February 2015) and the soon to be released High Performance Spark (O'Reilly Media, March 2017). In this podcast, Karau examines how to effectively search logs from Apache
Nick Pentreath is a principal engineer at IBM, a member of the Apache Spark project management committee (PMC) and author of Machine Learning with Spark (Packt Publishing, December 2014). In this podcast, Pentreath covers the basics of feature hashing and how to use it for all feature types in
Today’s businesses need a culture of collaboration that empowers knowledge workers to glean cognitive insights from data that help transform and modernize operations. See how cloud-based platforms and solutions enable data scientists and other experts to exploit artificial intelligence, machine
Technology is great on its own, but unless it’s doing something for your business and creating real value for the organization, it’s not really serving a point. IBM’s Kevin McIntyre talks about how the DataFirst Method helps the data professional stay focused on data.
Emily Curtin is a software engineer at The Weather Company (now IBM) working on the data engineering platform team. Robbie Strickland is vice president, engines and pipelines, IBM Watson Data Platform, at IBM. In this podcast, they give a technical overview of how Parquet works and how recent
This podcast discusses how embracing the concept of the new killer apps involves people and roles in an organization that need to coalesce around a rational set of business goals in working with data. New teams of people are forming that are data hungry.