Why has IBM created its own distribution of Apache Hadoop and Apache Spark, and what makes it stand out from the competition? We asked Prasad Pandit, program director, product management, Hadoop and open analytics systems, at IBM to give us a tour of the reference architecture for IBM Open Platform
The combination of Jupyter Notebooks, Apache Hadoop and Apache Spark has become a killer app for data practitioners. It unlocks the ability to explore, visualize and experiment with both structured and unstructured data sets with great ease and efficiency. We spoke recently with Chris Snow at IBM
SparkOscope helps Apache Spark developers take advantage of the job-level information available through the existing Spark Web UI; minimizes source code pollution; and extends the Spark Web UI with a palette of system-level metrics about the server, virtual machine or container related to each
The inability of lines of business to not serve requests because they have to wait for IT provisioning can lead to a proliferation of analytics silos that can cause a loss of control of data. See how the next big stage of analytics with integrated Apache Spark helps organizations understand the
Data science seems to be experiencing a renaissance when it comes to advanced open source tools. Get a glimpse into creative application development with IPython Notebooks, Jupyter Notebooks, Apache Spark, the PixieDust open source library and more at IBM Insight at World of Watson 2016.
IBM extended Big SQL, which was formerly exclusive to the IBM Open Platform (IOP), to the Hortonworks Data Platform (HDP) in September 2016. I recently spoke with Berni Schiefer, an IBM fellow in the IBM Analytics group, to learn more about the offering and the ongoing IBM focus on SQL.
Historical application of vector mathematics and the study of unstructured text data can be an important approach to understanding and actualizing the value of data. See how mathematical exploration of text data can unearth insight that translates into enhanced decision making.
IBM Insight at World of Watson 2016, 24–27 October 2016, at Mandalay Bay in Las Vegas, Nevada, is the only place to be for people who work with data. Take a look at this list of top-ten reasons you wont’ want to miss out on one of the most intriguing and innovative events of the year.
Advances in tools and the capability to work with cloud-based data sets are dramatically changing the nature of data science workloads. Take a look at one data scientist’s quest to learn more about performing data science analysis in the cloud.
Nancy Hensley, director of offering management for IBM Analytics speaks with Rob Thomas, vice president of development for analytics, at IBM, on the subject of business transformation, leading to a discussion of the data maturity curve.
In this video, released at the IBM DataFirst launch event, discover how seamless integration of IBM Project DataWorks, along with existing Bluemix-based services, can help data scientists, business analysts, data engineers and application developers engage in collaborative work efforts.
Despite big data’s hype, a significant number of organizations are still in a holding pattern—either locked in planning, hesitant to get started or wanting to avoid Apache Hadoop and Apache Spark projects. Complexity and a shortage of skills can exacerbate the situation. Increasingly, organizations
The concluding week of September 2016 offered much excitement in New York City, the backdrop for Strata + Hadoop World 2016 and several key IBM announcements, including the launch of a cloud-based, self-service environment for data science teams. Enjoy some key highlights captured from this