This is the fourth in a series of blogs on analytics and the cloud. Read our introduction to the series. This blog concerns itself with the rise of open source software and how it is used for a whole host of analytical purposes. However, as will be seen in this blog, there are significant gaps in
Although NoSQL database technology has been around for a long time (before SQL actually), not until the advent of Web 2.0, when companies such as Google and Amazon began using the technology, did NoSQL’s popularity really take off. Market Research Media forecasts NoSQL Market to be $3.4 Billion by
This white paper discusses the advantages of using the PySpark API, which enables the use of Python to interact with the Spark programming model. It starts with a basic description of Spark and then describes PySpark, its benefits, and when it is appropriate to use instead of "pandas" open source
This is the second in a series of blogs on analytics and the cloud. We will consider the rise of the Internet of Things (IoT), analytics used on that data and how the cloud can be utilized to drive value out of instrumenting a very wide range of ‘things’.
This is the first in a sequence of blogs that takes a peek at what is driving analytics onto the cloud, what are the challenges that will need to be overcome over the next 5 years and how they will be tackled.
In this white paper, discover how programmers and data scientists can use SparkR to transform R into a tool for big data analytics, taking advantage of parallel processing and near-linear scaling to tackle much larger challenges than would normally be possible with other methods.
Now that we’re into the swing of 2017, the time is ripe for the first CrowdChat of 2017 to explore the goals, challenges and strategies that CDOs and CIOs are focused on for their organizations. Get involved and share your thoughts in this kick-off IMB Big Data CrowdChat.
Learn how IBM SPSS Statistics can enhance the value that statistical analysis adds to a business, and find out how you can tap into the power of high-performance statistical modeling in your own organization.
Why has IBM created its own distribution of Apache Hadoop and Apache Spark, and what makes it stand out from the competition? We asked Prasad Pandit, program director, product management, Hadoop and open analytics systems, at IBM to give us a tour of the reference architecture for IBM Open Platform
From self-service analytics to the cloud, chief data officers (CDOs) had a wealth of information at their fingertips on the first day of IBM Insight at World of Watson 2016. Catch the high points of some of Monday’s most relevant sessions for CDOs in this quick recap.
Data science seems to be experiencing a renaissance when it comes to advanced open source tools. Get a glimpse into creative application development with IPython Notebooks, Jupyter Notebooks, Apache Spark, the PixieDust open source library and more at IBM Insight at World of Watson 2016.