Big Data University Surpasses 100,000 members!

Senior Big Data and Cloud Program Manager, IM Cloud Computing, IBM

This past August, Big Data University (BDU) reached a milestone: 100,000 registered users! From its humble beginnings in early 2011, we continue to see rapid growth in the number of registrations. As of the time this blog post was written, we are now at 118,500 users. The chart below says it all!


The success of our courses is largely due to the fact that most of them include hands-on exercises; so it's not just listening to theory, but actually getting your hands dirty! Hadoop Fundamentals I is by far our most popular course. This course was updated to version 2 this past July, and within 4 months 12,000 people registered! We are receiving excellent feedback about the courses, and this is not only attracting more users to the site, but also more volunteers and content contributors to create new courses. For example, we recently launched eight new courses thanks to new contributors:

Introduction to Data Analysis using R - This course teaches you how to perform data analysis using the R language. R is a powerful open source language that is ideal for analyzing both structured and unstructured data. The course focus is on performing analysis of unstructured data files. 

Using HBase for Real-time Access to your Big Data - This course introduces you to HBase, the open source Hadoop database used for random, real-time read/writes to your Big Data. The course will cover fundamental concepts of HBase, the basics of HBase client APIs, administrative and advanced features, and more! 

Moving Data into Hadoop - This course describes techniques for moving data into Hadoop. There are a variety of ways to get data into Hadoop from simple Hadoop shell commands to more sophisticated processes. Several techniques are presented, but two - Sqoop and Flume - are covered in greater detail. This course complements well the Hadoop Fundamentals I course.

Introduction to MapReduce Programming - This course explains the use of the mapper and reducer classes that make up a MapReduce application and where they get invoked in the application process. You are walked through the development of a MapReduce application using a development environment similar to Eclipse. This course is another great complement to the Hadoop Fundamentals I course.

Controlling Hadoop Jobs using Oozie - This course gives an overview of Oozie and how it is able to control Hadoop jobs. It begins with looking at the components required to code a workflow as well as optional components such as case statements, forks, and joins. That is followed by using the Oozie coordinator in order to schedule a workflow. 

Streams Developer Workshop - This course has been created in support of a live Webcast for developers about IBM InfoSphere Streams 3.2. The event will be held on November 14th, 2013 (10am - 6pm EST) and you can register by enrolling in this course. Webcast details and supporting materials is provided in the course. After the event, video recordings will be available.

Using Hive for Data Warehousing - Writing MapReduce programs to analyze your Big Data can get complex. Hive can help make querying your data much easier. Apache Hive, first created at Facebook, is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. This course will get you started so that you can use Hive for Data Warehousing tasks on your Big Data projects.

Iniciación a Big Data en español - Aplicaciones al mundo de los negocios - This course (in Spanish only) is first of a kind for BDU. Not only is it unique because it's one of two BDU courses in Spanish, but it is BDU's first fee-based course. BDU has no intention of making courses created by volunteers chargeable, but we do invite professionals or teachers to host their big data related courses in our site, and decide whether to charge a fee or not.

If you visited Big Data University in the past and have not looked at it in a while, I invite you to take a peek at its new look. Send us feedback through the Help button on the side of the page, and we will review it for future upgrades to the landing page. The Help button is also an excellent source of information for typical problems like "I forgot my user ID and password."

Next in our plan is to gamify some of our courses. We want to hide secret phrases in different courses that can only be unlocked after completing given activities. Once all phrases are found, and corresponding courses completed (by passing the course test), you can submit them to us and we will send you a prize!

Check out Big Data University today, and happy learning!