Blogs

How do you Hadoop?

Portfolio Marketing, Hadoop/BigInsights, IBM Analytics

Knowing what you’re trying to accomplish with big data says a lot about you. So who are you, and what are your big data requirements? Answer these five questions to learn more about what kind of Apache Hadoop user you probably are, and what you likely need from Hadoop to achieve successful business outcomes.

http://www.ibmbigdatahub.com/sites/default/files/quizstart.jpg

1. Which attribute best describes your approach to big data?

A  The freedom to choose with open source

B  The ability to know more about your data and enhance your decision making

C  The power to achieve the deepest possible insight on all data

D  All of the above

2. Which statement best describes your primary Hadoop optimization priority?

A  The need to empower business users without programming experience to conduct predictive analytics

B  The need to build quantifiable value from my organization’s data

C  The need to glean predictive insights and improve outcomes

D  All of the above

3. Which of these Hadoop benefits excites you most?

A  Scalability that allows for adding nodes as needed without requiring changes

B  Affordability that means realizing low cost/terabyte that is at a fraction of traditional options

C  Flexibility that provides for storing data without a schema and adding the data later as needed

D  All of the above

4. How do you imagine a Hadoop environment working for you?

A  As a platform for storing data that could not previously be stored efficiently

B  As a platform for new exploratory analytics applications

C  As a large-scale data ingestion and extract-transform-load (ETL) layer that complements existing databases

D  All of the above

5. Which Hadoop experience sounds most attractive?

A  An industry-compliant Hadoop distribution that gives you confidence and allows you to quickly begin collecting new data

B  A solution composed entirely of open source Hadoop-related projects

C  Hadoop packaging that provides critical capabilities for your enterprise Hadoop product ecosystem for data science, management, security and integration teams

D  All of the above

http://www.ibmbigdatahub.com/sites/default/files/a_networkengineer.jpg

How did you answer these questions? If you mostly chose option A, you’re most likely a network engineer. For you, Hadoop is a platform to support the business and its users—but that result is easier said than done. Network engineers are tasked with the weighty responsibility of allocating resources and optimizing workflows. To more rapidly scale a Hadoop platform for large numbers of users and clusters, network engineers can consider the IBM InfoSphere BigInsights Enterprise Management package with its enterprise features for parallel computing and application grid management. Another alternative is the IBM InfoSphere BigInsights cloud solution. That option allows for leaving the tough stuff for IBM to sort out while still getting all the workflow optimization that is needed.

http://www.ibmbigdatahub.com/sites/default/files/b_businessanalyst.jpg

If you mostly chose mostly B, you’re likely a business analyst. For you, Hadoop is a landing zone for data where you can quickly explore and analyze new data types—and deliver fast, highly informed business decisions. Getting it right gets the accolades, but you have to reach those conclusions quickly. Business analysts typically need to work with—and visualize—big data without the constraints of writing code. Consider BigInsights Data Analyst, which includes Big SQL and BigSheets. It fully supports query and visualization. Plus, it comes with sophisticated SQL support and a spreadsheet-style interface for exploring and visualizing big data.

http://www.ibmbigdatahub.com/sites/default/files/c_datascientist.jpg

If you chose mostly answer C, you’re likely a data scientist. For data scientists, big data analytics on Hadoop means the power to scrutinize huge volumes of data that conventional analytics and business intelligence solutions can’t touch. Detecting and unearthing both the micro and the macro takes powerful tools. As a result, IBM BigInsights Data Scientist has all the features that BigInsights Data Analyst has and also includes distributed R support for CRAN packages, machine learning, text and geospatial analytics.

http://www.ibmbigdatahub.com/sites/default/files/d_vip.jpg

Those who chose mostly answer D are quite likely seriously important people with a visionary understanding of the value of big data. This type of user along with colleagues who chose the other answers are determined to derive high-value insight from big data. Learn more about starting your Hadoop journey.