Thursday, January 21, 2021
Techiexpert.com
No Result
View All Result
  • Login
  • Register
  • Home
  • Tech news
  • Startups
  • AI
  • IOT
  • Big Data
  • Cloud
  • Data Analytics
  • ML
  • Blogging
Techiexpert.com
No Result
View All Result

Why doesn’t Tableau perform as well on Hadoop? And how to make it faster

Brahmajeet Desai by Brahmajeet Desai
August 8, 2019
in Tech news
Reading Time: 3min read
A A
0
Why doesn’t Tableau perform as well on Hadoop? And how to make it faster
9
SHARES
130
VIEWS
Share on FacebookShare on Twitter

Tableau, one of the most widely used business intelligence tools, is designed to simplify analysis for business users by letting them interact with their data visually, using drag-and-drop operations, without the need to write complex code or SQL queries. Depending upon their requirements, users can create their dashboards and have real-time conversations with data that lives across multiple systems and platforms.

But what happens when the size of data rises to big data proportions? Do the users still have the same agility as they had when analyzing smaller datasets?

Tableau users often report slowdowns when they connect directly to Hadoop and try to perform complex analysis on large volumes of data.

Why Tableau slows down 

Tableau is designed to enable the Cycle of Visual Analysis: get data, view data, ask questions, get answers, repeat. This cycle becomes challenging on Hadoop. The main reason is that Tableau generates and executes a SQL statement for every interaction and on every visualization. As the size of data increases, direct queries from Tableau to Hadoop take very long to return – sometimes minutes and hours instead of seconds. Additionally, when the number of dimensions and cardinalities increase, it takes even longer to fetch query results.  

ADVERTISEMENT

From the user’s perspective, as the analysis deepens, they want to drill down, drill up, or drill across their data to get useful business insights. However, Tableau has to connect to Hadoop to fetch the results for each interaction. This leads to a lag in dashboard refreshes, and interactivity suffers. Though Tableau is the perfect tool to help business users visualize their data, it is not designed to handle big data analysis.

Optimize Tableau performance  

Analysts and data experts have tried several approaches to maximize the performance of Tableau on Hadoop. They put in massive amounts of effort to tune their queries. Instead of pulling detailed reports, they make do with summary reports. Sometimes they reduce the size of their datasets to smaller subsets and then run queries on them instead of running them on the entire dataset. These approaches are restrictive and cannot meet the analytical needs of a growing business.

Another popular method is to pull data out of the big data platform in the form of extracts, and then make Tableau work on it. However, there are several scalability and performance limitations on the amount of data that can be processed this way. Also, this approach is resource-heavy and introduces latency as the data is not live.

Over the years, several enhancements have been made to Tableau to speed up big data analytics such as adding query optimizations capabilities, providing named connectors for Hadoop and other big data platforms, and many more. However, it is still difficult to match the speed at which the size of the data is growing.  

Instant analytics on Hadoop   

An innovative way to improve the performance of Tableau on Hadoop is to create a BI acceleration layer between the big data platform and Tableau that enables quick access to big data. Once this layer is in place, Tableau can connect to this layer instead of connecting directly to the big data platform.  

This layer is built using OLAP on big data technology. In this approach, data is pre-aggregated using the processing and storage capacity of Hadoop. Once the OLAP cubes are ready, queries become incredibly light-weight, and response times are instant, even for the most complex queries. Tableau can connect to this layer using standard connectors. Another key advantage of this approach is that this layer is transparent to the end-users. They can continue using their Tableau interface for big data analytics and enjoy the same speed and interactivity as before, without limitations on the amount of data they can bring into their visualizations.

Conclusion

Tableau was initially developed to work on relational technologies. Therefore, it is unfair to expect it to deal with massive volumes of data on Hadoop. By creating a BI acceleration layer on Hadoop, you can quickly scale up and use Tableau’s compelling visualizations for analyzing big data with high performance and unlimited scalability.

Tags: BigData analyticsHadoopTableau
Share4Tweet2Share1Pin1
Brahmajeet Desai

Brahmajeet Desai

Related Posts

What are some of the new innovations in Telehealth?
Tech news

Getting Started With Telehealth Technology: Here’s What Therapists Need to Know

January 21, 2021
10 CMDB Tools That Will Revolutionize 2021
Tech news

10 CMDB Tools That Will Revolutionize 2021

January 21, 2021
What You Need for An Effective Home Office Setup
Tech news

8 Advantages Coworking Spaces Have Over Traditional Offices

January 21, 2021
Best books on Windows 10 for beginners
Tech news

Best books on Windows 10 for beginners

January 20, 2021
Identity Verification Service: A Counter-Fraud Technology For Online Businesses
Tech news

Top 6 Identity Verification Methods to Verify the Identity of Customer

January 20, 2021
Shift Your Conferences Into Webinars: The Dawn Of A New Era
Tech news

Boost Business Growth by Using Webinar Platforms

January 19, 2021

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

I agree to the Terms & Conditions and Privacy Policy.

Latest Stories

What are some of the new innovations in Telehealth?
Tech news

Getting Started With Telehealth Technology: Here’s What Therapists Need to Know

by Sony T
January 21, 2021
10 CMDB Tools That Will Revolutionize 2021
Tech news

10 CMDB Tools That Will Revolutionize 2021

by Sony T
January 21, 2021
What You Need for An Effective Home Office Setup
Tech news

8 Advantages Coworking Spaces Have Over Traditional Offices

by Sony T
January 21, 2021
Best books on Windows 10 for beginners
Tech news

Best books on Windows 10 for beginners

by Sony T
January 20, 2021
5 Areas of Your Business That Can Be Streamlined By Big Data
Big Data

5 Areas to Streamline Your Business with Big Data

by Srikanth
January 20, 2021
Load More
Techiexpert.com

© 2020 All Rights Reserved

  • Terms of use
  • Privacy Policy
  • About Us
  • Contact us
  • Write For Us
  • Cookie Policy

  • Login
  • Sign Up
No Result
View All Result
  • Home
  • Tech news
  • Startups
  • AI
  • IOT
  • Big Data
  • Cloud
  • Data Analytics
  • ML
  • Blogging

© 2020 All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.