The document discusses choosing data visualization tools for data scientists. It outlines a decision analysis process that identifies strategic objectives, data scientist profiles, product features, functional objectives, and measures. Alternatives are then scored against the measures and objectives. The top alternatives are Tableau and Plotly, which both enable data exploration but have different tradeoffs regarding data sources and programming languages. RShiny and Bokeh also score similarly but differ in design customization and data connectivity.
Labmatrix is a software application that manages the operational aspects of collaborative clinical and translational research programs, including patient recruiting, consenting, sample management (biobanking), experimental characterization of the samples and tracking of patient clinical profiles.
Qiagram is a collaborative visual data exploration environment that enables investigator-initiated, hypothesis-driven data exploration, allowing business users as well as IT professionals to easily ask complex questions against complex data sets.
Tekslate.com is the Industry leader in providing Informatica Data Quality Training across the globe. Our online training methodology focus on hands on experience of Informatica Data Quality.
Labmatrix is a software application that manages the operational aspects of collaborative clinical and translational research programs, including patient recruiting, consenting, sample management (biobanking), experimental characterization of the samples and tracking of patient clinical profiles.
Qiagram is a collaborative visual data exploration environment that enables investigator-initiated, hypothesis-driven data exploration, allowing business users as well as IT professionals to easily ask complex questions against complex data sets.
Tekslate.com is the Industry leader in providing Informatica Data Quality Training across the globe. Our online training methodology focus on hands on experience of Informatica Data Quality.
Tableau’s predictive modeling feature allows users to leverage powerful statistical models to build and update predictive models efficiently while giving them the flexibility to select their predictors, collaborate on the model results within other table calculations, and comprehend and examine a large volume of data. Go through this presentation to discover how Tableau’s predictive modeling feature allows users to leverage powerful statistical models to build and update predictive models efficiently.
International Journal of Computational Engineering Research(IJCER) is an intentional online Journal in English monthly publishing journal. This Journal publish original research work that contributes significantly to further the scientific knowledge in engineering and Technology
Dallas datascienceconference jasongeng-v3Haoran Du
Jason Geng's presentation at Dallas Data Science Conference 2017 (www.dsassn.org/dallas)
Research of Data Science Project Lifecycle, Skillsets and Gaps between the industry and curriculums provided by universities.
Choosing the right software for your research study : an overview of leading ...Merlien Institute
Choosing the right software for your research study : an overview of leading CAQDAS packages by Christina Silver. This presentation is part of the proceedings of the International workshop on Computer-Aided Qualitative Research organised by Merlien Institute. This workshop was held on the 4-5 June in Utrecht, The Netherlands
This presentation "Tableau interview questions and answers" will help you to get prepared for Tableau job interviews. Tableau has become a mission-critical data visualization tool that helps people quickly understand data. The usefulness and popularity of Tableau make it a necessary skill for anyone working with data. As a reflection of the growing importance of data and tools for understanding it, the number of jobs requiring Tableau skills has increased dramatically since 2014.If you’re moving into the field of data analytics or you’re moving up the ladder and need Tableau skills, you’ll probably be interviewing for a job someday soon. We’re here to help, with the key Tableau job interview questions along with their best answers for you to think about ahead of time.
Some of the Tableau interview questions discussed in this presentation are mentioned below. Click on the time stamps to directly jump to that particular question.
1. What are the datatypes supported in Tableau?
2. What do you understand by dimensions and measures?
3. What do you understand by Discrete and Continuous in Tableau?
4. What are filters? Name the different filters in Tableau.
5. There are three customer segments in the Superstore dataset. What percent of the total profits is associated with the Corporate segment?
6. What are the different joins in Tableau? Give example
7. What is the difference between Join and Blending?
8. What is the difference b/w Live and Extract?
9. What is a Calculated Field? How will you create one?
10. How can you display top five and last five sales in the same view ?
11. Is there any difference between Sets and Groups, in Tableau?
12. What is a Parameter in Tableau? Give an example.
13. What is the difference between Tree maps and Heat maps?
14. What is the difference b/w .twbx and .twb?
15. Explain the difference b/w Tableau worksheet, dashboard, story, and workbook?
16. What do you understand by Blended Axis?
17. What is the use of dual axis? How do you create one?
18. What will the following function return? - Left(3, “Tableau”)
19. How do you handle Null and other special values?
20. Find the top product subcategories by Sales within each delivery method. Which sub-category is ranked #2 for first class ship mode?
21. Find the customer with the lowest overall profit. What is his/her profit ratio?
22. What is the Rank function in Tableau?
23. How can you embed a webpage in a dashboard?
24. Design a view to show region wise profit and sales?
25. How can you optimize the performance of a dashboard?
26. Which visualization will be used in the given scenarios:
27. What will you do if some country/province (any geographical entity) is missing and displaying a null when you use map view?
28. What is LOD expression?
29. How can you calculate daily profit measure using LOD?
30. How can you schedule a workbook in Tableau after publishing it?
Learn more at: https://www.simplilearn.com/
What is the Best Data Visualization Tool: Power BI or Tableau?Digital Dialogue
Business Intelligence (BI) is a vital field of computer science that involves a range of processes and technologies aimed at gathering, storing, analyzing, and providing access to data to enhance business decision-making. Read more here!
Agile Testing Days 2017 Intoducing AgileBI Sustainably - ExcercisesRaphael Branger
"We now do Agile BI too” is often heard in todays BI community. But can you really "create" agile in Business Intelligence projects? This presentation shows that Agile BI doesn't necessarily start with the introduction of an iterative project approach. An organisation is well advised to establish first the necessary foundations in regards to organisation, business and technology in order to become capable of an iterative, incremental project approach in the BI domain.
In this session you learn which building blocks you need to consider. In addition you will see what a meaningful sequence to these building blocks is. Selected aspects like test automation, BI specific design patterns as well as the Disciplined Agile Framework will be explained in more and practical details.
Tableau’s predictive modeling feature allows users to leverage powerful statistical models to build and update predictive models efficiently while giving them the flexibility to select their predictors, collaborate on the model results within other table calculations, and comprehend and examine a large volume of data. Go through this presentation to discover how Tableau’s predictive modeling feature allows users to leverage powerful statistical models to build and update predictive models efficiently.
International Journal of Computational Engineering Research(IJCER) is an intentional online Journal in English monthly publishing journal. This Journal publish original research work that contributes significantly to further the scientific knowledge in engineering and Technology
Dallas datascienceconference jasongeng-v3Haoran Du
Jason Geng's presentation at Dallas Data Science Conference 2017 (www.dsassn.org/dallas)
Research of Data Science Project Lifecycle, Skillsets and Gaps between the industry and curriculums provided by universities.
Choosing the right software for your research study : an overview of leading ...Merlien Institute
Choosing the right software for your research study : an overview of leading CAQDAS packages by Christina Silver. This presentation is part of the proceedings of the International workshop on Computer-Aided Qualitative Research organised by Merlien Institute. This workshop was held on the 4-5 June in Utrecht, The Netherlands
This presentation "Tableau interview questions and answers" will help you to get prepared for Tableau job interviews. Tableau has become a mission-critical data visualization tool that helps people quickly understand data. The usefulness and popularity of Tableau make it a necessary skill for anyone working with data. As a reflection of the growing importance of data and tools for understanding it, the number of jobs requiring Tableau skills has increased dramatically since 2014.If you’re moving into the field of data analytics or you’re moving up the ladder and need Tableau skills, you’ll probably be interviewing for a job someday soon. We’re here to help, with the key Tableau job interview questions along with their best answers for you to think about ahead of time.
Some of the Tableau interview questions discussed in this presentation are mentioned below. Click on the time stamps to directly jump to that particular question.
1. What are the datatypes supported in Tableau?
2. What do you understand by dimensions and measures?
3. What do you understand by Discrete and Continuous in Tableau?
4. What are filters? Name the different filters in Tableau.
5. There are three customer segments in the Superstore dataset. What percent of the total profits is associated with the Corporate segment?
6. What are the different joins in Tableau? Give example
7. What is the difference between Join and Blending?
8. What is the difference b/w Live and Extract?
9. What is a Calculated Field? How will you create one?
10. How can you display top five and last five sales in the same view ?
11. Is there any difference between Sets and Groups, in Tableau?
12. What is a Parameter in Tableau? Give an example.
13. What is the difference between Tree maps and Heat maps?
14. What is the difference b/w .twbx and .twb?
15. Explain the difference b/w Tableau worksheet, dashboard, story, and workbook?
16. What do you understand by Blended Axis?
17. What is the use of dual axis? How do you create one?
18. What will the following function return? - Left(3, “Tableau”)
19. How do you handle Null and other special values?
20. Find the top product subcategories by Sales within each delivery method. Which sub-category is ranked #2 for first class ship mode?
21. Find the customer with the lowest overall profit. What is his/her profit ratio?
22. What is the Rank function in Tableau?
23. How can you embed a webpage in a dashboard?
24. Design a view to show region wise profit and sales?
25. How can you optimize the performance of a dashboard?
26. Which visualization will be used in the given scenarios:
27. What will you do if some country/province (any geographical entity) is missing and displaying a null when you use map view?
28. What is LOD expression?
29. How can you calculate daily profit measure using LOD?
30. How can you schedule a workbook in Tableau after publishing it?
Learn more at: https://www.simplilearn.com/
What is the Best Data Visualization Tool: Power BI or Tableau?Digital Dialogue
Business Intelligence (BI) is a vital field of computer science that involves a range of processes and technologies aimed at gathering, storing, analyzing, and providing access to data to enhance business decision-making. Read more here!
Agile Testing Days 2017 Intoducing AgileBI Sustainably - ExcercisesRaphael Branger
"We now do Agile BI too” is often heard in todays BI community. But can you really "create" agile in Business Intelligence projects? This presentation shows that Agile BI doesn't necessarily start with the introduction of an iterative project approach. An organisation is well advised to establish first the necessary foundations in regards to organisation, business and technology in order to become capable of an iterative, incremental project approach in the BI domain.
In this session you learn which building blocks you need to consider. In addition you will see what a meaningful sequence to these building blocks is. Selected aspects like test automation, BI specific design patterns as well as the Disciplined Agile Framework will be explained in more and practical details.
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
For many IT experts, big data analytics tools and technologies are now a top priority. Let's find out the top big data analytics tools in this slide to initialize and advance the process of big data analysis.
How Can Business Analytics Dashboard Help Data Analysts.pdfGrow
The Business analytics dashboard provides a centralized platform that empowers data analysts to explore, analyze, and visualize data more efficiently and intuitively, ultimately driving better decision-making. You can discover how these intuitive Business Intelligence dashboard tools enhance every aspect of data analysis and streamline a data analyst's job. Revolutionize how data analysts function and make the data analysis journey successful by visiting Grow.com.
Rafael Oliveira Bitcoin diz que, na prática, os cientistas de dados trabalham em uma variedade de tarefas, como limpeza de dados, exploração de dados, engenharia de recursos, modelagem e avaliação.
Slide notes for "The Rise of Self-service Business Intelligence"skewdlogix
These notes are for “The Rise of Self-service Business Intelligence” at the second annual Data Management Conference (DMC) – Canada in Edmonton, AB – DAMA Edmonton – September 2016.
Intro of Key Features of SoftCAAT BI Softwarerafeq
This presentation provides a brief overview of SoftCAAT BI with use cases. SoftCAAT BI is a Data Analytics/BI/MIS software specially designed for performing analytics in the assignments of Assurance, Compliance, Consulting and Fraud Investigations.
The Rise of AI and Machine Learning in Power BI (1).pdfSparity1
Learn about the rise of AI and ML in Power BI and how AI-powered business Intelligence tools can improve data analysis, the new additions of Copilot and Microsoft Fabric that leverages AI to revolutionize data analysis and reporting in Power BI.
In today's data-driven world, understanding business analytics is essential for success. upGrad Campus offers top-notch online education in business analytics. Our expert-led courses provide hands-on experience and practical knowledge, ensuring you're equipped with the skills needed to excel in the field. Ready to unlock new opportunities with business analytics? Join upGrad Campus and take the first step towards a brighter future!
ow Do Data Analysis Tools Make Data Preparation Easier?Grow
Explore how data analysis tools simplify data preparation, automating cleansing, integration, and transformation tasks. Discover how advanced BI software streamlines data import, cleaning, and consolidation, saving time and improving accuracy in data analysis. For more information, visit Grow.com
Unifying the big data analytics stack by enabling ETL, OLAP, visualization, and collaboration via a single interface. Get an End To End implementation of The Modern Analytics Architecture.
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxAPTRON Solutions Noida
In a world overflowing with data, the ability to extract meaningful information is a valuable skill. Data Analytics Training in Noida at APTRON Solutions is your gateway to a rewarding career in this ever-evolving field. Our commitment to excellence, practical approach, and industry connections make us the ideal choice for aspiring data analysts in Noida. Join us today and embark on a journey towards becoming a proficient data analyst ready to tackle the challenges of tomorrow's data-driven world. Your future in data analytics starts here at APTRON Solutions Noida!
https://t.ly/_xoaj
Exploring the impact and evolution of Advanced Analytics Tools.pdfStats Statswork
The impact and evolution of advanced analytics tools have transformed how businesses operate, offering unprecedented insights and decision-making capabilities. Statstwork has been at the forefront of this evolution, providing cutting-edge solutions that leverage big data, machine learning, and AI. These tools enable companies to analyze vast amounts of data in real-time, identify trends, and predict future outcomes with high accuracy. As a result, businesses can optimize their operations, enhance customer experiences, and drive innovation. The continuous advancement of these tools promises even greater efficiencies and opportunities, making them indispensable in the modern data-driven landscape.
For more information contact:
https://www.statswork.com
& https://www.statswork.com/contact-us/
Contact our Experts:
Our Email id: info@statswork.com
Contact No: +91 8754467066
Exploring the impact and evolution of Advanced Analytics Tools.pdfStats Statswork
The impact and evolution of advanced analytics tools have transformed how businesses operate, offering unprecedented insights and decision-making capabilities. Statstwork has been at the forefront of this evolution, providing cutting-edge solutions that leverage big data, machine learning, and AI. These tools enable companies to analyze vast amounts of data in real-time, identify trends, and predict future outcomes with high accuracy. As a result, businesses can optimize their operations, enhance customer experiences, and drive innovation. The continuous advancement of these tools promises even greater efficiencies and opportunities, making them indispensable in the modern data-driven landscape.
For more information contact:
https://www.statswork.com
& https://www.statswork.com/contact-us/
Contact our Experts:
Our Email id: info@statswork.com
Contact No: +91 8754467066
Choosing a Data Visualization Tool for Data Scientists Report
1. 1 | P a g e
Choosing Data Visualization Tools for Data Scientists
By: Heather R. Gilley
Introduction
Part of becomingan operational businessintelligence(BI) office isbeingable tocommunicatethe keyinsightsderived
fromdata acquisitionandanalysis. Toeffectivelycommunicatethese insights,the rightdatascientist,product,andtool
needtobe pairedtogetherforthe task.Currently,the BIoffice isstaffedwithdatascientistswhoregularlyreceive
requestsfordatavisualizationssuchasreports,dashboards,andanalytical updates.The challengetheyface nowisto
choose the righttool or tools.Usingthe approachoutlinedbelow,adecisionanalysiswasconducted todeterminehow
each data visualizationtool alternative scoredagainstthe objectives.
Method
1. Identifystrategicobjectivesforchoosingadatavisualizationtool byelicitingthe decisionmakerandreferencing
keydocuments
2. Developdatascientist profilestoidentifynecessarytool featuresthatsupportthe variousskillsetsassociated
withdata scientists
3. Identifythe productfeaturesthatneedtobe supportedbythe datavisualizationtool
4. Alignproductfeaturesanddatascientistskill setstothe functionalobjectives
5. Constructperformance measures thataccuratelygauge the functional objectives
6. Determine the importanceof eachobjective andmeasure
7. Analyze resultsforalternativesandconductsensitivityanalysisof the measures
To choose the right data visualizationtool,datascientistskillsandproductfeaturesneedtobe identifiedand
incorporatedintothe model. Datascientistshave varyingskillsetsanddifferentproductsshare keyfeaturesthatthe
data visualizationtool mustbe able tomanage.These skillsandfeatures are developedintothe decisionanalytical
model aspart of the requirements.Todevelopthe functional objectivesandmeasures,keystrategicdocuments, job
positionrequirements, marketresearch,anddiscussionswiththe decisionmaker were usedtoidentifydata
visualizationtool featuresthataligntothe data scientistprofilesandproduct features.i
2. 2 | P a g e
Key Product Features
The businessintelligence office oftenistaskedwithcreating datavisualizations tocommunicate analytical results.These
data visualizationproductsfluctuatedependingonthe customer,function,andrequirements,buteachof these
productsfall somewherewithinthe spectrumof interactive tostaticandexplanatorytoexploratory. Aninteractive
productoffersthe audience the chance toviewindividual piecesof dataon the chart and filter/sorttheirviewtofind
more insights,whileastaticproducthas a single messagebeingconveyedinone image.Anexplanatoryproduct
providesthe audience withastorythat leadsthe userto the final results,while anexploratorydatavisualizations
provide the audience withaproductthatis meantto be analyzedformultiplestorylines. The followinggraphillustrates
howsome of the mostcommonlyrequestedproductsfallonthe spectrum.
Explanatory
Interactive
Exploratory
Static
Infographic
ReportDashboard
Interactive
Chart
Figure 1: Product Scope
3. 3 | P a g e
Data Scientist Profiles
While the productsare one part of the decisionmodel,the decisioncentersonchoosingthe righttool forthe data
scientist.The purpose of the profiles istoidentifythe tool features thatwill bestenable the datascientists’ skill sets.
Many articlesrecognize thatthere are a varietyof data scientistsandskill sets1
available.The followingdatascientist
profiles were developed basedon marketresearch,currentemployees,and organizational requirements.
Alternatives
There are manyoptionsfordata visualizationtoolsandeachone seemstoserve aseparate purpose.The business
intelligence office hasidentifiedsix alternativesfordatavisualizationtools. Currently,the teamhastemporarylicenses
for all of the alternativesinordertotestthe tools’capabilitiesagainsttheirdatasets. The clientisnotopposedto
choosingmore thanone alternative dependingonthe analytical resultsof the decisionmodel. Fordetailedinformation
for eachalternative,refertothe Alternativestabof the Data VisualizationTool DecisionModel excel workbook.
Data VisualizationTool Alternatives:
D3.js A JavaScriptlibrarythatenablesdeveloperstocreate complex,customdatavisualizationsonthe web
RShiny A R libraryand serverthatenablesRdata visualizationstobe interactive andavailable viaaHTML
framework
Bokeh A data visualizationforpythonthatcreateschartsfromD3 visualsandthe pythondata
Plot.ly A webapplication thatautomaticallycreatesvisualizationsfromavarietyof filestypesand
programminglanguages
Tableau A data visualizationtool thatoffersaneasy-to-useuserinterfacetocreate complex graphicsandcharts
Kibana An opensource datavisualizationanddashboardingtool thatconnectstothe NoSQLdatabase,elastic
search
Strategic Objective
The strategicobjective wasformedusingthe documentationforthe BIprogram, VANDL. The goal of VANDLis to
developdatascience people,skills,andtoolsforthe intelligence community.Partof thisgoal includesthe development
of toolsfortheirdata storage,analytics,andvisualizationsuite.These toolshave theirownstrategicobjectivetobe
considered, thatobjective istodesignforusability,extensibility,scalability,andaffordability.The current focusisthe
1
Top skill sets for data scientists and Analyzing the Analyzers
Domain
Data Scientist
Features:
1. Knowledgeable of the subject
matter and is able to add
context to the analysis for
insightful findings
2. General analysis(regression,
correlation, frequency
distributions)
3. Uses built-in tools foranalysis
Mathematical & Statistician
Data Scientist
Features:
1. Knowledgeable about complex statistical
modeling and analysis (ex. customer opinion
modeling, classification, text analysis, natural
language processing, etc.)
2. Builds, tests, and analyzes models utilizing
statisticalprogramminglanguages suchas,
python and R
3. Uses built-in tools and statistical programming
language librariesto buildvisualizations
Developer
Data Scientist
Features:
1. Knowledgeable in programming,
computer science, anddatabases
2. Creates connections between the
data andthe tools
3. Transforms data to enable
profiles 1 and 2 to perform
analysis andcommunicate results
4. Creates highlycustomized
interactive solutions
Figure 2: Data Scientist Profiles
4. 4 | P a g e
visualizationtool suite.Usingthe organizationsdocumentationandconversationswiththe decisionmaker,the following
strategicobjective wasidentifiedforchoosingdatavisualizationtools.
Choose a tool or tools that enable data scientists to manipulate, analyze, interpret, and visualize data
Functional Objectives
Functional objectivesare specificandmeasureablepartsof the strategicobjective.Since the productsanddatascientists
are ‘the who’and‘the what’that determine whichtool ischosen;those componentsare incorporatedintothe
functional objectives. The followingtable outlinesanddefinesthe functionalobjectives.
Table 1: Functional Objective Definitions
Functional Objectives Description
Be flexibleenoughtoaccommodate
differentproducttypes
The data visualizationscreatedfollow intoone of fourcategories,dashboards,
reports,charts,or infographics.Eachproducthas differentrequirementsthat
will be capturedinthe measures.
Enablesstatistical analysisand
discovery
It iseasiertorecognize patternsand identifyimportantinsightswhendata
scientistsare able tovisuallyanalyzethe data.Inaddition,beingable tovisually
representanalysisplaysakeyrole inidentifyingandcommunicatinganalytical
insight.
Enableshighlycustomizedsolutions Some solutionsneedmore advanceddatavisualizations,byhavingatool that
goesbeyondbasicbar charts,line charts,and pie charts the data scientistscan
create a visualizationthatmeetsthose needs
Highusability Noteveryone hasthe skill settocode solutions.Toolswithadvancedintuitive
GUIs enable datascientiststoquicklycreate datavisualizations.
Scaleswithbigdata projects The customerexperience businessintelligence office hasalarge data setthat is
rapidlygrowing,the selectedtool mustbe able toscale withthe incomingdata.
Measures
Measureswere createdtogauge how well analternative scoresagainstafunctional objective andultimatelythe
strategicobjective. These measureswere createdby reviewingexistingdocumentationand creatinganaffinitydiagram
to visuallymapobjectivesandmeasures. The scale defineshow the measure isgaugedandthe range determinesthe
scope for the scores.Measuresthatare gaugedusing a Likertscale are qualitative, scoreswere determinedby
interviewingthe datascientistswhohave testedthe alternate datavisualizationtools andbyelicitingthe decisionmaker
wheneverpossible.These measures were determinedtobe independentof eachother. The followingtable definesthe
measures,theirunits,andtheirscale.
Table 2: Measure Definition and Scale
Measure Description Scale Range
Analytical Capability Level of analysisbuiltintouser
interface
Levelsdefinedbythe Likertscale Analytical
Capability
ChartingCapability Chartingcapabilityallowsthe userto
create complex charts
Levelsdefinedbythe Likertscale Charting
Capability
ProgrammingCapability Programmingcapabilityallowsthe user
to customize the productsappearance
and functionality
Levels definedbythe Likertscale Programming
Capability
DesignCapability Capabilitytochange the appearance of
the product
Levelsdefinedbythe Likertscale Design
Capability
5. 5 | P a g e
Measure Description Scale Range
Numberof Supported
ProgrammingLanguages
The numberof programminglanguages
the tool is able toprocess
Countof the programming
languagesthe tool isable to
support
Numberof
Supported
Programming
Languages
GUI Toolswithuserinterfacesvstoolswith
interactive developmentenvironment
Levelsdefinedbythe Likertscale GUI
Interactive Product
Capability
How well the tool enablesproductsto
be interactive
Levelsdefinedbythe Likertscale Interactive
Product
Capability
Numberof Supported
File Types
The numberof filestypesthatthe tool
allowstobe importedandexported
Levelsdefinedbythe Likertscale Numberof
Supported
File Types
Data connectors The numberof data sourcesthe tool
can use
Countof featuresthatallowsthe
tool to connectto differentdata
sources
Data
connectors
AccessControl The layersof useraccesscontrol that
can be appliedtothe productsandthe
data behindthe products
Levelsdefinedbythe Likertscale Access
Control
Cost The yearlytotal cost per userto keepa
tool
Total costper userper year Cost
Data Size The quantityof data the tool isable to
ingestandchart. Thisexactamount
variesacrossdatasets;however
differenttoolsare able toscale to
differentlevels
Levelsdefinedbythe Likertscale 1 to 5
Analytical Approach
In orderto evaluate the alternatives,measureswere appliedtothe functional objectives.These measureswere
identifiedasindicatorsof the functional objectives becausetheyoverlapwiththe featuresnecessarytoaccommodate
the differentdatascientistskillsetsandthe differentdatavisualizationrequirements. A cardsort activitywasconducted
to ensure the datascientistprofilesandproductrequirements alignedwiththe functional objectivesandmeasures.
Mapping Objectives and Measures to Data Scientist Profiles
Functional objectives andmeasures were created usingthe featuresof the datascientistprofiles.Some measures,such
as numberof supportedprogramminglanguages,were identifiedascrossprofile requirementstobe flexible enoughto
accommodate differentproduct types.The abilitytobe flexibleenoughtoaccommodate differentproducttypestakes
intoconsiderationthatdatascientistshave differentskillstosupportthe same products. The followingdiagram
indicateshowthe profiles alignedtothe functional objectivesandmeasures.
6. 6 | P a g e
Figure 3: How Data Scientist Profiles Align to Functional Objectives
7. 7 | P a g e
Decision Model Structure
Afteridentifyingthe strategicobjectives,the functional objectives,andthe measures, the decisionmodel forchoosingadata visualizationtool ortoolscanbe
depictedinthe followingdiagram:
Figure 4: Decision Model Hierarchy
8. 8 | P a g e
Scoring the Alternatives
Once the model hasbeendefined,the alternativesare evaluatedandscoredagainstthe independentmeasures. The
informationfoundonthe alternativeswasthroughindependentresearchandfeedbackfromthe BIdata scientists
testingthe alternatives. Some of the measureswere identifiedasbeingmore subjective,these measureswere scoredon
a Likertscale, with1 being‘doesnothave capability’and5 being‘capabilityhighlyexceedexpectations’, tocreate
consistencybetweenscores. The remainingmeasurescouldbe quantifiedbyeithercountordollaramount.
Late intothe developmentof the decision model,itwasidentifiedthatmore in-depthinformationonalternativeswas
available throughcommercialresearchconductedbyIn-Q-Tel.Thiscompanyidentifies,adapts,anddeliversinnovative
technological solutionstothe intelligence communityandiscurrentlyconductingresearchondatavisualizationtoolsfor
data scientists.Afterthisdiscovery,the decisionmakerdeterminedthatthisinformationwillbe implementedintothe
secondphase of the decisionmodel,in the future alongwithanyotheridentifiedimprovements.
Determining Weights
To determine the weightsforthe measures,the swingweightmethodwasapplied. The firststepwastodetermine
swingweightsistoidentifythe bestandworstalternativesthatcouldexist.Nextstepwastoelicit the decisionmaker
for howthe measuresshouldbe ranked.Duringthistime the decisionmakerwasunavailable,soadditional team
memberswere consultedtodetermine howtorankeach measure. Finally, the weightswerecalculatedusingthe
identifiedranks. The followingtableshowsthe worst/bestalternative andtheircorrespondingweights.
Table 3: Swing Weights
Worst Best Rank Weight Weight
InteractiveProductCapability(IP) 1 5 1.00 0.132 Total RankWeight 7.55 WIP
AnalyticalCapability(AN) 0 581 0.95 0.126 WeightIP = 0.1325
Charting Capability(CH) 1 5 0.85 0.113
DataSize (DS) 1 5 0.80 0.106
Numberof Supported Programming
Languages(PL)
0 4 0.75 0.099
DataConnectors(DC) 2 40 0.70 0.093
Access Control(AC) 0 5 0.60 0.079
Programming Capability(PC) 1 5 0.55 0.073
GUI (G) 1 5 0.45 0.060
Design Capability(DC) 1 5 0.40 0.053
Numberof Supported FileTypes (FT) 1 5 0.30 0.040
Cost(C) 1999 0 0.20 0.026
9. 9 | P a g e
Analysis and Computation
Whenthe buildingblocksof the decisionmodel were established,the modelwasbuiltintoExcel andLogical Decisions
for Windows. Logical DecisionsforWindowswasusedtobuildthe model forcalculatingthe subjectivegoal of choosing
a data visualizationtool thatenablesdatascientists.Excel wasusedtocalculate the resultsforeachdatascientisttype.
The followingchartshowsthe rankedresultsforeachalternativeandhow theyscore againstthe functional objectives.
Figure 5: Alternatives Ranked by Goal: Choose Data Visualization Tool
From the resultswe cansee that there are alternativeswithverysimilarscores:Tableau&Plot.lyand Bokeh&RShiny.
The followingsectionshighlightthose differencesandthe tradeoffsof choosingone tool overanother.
Comparing Alternatives
Plot.ly vs. Tableau
Figure 6: Plot.ly vs. Tableau Tornado Diagram
Plot.ly andTableauscoredvarysimilarly.Bothtoolsare capable of creatingproductswithinthe BusinessIntelligence
Office’sscope andprovide aplatformfordatascientiststoexplore variousdatasets,butwithdifferenttradeoffs.Plot.ly
allowsdatascientiststouse multipledatamanipulationtoolssuchasPython,R,andExcel to create advanced
10. 10 | P a g e
visualizationsand conductadvancedanalyticsinacollaborative setting. Tableaurequires eachdatascientist tolearn
theirspreadsheetlanguage asopposedtousingthe skillsetstheyalreadypossess.Thisisanadvantage forPlot.lyasit
allowsdatascientistswithdesperate skillsetsto collaboratively use the same tool. However, Tableauisable toconnect
to a largernumberof data sources andis able toprocessdatasetsthat qualifyas“bigdata”. Since the governmentisone
of the largestproducersof data,thisis an importantrequirementtoconsider.
RShiny vs. Bokeh
Figure 7: RShiny vs. Bokeh Tornado Diagram
BokehandRShinyhad the same score,but inthe diagram above youcan see the tradeoffsof choosingone tool over
another. Aspart of the R library,RShinyissupportedbyamultitude of statistical programminglibraries. Also,the RShiny
package includes RShinyServer,whichisable toconnectto many differentdatasources.However,RShinyrequiresthe
userto implementaCSS file tochange the styles. Bokehallowsthe datascientisttoutilizedesign optionstoenhance
products,improve communications,andisalsosupported bymultiplestatistical programminglibraries,butnotasmany
as R.
11. 11 | P a g e
Alternative Results for Data Scientist Profiles
Each of the data scientistprofileshave correspondingfunctional objectives,asoutlinedinthe DataScientistProfiles
section,tochoose the besttool for eachdata scientistskillset.The followingsectionsoutline the resultsforeach
alternative asitrelatestothe data scientistprofiles:
Domain Data Scientist
Figure 8: Alternatives Ranked by Domain Data Scientist Profile
The domaindata scientistisfocusedoncreatingdifferentcustomizedproducttypeswithausable tool.Tableauand
Plot.lybothscoredhighlywiththe domaindatascientist.These alternativesofferintuitiveuserinterfacesthatallowa
data scientistto quicklycreate highlyinteractive chartsthatcan be usedfor communicationsoranalysis. While Tableau
offersmore designcapabilities,Plot.ly’sabilitytosupportmultiple programminglanguagesenablesdomaindata
scientiststocollaborate withotherdatascientistsmore easily.
12. 12 | P a g e
Mathematical & Statistician Data Scientist
Figure 9: Alternatives Ranked by Mathematical & Statistician Data Scientist Profile
The mathematical &statisticiandatascientistisconcernedwithbeing able toconductmore complex statisticalanalysis
on large datasetsandbeingable tocommunicate those results. Plot.lyisafairlynew technologythatisstill developing
theircapabilities andcurrentlyisunable tohandle datasetsthatqualifyas“big data”. Plot.lyintendstoexpandtheir
abilitytoingestandprocesslarge data sets;however,Tableaucurrentlyhasthatcapabilitybuiltintotheirsoftware.
Developer Data Scientist
Figure 10: Alternatives Ranked by Developer Data Scientist Profile
The developerdatascientistisresponsible foracquiringandtransformingthe dataintoadatasetthat is usable forother
data scientists;therefore,theyare more concernedwithscalabilityandcustomizability.Asnotedinthe mathematical &
statisticiandatascientistprofile,Tableauisthe besttool available forscalingwiththe dataquantity.
13. 13 | P a g e
Sensitivity Analysis
The resultsof the decisionmodel are more sensitivetosome measures overothers. The followingchartshowsthe
resultsof the sensitivityanalysisforthe differentmeasuresinthe decisionmodel:
0
{0}
0.495839701
{5}
0.515707251
{5}
0.509084734
{5}
0.55544235
{5}
0.495839701
{0}
0.5753099
{5}
0.569536424
{40}
0.595177449
{4}
0.495839701
{5}
0.495839701
{5}
0.621667516
{5}
0.528952284
{5}
0
{0}
0.469349635
{1}
0.475972151
{1}
0.456104602
{1}
0.495839701
{1}
0.422992019
{1999}
0.495839701
{1}
0.476821192
{1}
0.495839701
{1}
0.389879436
{1}
0.38325692
{1}
0.495839701
{1}
0.396501953
{1}
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Score
Data Size (DS)
Number of Supported File Types (FT)
Design Capability (DE)
GUI (G)
Cost (C)
Access Control (AC)
Data Connectors (DC)
Number of Supported Programming Languages (PL)
Programming Capability (PC)
Charting Capability (CH)
Analytical Capability (AN)
Interactive Product Capability (IP)
Choose a Data Visualization Tool
14. 14 | P a g e
Conclusion
Consistently,TableauandPlot.lyemerge ashighlyrankedalternatives.Tableauwaschosenasthe bestoptionforthe
overall objective,the mathematical &statisticiandatascientistprofile,andthe developerdatascientistprofile;while
Plot.lywaschosenasthe bestoptionfor the domaindatascientistprofile.Theseoptionshave differenttradeoffs
dependingthe datascientistneedsand the productrequirements.The datascientistprofilesare leaningtowards
Tableauandto gain a more granular insightintothe besttool dependingonproductrequirementsthe model needsto
be refinedevenfurther.Thismodel isstillfairlyhigh-level andiscurrentlyunderreview bythe decisionmaker togain
that level of granularity.
Thisdecisionmodel wasformedbyelicitingthe projectteam membersandreferringtothe project’skeystrategic
documents.Ideally,the decisionmakerwouldhave been elicitedconsistentlythroughoutthe process;however,he was
absentdue to a familyemergencyforthe majorityof the projectduration. Recently,the decisionmakerreturnedtothe
projectand he iscurrentlyreviewingthe resultsof the analysis. These changeswillbe incorporatedintothe future
model alongwithanyotheridentifiedchangesmade bythe decisionmaker.
Duringthe reviewprocessadditional resourceswere identifiedforrefiningthe decisionmodel.In-Q-Telconductedanin-
depthstudyof over50 data visualizationtoolswithnumerousattributesidentified.The decisionmakerprovideda
documentwithquantifiablemeasuresfordashboardproductrequirements.The studyandthe measureswillbe
reviewedtodetermineif theyneedtobe incorporatedintothe advanceddecisionmodelorif the resultsof thiscurrent
studyisenoughto drive a decision.
i Note, duringthe analysis process,the decision maker suddenly needed to be absent for an extended period of time due to a fa mily
emergency. The decision maker returned towards the end of the initiativeand has identified areas for further analysis.