SlideShare a Scribd company logo
4/6/2015 5 things should know about Excel about every data scientist
http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 1/4
32 minutes ago
Hi dude , Here I want to share you one easy way to get retail products for Microsoft Office
,Windows 7,windows 8,windows server 2012 R2, Click Here [http://www.windowskeyoffer.com/]  to our
store ,we provide genuine licenses with lower price .
Microsoft Excel has been one weapon for decades. I lists the go­to Excel
skills data scientists should master. 
5 things should know about Excel about every data
scientist 
When is the last time you opened Microsoft Excel to do some data science? If it's been a
while, you're missing out.
It's hard to have a discussion about data science tools (and there's a lot of that going on) ,
and the rest of the darlings in the corral of data science favorites, but what about Excel? I'm
sorry  if  Excel's  not  sophisticated  enough  for  your  data  science  needs  ­­  or  so  you  think.
Microsoft Excel has been a secret weapon of mine for decades ­­ it has been my ubiquitous
data tool ­­ and becoming a data scientist didn't stop me one bit from using it. Here are five
things about Excel that every data scientist should know.
[http://www.techrepublic.com/blog/10­things/10­steps­to­adding­a­timeline­to­an­excel­2013­
pivottable/]
Inserting a pivot table in a sheet in Excel 2013.
 Image: Screenshot by Susan Harkins/TechRepublic
4/6/2015 5 things should know about Excel about every data scientist
http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 2/4
[https://www.blogger.com/null]
Named ranges are a quick way to create a makeshift database in Excel. In simple terms, a
named range is a table of data that has a label for easy reference. No need to get fancy:
column  headings  across  the  top  row  and  then  rows  of  data  below,  following  the  typical
structure of any data table.
There are several ways to assign your custom name to the table, but I find it easiest to just
click in the top left corner (where the cell reference is displayed) and start typing. I typically
think of these as lookup tables, so I usually use the "lkp" prefix when naming them. Put your
primary key in the leftmost column and then use the VLOOKUP function anywhere in your
workbook to find any value in your table.
As soon as you have your named range in place, you can sort and filter with one click of the
filter  button.  This  is  a  fast  and  easy  way  to  explore  your  data  set  and  possibly  highlight
interesting rows or cells.
Once the range is in filter mode, it's good to inspect the filter drop­downs to get a sense of
the data in your data set. Excel's okay with combining types, so you can quickly spot data
errors  just  by  looking  at  the  different  values  in  the  filter  drop­down.  There's  also  an
extremely  powerful  Advanced  Filtering  functionality  that  allows  you  to  filter  your  data  set
based on criteria you specify in another range.
Pivot tables are a quick and easy way to slice and dice data. Although not as fully functional
as  a  full­blown  business  intelligence  tool,  pivot  tables  in  Excel  do  a  respectable  job  of
quickly cross­tabulating data and calculating counts, sums, and other aggregate metrics.
With your named range in place (are you getting the sense of how fundamental these are
when working with Excel?), click the pivot table button and then tell Excel where you want it
to go. For small jobs, I'll just put the pivot table next to the named range; for larger jobs, I'll
give  the  pivot  table  its  own  sheet.  Now  just  drag  and  drop  columns,  rows,  and  values
(metrics) to dynamically create your cross­tab analysis. It's not Business Objects, but it's not
bad for a spreadsheet tool.
Conditional  formatting  is  fun,  and  I  hope  Microsoft  expands  this  functionality  a  bit  in  the
future.  As  its  name  implies,  this  feature  allows  you  to  format  cells  based  on  criteria  you
specify (instead of static formatting where the cell always holds the same formatting).
1: Named ranges
2: Sorting and filtering
3: Pivot tables
4: Conditional formatting
4/6/2015 5 things should know about Excel about every data scientist
http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 3/4
Posted 32 minutes ago by Matti Vuorela
For  instance,  you  could  tell  Excel  to  format/highlight  all  cells  in  a  named  range  that  are
above a certain value. And if you want to get fancy, you can tell Excel to format cells based
on a formula that involves other cells. Excel has some built­in formats that make it easy to
quickly create a heat map or even an icon overlay. However, you're limited on the icons you
can select, and you cannot easily extract the exact color from a heat map. Overall though, it
does the trick for most situations.
We come to the most powerful feature Excel has to offer: Visual Basic. That's right, Visual
Basic. I know what you're thinking ­­ you're far too advanced for Visual Basic, right?
Visual Basic and Excel are awesome in the hands of a data scientist. You already know how
to program, so picking up Visual Basic won't be hard. And Visual Basic opens up a whole
new  world  of  creative  solutions  with  Excel  ­­  everything  from  creating  your  own  Excel­
based neural network, to Monte Carlo simulations, to anything else you can dream up.
Excel does have its limits, so don't push it. For the hard­core work, you're much better off
with R or Python. But don't discount Excel for a quick prototype or proof­of­concept.
Although Excel isn't a top resume­building skill for data scientists, you'd be remiss if you
didn't learn its ins and outs. Over and above the obvious features, which handle statistical
and  mathematical  formulae  pretty  well,  Excel  is  a  respectable  data  management  and
programming tool.
First learn the basics of named ranges and filtering, and then move on to more advanced
features like pivot tables and conditional formatting. Finally, learn Visual Basic for Excel. It's
really not hard to pick up one more language, and it's well worth the trouble. And don't worry
­­ nobody will take away your data scientist badge for learning Excel.
5: Visual Basic
Summary
 0 Add a comment
4/6/2015 5 things should know about Excel about every data scientist
http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 4/4
Sign out
  Notify me
Enter your comment...
Comment as:  Matti Vuorela (Google)
Publish
  Preview

More Related Content

Similar to 5 things should know about excel about every data scientist

RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automationsRPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
Diana Gray, MBA
 
Data Science for non techies using KNIME 08 Weeks Training
Data Science for non techies using KNIME 08 Weeks TrainingData Science for non techies using KNIME 08 Weeks Training
Data Science for non techies using KNIME 08 Weeks Training
Ali Raza Anjum
 
RPA Summer School Session 3.1: Your first Excel and Word automations
RPA Summer School Session 3.1: Your first Excel and Word automationsRPA Summer School Session 3.1: Your first Excel and Word automations
RPA Summer School Session 3.1: Your first Excel and Word automations
Cristina Vidu
 
Msoffice
MsofficeMsoffice
Msoffice
john wick
 
Microsoft Office Introduction
Microsoft Office IntroductionMicrosoft Office Introduction
Microsoft Office Introduction
Anitha Rao
 
microsoftoffice-introduction-200729052822.pdf
microsoftoffice-introduction-200729052822.pdfmicrosoftoffice-introduction-200729052822.pdf
microsoftoffice-introduction-200729052822.pdf
ditebogo nkoana
 
Empowerment Technologies Lecture 5 (Philippines SHS)
Empowerment Technologies Lecture 5 (Philippines SHS)Empowerment Technologies Lecture 5 (Philippines SHS)
Empowerment Technologies Lecture 5 (Philippines SHS)
John Bosco Javellana, MAEd.
 
Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
   Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...   Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
karunatomar3
 
Word 365
Word 365Word 365
Word 365
Bhavyapratap2
 
Should I stay or should I go?
Should I stay or should I go?Should I stay or should I go?
Should I stay or should I go?
Markus Flechtner
 
Excel Homework Help
Excel Homework HelpExcel Homework Help
Excel Homework Help
Stat Analytica
 
Production use of AI/ML Systems
Production use of AI/ML SystemsProduction use of AI/ML Systems
Production use of AI/ML Systems
Ekrem Aksoy
 
From Lab to Factory: Creating value with data
From Lab to Factory: Creating value with dataFrom Lab to Factory: Creating value with data
From Lab to Factory: Creating value with data
Peadar Coyle
 
Session 3.2 Your first excel and word automations
Session 3.2 Your first excel and word automationsSession 3.2 Your first excel and word automations
Session 3.2 Your first excel and word automations
Cristina Vidu
 
Office 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
Office 365 Productivity Tips -- Mayhem in Minneapolis, The RematchOffice 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
Office 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
Christian Buckley
 
Information Governance and ediscovery in office 365 ediscovery deep dive
Information Governance and ediscovery in office 365 ediscovery deep diveInformation Governance and ediscovery in office 365 ediscovery deep dive
Information Governance and ediscovery in office 365 ediscovery deep dive
bilgore
 
Automating With Excel An Object Oriented Approach
Automating  With  Excel    An  Object  Oriented  ApproachAutomating  With  Excel    An  Object  Oriented  Approach
Automating With Excel An Object Oriented Approach
Razorleaf Corporation
 
Excel syllabusppt
Excel syllabuspptExcel syllabusppt
Excel syllabusppt
kunalj13
 
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data ModelerThe Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
DATAVERSITY
 
Excel vs Tableau the comparison you should know
Excel vs Tableau  the comparison you should knowExcel vs Tableau  the comparison you should know
Excel vs Tableau the comparison you should know
Stat Analytica
 

Similar to 5 things should know about excel about every data scientist (20)

RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automationsRPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
RPA Summer School StudioX Session 3 AMER: Your first Excel and Word automations
 
Data Science for non techies using KNIME 08 Weeks Training
Data Science for non techies using KNIME 08 Weeks TrainingData Science for non techies using KNIME 08 Weeks Training
Data Science for non techies using KNIME 08 Weeks Training
 
RPA Summer School Session 3.1: Your first Excel and Word automations
RPA Summer School Session 3.1: Your first Excel and Word automationsRPA Summer School Session 3.1: Your first Excel and Word automations
RPA Summer School Session 3.1: Your first Excel and Word automations
 
Msoffice
MsofficeMsoffice
Msoffice
 
Microsoft Office Introduction
Microsoft Office IntroductionMicrosoft Office Introduction
Microsoft Office Introduction
 
microsoftoffice-introduction-200729052822.pdf
microsoftoffice-introduction-200729052822.pdfmicrosoftoffice-introduction-200729052822.pdf
microsoftoffice-introduction-200729052822.pdf
 
Empowerment Technologies Lecture 5 (Philippines SHS)
Empowerment Technologies Lecture 5 (Philippines SHS)Empowerment Technologies Lecture 5 (Philippines SHS)
Empowerment Technologies Lecture 5 (Philippines SHS)
 
Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
   Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...   Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
Mastering Microsoft Excel: Attitude Academy’s MS Excel Classes in Yamuna V...
 
Word 365
Word 365Word 365
Word 365
 
Should I stay or should I go?
Should I stay or should I go?Should I stay or should I go?
Should I stay or should I go?
 
Excel Homework Help
Excel Homework HelpExcel Homework Help
Excel Homework Help
 
Production use of AI/ML Systems
Production use of AI/ML SystemsProduction use of AI/ML Systems
Production use of AI/ML Systems
 
From Lab to Factory: Creating value with data
From Lab to Factory: Creating value with dataFrom Lab to Factory: Creating value with data
From Lab to Factory: Creating value with data
 
Session 3.2 Your first excel and word automations
Session 3.2 Your first excel and word automationsSession 3.2 Your first excel and word automations
Session 3.2 Your first excel and word automations
 
Office 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
Office 365 Productivity Tips -- Mayhem in Minneapolis, The RematchOffice 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
Office 365 Productivity Tips -- Mayhem in Minneapolis, The Rematch
 
Information Governance and ediscovery in office 365 ediscovery deep dive
Information Governance and ediscovery in office 365 ediscovery deep diveInformation Governance and ediscovery in office 365 ediscovery deep dive
Information Governance and ediscovery in office 365 ediscovery deep dive
 
Automating With Excel An Object Oriented Approach
Automating  With  Excel    An  Object  Oriented  ApproachAutomating  With  Excel    An  Object  Oriented  Approach
Automating With Excel An Object Oriented Approach
 
Excel syllabusppt
Excel syllabuspptExcel syllabusppt
Excel syllabusppt
 
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data ModelerThe Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
 
Excel vs Tableau the comparison you should know
Excel vs Tableau  the comparison you should knowExcel vs Tableau  the comparison you should know
Excel vs Tableau the comparison you should know
 

Recently uploaded

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
Data Hops
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 

Recently uploaded (20)

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 

5 things should know about excel about every data scientist

  • 1. 4/6/2015 5 things should know about Excel about every data scientist http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 1/4 32 minutes ago Hi dude , Here I want to share you one easy way to get retail products for Microsoft Office ,Windows 7,windows 8,windows server 2012 R2, Click Here [http://www.windowskeyoffer.com/]  to our store ,we provide genuine licenses with lower price . Microsoft Excel has been one weapon for decades. I lists the go­to Excel skills data scientists should master.  5 things should know about Excel about every data scientist  When is the last time you opened Microsoft Excel to do some data science? If it's been a while, you're missing out. It's hard to have a discussion about data science tools (and there's a lot of that going on) , and the rest of the darlings in the corral of data science favorites, but what about Excel? I'm sorry  if  Excel's  not  sophisticated  enough  for  your  data  science  needs  ­­  or  so  you  think. Microsoft Excel has been a secret weapon of mine for decades ­­ it has been my ubiquitous data tool ­­ and becoming a data scientist didn't stop me one bit from using it. Here are five things about Excel that every data scientist should know. [http://www.techrepublic.com/blog/10­things/10­steps­to­adding­a­timeline­to­an­excel­2013­ pivottable/] Inserting a pivot table in a sheet in Excel 2013.  Image: Screenshot by Susan Harkins/TechRepublic
  • 2. 4/6/2015 5 things should know about Excel about every data scientist http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 2/4 [https://www.blogger.com/null] Named ranges are a quick way to create a makeshift database in Excel. In simple terms, a named range is a table of data that has a label for easy reference. No need to get fancy: column  headings  across  the  top  row  and  then  rows  of  data  below,  following  the  typical structure of any data table. There are several ways to assign your custom name to the table, but I find it easiest to just click in the top left corner (where the cell reference is displayed) and start typing. I typically think of these as lookup tables, so I usually use the "lkp" prefix when naming them. Put your primary key in the leftmost column and then use the VLOOKUP function anywhere in your workbook to find any value in your table. As soon as you have your named range in place, you can sort and filter with one click of the filter  button.  This  is  a  fast  and  easy  way  to  explore  your  data  set  and  possibly  highlight interesting rows or cells. Once the range is in filter mode, it's good to inspect the filter drop­downs to get a sense of the data in your data set. Excel's okay with combining types, so you can quickly spot data errors  just  by  looking  at  the  different  values  in  the  filter  drop­down.  There's  also  an extremely  powerful  Advanced  Filtering  functionality  that  allows  you  to  filter  your  data  set based on criteria you specify in another range. Pivot tables are a quick and easy way to slice and dice data. Although not as fully functional as  a  full­blown  business  intelligence  tool,  pivot  tables  in  Excel  do  a  respectable  job  of quickly cross­tabulating data and calculating counts, sums, and other aggregate metrics. With your named range in place (are you getting the sense of how fundamental these are when working with Excel?), click the pivot table button and then tell Excel where you want it to go. For small jobs, I'll just put the pivot table next to the named range; for larger jobs, I'll give  the  pivot  table  its  own  sheet.  Now  just  drag  and  drop  columns,  rows,  and  values (metrics) to dynamically create your cross­tab analysis. It's not Business Objects, but it's not bad for a spreadsheet tool. Conditional  formatting  is  fun,  and  I  hope  Microsoft  expands  this  functionality  a  bit  in  the future.  As  its  name  implies,  this  feature  allows  you  to  format  cells  based  on  criteria  you specify (instead of static formatting where the cell always holds the same formatting). 1: Named ranges 2: Sorting and filtering 3: Pivot tables 4: Conditional formatting
  • 3. 4/6/2015 5 things should know about Excel about every data scientist http://windowskeyoffer.blogspot.kr/2015/04/5­things­should­know­about­excel­about.html 3/4 Posted 32 minutes ago by Matti Vuorela For  instance,  you  could  tell  Excel  to  format/highlight  all  cells  in  a  named  range  that  are above a certain value. And if you want to get fancy, you can tell Excel to format cells based on a formula that involves other cells. Excel has some built­in formats that make it easy to quickly create a heat map or even an icon overlay. However, you're limited on the icons you can select, and you cannot easily extract the exact color from a heat map. Overall though, it does the trick for most situations. We come to the most powerful feature Excel has to offer: Visual Basic. That's right, Visual Basic. I know what you're thinking ­­ you're far too advanced for Visual Basic, right? Visual Basic and Excel are awesome in the hands of a data scientist. You already know how to program, so picking up Visual Basic won't be hard. And Visual Basic opens up a whole new  world  of  creative  solutions  with  Excel  ­­  everything  from  creating  your  own  Excel­ based neural network, to Monte Carlo simulations, to anything else you can dream up. Excel does have its limits, so don't push it. For the hard­core work, you're much better off with R or Python. But don't discount Excel for a quick prototype or proof­of­concept. Although Excel isn't a top resume­building skill for data scientists, you'd be remiss if you didn't learn its ins and outs. Over and above the obvious features, which handle statistical and  mathematical  formulae  pretty  well,  Excel  is  a  respectable  data  management  and programming tool. First learn the basics of named ranges and filtering, and then move on to more advanced features like pivot tables and conditional formatting. Finally, learn Visual Basic for Excel. It's really not hard to pick up one more language, and it's well worth the trouble. And don't worry ­­ nobody will take away your data scientist badge for learning Excel. 5: Visual Basic Summary  0 Add a comment