SlideShare a Scribd company logo
1 of 47
BUSINESS INTELLIGENCE
& ADVANCED ANALYTICS


  The Search for Patterns,
  Waldo, and Black Swans
  Barrett Peterson, C.P.A.
  ICPAS Fox River Trail Chapter, June 28, 2012
WHY
BUSINESS
INTELLIGENCE?



                Information

          Good Data Good Analysis
HISTORY AND
BACKGROUND
A LITTLE BACKGROUND

HISTORY       • Computer based business
A trip down     intelligence systems is an idea
memory
lane            that is middle aged – about 40 .
                Previously described as:

                – Decision support systems [DSS]
                – Executive information systems [EIS]
                – Management information systems [MIS]
A LITTLE BACKGROUND
              • Internet Development
                  – ARAPNET and others – 1960s
                  – Internet Protocols – 1982, presumably by Al Gore
History       • IBM researcher Edgar Codd credited with development
                of relational data base theory in 1970.
Important     • IBM’s Donald Chamberlin and Raymond Boyce develop
Technology      structured query language [SQL] in the early 1970s to
Inventions      manipulate and retrieve data from IBM’s early relational
                data base management system
              • World Wide Web and 1st web browser invented by Tim
                Berners-Lee in 1990 by combining the internet,
                hypertext mark-up language, and Uniform Resource
                Locator [URL] system. Became Nexus.
              • Mosaic, designed by Marc Andressen became the first
                commercial web browser [Netscape].
              • Development of big data enabling database designs and
                high speed processing during the last 15 years.
A LITTLE BACKGROUND
            • Development of the primary infrastructure
                – Database design
                – Processing and Storage Hardware
History         – Server Development and Massively Parallel Processing
            • Improved telecommunications speed
Drivers
            • Hardware miniaturization, capacity, and speed
Enabling
                – Memory [RAM] capacity
BI and
                – Storage capacity and transfer speed
Advanced
                – Bus speed
Analytics
                – Video processing capacity and speed
            • Increased hardware speed and capacity
            • Digital formats for sensors, cameras, RFID, and other data
                collection sources
            • Mobile computing
            • “Cloud” capability exploits many of these developments
A LITTLE BACKGROUND
                    •   Analytics
TERMINOLOGY         •   Business Intelligence
A consultant’s
                    •   Knowledge Management
collection of       •   Content Management
confusing names -
a sampler           •   Data Mining
                    •   Big Data
                    •   Data Integration
                    •   Gameification
                    •   Blob [Binary Large Object]
A LITTLE BACKGROUND
           • CPU speed and power
              – Moore’s law
Drivers
              – Multi-core chips
And           – Solid State Memory
Enablers   • Storage improvement and cost reduction
              – Greatly increased capacity
of            – Greatly increased access/transfer speed
Big           – Greatly reduced cost
           • Data collection from a wide range of devices
Data
           • Data communications – speed and volume
           • Database management techniques and
             software
           • Application speed and power
BUSINESS
INTELLIGENCE
AND ADVANCED
  ANALYTICS
   DEFINED
TONIGHT’S CRITICAL DEFINITIONS


Business       A system comprised of “computer”
Intelligence   hardware and software to:

               • Collect, “clean”, filter, and integrate data
               • Store data [hardware and software]
               • Provide knowledge management,
                 analytical , and presentation tools to
                 translate data into decision useful
                 information
TONIGHT’S CRITICAL DEFINITIONS
               • Prehistoric – Mainframe Era
                   – DSS, EIS, MIS
                   – Hierarchical Master Data Files
Business       • The Current Era [Primarily] – Business Intelligence
                   – Primarily “structured” data [data that can be
Intelligence         represented in relational /dimensional tables or flat
                     files], and BLOB [binary large object] formats
Generations        – Analysis of “known” patterns
                   – Presented in tables, simple charts, and dashboards
               • Emerging – Big Data and Advanced Analytics
                   – to discover new, changing, or variable patterns
                   – A wide variety of “unstructured” digital data
                     formats added to “structured” data
                   – Emerging storage structures
                   – “Exploratory” analytics
                   – Zoomable User Interface [ZUIs]
                   – Solid State Memory and Solid-State Drives
THE HARDWARE
AND SOFTWARE
  ELEMENTS OF
    BUSINESS
 INTELLIGENCE
BUSINESS INTELLIGENCE ELEMENTS
              • Computer – CPU, Memory, and Operating System Software
              • Data Collection
                  – Master Data Management
                  – Collection Processes and Devices
                  – Data Cleansing Processes and Software
Principal     • Data Storage
Components        – Physical Devices and Storage Management Software
                  – Data Management and Integration
for               – Database Software Storage
                       • Relational – Traditional ERP/Transaction systems
Maximum                • Dimensional – Traditional Data Warehouse, including
Application               associated BLOB
                       • Distributed , Multiple Server, Storage Systems
                       • NoSQL [Not Only SQL] Distributed Operational Stores
                       • Hadoop for Highly Parallel Processing and Intensive Data
                          Analytics Applications
              • Middleware Software
              • Business Intelligence Application Software
                  – OLAP, Dashboard, and Chart Reports
                  – Statistical Analysis and Presentation Tools
BUSINESS INTELLIGENCE ELEMENTS
               • Data Governance and Management
                   –   Uniform terminology
                   –   Uniform meaning
DATA               –   Uniform units of measure
ISSUES:            –   Metadata
               • Data Structure and Attributes
THE
                   – Structured - Relational/Dimensional
CORNERESTONE       – Unstructured
                   – Rate of change, context, and other attributes
               • Data Collection and Preparation
                   – Filtering, particularly “Big Data”
                   – Extract, Transform, Load [ETL] for “structured data
               • Data Base File Systems
               • Data Storage and Retrieval
                   – Capacity
                   – Access/Retrieval speed
BUSINESS INTELLIGENCE ELEMENTS
             • Metadata management
                 – Business definitions , rules, sources
                 – Technical attributes, such as type, scale,
                    transformation methods
MASTER           – Processing requirements – filtering, ETL, aggregation,
                    summarization
DATA         • Data Definitions and data dictionaries
GOVERNANCE       – Name
                 – Unit(s) of measure
AND          • Data collection and filtering or transforming requirements
MANAGEMENT       – Sources – internal and external
                 – Context addition/filtering requirements
             • Data integration specifications
                 – Multiple platforms and applications
                 – Mapping to intermediate data marts
             • Privacy requirements
                 – Personal Identifying data
                 – Laws: HIPPA, Privacy act
BUSINESS INTELLIGENCE ELEMENTS

               • Data Structures
                  – “Structured” Data , principally text and
Data                numbers capable of incorporation in relational
                    or dimensional tables
Structures        – “Unstructured” Data, not suitable for relational
and                 tables, many in newer data formats
Attributes     • Big Data Attributes
Are Critical      – Both “structured” and “unstructured”
                  – The four major “Vs” of big data
Drivers
                      •   Volume - huge
                      •   Velocity – fast changing, unlike structured
                      •   Variety – format and content
                      •   Variability – lacks the consistency of structured
                          data
BUSINESS INTELLIGENCE ELEMENTS

             • Content Structure – Traditional Financial Data
                 – Numerical
Data             – Sign/Debit or Credit
Structures       – Text Descriptions
             • Database Management Structures
IT               – Legacy Systems: Hierarchical and Network
Lingo            – Transaction Systems: Relational
                      • Relations [Tables]. Attribute [columns], Instance [Rows]
                      • Rules: no duplicate rows; single value for attributes
                 – Warehouse Systems: Dimensional
                      • Facts [data items, usually a dollar amount or unit count]
                      • Measures – dollar or count for facts
                      • Dimensions – groups of hierarchies and descriptors of
                        various aspects or context for the facts/measures
             • Microsoft Office and Similar File Formats
             • Photography and Art
Business Intelligence Elements

RELATIONAL
TABLE
ILLUSTRATION




               “Tuple” is borrowed from mathematics
               and set theory and is used in database
               design to refer to the attributes of an
               “item” or “value” [row], the subject or
               title of the table. Value examples include
               customers, vendors, orders, product SKUs
BUSINESS INTELLIGENCE ELEMENTS


MATH
CAN BE
COMPLICATED
BUSINESS INTELLIGENCE ELEMENTS
              • Numbers and words/letters
                  – Relational/Dimensional
                  – Spreadsheets
                  – Word Processing documents
DATA          • Sound and Music
              • Photo
FILE          • Video
TYPE          • Video Game
              • CAD Design
CATEGORIES,   • Graphical
ALMOST            – PDF
                  – Raster, Vector Graphics
ENGLISH
                  – Statistical Visualization
              • Scientific
              • Signal
              • XML [Web based mark-up formats]
              • Geo-Location
              • Web Logs
BUSINESS INTELLIGENCE ELEMENTS
              • Collection
                   – Company transaction/ERP systems
                   – Purchased, such as Nielsen, IRI
DATA               – Vendor supplied, such as bank transactions
COLLECTION    • Filtering
AND                – Adding context such as date or location
                   – Eliminating “chatter” from high volume data
PREPARATION        – Error correction
              • Aggregation & Integration
DATA COLLECTION - RFID




RFID tag    RFID tag reader
DATA COLLECTION




Various sensors   Surveillance Camera
DATA FILTERING AND CLEANSING IS IMPORTANT
BUSINESS INTELLIGENCE ELEMENTS
          • Relational – SQL
          • Dimensional – SQL, OLAP
DATA      • Binary Large Object [BLOB] – binary data,
BASE        most often photos, video, audio, or PDF files
FILE      • Massively Parallel-Processing [MPP]
SYSTEMS   • Apache Hadoopp Distributed File System
            [HDFS] – Java
             – Google File System [GFS], used solely by Google
             – Google Map Reduce
          • Amazon S3 filesystem [used by Amazon]
          • NoSQL
          • Resource Description Framework [RDF]
            Databases, like Big Data
BUSINESS INTELLIGENCE ELEMENTS
             • Significant Originators
                 – Google MapReduce
                 – Google File System [GFS]
SELECT           – Amazon S3 filesystem
BIG DATA     • Continuing Developments
DATABASE         – Apache Software Foundation
MANAGEMENT             • Apache Cassandra distributed database management
                         system
SYSTEMS
                       • Apache Hadoop software framework to support
                         data-intensive distributed applications
                       • Apache Hive, a data warehouse structure built on
                         Hadoop
                       • Pig - high level programming language for creating
                         MapReduce programs with Hadoop
                 – Significant to Technology Development
                       • Facebook
                       • Yahoo
                       • LinkedIn [Project Voldemort]
BUSINESS INTELLIGENCE ELEMENTS
                 • Convergence aspect of mainframes and
                   servers
COMPUTER         • Massively parallel , multiple server,
HARDWARE           distributed processing, in multiple data
CONSIDERATIONS     centers – grid computing
                 • Multi-core , high capacity, lower power
                   consumption, CPUs
                 • Memory servers for RAM employing
                   DRAM comprised of Fully Buffered Direct
                   Inline Memory Modules [FBDIMM]
                 • Solid state flash drive storage
                 • Greatly improved., and less costly, hard
                   drive storage
BI CONFIGURATION SIZES




Small – BI, but
 not Big Data              Large – IBM Sequoia At
   capable        Medium       Livermore Labs
BUSINESS INTELLIGENCE ELEMENTS
            • Data Storage Terminology
               – Memory – CPU direct connected, often called RAM
               – Storage – not directly connected to the CPU
DATA
            • Data Storage Device Types
STORAGE        – Memory
HARDWARE/            • DRAM – based
                     • Flash memory – based Solid-State Drives [SSDs]
SOFTWARE
               – Storage
                     • Hard Disk Drives [HDD]
                     • Optical Drives – CDs, DVDs
            • Data Storage Systems
               – Direct Attached
               – Network Attached Storage [NAS]
               – Storage Area Network [SAN]
               – pNFS – Parallel Network file systems
BUSINESS INTELLIGENCE ELEMENTS
              • Traditional Reporting Systems
                  – ERP systems, including extract and presentation tools
                  – Downloads to Excel and similar programs for analysis
                      using functions and pivot tables
BI            • Presentation Tools
APPLICATION   • Specialized Analytics
SOFTWARE          – IBM InfoSphere BigInsights and InfoSphere Streams
                  – IBM Netezza
                  – ParAccel Analytic Database
                  – EMC Greenplum
                  – SAS High Performance Computing
                  – Information Builders WebFocus
              • Exploratory Tools, like IBM SPSS [originally Statistical Package
                for the Social Sciences]
                  – Data mining with specialized algorithms
                  – Statistical analysis and related charting software
BUSINESS INTELLIGENCE ELEMENTS
              •   BI Reporting
              •   Predictive Analytics
ADVANCED
ANALYTICS     •   Data Exploration - correlation
APPLICATION   •   Data Visualization - graphical
TYPES         •   Instrumentation Analytics
              •   Content Analytics
              •   Web Analytics
              •   Functional Applications
              •   Industry Applications
BUSINESS INTELLIGENCE ELEMENTS



USE
STATISTICAL
TECHNIQUES
APPROPRIATELY
ALGORITHMS CAN BE TREACHEROUS


DATA
MODELS
HAVE
LIMITS
BI AND ADVANCE ANALYTICS OUTPUT ILLUSTRATIONS
EXAMPLES OF
   USES
•   Sales and Operations Planning
           •   Financial Instruments Modeling
           •   Production Control
           •   Online Retail
           •   Economics and Policy Development
SELECTED
           •   Agriculture/Farming
EXAMPLES
           •   Weather Analysis/Prediction
OF USES
           •   Environmental Impact Assessment
           •   Healthcare Diagnosis and Records Management
           •   Genomic Analytics and Pharmaceutical and Medical
               Research
           •   Natural Resource Exploration
           •   Research Physics
           •   Road, Rail Traffic Management
           •   Security Surveillance
           •   Astronomy
           •   Logistics Management, Including GPS Tracking
           •   Electrical and Telecommunications Grids Mgmt
           •   Social Media – Facebook, LinkedIn, Google+, Twitter,
               YouTube, Pinterest
           •   TV shows – Star Trek, Person of Interest
•   Retail
                  – Amazon
                  – Dell
                  – Delta Sonic Car Washes
           •   Data Services
                  – IBM
SELECTED          – Google
USERS             – Amazon
           •   Financial Services
           •   Manufacturing
                  – McCain Foods – Frozen foods
                  – Boeing
           •   Transportation and Logistics
                  – Logistics – UPS, FedEx
                  – Rail – UP, CSX, TTX
                  – Air – United, AMR, Southwest
           •   Social Media
                  – LinkedIn
                  – Facebook
           •   Medicine and Health
                  – Center for Disease Control (CDC)
                  – J. Craig Venter Institute
           •   Science
                  – Livermore Labs
SELECTED EXAMPLES OF USE
         • Technical Elements
            – Direct on-line access
AMAZON      – Amazon specialized “Big Data”
              database
            – Distributed and extremely large data
              centers
            – Highly automated, high technology
              warehouses
            – High supplier [vendors] integration
         • User Benefits
            – Favorable prices
            – Suggested associated purchases
            – Individual interest advertising
SELECTED EXAMPLES OF USE
       • Technical Elements
          – Web driven order entry and custom
DELL        purchase configuration
          – Tracking of sales correspondence with
            promotional offers
          – Supplier re-order integration
       • User Benefits
          – Ability to customize purchase
          – Reasonable cost
          – Prompt delivery
SELECTED EXAMPLES OF USE
         • Technical components
            – Shared component and assembly designs
BOEING      – More detailed quality specifications and
              product tolerances
            – Control of assembly schedule
            – “Real time” exchange of technical
              information
            – Dissemination of best practices
         • Customer benefits
            – Faster deliveries
            – Increased product quality
            – Reduced defects
SELECTED EXAMPLES OF USE
                 • Techniques employed
                    – Collect cellphone and GPS signals, traffic
NEW                    cameras, and roadside sensors
JERSEY              – Identify accidents, traffic jams, and road damage
DEPARTMENT
                    – Emergency vehicles can be dispatched
OF
                    – Update traffic websites
TRANSPORTATION
                    – Sends messages to drivers’ GPS devices and
                       cellphones
                    – Uses supercomputers running Intrix application
                 • Benefits
                    – Eliminates traffic congestion faster
                    – More timely relief for accident victims
                    – Facilitate road paving scheduling
SELECTED EXAMPLES OF USE
           • Technical Elements
               – General LinkedIn Structure
                   • Personal Profile
LINKEDIN           • Individual Connections
                   • Groups
                   • Company Searches
                   • Questions and Answers
               – Attached application partners
                   • Amazon – Reading List
                   • Slideshare
           • User Benefits
               – Networking with professional contacts
               – Personal branding capabilities
               – Business Development
               – Job Search enhancement
LINKEDIN PROFILE PAGE SAMPLE
Facebook Page Sample
TRENDS
• More, bigger, faster – big data gets bigger
• Cloud services continue to expand
• Mobile computing expands
• Hadoop becomes more common
• Interactive data visualization will expand
• Social media type platforms will increase
  their prominence
• Analytics skills demands will increase
RESOURCES
• Books
   • Competing on Analytics, Davenport & Harris
   • Analytics at Work, Davenport, Harris, & Morison
   • The Data Asset, Fisher
   • Data Strategy, Adelman, Moss, Abai
• Websites
   • The Data Warehouse Institute – tdwi.org
   • IBM data analytics: www.ibm.com, smarter planet
SUMMARY
WHY USE BI AND ADVANCED ANALYTICS


INSIGHT
FROM
DATA

More Related Content

What's hot

Seminar datawarehousing
Seminar datawarehousingSeminar datawarehousing
Seminar datawarehousingKavisha Uniyal
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Agnes Molnar
 
Metadata in data warehouse
Metadata in data warehouseMetadata in data warehouse
Metadata in data warehouseSiddique Ibrahim
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaRadhika Kotecha
 
White Paper - Data Warehouse Documentation Roadmap
White Paper -  Data Warehouse Documentation RoadmapWhite Paper -  Data Warehouse Documentation Roadmap
White Paper - Data Warehouse Documentation RoadmapDavid Walker
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 

What's hot (8)

Seminar datawarehousing
Seminar datawarehousingSeminar datawarehousing
Seminar datawarehousing
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010
 
Metadata in data warehouse
Metadata in data warehouseMetadata in data warehouse
Metadata in data warehouse
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
White Paper - Data Warehouse Documentation Roadmap
White Paper -  Data Warehouse Documentation RoadmapWhite Paper -  Data Warehouse Documentation Roadmap
White Paper - Data Warehouse Documentation Roadmap
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 

Similar to Business Intelligence Data Analytics June 28 2012 Icpas V4 Final 20120625 8am

Bp presentation business intelligence and advanced data analytics september ...
Bp presentation business intelligence  and advanced data analytics september ...Bp presentation business intelligence  and advanced data analytics september ...
Bp presentation business intelligence and advanced data analytics september ...Barrett Peterson
 
Management information system database management
Management information system database managementManagement information system database management
Management information system database managementOnline
 
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...Pedro Mac Dowell Innecco
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsGDi Techno Solutions
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsAseda Owusua Addai-Deseh
 
Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introductionMurli Jha
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game ChangerCaserta
 
Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Martin Bém
 

Similar to Business Intelligence Data Analytics June 28 2012 Icpas V4 Final 20120625 8am (20)

Bp presentation business intelligence and advanced data analytics september ...
Bp presentation business intelligence  and advanced data analytics september ...Bp presentation business intelligence  and advanced data analytics september ...
Bp presentation business intelligence and advanced data analytics september ...
 
Foundations of business intelligence databases and information management
Foundations of business intelligence databases and information managementFoundations of business intelligence databases and information management
Foundations of business intelligence databases and information management
 
Chapter 5 data resource management
Chapter 5  data resource managementChapter 5  data resource management
Chapter 5 data resource management
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
Management information system database management
Management information system database managementManagement information system database management
Management information system database management
 
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno Solutions
 
Lecture1
Lecture1Lecture1
Lecture1
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Chapter5
Chapter5Chapter5
Chapter5
 
Dw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhanDw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhan
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
 
Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introduction
 
BI Introduction
BI IntroductionBI Introduction
BI Introduction
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer
 
Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24
 

More from Barrett Peterson

BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...
BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...
BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...Barrett Peterson
 
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...Barrett Peterson
 
Bp presentation ifrs large and small icpas chicago south presentation marc…
Bp presentation ifrs large and small icpas chicago south presentation marc…Bp presentation ifrs large and small icpas chicago south presentation marc…
Bp presentation ifrs large and small icpas chicago south presentation marc…Barrett Peterson
 
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...Barrett Peterson
 
20050722 Barrett Peterson Special Rreport
20050722 Barrett Peterson Special Rreport20050722 Barrett Peterson Special Rreport
20050722 Barrett Peterson Special RreportBarrett Peterson
 
FP&A For Sales And Operations For Sales And Operations Planning Icpas Ju...
FP&A For Sales And Operations For Sales And Operations Planning  Icpas Ju...FP&A For Sales And Operations For Sales And Operations Planning  Icpas Ju...
FP&A For Sales And Operations For Sales And Operations Planning Icpas Ju...Barrett Peterson
 
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010Barrett Peterson
 
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...Barrett Peterson
 
Bp April 12 2010 Presentation Accounting Principles Under Development What ...
Bp April 12 2010 Presentation Accounting Principles Under Development   What ...Bp April 12 2010 Presentation Accounting Principles Under Development   What ...
Bp April 12 2010 Presentation Accounting Principles Under Development What ...Barrett Peterson
 

More from Barrett Peterson (9)

BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...
BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...
BP Presentation Retirement Benefits Promises and Challenges North Shore, Nove...
 
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...
BP Presentation. Strategic Cost Management - A Profitability Tool, ICPAS Nort...
 
Bp presentation ifrs large and small icpas chicago south presentation marc…
Bp presentation ifrs large and small icpas chicago south presentation marc…Bp presentation ifrs large and small icpas chicago south presentation marc…
Bp presentation ifrs large and small icpas chicago south presentation marc…
 
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...
Bp Presentation, Ifrs Large And Small Icpas North Shore Presentation November...
 
20050722 Barrett Peterson Special Rreport
20050722 Barrett Peterson Special Rreport20050722 Barrett Peterson Special Rreport
20050722 Barrett Peterson Special Rreport
 
FP&A For Sales And Operations For Sales And Operations Planning Icpas Ju...
FP&A For Sales And Operations For Sales And Operations Planning  Icpas Ju...FP&A For Sales And Operations For Sales And Operations Planning  Icpas Ju...
FP&A For Sales And Operations For Sales And Operations Planning Icpas Ju...
 
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010
Strategic Cost Management – A Profitability Tool, Bp, Fla, November 20, 2010
 
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...
Bp Presentation, Fasb And Iasb Convergence, And Ofrs For Sm Es, Finance Leade...
 
Bp April 12 2010 Presentation Accounting Principles Under Development What ...
Bp April 12 2010 Presentation Accounting Principles Under Development   What ...Bp April 12 2010 Presentation Accounting Principles Under Development   What ...
Bp April 12 2010 Presentation Accounting Principles Under Development What ...
 

Business Intelligence Data Analytics June 28 2012 Icpas V4 Final 20120625 8am

  • 1. BUSINESS INTELLIGENCE & ADVANCED ANALYTICS The Search for Patterns, Waldo, and Black Swans Barrett Peterson, C.P.A. ICPAS Fox River Trail Chapter, June 28, 2012
  • 2. WHY BUSINESS INTELLIGENCE? Information Good Data Good Analysis
  • 4. A LITTLE BACKGROUND HISTORY • Computer based business A trip down intelligence systems is an idea memory lane that is middle aged – about 40 . Previously described as: – Decision support systems [DSS] – Executive information systems [EIS] – Management information systems [MIS]
  • 5. A LITTLE BACKGROUND • Internet Development – ARAPNET and others – 1960s – Internet Protocols – 1982, presumably by Al Gore History • IBM researcher Edgar Codd credited with development of relational data base theory in 1970. Important • IBM’s Donald Chamberlin and Raymond Boyce develop Technology structured query language [SQL] in the early 1970s to Inventions manipulate and retrieve data from IBM’s early relational data base management system • World Wide Web and 1st web browser invented by Tim Berners-Lee in 1990 by combining the internet, hypertext mark-up language, and Uniform Resource Locator [URL] system. Became Nexus. • Mosaic, designed by Marc Andressen became the first commercial web browser [Netscape]. • Development of big data enabling database designs and high speed processing during the last 15 years.
  • 6. A LITTLE BACKGROUND • Development of the primary infrastructure – Database design – Processing and Storage Hardware History – Server Development and Massively Parallel Processing • Improved telecommunications speed Drivers • Hardware miniaturization, capacity, and speed Enabling – Memory [RAM] capacity BI and – Storage capacity and transfer speed Advanced – Bus speed Analytics – Video processing capacity and speed • Increased hardware speed and capacity • Digital formats for sensors, cameras, RFID, and other data collection sources • Mobile computing • “Cloud” capability exploits many of these developments
  • 7. A LITTLE BACKGROUND • Analytics TERMINOLOGY • Business Intelligence A consultant’s • Knowledge Management collection of • Content Management confusing names - a sampler • Data Mining • Big Data • Data Integration • Gameification • Blob [Binary Large Object]
  • 8. A LITTLE BACKGROUND • CPU speed and power – Moore’s law Drivers – Multi-core chips And – Solid State Memory Enablers • Storage improvement and cost reduction – Greatly increased capacity of – Greatly increased access/transfer speed Big – Greatly reduced cost • Data collection from a wide range of devices Data • Data communications – speed and volume • Database management techniques and software • Application speed and power
  • 10. TONIGHT’S CRITICAL DEFINITIONS Business A system comprised of “computer” Intelligence hardware and software to: • Collect, “clean”, filter, and integrate data • Store data [hardware and software] • Provide knowledge management, analytical , and presentation tools to translate data into decision useful information
  • 11. TONIGHT’S CRITICAL DEFINITIONS • Prehistoric – Mainframe Era – DSS, EIS, MIS – Hierarchical Master Data Files Business • The Current Era [Primarily] – Business Intelligence – Primarily “structured” data [data that can be Intelligence represented in relational /dimensional tables or flat files], and BLOB [binary large object] formats Generations – Analysis of “known” patterns – Presented in tables, simple charts, and dashboards • Emerging – Big Data and Advanced Analytics – to discover new, changing, or variable patterns – A wide variety of “unstructured” digital data formats added to “structured” data – Emerging storage structures – “Exploratory” analytics – Zoomable User Interface [ZUIs] – Solid State Memory and Solid-State Drives
  • 12. THE HARDWARE AND SOFTWARE ELEMENTS OF BUSINESS INTELLIGENCE
  • 13. BUSINESS INTELLIGENCE ELEMENTS • Computer – CPU, Memory, and Operating System Software • Data Collection – Master Data Management – Collection Processes and Devices – Data Cleansing Processes and Software Principal • Data Storage Components – Physical Devices and Storage Management Software – Data Management and Integration for – Database Software Storage • Relational – Traditional ERP/Transaction systems Maximum • Dimensional – Traditional Data Warehouse, including Application associated BLOB • Distributed , Multiple Server, Storage Systems • NoSQL [Not Only SQL] Distributed Operational Stores • Hadoop for Highly Parallel Processing and Intensive Data Analytics Applications • Middleware Software • Business Intelligence Application Software – OLAP, Dashboard, and Chart Reports – Statistical Analysis and Presentation Tools
  • 14. BUSINESS INTELLIGENCE ELEMENTS • Data Governance and Management – Uniform terminology – Uniform meaning DATA – Uniform units of measure ISSUES: – Metadata • Data Structure and Attributes THE – Structured - Relational/Dimensional CORNERESTONE – Unstructured – Rate of change, context, and other attributes • Data Collection and Preparation – Filtering, particularly “Big Data” – Extract, Transform, Load [ETL] for “structured data • Data Base File Systems • Data Storage and Retrieval – Capacity – Access/Retrieval speed
  • 15. BUSINESS INTELLIGENCE ELEMENTS • Metadata management – Business definitions , rules, sources – Technical attributes, such as type, scale, transformation methods MASTER – Processing requirements – filtering, ETL, aggregation, summarization DATA • Data Definitions and data dictionaries GOVERNANCE – Name – Unit(s) of measure AND • Data collection and filtering or transforming requirements MANAGEMENT – Sources – internal and external – Context addition/filtering requirements • Data integration specifications – Multiple platforms and applications – Mapping to intermediate data marts • Privacy requirements – Personal Identifying data – Laws: HIPPA, Privacy act
  • 16. BUSINESS INTELLIGENCE ELEMENTS • Data Structures – “Structured” Data , principally text and Data numbers capable of incorporation in relational or dimensional tables Structures – “Unstructured” Data, not suitable for relational and tables, many in newer data formats Attributes • Big Data Attributes Are Critical – Both “structured” and “unstructured” – The four major “Vs” of big data Drivers • Volume - huge • Velocity – fast changing, unlike structured • Variety – format and content • Variability – lacks the consistency of structured data
  • 17. BUSINESS INTELLIGENCE ELEMENTS • Content Structure – Traditional Financial Data – Numerical Data – Sign/Debit or Credit Structures – Text Descriptions • Database Management Structures IT – Legacy Systems: Hierarchical and Network Lingo – Transaction Systems: Relational • Relations [Tables]. Attribute [columns], Instance [Rows] • Rules: no duplicate rows; single value for attributes – Warehouse Systems: Dimensional • Facts [data items, usually a dollar amount or unit count] • Measures – dollar or count for facts • Dimensions – groups of hierarchies and descriptors of various aspects or context for the facts/measures • Microsoft Office and Similar File Formats • Photography and Art
  • 18. Business Intelligence Elements RELATIONAL TABLE ILLUSTRATION “Tuple” is borrowed from mathematics and set theory and is used in database design to refer to the attributes of an “item” or “value” [row], the subject or title of the table. Value examples include customers, vendors, orders, product SKUs
  • 20. BUSINESS INTELLIGENCE ELEMENTS • Numbers and words/letters – Relational/Dimensional – Spreadsheets – Word Processing documents DATA • Sound and Music • Photo FILE • Video TYPE • Video Game • CAD Design CATEGORIES, • Graphical ALMOST – PDF – Raster, Vector Graphics ENGLISH – Statistical Visualization • Scientific • Signal • XML [Web based mark-up formats] • Geo-Location • Web Logs
  • 21. BUSINESS INTELLIGENCE ELEMENTS • Collection – Company transaction/ERP systems – Purchased, such as Nielsen, IRI DATA – Vendor supplied, such as bank transactions COLLECTION • Filtering AND – Adding context such as date or location – Eliminating “chatter” from high volume data PREPARATION – Error correction • Aggregation & Integration
  • 22. DATA COLLECTION - RFID RFID tag RFID tag reader
  • 23. DATA COLLECTION Various sensors Surveillance Camera
  • 24. DATA FILTERING AND CLEANSING IS IMPORTANT
  • 25. BUSINESS INTELLIGENCE ELEMENTS • Relational – SQL • Dimensional – SQL, OLAP DATA • Binary Large Object [BLOB] – binary data, BASE most often photos, video, audio, or PDF files FILE • Massively Parallel-Processing [MPP] SYSTEMS • Apache Hadoopp Distributed File System [HDFS] – Java – Google File System [GFS], used solely by Google – Google Map Reduce • Amazon S3 filesystem [used by Amazon] • NoSQL • Resource Description Framework [RDF] Databases, like Big Data
  • 26. BUSINESS INTELLIGENCE ELEMENTS • Significant Originators – Google MapReduce – Google File System [GFS] SELECT – Amazon S3 filesystem BIG DATA • Continuing Developments DATABASE – Apache Software Foundation MANAGEMENT • Apache Cassandra distributed database management system SYSTEMS • Apache Hadoop software framework to support data-intensive distributed applications • Apache Hive, a data warehouse structure built on Hadoop • Pig - high level programming language for creating MapReduce programs with Hadoop – Significant to Technology Development • Facebook • Yahoo • LinkedIn [Project Voldemort]
  • 27. BUSINESS INTELLIGENCE ELEMENTS • Convergence aspect of mainframes and servers COMPUTER • Massively parallel , multiple server, HARDWARE distributed processing, in multiple data CONSIDERATIONS centers – grid computing • Multi-core , high capacity, lower power consumption, CPUs • Memory servers for RAM employing DRAM comprised of Fully Buffered Direct Inline Memory Modules [FBDIMM] • Solid state flash drive storage • Greatly improved., and less costly, hard drive storage
  • 28. BI CONFIGURATION SIZES Small – BI, but not Big Data Large – IBM Sequoia At capable Medium Livermore Labs
  • 29. BUSINESS INTELLIGENCE ELEMENTS • Data Storage Terminology – Memory – CPU direct connected, often called RAM – Storage – not directly connected to the CPU DATA • Data Storage Device Types STORAGE – Memory HARDWARE/ • DRAM – based • Flash memory – based Solid-State Drives [SSDs] SOFTWARE – Storage • Hard Disk Drives [HDD] • Optical Drives – CDs, DVDs • Data Storage Systems – Direct Attached – Network Attached Storage [NAS] – Storage Area Network [SAN] – pNFS – Parallel Network file systems
  • 30. BUSINESS INTELLIGENCE ELEMENTS • Traditional Reporting Systems – ERP systems, including extract and presentation tools – Downloads to Excel and similar programs for analysis using functions and pivot tables BI • Presentation Tools APPLICATION • Specialized Analytics SOFTWARE – IBM InfoSphere BigInsights and InfoSphere Streams – IBM Netezza – ParAccel Analytic Database – EMC Greenplum – SAS High Performance Computing – Information Builders WebFocus • Exploratory Tools, like IBM SPSS [originally Statistical Package for the Social Sciences] – Data mining with specialized algorithms – Statistical analysis and related charting software
  • 31. BUSINESS INTELLIGENCE ELEMENTS • BI Reporting • Predictive Analytics ADVANCED ANALYTICS • Data Exploration - correlation APPLICATION • Data Visualization - graphical TYPES • Instrumentation Analytics • Content Analytics • Web Analytics • Functional Applications • Industry Applications
  • 33. ALGORITHMS CAN BE TREACHEROUS DATA MODELS HAVE LIMITS
  • 34. BI AND ADVANCE ANALYTICS OUTPUT ILLUSTRATIONS
  • 35. EXAMPLES OF USES
  • 36. Sales and Operations Planning • Financial Instruments Modeling • Production Control • Online Retail • Economics and Policy Development SELECTED • Agriculture/Farming EXAMPLES • Weather Analysis/Prediction OF USES • Environmental Impact Assessment • Healthcare Diagnosis and Records Management • Genomic Analytics and Pharmaceutical and Medical Research • Natural Resource Exploration • Research Physics • Road, Rail Traffic Management • Security Surveillance • Astronomy • Logistics Management, Including GPS Tracking • Electrical and Telecommunications Grids Mgmt • Social Media – Facebook, LinkedIn, Google+, Twitter, YouTube, Pinterest • TV shows – Star Trek, Person of Interest
  • 37. Retail – Amazon – Dell – Delta Sonic Car Washes • Data Services – IBM SELECTED – Google USERS – Amazon • Financial Services • Manufacturing – McCain Foods – Frozen foods – Boeing • Transportation and Logistics – Logistics – UPS, FedEx – Rail – UP, CSX, TTX – Air – United, AMR, Southwest • Social Media – LinkedIn – Facebook • Medicine and Health – Center for Disease Control (CDC) – J. Craig Venter Institute • Science – Livermore Labs
  • 38. SELECTED EXAMPLES OF USE • Technical Elements – Direct on-line access AMAZON – Amazon specialized “Big Data” database – Distributed and extremely large data centers – Highly automated, high technology warehouses – High supplier [vendors] integration • User Benefits – Favorable prices – Suggested associated purchases – Individual interest advertising
  • 39. SELECTED EXAMPLES OF USE • Technical Elements – Web driven order entry and custom DELL purchase configuration – Tracking of sales correspondence with promotional offers – Supplier re-order integration • User Benefits – Ability to customize purchase – Reasonable cost – Prompt delivery
  • 40. SELECTED EXAMPLES OF USE • Technical components – Shared component and assembly designs BOEING – More detailed quality specifications and product tolerances – Control of assembly schedule – “Real time” exchange of technical information – Dissemination of best practices • Customer benefits – Faster deliveries – Increased product quality – Reduced defects
  • 41. SELECTED EXAMPLES OF USE • Techniques employed – Collect cellphone and GPS signals, traffic NEW cameras, and roadside sensors JERSEY – Identify accidents, traffic jams, and road damage DEPARTMENT – Emergency vehicles can be dispatched OF – Update traffic websites TRANSPORTATION – Sends messages to drivers’ GPS devices and cellphones – Uses supercomputers running Intrix application • Benefits – Eliminates traffic congestion faster – More timely relief for accident victims – Facilitate road paving scheduling
  • 42. SELECTED EXAMPLES OF USE • Technical Elements – General LinkedIn Structure • Personal Profile LINKEDIN • Individual Connections • Groups • Company Searches • Questions and Answers – Attached application partners • Amazon – Reading List • Slideshare • User Benefits – Networking with professional contacts – Personal branding capabilities – Business Development – Job Search enhancement
  • 45. TRENDS • More, bigger, faster – big data gets bigger • Cloud services continue to expand • Mobile computing expands • Hadoop becomes more common • Interactive data visualization will expand • Social media type platforms will increase their prominence • Analytics skills demands will increase
  • 46. RESOURCES • Books • Competing on Analytics, Davenport & Harris • Analytics at Work, Davenport, Harris, & Morison • The Data Asset, Fisher • Data Strategy, Adelman, Moss, Abai • Websites • The Data Warehouse Institute – tdwi.org • IBM data analytics: www.ibm.com, smarter planet
  • 47. SUMMARY WHY USE BI AND ADVANCED ANALYTICS INSIGHT FROM DATA