This deck contains insights on Impact of Business Intelligence, Visualization, Artificial Intelligence, Machine Learning, Deep Learning, Use Cases and how to get started...
The document discusses opportunities for using big data in statistics. It describes how large amounts of digital data are being generated daily and how traditional tools cannot handle this volume of data. Significant knowledge is hidden in big data that can help address important issues. The document outlines how statistics play a key role in economic and political decisions and proposes using big data, such as telecom data, as a new source for statistics to enrich decision making. This would provide a low-cost, endless source of data. The document advocates designing systems to support various analysis techniques and tailoring approaches to specific domains using open standards.
The document discusses big data analytics. It begins by defining big data as large datasets that are difficult to capture, store, manage and analyze using traditional database management tools. It notes that big data is characterized by the three V's - volume, variety and velocity. The document then covers topics such as unstructured data, trends in data storage, and examples of big data in industries like digital marketing, finance and healthcare.
Big data introduction - Big Data from a Consulting perspective - SogetiEdzo Botjes
Big data introduction - Sogeti - Consulting Services - Business Technology - 20130628 v5
This is a small introduction to the topic Big Data and a small vision on how to enable a (big) company in using big data and embed it into the organisation.
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
A large transportation company needed help optimizing their transportation model to reduce costs. Teradata developed a transportation optimization model and user interface tool that takes forecasted volumes and determines the most optimal transportation modes and routes to deliver products to customers while considering capacities, constraints, and business rules. The tool selects the lowest cost solutions for each material/customer pair and allows users to conduct "what-if" analysis of scenarios to further reduce total costs.
Looking at what is driving Big Data. Market projections to 2017 plus what is are customer and infrastructure priorities. What drove BD in 2013 and what were barriers. Introduction to Business Analytics, Types, Building Analytics approach and ten steps to build your analytics platform within your company plus key takeaways.
Big Data and Hadoop Training batch in Pune is scheduled to commence on December 7th, 2013.This batch will be as per a new revamped four day schedule, contents and focus, based on feedback from participants of earlier courses. The training is conducted in a workshop like environment with an effective blend of hands-on practicals and assignments to augment the fundamental theory covered.
About the Faculty:
He is a Doctorate in Engineering and an industry veteran with more than twenty five years experience in launching new technologies, products and businesses. He has been involved in acquiring five patents for the company that he has worked for.
Big Data Analytics – Why?
Data is now generated by more sources and at ever increasing rates. Examples include Social Media sites, GPS based tracking systems, point of sale equipment, etc. The ability to process such data can provide that essential edge required for business success. Demand for Big Data professionals is rapidly increasing. Knowledge of Big Data can provide an advantage leading to faster professional advancement
About this course
This course on Big Data Analytics for Business is a combination of essential fundamentals, practical techniques, hands-on sessions on Hadoop, and case studies to cement all this together.
By completing this course you will be able to …
Understand fundamentals of analytics: Descriptive, Predictive and Prescriptive Analytics
Know what ‘Big Data’, Map Reduce and Hadoop are all about
Get a grip on the structure of Big Data applications
Effectively use Big Data techniques like Map Reduce and tools like Hadoop, Hive, Hbase, Pig
Choose the most appropriate tools to solve Big Data problems
Identify, propose and lead Big Data projects in your organizations
Course Content -
What is Big Data?
Overview of Big Data tools and techniques
In-depth coverage of Map-reduce techniques to manage Big Data
Hadoop - In Depth
HDFS – In Depth
Installing and managing Hadoop – Hands-on
Introduction to Hadoop Clusters
Hands-on session using native installation and Amazon EMR implementation of Hadoop
The Hadoop ecosystem: Pig, HIVE, HBase, Pig, SQOOP and Flume
Analytics: Descriptive, Predictive and Prescriptive
What is Big Data Analytics
Introducing Analytics in the enterprise: Case Studies
Trends in Big Data Analytics
The course takes a "hands-on" approach to ensure that the basics are understood very well and assimilated concepts are applied in practice.
Essential pre-requisite for practitioner course: Java programming language.
Note: Basic Java Module for participants those who are new to Java.
The document discusses opportunities for using big data in statistics. It describes how large amounts of digital data are being generated daily and how traditional tools cannot handle this volume of data. Significant knowledge is hidden in big data that can help address important issues. The document outlines how statistics play a key role in economic and political decisions and proposes using big data, such as telecom data, as a new source for statistics to enrich decision making. This would provide a low-cost, endless source of data. The document advocates designing systems to support various analysis techniques and tailoring approaches to specific domains using open standards.
The document discusses big data analytics. It begins by defining big data as large datasets that are difficult to capture, store, manage and analyze using traditional database management tools. It notes that big data is characterized by the three V's - volume, variety and velocity. The document then covers topics such as unstructured data, trends in data storage, and examples of big data in industries like digital marketing, finance and healthcare.
Big data introduction - Big Data from a Consulting perspective - SogetiEdzo Botjes
Big data introduction - Sogeti - Consulting Services - Business Technology - 20130628 v5
This is a small introduction to the topic Big Data and a small vision on how to enable a (big) company in using big data and embed it into the organisation.
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
A large transportation company needed help optimizing their transportation model to reduce costs. Teradata developed a transportation optimization model and user interface tool that takes forecasted volumes and determines the most optimal transportation modes and routes to deliver products to customers while considering capacities, constraints, and business rules. The tool selects the lowest cost solutions for each material/customer pair and allows users to conduct "what-if" analysis of scenarios to further reduce total costs.
Looking at what is driving Big Data. Market projections to 2017 plus what is are customer and infrastructure priorities. What drove BD in 2013 and what were barriers. Introduction to Business Analytics, Types, Building Analytics approach and ten steps to build your analytics platform within your company plus key takeaways.
Big Data and Hadoop Training batch in Pune is scheduled to commence on December 7th, 2013.This batch will be as per a new revamped four day schedule, contents and focus, based on feedback from participants of earlier courses. The training is conducted in a workshop like environment with an effective blend of hands-on practicals and assignments to augment the fundamental theory covered.
About the Faculty:
He is a Doctorate in Engineering and an industry veteran with more than twenty five years experience in launching new technologies, products and businesses. He has been involved in acquiring five patents for the company that he has worked for.
Big Data Analytics – Why?
Data is now generated by more sources and at ever increasing rates. Examples include Social Media sites, GPS based tracking systems, point of sale equipment, etc. The ability to process such data can provide that essential edge required for business success. Demand for Big Data professionals is rapidly increasing. Knowledge of Big Data can provide an advantage leading to faster professional advancement
About this course
This course on Big Data Analytics for Business is a combination of essential fundamentals, practical techniques, hands-on sessions on Hadoop, and case studies to cement all this together.
By completing this course you will be able to …
Understand fundamentals of analytics: Descriptive, Predictive and Prescriptive Analytics
Know what ‘Big Data’, Map Reduce and Hadoop are all about
Get a grip on the structure of Big Data applications
Effectively use Big Data techniques like Map Reduce and tools like Hadoop, Hive, Hbase, Pig
Choose the most appropriate tools to solve Big Data problems
Identify, propose and lead Big Data projects in your organizations
Course Content -
What is Big Data?
Overview of Big Data tools and techniques
In-depth coverage of Map-reduce techniques to manage Big Data
Hadoop - In Depth
HDFS – In Depth
Installing and managing Hadoop – Hands-on
Introduction to Hadoop Clusters
Hands-on session using native installation and Amazon EMR implementation of Hadoop
The Hadoop ecosystem: Pig, HIVE, HBase, Pig, SQOOP and Flume
Analytics: Descriptive, Predictive and Prescriptive
What is Big Data Analytics
Introducing Analytics in the enterprise: Case Studies
Trends in Big Data Analytics
The course takes a "hands-on" approach to ensure that the basics are understood very well and assimilated concepts are applied in practice.
Essential pre-requisite for practitioner course: Java programming language.
Note: Basic Java Module for participants those who are new to Java.
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
We are in the middle of a data flood and we need to figure out how to tame it without drowning. Most of what has been written about Big Data is focused on selling hardware and services. But what about a Big Data Strategy that guides hardware and software decisions? While virtually every major organization is faced with the challenge of figuring out the approach for and the requirements of this new development, jumping into the fray hastily and unprepared will only reproduce the same dismal IT project results as previously experienced. Join Dr. Peter Aiken as he will debunk a number of misconceptions about Big Data as your un-typical IT project. He will provide guidance on how to establish realistic Big Data management plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers without getting lost in the hype.
Takeaways:
- The means by which Big Data techniques can complement existing data management practices
- The prototyping nature of practicing Big Data techniques
- The distinct ways in which utilizing Big Data can generate business value
- Bigger Data isn’t always Better Data
This document summarizes a presentation on creating value from big data and business analytics. It discusses research conducted with three large UK organizations: a mobile telecom operator, television company, and integrated transport authority. The research aimed to understand how these organizations leverage their big data and analytics capabilities to create business value. Key findings included using mobile network data for location-based services and public alerts, predictive modeling of viewers for targeted advertising, and travel data analysis to improve transport operations and customer experiences.
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
Monday was another great conference by MinneAnalytics! #MinneFRAMA was a great success with over 1,100 attendees at Science Museum of Minnesota. Alison Rempel Brown is a great host! A Teradata colleague told me that her post about my presentation "blew up" with hits and she got over 2K views, and 60+ likes. I'm proud to be a part of this great #datascience organization brining #machinelearning and #artificialintelligence #analytics to our #bigdata clients. If you want my slides, here they are.
Simplifying Big Data Analytics for the BusinessTeradata Aster
Tasso Argyros, Co-Founder & Co-President, Teradata Aster presents at the 2012 Big Analytics Roadshow.
The opportunity exists for organizations in every industry to unlock the power of iterative, big data analysis with new applications such as digital marketing optimization and social network analysis to improve their bottom line. Big data analysis is not just the ability to analyze large volumes of data, but the ability to analyze more varieties of data by performing more complex analysis than is possible with more traditional technologies. This session will demonstrate how to bring the science of data to the art of business by empowering more business users and analysts with operationalized insights that drive results. See how data science is making emerging analytic technologies more accessible to businesses while providing better manageability to enterprise architects across retail, financial services, and media companies.
What is the impact of Big Data on Analytics from a Data Science perspective.
Presented at the Big Data and Analytics Summit 2014, Nasscom by Mamatha Upadhyaya.
Slide 2: Etymology: The etymology of the term ‘Big Data’ can be traced back to the mid-1990s, when it was first used by John Mashey to refer to handling and analysis of massive datasets. However, by 2013, ‘Big Data’ was already being declared obsolescent as a meaningful term by some, as it was too wide ranging and vague in definition (e.g. de Goes, 2013).
Side 6: Vagaries: Kitchin argues that it is velocity and these additional key characteristics that set Big Data apart and make them a “disruptive innovation – one that radically changes the nature of data and what can be done with them” (Kitchin, 2014). However, there is no one characteristic profile that all Big Data fit and they can take multiple forms.
Slide 8: Ethics: Several ethical questions have been raised about the scope of data being generated and retained; such as those concerning privacy, informed consent, and protection from harm.
These questions raise wider issues about what kinds of data should be combined and analysed, and the purposes to which the resulting information should be put.
Slide 9: Inequalities: Challenges of inequality have also been posed:
Whose data traces will be analysed? It is likely that only those who are better off will be represented (as they are more likely to use social media, etc.)
Access and use of open data is unlikely to be equally available to everyone due to existing structural inequalities (Eynon, 2013)
Slide 11: What do Big Data actually tell us? Eynon (2013) argues that Big Data is concerned with capturing and examining patterns, and tells us more about what people actually do than about what they say they do. However, this is not sufficient for all kinds of social science research. We need to understand the meanings of behaviours which cannot be inferred simply from tracking specific patterns.
In order that Big Data are used appropriately, we need to ensure understanding of what kinds of research can or cannot be carried out using them. Big Data should not be seen as [a] “technical fix” for research, but should be used to empower, support and facilitate practice and critical research.
This document provides an overview of big data, including its definition, characteristics, categories, sources, storage, analytics, challenges and opportunities. Big data is large and complex datasets that are difficult to process using traditional database management tools. It is characterized by the 5 V's - volume, variety, velocity, value and veracity. Big data comes from both internal and external sources and can be structured, unstructured or semi-structured. It requires specialized storage technologies like Hadoop and NoSQL databases. Analytics on big data uses techniques like machine learning, regression analysis and social network analysis to gain insights. The growth of big data presents both challenges in processing diverse and voluminous data as well as opportunities to generate value.
Big Data and The Future of Insight - Future FoundationForesight Factory
As Big Data sweeps through consumer-facing businesses, we ask:
- If Big Data is truly a revolution, then what (and whom) will it eliminate or elevate?
- What value will still be derived from conventional market research and brand-building techniques?
- If every brand is backed by Big Data, can every brand prosper?
For more information please contact info@futurefoundation.net or visit www.futurefoundation.net
GGV Capital: Venture Investing and the Cloud (2012)GGV Capital
This document discusses venture investing in cloud computing. It provides an overview of why VCs continue to see opportunities in the cloud sector. The presentation agenda covers trends disrupting the cloud like mobile and big data, as well as opportunities in serving small and medium businesses. The document concludes with advice for cloud startups on effectively approaching VCs for funding, emphasizing differentiation, market size, scalability, financial model, and chemistry over legal terms.
This document discusses big data and provides an overview of key concepts:
- Big data is defined as datasets that are too large or complex for traditional data management tools to handle. It is characterized by volume, velocity, and variety.
- Big data comes from a variety of sources like social media, sensors, web logs, and transaction systems. It is growing rapidly due to the digitization of information.
- Big data can be used for applications like enhancing customer insights, optimizing operations, and extending security and intelligence capabilities. Example use cases are described.
- Architecting solutions for big data requires handling its scale and integrating diverse data types and sources. Both traditional and new analytics approaches are needed.
Gayatri Patel, eBay, presents at the Big Analytics 2012 Roadshow
The wonders of what data can do for an organization is measured in the productivity and competitiveness of their team's decisions. Some believe more data is the key. Agreed...but good decisions require more than just deriving intelligence from big data. In this dynamic market, the need to socialize and evolve ideas with other teams, quickly correlate information across sources, and test ideas to fail fast early are strong enablers to gain competitive footing. eBay¹s analytic and technology advancements garners insights and approaches that continue to help our employees tell their "data stories" and make better decisions.
Big data refers to extremely large data sets that are too large to be processed using traditional data processing applications. It is characterized by high volume, variety, and velocity. Examples of big data sources include social media, jet engines, stock exchanges, and more. Big data can be structured, unstructured, or semi-structured. Key characteristics include volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and more informed decision making for organizations in various industries.
1. The document discusses various applications and uses of big data across different domains like government, healthcare, transportation, and more. It also covers big data techniques, platforms, analytics capabilities, and challenges.
2. Key topics covered include how cities can use mobile apps to get real-time road bump data, how insurance companies can price policies more granularly, and how companies can better hire and retain employees through recruiting analytics.
3. The summary also mentions concepts like data science, decision making, data types, use cases, capabilities, challenges, and roles like data scientists that are important in the big data field.
The document discusses how data has become a central business asset and strategic advantage. It notes that the growth of data from sources like the Internet of Things means that variety, not just volume or velocity, will be important. New business processes will revolve around data, which will become more valuable over the next decade. It also provides examples of how companies like eBay and Groupon have used data for competitive advantages like identifying top sellers.
Applications of Big Data Analytics in BusinessesT.S. Lim
The document discusses big data and big data analytics. It begins with definitions of big data from various sources that emphasize the large volumes of structured and unstructured data. It then discusses key aspects of big data including the three Vs of volume, variety, and velocity. The document also provides examples of big data applications in various industries. It explains common analytical methods used in big data including linear regression, decision trees, and neural networks. Finally, it discusses popular tools and frameworks for big data analytics.
Big data refers to the massive amounts of digital data being created every day from various sources such as social media, sensors, photos, videos, and online activities. This data is characterized by its volume, velocity, variety, and veracity. New technologies allow businesses and organizations to analyze these large, diverse, and complex data sets to gain insights and add value in many ways such as improving customer targeting, optimizing processes, enhancing health research, bolstering security efforts, and upgrading city infrastructure. While big data is transforming many industries, its full potential is just beginning to be realized.
What is Big Data?
Big Data Laws
Why Big Data?
Industries using Big Data
Current process/SW in SCM
Challenges in SCM industry
How Big data can solve the problems?
Migration to Big data for an SCM industry
This document provides an overview of big data in various industries. It begins by defining big data and explaining the three V's of big data - volume, variety, and velocity. It then discusses examples of big data in digital marketing, financial services, and healthcare. For digital marketing, it discusses database marketers as pioneers of big data and how big data is transforming digital marketing. For financial services, it discusses how big data is used for fraud detection and credit risk management. It also provides details on algorithmic trading and how it crunches complex interrelated big data. Overall, the document outlines how big data is being leveraged across industries to improve operations, increase revenues, and achieve competitive advantages.
Big data comes from a variety of sources such as sensors, social media, digital pictures, purchase transactions, and cell phone GPS signals. The volume of data created each day is vast, with over 2.5 quintillion bytes created in the last two years alone. Big data has four characteristics - volume, variety, velocity and value. It refers to both the large amount of data and the different types of structured and unstructured data. This data is generated and moves around at high speeds. While big data brings value, it can be difficult to analyze and extract useful insights from due to its scale and complexity. Technologies like Hadoop, HDFS, and MapReduce help process and analyze big data across large clusters of servers in a
Big data refers to large and complex data sets that are difficult to manage and analyze using traditional data management tools. It is generated from various sources like social media, scientific instruments, mobile devices, and sensor technology. Big data provides opportunities for insights and smart solutions but also poses challenges in processing, analyzing, and gaining insights from such large volumes of data. For managers in India, big data is highly relevant as the Indian analytics industry is growing rapidly and is expected to reach $16 billion by 2025, with big data being a major driver of growth in industries. Digitalization is also expanding the big data market in India as internet and smartphone usage increases across more regions of the country.
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
We are in the middle of a data flood and we need to figure out how to tame it without drowning. Most of what has been written about Big Data is focused on selling hardware and services. But what about a Big Data Strategy that guides hardware and software decisions? While virtually every major organization is faced with the challenge of figuring out the approach for and the requirements of this new development, jumping into the fray hastily and unprepared will only reproduce the same dismal IT project results as previously experienced. Join Dr. Peter Aiken as he will debunk a number of misconceptions about Big Data as your un-typical IT project. He will provide guidance on how to establish realistic Big Data management plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers without getting lost in the hype.
Takeaways:
- The means by which Big Data techniques can complement existing data management practices
- The prototyping nature of practicing Big Data techniques
- The distinct ways in which utilizing Big Data can generate business value
- Bigger Data isn’t always Better Data
This document summarizes a presentation on creating value from big data and business analytics. It discusses research conducted with three large UK organizations: a mobile telecom operator, television company, and integrated transport authority. The research aimed to understand how these organizations leverage their big data and analytics capabilities to create business value. Key findings included using mobile network data for location-based services and public alerts, predictive modeling of viewers for targeted advertising, and travel data analysis to improve transport operations and customer experiences.
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
Monday was another great conference by MinneAnalytics! #MinneFRAMA was a great success with over 1,100 attendees at Science Museum of Minnesota. Alison Rempel Brown is a great host! A Teradata colleague told me that her post about my presentation "blew up" with hits and she got over 2K views, and 60+ likes. I'm proud to be a part of this great #datascience organization brining #machinelearning and #artificialintelligence #analytics to our #bigdata clients. If you want my slides, here they are.
Simplifying Big Data Analytics for the BusinessTeradata Aster
Tasso Argyros, Co-Founder & Co-President, Teradata Aster presents at the 2012 Big Analytics Roadshow.
The opportunity exists for organizations in every industry to unlock the power of iterative, big data analysis with new applications such as digital marketing optimization and social network analysis to improve their bottom line. Big data analysis is not just the ability to analyze large volumes of data, but the ability to analyze more varieties of data by performing more complex analysis than is possible with more traditional technologies. This session will demonstrate how to bring the science of data to the art of business by empowering more business users and analysts with operationalized insights that drive results. See how data science is making emerging analytic technologies more accessible to businesses while providing better manageability to enterprise architects across retail, financial services, and media companies.
What is the impact of Big Data on Analytics from a Data Science perspective.
Presented at the Big Data and Analytics Summit 2014, Nasscom by Mamatha Upadhyaya.
Slide 2: Etymology: The etymology of the term ‘Big Data’ can be traced back to the mid-1990s, when it was first used by John Mashey to refer to handling and analysis of massive datasets. However, by 2013, ‘Big Data’ was already being declared obsolescent as a meaningful term by some, as it was too wide ranging and vague in definition (e.g. de Goes, 2013).
Side 6: Vagaries: Kitchin argues that it is velocity and these additional key characteristics that set Big Data apart and make them a “disruptive innovation – one that radically changes the nature of data and what can be done with them” (Kitchin, 2014). However, there is no one characteristic profile that all Big Data fit and they can take multiple forms.
Slide 8: Ethics: Several ethical questions have been raised about the scope of data being generated and retained; such as those concerning privacy, informed consent, and protection from harm.
These questions raise wider issues about what kinds of data should be combined and analysed, and the purposes to which the resulting information should be put.
Slide 9: Inequalities: Challenges of inequality have also been posed:
Whose data traces will be analysed? It is likely that only those who are better off will be represented (as they are more likely to use social media, etc.)
Access and use of open data is unlikely to be equally available to everyone due to existing structural inequalities (Eynon, 2013)
Slide 11: What do Big Data actually tell us? Eynon (2013) argues that Big Data is concerned with capturing and examining patterns, and tells us more about what people actually do than about what they say they do. However, this is not sufficient for all kinds of social science research. We need to understand the meanings of behaviours which cannot be inferred simply from tracking specific patterns.
In order that Big Data are used appropriately, we need to ensure understanding of what kinds of research can or cannot be carried out using them. Big Data should not be seen as [a] “technical fix” for research, but should be used to empower, support and facilitate practice and critical research.
This document provides an overview of big data, including its definition, characteristics, categories, sources, storage, analytics, challenges and opportunities. Big data is large and complex datasets that are difficult to process using traditional database management tools. It is characterized by the 5 V's - volume, variety, velocity, value and veracity. Big data comes from both internal and external sources and can be structured, unstructured or semi-structured. It requires specialized storage technologies like Hadoop and NoSQL databases. Analytics on big data uses techniques like machine learning, regression analysis and social network analysis to gain insights. The growth of big data presents both challenges in processing diverse and voluminous data as well as opportunities to generate value.
Big Data and The Future of Insight - Future FoundationForesight Factory
As Big Data sweeps through consumer-facing businesses, we ask:
- If Big Data is truly a revolution, then what (and whom) will it eliminate or elevate?
- What value will still be derived from conventional market research and brand-building techniques?
- If every brand is backed by Big Data, can every brand prosper?
For more information please contact info@futurefoundation.net or visit www.futurefoundation.net
GGV Capital: Venture Investing and the Cloud (2012)GGV Capital
This document discusses venture investing in cloud computing. It provides an overview of why VCs continue to see opportunities in the cloud sector. The presentation agenda covers trends disrupting the cloud like mobile and big data, as well as opportunities in serving small and medium businesses. The document concludes with advice for cloud startups on effectively approaching VCs for funding, emphasizing differentiation, market size, scalability, financial model, and chemistry over legal terms.
This document discusses big data and provides an overview of key concepts:
- Big data is defined as datasets that are too large or complex for traditional data management tools to handle. It is characterized by volume, velocity, and variety.
- Big data comes from a variety of sources like social media, sensors, web logs, and transaction systems. It is growing rapidly due to the digitization of information.
- Big data can be used for applications like enhancing customer insights, optimizing operations, and extending security and intelligence capabilities. Example use cases are described.
- Architecting solutions for big data requires handling its scale and integrating diverse data types and sources. Both traditional and new analytics approaches are needed.
Gayatri Patel, eBay, presents at the Big Analytics 2012 Roadshow
The wonders of what data can do for an organization is measured in the productivity and competitiveness of their team's decisions. Some believe more data is the key. Agreed...but good decisions require more than just deriving intelligence from big data. In this dynamic market, the need to socialize and evolve ideas with other teams, quickly correlate information across sources, and test ideas to fail fast early are strong enablers to gain competitive footing. eBay¹s analytic and technology advancements garners insights and approaches that continue to help our employees tell their "data stories" and make better decisions.
Big data refers to extremely large data sets that are too large to be processed using traditional data processing applications. It is characterized by high volume, variety, and velocity. Examples of big data sources include social media, jet engines, stock exchanges, and more. Big data can be structured, unstructured, or semi-structured. Key characteristics include volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and more informed decision making for organizations in various industries.
1. The document discusses various applications and uses of big data across different domains like government, healthcare, transportation, and more. It also covers big data techniques, platforms, analytics capabilities, and challenges.
2. Key topics covered include how cities can use mobile apps to get real-time road bump data, how insurance companies can price policies more granularly, and how companies can better hire and retain employees through recruiting analytics.
3. The summary also mentions concepts like data science, decision making, data types, use cases, capabilities, challenges, and roles like data scientists that are important in the big data field.
The document discusses how data has become a central business asset and strategic advantage. It notes that the growth of data from sources like the Internet of Things means that variety, not just volume or velocity, will be important. New business processes will revolve around data, which will become more valuable over the next decade. It also provides examples of how companies like eBay and Groupon have used data for competitive advantages like identifying top sellers.
Applications of Big Data Analytics in BusinessesT.S. Lim
The document discusses big data and big data analytics. It begins with definitions of big data from various sources that emphasize the large volumes of structured and unstructured data. It then discusses key aspects of big data including the three Vs of volume, variety, and velocity. The document also provides examples of big data applications in various industries. It explains common analytical methods used in big data including linear regression, decision trees, and neural networks. Finally, it discusses popular tools and frameworks for big data analytics.
Big data refers to the massive amounts of digital data being created every day from various sources such as social media, sensors, photos, videos, and online activities. This data is characterized by its volume, velocity, variety, and veracity. New technologies allow businesses and organizations to analyze these large, diverse, and complex data sets to gain insights and add value in many ways such as improving customer targeting, optimizing processes, enhancing health research, bolstering security efforts, and upgrading city infrastructure. While big data is transforming many industries, its full potential is just beginning to be realized.
What is Big Data?
Big Data Laws
Why Big Data?
Industries using Big Data
Current process/SW in SCM
Challenges in SCM industry
How Big data can solve the problems?
Migration to Big data for an SCM industry
This document provides an overview of big data in various industries. It begins by defining big data and explaining the three V's of big data - volume, variety, and velocity. It then discusses examples of big data in digital marketing, financial services, and healthcare. For digital marketing, it discusses database marketers as pioneers of big data and how big data is transforming digital marketing. For financial services, it discusses how big data is used for fraud detection and credit risk management. It also provides details on algorithmic trading and how it crunches complex interrelated big data. Overall, the document outlines how big data is being leveraged across industries to improve operations, increase revenues, and achieve competitive advantages.
Big data comes from a variety of sources such as sensors, social media, digital pictures, purchase transactions, and cell phone GPS signals. The volume of data created each day is vast, with over 2.5 quintillion bytes created in the last two years alone. Big data has four characteristics - volume, variety, velocity and value. It refers to both the large amount of data and the different types of structured and unstructured data. This data is generated and moves around at high speeds. While big data brings value, it can be difficult to analyze and extract useful insights from due to its scale and complexity. Technologies like Hadoop, HDFS, and MapReduce help process and analyze big data across large clusters of servers in a
Big data refers to large and complex data sets that are difficult to manage and analyze using traditional data management tools. It is generated from various sources like social media, scientific instruments, mobile devices, and sensor technology. Big data provides opportunities for insights and smart solutions but also poses challenges in processing, analyzing, and gaining insights from such large volumes of data. For managers in India, big data is highly relevant as the Indian analytics industry is growing rapidly and is expected to reach $16 billion by 2025, with big data being a major driver of growth in industries. Digitalization is also expanding the big data market in India as internet and smartphone usage increases across more regions of the country.
This document discusses data science and the growing field of big data. It notes that data science uses scientific methods and processes to extract knowledge and insights from structured and unstructured data. It provides some key facts about the massive amount of data being generated every day from various sources like social media, internet transactions, sensors and devices. The document also discusses the differences between data science and computer science, with data science focusing more on analyzing large datasets to answer questions and find insights, while computer science focuses more on software development and engineering.
Big data comes from a variety of sources such as sensors, social media, digital pictures, purchase transactions, and cell phone GPS signals. The volume of data created each day is vast, with 2.5 quintillion bytes created daily, 90% of which has been created in just the last two years. Big data is characterized by its volume, variety, velocity and value. It requires new tools like Hadoop and MapReduce to store and analyze data across distributed systems. When dealing with big data, once complex modeling can sometimes be replaced by simple counting techniques due to the large amount of data available. Companies are beginning to generate value from big data through new insights and business models.
The document discusses big data, its history, technologies, and uses. It begins with an introduction to big data and defines it using the 3Vs/4Vs model, describing the volume, velocity, variety and increasingly veracity of data. It then discusses big data technologies like Hadoop, databases, reporting, dashboards and real-time analytics. Examples are given of how big data is used, such as understanding customers, optimizing business processes, improving health outcomes, and improving security and law enforcement. Requirements for big data analytics are also mentioned, including data management, analytics applications, and business interpretation.
Data science is a multi-disciplinary field that uses scientific methods and processes to extract knowledge and insights from large amounts of structured and unstructured data. As the amount of data in the world grows exponentially, the need for data scientists to analyze this big data and discover useful patterns will also grow dramatically. By 2025, it is estimated that there will be over 200 zettabytes of cloud data storage worldwide and data science jobs are projected to be the highest paid and most in-demand jobs of the future.
From Paris Hilton to Walmart: welcome to the Big Data RevolutionWilliam Visterin
Presentatie op het CPBB Convention van BNP. De presentatie bleek een van de drukst bijgewoonde van de hele conventie. Big data op een erg toegankelijke manier.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
Big data refers to large and complex datasets that are difficult to analyze using traditional methods. It is characterized by high volume, velocity, and variety of data from numerous sources. Big data analytics uses tools like Hadoop and Spark to extract meaningful insights from large, unstructured datasets in real-time. This allows companies to gain valuable business insights, reduce costs, enhance customer experience, innovate products, and make faster decisions.
The document outlines an agenda for a Big Data breakfast event hosted by Rocket Fuel, including welcome remarks, a panel discussion on big data and AI, and a presentation by the CEO of Rocket Fuel on how the company uses big data and artificial intelligence for digital media. The event features speakers from Rocket Fuel and other companies discussing topics like the growth of big data, applications of big data in marketing, and how big data is changing the advertising industry.
2015 is knocking on the door and will be an exciting and surprising year for the BI industry. However, not everything will be a surprise for Panorama as we are always on top of the latest trends influencing the Business Intelligence community.
• What will the future hold for the industry?
• What are our BI experts thoughts, predictions and internal assessments on what new directions the Business Intelligence community will see in the coming year?
• Countdown of the most important trends in the industry
This document discusses how big data can enable the travel and tourism industries. It defines big data as large datasets characterized by their volume, velocity, variety, and veracity. Big data comes from a variety of sources as people leave digital traces online and through mobile technologies. The benefits of big data for businesses include improved customer experience personalization, optimized marketing and products, predictive analytics, and risk management. The big data market is expected to double from 2014 to 2018. Future developments include improvements in data processing, centralized data repositories, and analytics solutions in the public cloud to reduce costs and security risks. Big data can deliver business insights, innovation, better customer relationships, and continuously improved experiences for the tourism industry.
Susan Etlinger is an industry analyst who focuses on data and analytics. She has authored two reports on social media ROI and social analytics. She advises clients on measurement strategies and extracting insights from social data. She also works with technology companies to refine their strategies.
Big data refers to very large data sets that cannot be analyzed using traditional techniques. It is characterized by volume, velocity, and variety. Analyzing big data can help solve problems and generate value. The amount of data is growing exponentially from various sources like customer transactions, photos, and genome sequencing. This growth is driving changes in analytics approaches and capabilities.
Data science brings together techniques from computer science, statistics, mathematics, and domain knowledge to extract insights from
Defining Data Science
• What Does a Data Science Professional Do?
• Data Science in Business
• Use Cases for Data Science
• Installation of R and R studio
The document discusses technologies to support analytics for IoT data streams. It describes how IoT data will grow exponentially and analyzing data streams in real-time can provide 100x more value than analyzing stored data. Examples are given of how Hertz and retailers are leveraging real-time analysis of IoT data streams to improve customer experiences and increase revenue.
Integrate Big Data into Your Organization with Informatica and PerficientPerficient, Inc.
This document discusses how Perficient, an IT consulting firm, can help clients integrate big data into their organizations at lower total costs. It provides an overview of Perficient's services and solutions expertise in areas like business intelligence, customer experience, enterprise resource planning, and mobile platforms. The document also profiles Perficient with details on its history, locations, colleagues, and partnership model. Finally, it outlines an agenda for an event on balancing innovation and costs with big data, including discussions on PowerCenter Big Data Edition and what customers are doing with Informatica and big data.
This document provides an overview of data science, big data, and the data preprocessing steps involved in data science projects. It defines data science as extracting meaningful insights from large, structured and unstructured data using scientific methods, technologies and algorithms. It also defines big data in terms of the volume, variety and velocity of data. The document outlines common data sources that generate big data and applications of big data such as in finance, healthcare, transportation and more. It concludes by describing the key steps in data preprocessing: data cleaning, transformation and reduction to prepare raw data for analysis.
Forecast to contribute £216 billion to the UK economy via business creation, efficiency and innovation, and generate 360,000 new jobs by 2020, big data is a key area for recruiters.
In this QuickView:
- Big data in numbers
- Top 10 industries hiring big data professionals
- Top 10 qualifications sought by hirers
- Top 10 database and BI skills sought by hirers
- Getting started in big data: popular big data techniques and vendors
MBA-TU-Thailand:BigData for business startup.stelligence
This document provides an overview of big data presented by Santisook Limpeeticharoenchot. It begins with an introduction to big data, covering definitions, characteristics involving volume, velocity, variety and veracity. Examples of big data sources like machine data, sensor data, and internet of things data are described. The use of big data analytics in industries like manufacturing, healthcare, and transportation is discussed. Finally, the document touches on data visualization, different types of analytics, and how companies can use big data to better understand customers and optimize business processes.
Similar to BI, AI/ML, Use Cases, Business Impact and how to get started (20)
This document provides an overview of the steps to configure a Data Loss Prevention (DLP) policy in Office 365:
1. Create a new DLP policy and give it a name and description.
2. Choose the locations to include or exclude such as specific sites or users' OneDrives.
3. Configure the policy settings including whether to block or allow sensitive information types and what actions to take if sensitive info is detected, such as blocking access. Advanced settings allow creating custom rules.
4. Turn the policy on or off immediately and review before closing. Custom expressions can also be created to match specific content formats not covered by the default templates.
Power BI dashboards allow users to visualize, analyze, and gain insights from data. They provide interactive reports and key metrics displayed visually on a single screen or page. Dashboards in Power BI bring together multiple reports and visuals to give stakeholders a high-level overview of key performance indicators and business metrics.
This website provides information on popular Azure solutions across different categories including compute, web and mobile applications, data and storage, analytics, internet of things, networking, media and content delivery networks, hybrid integration, identity and access management, developer services, and management.
Digital asset management using SharePoint 2013Karthick S
The document discusses using Microsoft SharePoint 2013 to implement a digital asset management solution. It outlines requirements like centralized storage and tagging/searching capabilities. The solution would leverage out-of-the-box SharePoint features and custom components. An effective DAM solution specifies metadata, storage, access controls, and policies for digital assets throughout their lifecycle. SharePoint is a suitable platform as it provides document management, metadata, permissions, and search capabilities.
This document provides an overview of business intelligence and analytics. It discusses how transaction data is processed through data warehousing to build multi-dimensional data that can be analyzed using BI tools. Dashboards can then visualize this data through various formats like charts, KPIs, scorecards, and maps. The document traces the evolution of BI from being developed only by experts to now enabling self-service for end users through tools like Excel Power BI, SharePoint Power BI, and SQL Server PowerPivot and Power BI.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
Dandelion Hashtable: beyond billion requests per second on a commodity serverAntonios Katsarakis
This slide deck presents DLHT, a concurrent in-memory hashtable. Despite efforts to optimize hashtables, that go as far as sacrificing core functionality, state-of-the-art designs still incur multiple memory accesses per request and block request processing in three cases. First, most hashtables block while waiting for data to be retrieved from memory. Second, open-addressing designs, which represent the current state-of-the-art, either cannot free index slots on deletes or must block all requests to do so. Third, index resizes block every request until all objects are copied to the new index. Defying folklore wisdom, DLHT forgoes open-addressing and adopts a fully-featured and memory-aware closed-addressing design based on bounded cache-line-chaining. This design offers lock-free index operations and deletes that free slots instantly, (2) completes most requests with a single memory access, (3) utilizes software prefetching to hide memory latencies, and (4) employs a novel non-blocking and parallel resizing. In a commodity server and a memory-resident workload, DLHT surpasses 1.6B requests per second and provides 3.5x (12x) the throughput of the state-of-the-art closed-addressing (open-addressing) resizable hashtable on Gets (Deletes).
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
A Comprehensive Guide to DeFi Development Services in 2024Intelisync
DeFi represents a paradigm shift in the financial industry. Instead of relying on traditional, centralized institutions like banks, DeFi leverages blockchain technology to create a decentralized network of financial services. This means that financial transactions can occur directly between parties, without intermediaries, using smart contracts on platforms like Ethereum.
In 2024, we are witnessing an explosion of new DeFi projects and protocols, each pushing the boundaries of what’s possible in finance.
In summary, DeFi in 2024 is not just a trend; it’s a revolution that democratizes finance, enhances security and transparency, and fosters continuous innovation. As we proceed through this presentation, we'll explore the various components and services of DeFi in detail, shedding light on how they are transforming the financial landscape.
At Intelisync, we specialize in providing comprehensive DeFi development services tailored to meet the unique needs of our clients. From smart contract development to dApp creation and security audits, we ensure that your DeFi project is built with innovation, security, and scalability in mind. Trust Intelisync to guide you through the intricate landscape of decentralized finance and unlock the full potential of blockchain technology.
Ready to take your DeFi project to the next level? Partner with Intelisync for expert DeFi development services today!
Skybuffer SAM4U tool for SAP license adoptionTatiana Kojar
Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool.
SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.
This presentation provides valuable insights into effective cost-saving techniques on AWS. Learn how to optimize your AWS resources by rightsizing, increasing elasticity, picking the right storage class, and choosing the best pricing model. Additionally, discover essential governance mechanisms to ensure continuous cost efficiency. Whether you are new to AWS or an experienced user, this presentation provides clear and practical tips to help you reduce your cloud costs and get the most out of your budget.
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframePrecisely
Inconsistent user experience and siloed data, high costs, and changing customer expectations – Citizens Bank was experiencing these challenges while it was attempting to deliver a superior digital banking experience for its clients. Its core banking applications run on the mainframe and Citizens was using legacy utilities to get the critical mainframe data to feed customer-facing channels, like call centers, web, and mobile. Ultimately, this led to higher operating costs (MIPS), delayed response times, and longer time to market.
Ever-changing customer expectations demand more modern digital experiences, and the bank needed to find a solution that could provide real-time data to its customer channels with low latency and operating costs. Join this session to learn how Citizens is leveraging Precisely to replicate mainframe data to its customer channels and deliver on their “modern digital bank” experiences.
In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security.
This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing.
The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
2. What do I aim to cover…
• Trends…
• Data Modeling, Visualization
• Impact on Business
• AI/ML, Data Science
• Impact on Business
• How to get started
3. Beforewemoveahead…
“If we have data, let’s look at data. If all we have are opinions, let’s go with mine.”
-Jim Barkesdale, CEO of Netscape
Clive Humby, UK Mathematician and architect of Tesco’s Clubcard, 2006 (widely credited
as the first to coin the phrase): “Data is the new oil. It’s valuable, but if unrefined it cannot
really be used. It has to be changed into gas, plastic, chemicals, etc to create a valuable
entity that drives profitable activity; so must data be broken down, analyzed for it to have
value.”
Peter Sondergaard, SVP Gartner, 2011: “Information is the oil of the 21st century, and
analytics is the combustion engine.
Piero Scaruffi, cognitive scientist and author of “History of Silicon Valley”, 2016: “The
difference between oil and data is that the product of oil does not generate more oil
(unfortunately), whereas the product of data (self-driving cars, drones, wearables, etc) will
generate more data (where do you normally drive, how fast/well you drive, who is with you,
etc).”
4. The data volumes are exploding, more data has been created in the past two years than in the entire previous history of
the human race.
Data is growing faster than ever before and by the year 2020, about 1.7 megabytes of new information will be created
every second for every human being on the planet.
By then, our accumulated digital universe of data will grow from 4.4 zettabyets today to around 44 zettabytes, or
44 trillion gigabytes.
Every second we create new data. For example, we perform 40,000 search queries every second (on Google alone),
which makes it 3.5 searches per day and 1.2 trillion searches per year.
In a recent month, over 1 billion people (10,000 Lakhs) used Facebook FB +0% in a single day.
Facebook users send on average 31.25 million messages and view 2.77 million videos every minute.
We are seeing a massive growth in video and photo data, where every minute up to 300 hours of video are uploaded to
YouTube alone.
In the coming year, a staggering 1 trillion photos (1 lakh crores) will be taken and billions of them will be shared online.
By next year, nearly 80% of photos will be taken on smart phones.
This year, over 1.4 billion smart phones will be shipped - all packed with sensors capable of collecting all kinds of data,
not to mention the data the users create themselves.
By 2020, we will have over 6.1 billion smartphone users globally (overtaking basic fixed phone subscriptions).
1
2
3
4
5
6
7
8
9
10
5. Within five years there will be over 50 billion smart connected devices in the world, all developed to collect,
analyze and share data.
By 2020, at least a third of all data will pass through the cloud (a network of servers connected over the Internet).
Distributed computing (performing computing tasks using a network of computers in the cloud) is very
real. Google GOOGL +0% uses it every day to involve about 1,000 computers in answering a single search query,
which takes no more than 0.2 seconds to complete.
Estimates suggest that by better integrating big data, healthcare could save as much as $300 billion a year — that’s
equal to reducing costs by $1000 a year for every man, woman, and child.
The White House has already invested more than $200 million in big data projects.
For a typical Fortune 1000 company, just a 10% increase in data accessibility will result in more than $65 million
additional net income.
Retailers who leverage the full power of big data could increase their operating margins by as much as 60%.
73% of organizations have already invested or plan to invest in data related projects
11
12
13
14
15
16
17
18
6. 19 And one of my favourite facts:
At the moment less than 0.5%of all data is ever analysed
and used, just imagine the potential here.
8. Questions that are in CxOs’ minds…
How do I make sense out
of all the data collected?
How do I gain insights
about my business?
Am I using the right tools
& technologies?
Is Cloud the right choice
for the future?
Can it predict my future?
Am I able to visualize the
data available easily?
Can data help me make
my decisions?
Am I collecting right,
enough and all data?
Am I collecting
usable data?
Is the data collected usable
by my Analysts directly?
10. Data Terminologies…
Data
Data
Warehouse
Data
Modeling
Big Data
Data
Science
Data
Mining
Database
Electronic method to
store data… in a
schematic fashion
Multi-dimensional
way to store essential
data – to suit data
mining & analytics
Refers to finding
insights / intelligence /
facts hidden in data,
now replaced more
with Data Science
A fast emerging field
combining Math & Stat
techniques to find
insights, patterns &
predictions
Extract, transform
data to suit for easy
& rapid visualization
Data running in excess of
petabytes usually –
combining both
structured, unstructured
data
12. • High level overview of DW
• Primarily for business users to
get an idea of the DW
• Very less technical details
• Extension of CDM
• Entities & Relationships are
included
• Attributes, PKs, FKs are defined
• LDM and CDM are independent of
DB Tech
• PDM represents the actual DB
• Entities = tables, Attributes =
columns
• PDM is different for different DBs
• Data Types differ from SQL to Oracle
to DB2 for example
• PDM also includes Views,
Procedures, Indexes
• Then using DDL Statements – all are
created into the DB
21. •What we are selling (products)
•When we are selling (year)
•Where we are selling (country)
•How we are selling (order type)
22.
23. Benefits…
Helps organizations
identify key trends
Enables swift action
Meaningful
interpretation of the
data, cut the clutter
Increase productivity &
sales (focus on doing
your core business and
not data analysis)
Tell a story
Faster ad-hoc data
analysis
Self-service capabilities
to end users
Reduced burden on IT
33. Your journey… (with no real end)
Keep learning…
•Understand terminologies
•ETL, DW, Schema, Relationships, Visualization
•Download and try with sample datasets
•Online courses (edx, coursera, udemy, guvi)
•Certifications (MS, AWS, G)
•Medium.Com, AnalyticsIndiaMag
•Attend Meetups
•Attend Industry Events
•Build Connections
Do BI Projects
Work with Demo DataSources
Become good with Excel
Demo Environments
Interact with Team Members, Managers
Interact with Customers
Learn the domain
First get strong with data management
Then Visualization
Take part in Real Projects
Data Science / AI / ML Projects
Learn languages (P), tools
Study Real World problems solved
Interact with Customers
Perform EDA
Learn the domain
Work with demo datasets
Participate in Kaggle contests
Volunteer to solve real world problems
Work on real projects