This document provides an overview of big data and big data analytics. It defines big data as large, complex datasets that grow quickly in volume and variety. Big data analytics involves examining these large datasets to find patterns and useful information. The challenges of big data include increased storage needs and handling diverse data formats. Hadoop is a framework that allows distributed processing of big data across clusters of computers. Common big data analytics tools include MapReduce, Spark, HBase and Hive. The benefits of big data analytics include improved decision making, customer service and efficiency.
Agile Data Science is a lean methodology that is adopted from Agile Software Development. At the core it centers around people, interactions, and building minimally viable products to ship fast and often to solicit customer feedback. In this presentation, I describe how this work was done in the past with examples. Get started today with our help by visiting http://www.alpinenow.com
This document provides an introduction to big data and data science from Amity Institute of Information Technology. It defines big data and data science, highlighting that big data is a subset of data science. The key differences between big data and data science are described. Examples of applications of big data in various domains like social media, healthcare, finance, ecommerce and education are outlined. Finally, the skills required to become a data scientist or big data specialist are summarized.
Big Data is the lastest cashcow. Data Analytics has now a crucial role for industries. This article describes as to what is Big Data and Analytics and how a Chartered Accountant will be able to provide value in this field.
This document discusses the rise of big data and analytics used to analyze large volumes of data. It notes that while terabytes used to be considered big data, petabytes are now common as organizations seek to analyze more transaction details, web data, and machine-generated data. To handle larger volumes, vendors have created specialized analytical platforms that can analyze structured data faster than general databases. The document also discusses how new technologies like Hadoop help analyze unstructured data and how businesses need tools to help both casual and power users analyze data.
Oracle is a leading technology company focused on database software and cloud computing. It generates revenue from software licenses and cloud services. While Oracle faces competition from other large tech companies, its strengths include consulting services, global sales channels, and expertise in data storage and applications. The rise of big data presents both opportunities and challenges for Oracle to leverage new types and volumes of customer information through its products.
Big data analytics (BDA) involves examining large, diverse datasets to uncover hidden patterns, correlations, trends, and insights. BDA helps organizations gain a competitive advantage by extracting insights from data to make faster, more informed decisions. It supports a 360-degree view of customers by analyzing both structured and unstructured data sources like clickstream data. Businesses can leverage techniques like machine learning, predictive analytics, and natural language processing on existing and new data sources. BDA requires close collaboration between IT, business users, and data scientists to process and analyze large datasets beyond typical storage and processing capabilities.
This document provides an overview of key concepts in data analytics including:
- The sources and nature of data as well as classifications like structured, semi-structured, and unstructured data.
- The need for data analytics to gather hidden insights, generate reports, perform market analysis, and improve business requirements.
- The stages of the data analytics lifecycle including discovery, data preparation, model planning, model building, and communicating results.
- Popular tools used in data analytics like R, Python, Tableau, and SAS.
This document provides an overview of big data and big data analytics. It defines big data as large, complex datasets that grow quickly in volume and variety. Big data analytics involves examining these large datasets to find patterns and useful information. The challenges of big data include increased storage needs and handling diverse data formats. Hadoop is a framework that allows distributed processing of big data across clusters of computers. Common big data analytics tools include MapReduce, Spark, HBase and Hive. The benefits of big data analytics include improved decision making, customer service and efficiency.
Agile Data Science is a lean methodology that is adopted from Agile Software Development. At the core it centers around people, interactions, and building minimally viable products to ship fast and often to solicit customer feedback. In this presentation, I describe how this work was done in the past with examples. Get started today with our help by visiting http://www.alpinenow.com
This document provides an introduction to big data and data science from Amity Institute of Information Technology. It defines big data and data science, highlighting that big data is a subset of data science. The key differences between big data and data science are described. Examples of applications of big data in various domains like social media, healthcare, finance, ecommerce and education are outlined. Finally, the skills required to become a data scientist or big data specialist are summarized.
Big Data is the lastest cashcow. Data Analytics has now a crucial role for industries. This article describes as to what is Big Data and Analytics and how a Chartered Accountant will be able to provide value in this field.
This document discusses the rise of big data and analytics used to analyze large volumes of data. It notes that while terabytes used to be considered big data, petabytes are now common as organizations seek to analyze more transaction details, web data, and machine-generated data. To handle larger volumes, vendors have created specialized analytical platforms that can analyze structured data faster than general databases. The document also discusses how new technologies like Hadoop help analyze unstructured data and how businesses need tools to help both casual and power users analyze data.
Oracle is a leading technology company focused on database software and cloud computing. It generates revenue from software licenses and cloud services. While Oracle faces competition from other large tech companies, its strengths include consulting services, global sales channels, and expertise in data storage and applications. The rise of big data presents both opportunities and challenges for Oracle to leverage new types and volumes of customer information through its products.
Big data analytics (BDA) involves examining large, diverse datasets to uncover hidden patterns, correlations, trends, and insights. BDA helps organizations gain a competitive advantage by extracting insights from data to make faster, more informed decisions. It supports a 360-degree view of customers by analyzing both structured and unstructured data sources like clickstream data. Businesses can leverage techniques like machine learning, predictive analytics, and natural language processing on existing and new data sources. BDA requires close collaboration between IT, business users, and data scientists to process and analyze large datasets beyond typical storage and processing capabilities.
This document provides an overview of key concepts in data analytics including:
- The sources and nature of data as well as classifications like structured, semi-structured, and unstructured data.
- The need for data analytics to gather hidden insights, generate reports, perform market analysis, and improve business requirements.
- The stages of the data analytics lifecycle including discovery, data preparation, model planning, model building, and communicating results.
- Popular tools used in data analytics like R, Python, Tableau, and SAS.
This document summarizes a report on big data analytics and the use of analytical platforms. It describes how companies have been dealing with large volumes of data for decades but that data volumes are growing exponentially due to new types of structured, semi-structured, and unstructured data from sources like the web, social media, sensors and machine data. New analytical platforms and technologies are needed to efficiently store, manage and analyze this diverse new "big data". The report is based on a survey of 302 BI professionals and interviews with industry experts regarding their use of analytical platforms for big data analytics.
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...Experfy
Gartner, IBM, Accenture and many others have asserted that 80% or more of the world’s information is unstructured – and inherently hard to analyze. What does that mean? And what is required to extract insight from unstructured data?
Unstructured data is infinitely variable in quality and format, because it is produced by humans who can be fastidious, unpredictable, ill-informed, or even cynical, but always unique, not standard in any way. Recent advances in natural language processing provides the notion that unstructured content can be included in data analysis.
Serious growth and value companies are committed to data. The exponential growth of Big Data has posed major challenges in data governance and data analysis. Good data governance is pivotal for business growth.
Therefore, it is of paramount importance to slice and dice Big Data that addresses data governance and data analysis issues. In order to support high quality business decision making, it is important to fully harness the potential of Big Data by implementing proper Data Migration, Data Ingestion, Data Management, Data Analysis, Data Visualization and Data Virtualization tools.
Check it out: https://www.experfy.com/training/courses/march-towards-big-data-big-data-implementation-migration-ingestion-management-visualization
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
The document discusses a webinar by SAP and Ernst & Young on big data. It explores big data adoption trends, how organizations can leverage big data to improve business performance and manage risks, and common use cases across industries like retail, transportation, and government. The webinar provides guidance on how organizations can get started with big data initiatives by identifying executive sponsors, use cases, architectural gaps, and building a business case to justify investment.
Business analytics (BA) is the practice of iterative, methodical exploration of an organization's data, with an emphasis on statistical analysis. BA is used by companies committed to data-driven decision-making to gain insights that inform business decisions and can be used to automate and optimize business processes. BA techniques break down into basic business intelligence, which involves collecting and preparing data, and deeper statistical analysis. True data science involves more custom coding and open-ended questions compared to most business analysts.
Big data refers to extremely large data sets that are too large to be processed using traditional data processing applications. It is characterized by high volume, variety, and velocity. Examples of big data sources include social media, jet engines, stock exchanges, and more. Big data can be structured, unstructured, or semi-structured. Key characteristics include volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and more informed decision making for organizations in various industries.
This document discusses business analytics and data analytics capabilities. It covers key concepts like data warehouses, data marts, ETL processes, business intelligence, data mining techniques, and how organizations can use analytics to gain insights from data to support decision making and gain a competitive advantage. The document provides examples of how companies like IHG and retailers use analytics to improve operations and customer understanding.
Big Data Developer Career Path: Job & Interview PreparationIntellipaat
Youtube link : https://www.youtube.com/watch?v=iggl879a0s8
Intellipaat Big Data Hadoop Training: https://intellipaat.com/big-data-hadoop-training/
Read complete Big Data Hadoop tutorial here: https://intellipaat.com/blog/tutorial/hadoop-tutorial/
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
This document discusses top big data analytics tools and emerging trends in big data analytics. It defines big data analytics as examining large data sets to find patterns and business insights. The document then covers several open source and commercial big data analytics tools, including Jaspersoft and Talend for reporting, Skytree for machine learning, Tableau for visualization, and Pentaho and Splunk for reporting. It emphasizes that tool selection is just one part of a big data project and that evaluating business value is also important.
Modern Analytics And The Future Of Quality And Performance ExcellenceICFAI Business School
This document discusses modern business analytics and its applications. It defines analytics as using data, technology and analysis to help managers make better decisions. It outlines common analytics tools like Excel, SPSS and R. It traces the history and evolution of analytics from the 1950s to today. It describes the three main disciplines of analytics as business intelligence, quantitative methods, and statistics. It discusses descriptive, predictive and prescriptive analytics approaches. Finally, it discusses challenges and advantages of modern analytics for quality and strategic management.
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptxRATISHKUMAR32
The presentation contain the business profiles in big data analytics. through this ppt user can learn about the different case studies such as facebook and walmart. This ppt contain the information and seven characteristics that are required to learn the basics of big data.
A study on web analytics with reference to select sports websitesBhanu Prakash
This document is a project report submitted by Y. Bhanu Prakash to GITAM Institute of Management in partial fulfillment of the degree of Bachelor of Business Administration in Business Analytics. The report is on the topic of web analytics with reference to select sports websites. It includes declarations by the student and certification by the guide, as well as acknowledgements. The report will consist of 5 chapters - an introduction to analytics, a profile of Alexa.com, methodology, analysis and interpretation of data, and observations and conclusions.
Now companies are in the middle of a renovation that forces them to be analytics-driven to
continue being competitive. Data analysis provides a complete insight about their business. It
also gives noteworthy advantages over their competitors. Analytics-driven insights compel
businesses to take action on service innovation, enhance client experience, detect irregularities in
process and provide extra time for product or service marketing. To work on analytics driven
activities, companies require to gather, analyse and store information from all possible sources.
Companies should bring appropriate tools and workflows in practice to analyse data rapidly and
unceasingly. They should obtain insight from data analysis result and make changes in their
business process and practice on the basis of gained result. It would help to be more agile than
their previous process and function.
Big data refers to huge amounts of data from various sources that traditional data management systems cannot handle. It is characterized by volume, velocity, variety, and veracity. Handling big data requires expertise in security, management, and analytics. Data scientists use descriptive, diagnostic, predictive, and prescriptive analytics techniques on big data to create business insights and decisions using business intelligence tools. While big data offers opportunities, it also poses risks like bad data, security issues, and costs if not properly analyzed and managed.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze growing amounts of structured data. New technologies also analyze semi-structured data like web logs and machine data. While reports and dashboards serve casual users, more advanced analytics are needed for power users to fully leverage big data.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze growing amounts of structured data. New technologies also analyze semi-structured data like web logs and machine data. While reports and dashboards serve casual users, more advanced analytics are needed for power users to fully leverage big data.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 business intelligence professionals and interviews. Most companies have implemented analytical platforms that provide higher performance than traditional databases to analyze growing structured data. New technologies also analyze complex unstructured data like web logs. However, most business intelligence environments still do not unite reporting and analysis. This report proposes a unified architecture using various analytical tools to make more data accessible for both routine and complex analytics.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze structured data that is growing in size. New technologies also analyze semi-structured and unstructured data like web traffic and sensor data. The growth in data types and volumes is fueled by ability to store and analyze more data enabled by technology advances. Organizations seek to better understand customers by bringing more data sources together through analytics.
Big Data Tools PowerPoint Presentation SlidesSlideTeam
The document discusses big data analysis requirements and tools. It covers where big data comes from both internally and externally. It then discusses tools for analyzing big data such as BI tools, in-database analytics, Hadoop, decision management, and discovery tools. Techniques for analyzing big data like classification tree analysis, genetic algorithms, regression analysis, machine learning, and sentiment analysis are also covered. The key benefits and a successful implementation roadmap for big data in an organization are summarized.
Borys Pratsiuk is the Head of R&D at an unnamed company. He has over 15 years of experience in engineering roles related to Android development, embedded systems, and solid state electronics. He holds a PhD in Solid State Electronics from Kiev Polytechnic Institute and has worked in both academic and industry roles in South Korea and Ukraine. The presentation discusses big data, analytics, artificial intelligence and machine learning applications across various industries. It provides examples of deep learning solutions developed for clients in areas like computer vision, natural language processing, predictive analytics and process automation. The presentation emphasizes Ciklum's full-service approach to developing and deploying deep learning solutions from data collection and modeling to deployment and ongoing support.
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsVictor Morales
K8sGPT is a tool that analyzes and diagnoses Kubernetes clusters. This presentation was used to share the requirements and dependencies to deploy K8sGPT in a local environment.
This document summarizes a report on big data analytics and the use of analytical platforms. It describes how companies have been dealing with large volumes of data for decades but that data volumes are growing exponentially due to new types of structured, semi-structured, and unstructured data from sources like the web, social media, sensors and machine data. New analytical platforms and technologies are needed to efficiently store, manage and analyze this diverse new "big data". The report is based on a survey of 302 BI professionals and interviews with industry experts regarding their use of analytical platforms for big data analytics.
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...Experfy
Gartner, IBM, Accenture and many others have asserted that 80% or more of the world’s information is unstructured – and inherently hard to analyze. What does that mean? And what is required to extract insight from unstructured data?
Unstructured data is infinitely variable in quality and format, because it is produced by humans who can be fastidious, unpredictable, ill-informed, or even cynical, but always unique, not standard in any way. Recent advances in natural language processing provides the notion that unstructured content can be included in data analysis.
Serious growth and value companies are committed to data. The exponential growth of Big Data has posed major challenges in data governance and data analysis. Good data governance is pivotal for business growth.
Therefore, it is of paramount importance to slice and dice Big Data that addresses data governance and data analysis issues. In order to support high quality business decision making, it is important to fully harness the potential of Big Data by implementing proper Data Migration, Data Ingestion, Data Management, Data Analysis, Data Visualization and Data Virtualization tools.
Check it out: https://www.experfy.com/training/courses/march-towards-big-data-big-data-implementation-migration-ingestion-management-visualization
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
The document discusses a webinar by SAP and Ernst & Young on big data. It explores big data adoption trends, how organizations can leverage big data to improve business performance and manage risks, and common use cases across industries like retail, transportation, and government. The webinar provides guidance on how organizations can get started with big data initiatives by identifying executive sponsors, use cases, architectural gaps, and building a business case to justify investment.
Business analytics (BA) is the practice of iterative, methodical exploration of an organization's data, with an emphasis on statistical analysis. BA is used by companies committed to data-driven decision-making to gain insights that inform business decisions and can be used to automate and optimize business processes. BA techniques break down into basic business intelligence, which involves collecting and preparing data, and deeper statistical analysis. True data science involves more custom coding and open-ended questions compared to most business analysts.
Big data refers to extremely large data sets that are too large to be processed using traditional data processing applications. It is characterized by high volume, variety, and velocity. Examples of big data sources include social media, jet engines, stock exchanges, and more. Big data can be structured, unstructured, or semi-structured. Key characteristics include volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and more informed decision making for organizations in various industries.
This document discusses business analytics and data analytics capabilities. It covers key concepts like data warehouses, data marts, ETL processes, business intelligence, data mining techniques, and how organizations can use analytics to gain insights from data to support decision making and gain a competitive advantage. The document provides examples of how companies like IHG and retailers use analytics to improve operations and customer understanding.
Big Data Developer Career Path: Job & Interview PreparationIntellipaat
Youtube link : https://www.youtube.com/watch?v=iggl879a0s8
Intellipaat Big Data Hadoop Training: https://intellipaat.com/big-data-hadoop-training/
Read complete Big Data Hadoop tutorial here: https://intellipaat.com/blog/tutorial/hadoop-tutorial/
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
This document discusses top big data analytics tools and emerging trends in big data analytics. It defines big data analytics as examining large data sets to find patterns and business insights. The document then covers several open source and commercial big data analytics tools, including Jaspersoft and Talend for reporting, Skytree for machine learning, Tableau for visualization, and Pentaho and Splunk for reporting. It emphasizes that tool selection is just one part of a big data project and that evaluating business value is also important.
Modern Analytics And The Future Of Quality And Performance ExcellenceICFAI Business School
This document discusses modern business analytics and its applications. It defines analytics as using data, technology and analysis to help managers make better decisions. It outlines common analytics tools like Excel, SPSS and R. It traces the history and evolution of analytics from the 1950s to today. It describes the three main disciplines of analytics as business intelligence, quantitative methods, and statistics. It discusses descriptive, predictive and prescriptive analytics approaches. Finally, it discusses challenges and advantages of modern analytics for quality and strategic management.
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptxRATISHKUMAR32
The presentation contain the business profiles in big data analytics. through this ppt user can learn about the different case studies such as facebook and walmart. This ppt contain the information and seven characteristics that are required to learn the basics of big data.
A study on web analytics with reference to select sports websitesBhanu Prakash
This document is a project report submitted by Y. Bhanu Prakash to GITAM Institute of Management in partial fulfillment of the degree of Bachelor of Business Administration in Business Analytics. The report is on the topic of web analytics with reference to select sports websites. It includes declarations by the student and certification by the guide, as well as acknowledgements. The report will consist of 5 chapters - an introduction to analytics, a profile of Alexa.com, methodology, analysis and interpretation of data, and observations and conclusions.
Now companies are in the middle of a renovation that forces them to be analytics-driven to
continue being competitive. Data analysis provides a complete insight about their business. It
also gives noteworthy advantages over their competitors. Analytics-driven insights compel
businesses to take action on service innovation, enhance client experience, detect irregularities in
process and provide extra time for product or service marketing. To work on analytics driven
activities, companies require to gather, analyse and store information from all possible sources.
Companies should bring appropriate tools and workflows in practice to analyse data rapidly and
unceasingly. They should obtain insight from data analysis result and make changes in their
business process and practice on the basis of gained result. It would help to be more agile than
their previous process and function.
Big data refers to huge amounts of data from various sources that traditional data management systems cannot handle. It is characterized by volume, velocity, variety, and veracity. Handling big data requires expertise in security, management, and analytics. Data scientists use descriptive, diagnostic, predictive, and prescriptive analytics techniques on big data to create business insights and decisions using business intelligence tools. While big data offers opportunities, it also poses risks like bad data, security issues, and costs if not properly analyzed and managed.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze growing amounts of structured data. New technologies also analyze semi-structured data like web logs and machine data. While reports and dashboards serve casual users, more advanced analytics are needed for power users to fully leverage big data.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze growing amounts of structured data. New technologies also analyze semi-structured data like web logs and machine data. While reports and dashboards serve casual users, more advanced analytics are needed for power users to fully leverage big data.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 business intelligence professionals and interviews. Most companies have implemented analytical platforms that provide higher performance than traditional databases to analyze growing structured data. New technologies also analyze complex unstructured data like web logs. However, most business intelligence environments still do not unite reporting and analysis. This report proposes a unified architecture using various analytical tools to make more data accessible for both routine and complex analytics.
This report examines the rise of big data and analytics used to analyze large volumes of data. It is based on a survey of 302 BI professionals and interviews. Most organizations have implemented analytical platforms to help analyze structured data that is growing in size. New technologies also analyze semi-structured and unstructured data like web traffic and sensor data. The growth in data types and volumes is fueled by ability to store and analyze more data enabled by technology advances. Organizations seek to better understand customers by bringing more data sources together through analytics.
Big Data Tools PowerPoint Presentation SlidesSlideTeam
The document discusses big data analysis requirements and tools. It covers where big data comes from both internally and externally. It then discusses tools for analyzing big data such as BI tools, in-database analytics, Hadoop, decision management, and discovery tools. Techniques for analyzing big data like classification tree analysis, genetic algorithms, regression analysis, machine learning, and sentiment analysis are also covered. The key benefits and a successful implementation roadmap for big data in an organization are summarized.
Borys Pratsiuk is the Head of R&D at an unnamed company. He has over 15 years of experience in engineering roles related to Android development, embedded systems, and solid state electronics. He holds a PhD in Solid State Electronics from Kiev Polytechnic Institute and has worked in both academic and industry roles in South Korea and Ukraine. The presentation discusses big data, analytics, artificial intelligence and machine learning applications across various industries. It provides examples of deep learning solutions developed for clients in areas like computer vision, natural language processing, predictive analytics and process automation. The presentation emphasizes Ciklum's full-service approach to developing and deploying deep learning solutions from data collection and modeling to deployment and ongoing support.
Similar to Unit-I_Big data life cycle.pptx, sources of Big Data (20)
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsVictor Morales
K8sGPT is a tool that analyzes and diagnoses Kubernetes clusters. This presentation was used to share the requirements and dependencies to deploy K8sGPT in a local environment.
International Conference on NLP, Artificial Intelligence, Machine Learning an...gerogepatton
International Conference on NLP, Artificial Intelligence, Machine Learning and Applications (NLAIM 2024) offers a premier global platform for exchanging insights and findings in the theory, methodology, and applications of NLP, Artificial Intelligence, Machine Learning, and their applications. The conference seeks substantial contributions across all key domains of NLP, Artificial Intelligence, Machine Learning, and their practical applications, aiming to foster both theoretical advancements and real-world implementations. With a focus on facilitating collaboration between researchers and practitioners from academia and industry, the conference serves as a nexus for sharing the latest developments in the field.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTjpsjournal1
The rivalry between prominent international actors for dominance over Central Asia's hydrocarbon
reserves and the ancient silk trade route, along with China's diplomatic endeavours in the area, has been
referred to as the "New Great Game." This research centres on the power struggle, considering
geopolitical, geostrategic, and geoeconomic variables. Topics including trade, political hegemony, oil
politics, and conventional and nontraditional security are all explored and explained by the researcher.
Using Mackinder's Heartland, Spykman Rimland, and Hegemonic Stability theories, examines China's role
in Central Asia. This study adheres to the empirical epistemological method and has taken care of
objectivity. This study analyze primary and secondary research documents critically to elaborate role of
china’s geo economic outreach in central Asian countries and its future prospect. China is thriving in trade,
pipeline politics, and winning states, according to this study, thanks to important instruments like the
Shanghai Cooperation Organisation and the Belt and Road Economic Initiative. According to this study,
China is seeing significant success in commerce, pipeline politics, and gaining influence on other
governments. This success may be attributed to the effective utilisation of key tools such as the Shanghai
Cooperation Organisation and the Belt and Road Economic Initiative.
Using recycled concrete aggregates (RCA) for pavements is crucial to achieving sustainability. Implementing RCA for new pavement can minimize carbon footprint, conserve natural resources, reduce harmful emissions, and lower life cycle costs. Compared to natural aggregate (NA), RCA pavement has fewer comprehensive studies and sustainability assessments.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
Comparative analysis between traditional aquaponics and reconstructed aquapon...bijceesjournal
The aquaponic system of planting is a method that does not require soil usage. It is a method that only needs water, fish, lava rocks (a substitute for soil), and plants. Aquaponic systems are sustainable and environmentally friendly. Its use not only helps to plant in small spaces but also helps reduce artificial chemical use and minimizes excess water use, as aquaponics consumes 90% less water than soil-based gardening. The study applied a descriptive and experimental design to assess and compare conventional and reconstructed aquaponic methods for reproducing tomatoes. The researchers created an observation checklist to determine the significant factors of the study. The study aims to determine the significant difference between traditional aquaponics and reconstructed aquaponics systems propagating tomatoes in terms of height, weight, girth, and number of fruits. The reconstructed aquaponics system’s higher growth yield results in a much more nourished crop than the traditional aquaponics system. It is superior in its number of fruits, height, weight, and girth measurement. Moreover, the reconstructed aquaponics system is proven to eliminate all the hindrances present in the traditional aquaponics system, which are overcrowding of fish, algae growth, pest problems, contaminated water, and dead fish.
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Unit-I_Big data life cycle.pptx, sources of Big Data
1. Course – Big Data Analytics (Professional Elective-II)
Course code-IT314B
Unit-II- BIG DATA ANALYTICS LIFE CYCLE
Sanjivani Rural Education Society’s
Sanjivani College of Engineering, Kopargaon-423603
(An Autonomous Institute Affiliated to Savitribai Phule Pune University, Pune)
NAAC ‘A’ Grade Accredited, ISO 9001:2015 Certified
Department of Information Technology
(NBA Accredited)
Mr. Rajendra N Kankrale
Asst. Prof.
1
2. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Unit-I BIG DATA ANALYTICS LIFE CYCLE
• Syllabus
• Introduction to Big Data, sources of Big Data, Data Analytic Lifecycle:
Introduction, Phase 1: Discovery, Phase 2: Data Preparation, Phase 3: Model
Planning, Phase 4: Model Building, Phase 5: Communication results, Phase 6:
Operationalize.
2
3. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Unit-I BIG DATA ANALYTICS LIFE CYCLE
1. Why Big Data analytics?
2. What is Big Data analytics?
3. Lifecycle of Big Data analytics
4. Types of Big Data analytics
5. Tools used in Big Data analytics
6. Big Data application domains
3
4. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Why Big Data analytics?
Take the music streaming platform Spotify for example. The company has nearly 96
million users that generate a tremendous amount of data every day. Through this
information, the cloud-based platform automatically generates suggested songs—
through a smart recommendation engine—based on likes, shares, search history, and
more. What enables this is the techniques, tools, and frameworks that are a result of
Big Data analytics.
If you are a Spotify user, then you must have come across the top recommendation
section, which is based on your likes, past history, and other things. Utilizing a
recommendation engine that leverages data filtering tools that collect data and then
filter it using algorithms works. This is what Spotify does.
4
5. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Uses and Examples of Big Data Analytics
There are many different ways that Big Data analytics can be used in order to improve
businesses and organizations. Here are some examples:
• Using analytics to understand customer behavior in order to optimize the customer
experience
• Predicting future trends in order to make better business decisions
• Improving marketing campaigns by understanding what works and what doesn't
• Increasing operational efficiency by understanding where bottlenecks are and how
to fix them
• Detecting fraud and other forms of misuse sooner
These are just a few examples — the possibilities are really endless when it comes to
Big Data analytics. It all depends on how you want to use it in order to improve your
business.
5
6. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
What is Big Data analytics?
• What is Big Data?
• Big Data is a massive amount of data sets that cannot be stored, processed, or
analyzed using traditional tools.
• Big Data analytics is a process used to extract meaningful insights, such as
hidden patterns, unknown correlations, market trends, and customer
preferences. Big Data analytics provides various advantages—it can be used
for better decision making, preventing fraudulent activities, among other
things.
6
7. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Big Data sources
• Today, there are millions of data sources that generate data at a very rapid
rate. These data sources are present across the world. Some of the largest
sources of data are social media platforms and networks. Let’s use
Facebook as an example—it generates more than 500 terabytes of data
every day. This data includes pictures, videos, messages, and more.
• Data also exists in different formats, like structured data, semi-structured
data, and unstructured data. For example, in a regular Excel sheet, data is
classified as structured data—with a definite format. In contrast, emails fall
under semi-structured, and your pictures and videos fall under unstructured
data. All this data combined makes up Big Data.
7
8. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Types of Big Data analytics
The following are the four types of big data analytics:
1. Prescriptive Analytics- (What is the solution?)
2. Diagnostic Analytics- (why did happened?)
3. Predictive Analytics- (What will happen?)
4. Descriptive Analytics- (What has happened ?)
8
9. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Tools used in Big Data analytics
• Apache Spark: Spark is a framework for real-time data analytics, which
is a part of the Hadoop ecosystem.
• Python: Python is one of the most versatile programming languages that
is rapidly being deployed for various applications including machine
learning.
• SAS: SAS is an advanced analytical tool that is used for working with large
volumes of data and deriving valuable insights from it.
9
10. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Tools used in Big Data analytics
• Hadoop: Hadoop is the most popular big data framework that is
deployed by a wide range of organizations from around the world for
making sense of big data.
• SQL: SQL is used for working with relational database management
systems.
• Tableau: Tableau is the most popular business intelligence tool that is
deployed for the purpose of data visualization and business analytics.
10
11. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Tools used in Big Data analytics
• Splunk: Splunk is the tool of choice for parsing machine-generated data
and deriving valuable business insights out of it.
• R: R is the no. 1 programming language that is being used by data
scientists for statistical computing and graphical applications alike.
11
12. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Tools used in Big Data analytics
Cassandra
APACHE Cassandra is an open-source NoSQL distributed database that is used to
fetch large amounts of data. It’s one of the most popular tools for data analytics and
has been praised by many tech companies due to its high scalability and availability
without compromising speed and performance. It is capable of delivering thousands of
operations every second and can handle petabytes of resources with almost zero
downtime. It was created by Facebook back in 2008 and was published publicly.
12
13. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Tools used in Big Data analytics
Apache Storm
A storm is a robust, user-friendly tool used for data analytics, especially in
small companies. The best part about the storm is that it has no language
barrier (programming) in it and can support any of them. It was designed to
handle a pool of large data in fault-tolerance and horizontally scalable
methods. When we talk about real-time data processing, Storm leads the chart
because of its distributed real-time big data processing system, due to which
today many tech giants are using APACHE Storm in their system. Some of the
most notable names are Twitter, Zendesk, NaviSite, etc.
13
14. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Applications of Big Data Analytics
• Customer Acquisition and Retention: Customer information helps tremendously
in marketing trends, through data-driven actions, to increase customer satisfaction.
For example, personalization engines for Netflix, Amazon, and Spotify help with
improved customer experiences and gaining customer loyalty.
• Targeted Ads: Personalized data about interaction patterns, order history, and
product page viewing history can help immensely to create targeted ad campaigns
for customers on a larger scale and at the individual level.
14
15. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Applications of Big Data Analytics
• Product Development: It can generate insights on development decisions, product
viability, performance measurements, etc., and direct improvements that positively
serve the customers.
• Price Optimization: Pricing models can be modeled and used by retailers with the
help of diverse data sources to maximize revenues.
• Supply Chain and Channel Analytics: Predictive analytical models help with
B2B supplier networks, preemptive replenishment, route optimizations, inventory
management, and notification of potential delays in deliveries.
• Risk Management: It helps in the identification of new risks with the help of data
patterns for the purpose of developing effective risk management strategies.
• Improved Decision-making: The insights that are extracted from the data can help
enterprises make sound and quick decisions.
15
16. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Examples/Areas Using Big Data Analytics Tools
• Healthcare: Big data analytics technologies and tools are being used in healthcare
to predict patient outcomes, identify at-risk patients, and improve population health.
• Retail: Big data analytics tools are being used by retailers to improve customer
experience, target marketing campaigns, and prevent fraud.
• Manufacturing: Big data analytics tools are being used in manufacturing to
improve quality control, reduce downtime, and optimize production processes.
• Banking: Real time big data analytics tools are being used by banks to detect
fraudulent activities, prevent money laundering, and improve customer service.
• Government: Big data analytics tools are being used by government agencies to
improve public services, combat fraud and corruption, and better understand citizen
needs.
16
17. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• The Data analytics lifecycle was designed to address Big Data problems and data
science projects. The process is repeated to show the real projects. To address the
specific demands for conducting analysis on Big Data, the step-by-step
methodology is required to plan the various tasks associated with the acquisition,
processing, analysis, and recycling of data.
17
18. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
18
19. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 1: Discovery –
• The data science team is trained and researches the issue.
• Create context and gain understanding.
• Learn about the data sources that are needed and accessible to the project.
• The team comes up with an initial hypothesis, which can be later confirmed
with evidence.
19
20. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 1: Discovery –
• The first phase of the Data Analytics Lifecycle is the data discovery step. This
stage involves identifying potential data sources, both internal and external,
that are relevant to the business problem at hand. It is essential to define the
scope of the analysis and gather data from various databases, applications, and
online repositories. Data can come in different formats, including structured,
unstructured, and semi-structured data.
• The key to success in this phase is to ensure the data collected is accurate,
relevant, and comprehensive. Missing or flawed data can lead to misleading
insights and decisions down the line. Rigorous data quality checks and
validation procedures are necessary to maintain data integrity.
20
21. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 2: Data Preparation -
• Once the data is collected, it is crucial to clean and preprocess it before
analysis. Data preparation involves identifying and rectifying errors,
duplications, and inconsistencies in the dataset. This process ensures that the
data is of high quality and ready for further analysis.
• Data preprocessing tasks may include data transformation, normalisation, and
handling missing values. Cleaning and preprocessing are time-consuming but
vital steps that significantly impact the accuracy and reliability of the final
results. Proper data preprocessing can also help in dealing with noise and
irrelevant data, leading to better outcomes.
21
22. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 2: Data Preparation -
• Methods to investigate the possibilities of pre-processing, analysing, and
preparing data before analysis and modelling.
• It is required to have an analytic sandbox. The team performs, loads, and
transforms to bring information to the data sandbox.
• Data preparation tasks can be repeated and not in a predetermined sequence.
• Some of the tools used commonly for this process include - Hadoop, Alpine
Miner, Open Refine, etc.-
22
23. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 2: Data Preparation -
• Data preparation and processing involves gathering, sorting, processing and
purifying collected information to make sure it can be utilized by subsequent
steps of analysis.
• Data Collection: Draw information from external sources.
• Data Entry: Within an organization, data entry refers to creating new points of
information using either digital technologies or manual input procedures.
• Signal Reception: Accumulating data from digital devices like the Internet of
Things devices and control systems.
• An analytical sandbox is essential during the data preparation stage of data
analytics Life Cycle. This scalable platform is used by data analysts and
scientists alike for processing their data sets; once executed, loaded, or altered
it resides securely inside this sandbox for later examination and modification.
23
24. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 3: Model Planning -
• The team studies data to discover the connections between variables. Later, it
selects the most significant variables as well as the most effective models.
• In this phase, the data science teams create data sets that can be used for
training for testing, production, and training goals.
• The team builds and implements models based on the work completed in the
modelling planning phase.
• Some of the tools used commonly for this stage are MATLAB and
STASTICA.
24
25. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 3: Model Planning -
• The team studies data to discover the connections between variables. Later, it
selects the most significant variables as well as the most effective models.
• In this phase, the data science teams create data sets that can be used for
training for testing, production, and training goals.
• The team builds and implements models based on the work completed in the
modelling planning phase.
• Some of the tools used commonly for this stage are MATLAB and
STASTICA.
25
26. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 4: Model Building -
• The team creates datasets for training, testing as well as production use.
• The team is also evaluating whether its current tools are sufficient to run the
models or if they require an even more robust environment to run models.
• Tools that are free or open-source or free tools Rand PL/R, Octave, WEKA.
• Commercial tools - MATLAB, STASTICA.
26
27. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 5: Communication Results -
• Following the execution of the model, team members will need to evaluate the
outcomes of the model to establish criteria for the success or failure of the
model.
• The team is considering how best to present findings and outcomes to the
various members of the team and other stakeholders while taking into
consideration cautionary tales and assumptions.
• The team should determine the most important findings, quantify their value to
the business and create a narrative to present findings and summarize them to
all stakeholders.
27
28. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
• Life Cycle of Data Analytics
• Phase 6: Operationalize -
• The team distributes the benefits of the project to a wider audience. It sets up a
pilot project that will deploy the work in a controlled manner prior to
expanding the project to the entire enterprise of users.
• This technique allows the team to gain insight into the performance and
constraints related to the model within a production setting at a small scale and
then make necessary adjustments before full deployment.
• The team produces the last reports, presentations, and codes.
• Open source or free tools such as WEKA, SQL, MADlib, and Octave.
28
29. BDA- Unit-I BIG DATA ANALYTICS LIFE CYCLE Department of IT
Lifecycle of Big Data analytics
The Big Data Analytics Life cycle is divided into nine phases, named as :
1. Business Case/Problem Definition
2. Data Identification
3. Data Acquisition and filtration
4. Data Extraction
5. Data Munging(Validation and Cleaning)
6. Data Aggregation & Representation(Storage)
7. Exploratory Data Analysis
8. Data Visualization(Preparation for Modeling and Assessment)
9. Utilization of analysis results.
29