SEMINAR PRESENTATION FOR DATA SCIENCE

•Download as PPTX, PDF•

0 likes•20 views

It is a seminar project made by me . It is about the the data science and it's uses. As computer science student this is a important to manage the data and how to deal with easily and efficiently.

Education

TITLE:PRESENTATION ON hyper transport
Presented by:-
Name-Goutam Nayak
Section-ECE
Semester:-3rd
Regd no-2201298516
Guided by-

What Is Hyper Transport:
Hyper Transport is a scalable packet-based, high-
bandwidth, and low-latency point-to-point interconnect
technology intended to interconnect processors and also
link them to I/O peripheral devices. Hyper Transport was
initially devised as an efficient replacement for traditional
system buses for on-board communications.

History Of Hyper Transport:
• In earlier time there is no much more data of user in the internet so
all data are stores in excel form .But gradually number user increases
so to store the data
of user large no of ware house are made. This data is used to deal
with
real world problem like to improve the business strategy improve
user experience.

Data collection :
The first step of data science is data collection .This involves
gathering data from various sources and cleaning it to ensure
accuracy .It’s important to use reliable sources and appropriate
methods for collecting data.
Data is collected from different source like
1.web data, e-commerce
2.Financial transactions, bank/credit transactions
3.Online trading and purchasing
4.Social network

Data analysis:
Data analysis involves exploring and
interpreting data to identify patterns
and trends. This can be done using
statistical methods and visualization
tools .It’s important to have a
hypothesis and a plan analysis.
 According to a report Google process
20PB data daily.
 Facebook has 60TB of daily logs.
 eBay has 6.5PB of user data.

Machine learning:
Machine learning is a subset of
data science that involves
using algorithm to learn
patterns from data. It can be
used for predication and
classification tasks. It’s
important to choose the right
algorithm and evaluates its
performance.

Conclusion:
Data science is a powerful tools for extracting insights from data.
To be effective, it requires a combination of technical skills and
domain expertise . By following the techniques and strategies
discussed one can become a more effective data scientist.

This document summarizes a survey on data mining. It discusses how data mining helps extract useful business information from large databases and build predictive models. Commonly used data mining techniques are discussed, including artificial neural networks, decision trees, genetic algorithms, and nearest neighbor methods. An ideal data mining architecture is proposed that fully integrates data mining tools with a data warehouse and OLAP server. Examples of profitable data mining applications are provided in industries such as pharmaceuticals, credit cards, transportation, and consumer goods. The document concludes that while data mining is still developing, it has wide applications across domains to leverage knowledge in data warehouses and improve customer relationships.

Study on Positive and Negative Rule Based Mining Techniques for E-Commerce Ap...

Association of Scientists, Developers and Faculties

In the recent years the scope of data mining has evolved into an active area of research because of the previously unknown and interesting knowledge from very large database collection. The data mining is applied on a variety of applications in multiple domains like in business, IT and many more sectors. In Data Mining the major problem which receives great attention by the community is the classification of the data. The classification of data should be such that it could be they can be easily verified and should be easily interpreted by the humans. In this paper we would be studying various data mining techniques so that we can find few combinations for enhancing the hybrid technique which would be having multiple techniques involved so enhance the usability of the application. We would be studying CHARM Algorithm, CM-SPAM Algorithm, Apriori Algorithm, MOPNAR Algorithm and the Top K Rules.

SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...

ijdpsjournal

The paper aims at proposing a solution for designing and developing a seamless automation and integration of machine learning capabilities for Big Data with the following requirements: 1) the ability to seamlessly handle and scale very large amount of unstructured and structured data from diversified and heterogeneous sources; 2) the ability to systematically determine the steps and procedures needed for analyzing Big Data datasets based on data characteristics, domain expert inputs, and data pre-processing component; 3) the ability to automatically select the most appropriate libraries and tools to compute and accelerate the machine learning computations; and 4) the ability to perform Big Data analytics with high learning performance, but with minimal human intervention and supervision. The whole focus is to provide a seamless automated and integrated solution which can be effectively used to analyze Big Data with highfrequency and high-dimensional features from different types of data characteristics and different application problem domains, with high accuracy, robustness, and scalability. This paper highlights the research methodologies and research activities that we propose to be conducted by the Big Data researchers and practitioners in order to develop and support seamless automation and integration of machine learning capabilities for Big Data analytics.

SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...

ijdpsjournal

Data Mining @ Information Age

IIRindia

In the information age, data turns to be the vital. Hence it is important to understand the data in order to face the future information challenges. This paper deals with the importance of data mining while explaining the concepts and life cycle involved. It extracts the basic gist of the topic presented in a user-friendly way. Further, in developing different stages of data mining followed by its extended application usage in practical business platform.

IRJET- A Survey on Mining of Tweeter Data for Predicting User Behavior

IRJET Journal

This document discusses mining and analyzing social media data using big data techniques to predict user behavior. It proposes using tools like Hadoop and HDFS to capture trends in areas like drug abuse from large amounts of Twitter data. A framework is presented that involves gathering Twitter data using APIs, applying big data mining techniques, and using the results for more sophisticated analysis applications to help address issues like monitoring public health. Challenges around managing large social media datasets are also discussed.

06. 9534 14985-1-ed b edit dhyan

IAESIJEECS

Big data refers to huge set of data which is very common these days due to the increase of internet utilities. Data generated from social media is a very common example for the same. This paper depicts the summary on big data and ways in which it has been utilized in all aspects. Data mining is radically a mode of deriving the indispensable knowledge from extensively vast fractions of data which is quite challenging to be interpreted by conventional methods. The paper mainly focuses on the issues related to the clustering techniques in big data. For the classification purpose of the big data, the existing classification algorithms are concisely acknowledged and after that, k-nearest neighbour algorithm is discreetly chosen among them and described along with an example.

Fundamentals of data mining and its applications

Subrat Swain

Data mining involves applying intelligent methods to extract patterns from large data sets. It is used to discover useful knowledge from a variety of data sources. The overall goal is to extract human-understandable knowledge that can be used for decision-making. The document discusses the data mining process, which typically involves problem definition, data exploration, data preparation, modeling, evaluation, and deployment. It also covers data mining software tools and techniques for ensuring privacy, such as randomization and k-anonymity. Finally, it outlines several applications of data mining in fields like industry, science, music, and more.

This document discusses mining social media data to understand drug usage. It proposes using big data techniques like Hadoop and MapReduce to extract and analyze data from social networks about drug abuse. The methodology involves collecting data from platforms using crawlers, storing it in Hadoop, filtering it, then applying complex analysis using cloud computing. Prior work on extracting health information from social media and multi-scale community detection in networks is reviewed. The challenges of privacy preservation and scalability when anonymizing big healthcare datasets are also discussed.

An introduction to Data Mining

Shobhita Dayal

Borys Pratsiuk "How to be NVidia partner"

Lviv Startup Club

Borys Pratsiuk is the Head of R&D at an unnamed company. He has over 15 years of experience in engineering roles related to Android development, embedded systems, and solid state electronics. He holds a PhD in Solid State Electronics from Kiev Polytechnic Institute and has worked in both academic and industry roles in South Korea and Ukraine. The presentation discusses big data, analytics, artificial intelligence and machine learning applications across various industries. It provides examples of deep learning solutions developed for clients in areas like computer vision, natural language processing, predictive analytics and process automation. The presentation emphasizes Ciklum's full-service approach to developing and deploying deep learning solutions from data collection and modeling to deployment and ongoing support.

An introduction to Data Mining by Kurt Thearling

Pim Piepers

An Introduction to Data Mining Discovering hidden value in your data warehouse By Kurt Thearling Overview Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses. Data mining tools predict future trends and behaviors, allowing businesses to make proactive, knowledge-driven decisions. The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by retrospective tools typical of decision support systems. Data mining tools can answer business questions that traditionally were too time consuming to resolve. They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations. Most companies already collect and refine massive quantities of data. Data mining techniques can be implemented rapidly on existing software and hardware platforms to enhance the value of existing information resources, and can be integrated with new products and systems as they are brought on-line. When implemented on high performance client/server or parallel processing computers, data mining tools can analyze massive databases to deliver answers to questions such as, "Which clients are most likely to respond to my next promotional mailing, and why?" This white paper provides an introduction to the basic technologies of data mining. Examples of profitable applications illustrate its relevance to today’s business environment as well as a basic description of how data warehouse architectures can evolve to deliver the value of data mining to end users.

Data management plans – EUDAT Best practices and case study | www.eudat.eu

EUDAT

| www.eudat.eu | Presentation given by Stéphane Coutin during the PRACE 2017 Spring School joint training event with the EU H2020 VI-SEEM project (https://vi-seem.eu/) organised by CaSToRC at The Cyprus Institute. Science and more specifically projects using HPC is facing a digital data explosion. Instruments and simulations are producing more and more volume; data can be shared, mined, cited, preserved… They are a great asset, but they are facing risks: we can miss storage, we can lose them, they can be misused,… To start this session, we will review why it is important to manage research data and how to do this by maintaining a Data Management Plan. This will be based on the best practices from EUDAT H2020 project and European Commission recommendation. During the second part we will interactively draft a DMP for a given use case.

An introduction to data mining

Shiva Krishna Chandra Shekar

Data mining involves using algorithms to automatically find patterns in large datasets. It is used to make predictions about future trends and behaviors to help companies make proactive decisions. The document discusses the history and evolution of data mining, from early data collection and storage to today's powerful algorithms and massive databases. Common data mining techniques are also outlined.

Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...

Sahilakhurana

Banking and securities Challenges Early warning for securities fraud and trade visibilities Card fraud detection and audit trails Enterprise credit risk reporting Customer data transformation and analytics. The Security Exchange commission (SEC) is using big data to monitor financial market activity by using network analytics and natural language processing. This helps to catch illegal trading activity in the financial markets. The Data Analytics Lifecycle is designed specifically for Big Data problems and data science projects. The lifecycle has six phases, and project work can occur in several phases at once. For most phases in the lifecycle, the movement can be either forward or backward. This iterative depiction of the lifecycle is intended to more closely portray a real project, in which aspects of the project move forward and may return to earlier stages as new information is uncovered and team members learn more about various stages of the project. This enables participants to move iteratively through the process and drive toward operationalizing the project work. Phase 1—Discovery: In Phase 1, the team learns the business domain, including relevant history such as whether the organization or business unit has attempted similar projects in the past from which they can learn. The team assesses the resources available to support the project in terms of people, technology, time, and data. Important activities in this phase include framing the business problem as an analytics challenge that can be addressed in subsequent phases and formulating initial hypotheses (IHs) to test and begin learning the data. Phase 2—Data preparation: Phase 2 requires the presence of an analytic sandbox, in which the team can work with data and perform analytics for the duration of the project. The team needs to execute extract, load, and transform (ELT) or extract, transform and load (ETL) to get data into the sandbox. The ELT and ETL are sometimes abbreviated as ETLT. Data should be transformed in the ETLT process so the team can work with it and analyze it. In this phase, the team also needs to familiarize itself with the data thoroughly and take steps to condition the data.

UNIT - 1 : Part 1: Data Warehousing and Data Mining

Nandakumar P

The document provides an overview of data warehousing and data mining. It discusses how data warehousing transforms data into information to support decision making. It contrasts operational systems optimized for transactions with data warehouses designed for analysis. Data warehouses integrate data from multiple sources and support multidimensional analysis and ad-hoc queries. The document also introduces data mining as a way to extract intelligence from warehouse data.

QuickView #3 - Big Data

Sonovate

Forecast to contribute £216 billion to the UK economy via business creation, efficiency and innovation, and generate 360,000 new jobs by 2020, big data is a key area for recruiters. In this QuickView: - Big data in numbers - Top 10 industries hiring big data professionals - Top 10 qualifications sought by hirers - Top 10 database and BI skills sought by hirers - Getting started in big data: popular big data techniques and vendors

Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate

Oomph! Recruitment

IRJET- Survey of Big Data with Hadoop

IRJET Journal

This document discusses big data and Hadoop. It begins with an introduction to big data, noting the volume, variety and velocity of data. It then provides an overview of Hadoop, including its core components HDFS for storage and MapReduce for processing. The document also outlines some of the key challenges of big data including heterogeneity, scale, timeliness, privacy and the need for human collaboration in analysis. It concludes by discussing how Hadoop provides a solution for big data processing through its distributed architecture and use of HDFS and MapReduce.

Wearable Technology Orientation using Big Data Analytics for Improving Qualit...

IRJET Journal

This document discusses using big data analytics on data from wearable devices to improve personalized recommendations and quality of life. It proposes a framework that uses Hadoop and MapReduce to analyze large amounts of data from various wearables. The framework includes data acquisition, processing, and storing in HDFS. It then performs analytics to populate a personalized knowledge base and provide adaptive recommendations. This framework aims to better leverage and analyze the large and growing volumes of data from wearables.

Mis 2

Rohit Garg

This document discusses emerging information technologies that can improve organizational performance, including expert systems, neural networks, geographic information systems, groupware, and internet applications. It provides an overview of key topics such as the evolution of IT infrastructure, advantages and challenges of the internet and groupware, how expert systems and neural networks can be used, applications of geographic information systems, and contemporary trends in hardware/software platforms and management issues when adopting new technologies.

Real World Application of Big Data In Data Mining Tools

ijsrd.com

The main aim of this paper is to make a study on the notion Big data and its application in data mining tools like R, Weka, Rapidminer, Knime,Mahout and etc. We are awash in a flood of data today. In a broad range of application areas, data is being collected at unmatched scale. Decisions that previously were based on surmise, or on painstakingly constructed models of reality, can now be made based on the data itself. Such Big Data analysis now drives nearly every aspect of our modern society, including mobile services, retail, manufacturing, financial services, life sciences, and physical sciences. The paper mainly focuses different types of data mining tools and its usage in big data in knowledge discovery.

Introduction to Data Analytics and data analytics life cycle

Dr. Radhey Shyam

The document provides an overview of data analytics and big data concepts. It discusses the characteristics of big data, including the four V's of volume, velocity, variety and veracity. It also describes different types of data like structured, semi-structured and unstructured data. The document then introduces big data platforms and tools like Hadoop, Spark and Cassandra. Finally, it discusses the need for data analytics in business, including enabling better decision making and improving efficiency.

Complete-SRS.doc

jadhavpravin920

The software development process is complete for computer project analysis, and it is important to the evaluation of the random project. These practice guidelines are for those who manage big-data and big-data analytics projects or are responsible for the use of data analytics solutions. They are also intended for business leaders and program leaders that are responsible for developing agency capability in the area of big data and big data analytics . For those agencies currently not using big data or big data analytics, this document may assist strategic planners, business teams and data analysts to consider the value of big data to the current and future programs. This document is also of relevance to those in industry, research and academia who can work as partners with government on big data analytics projects. Technical APS personnel who manage big data and/or do big data analytics are invited to join the Data Analytics Centre of Excellence Community of Practice to share information of technical aspects of big data and big data analytics, including achieving best practice with modeling and related requirements. To join the community, send an email to the Data Analytics Centre of Excellence

L3 Big Data and Application.pptx

Shambhavi Vats

This document provides an introduction to big data and data science from Amity Institute of Information Technology. It defines big data and data science, highlighting that big data is a subset of data science. The key differences between big data and data science are described. Examples of applications of big data in various domains like social media, healthcare, finance, ecommerce and education are outlined. Finally, the skills required to become a data scientist or big data specialist are summarized.

ii mca juno

Ramya Sasi

Data mining involves using analytical techniques to discover patterns in large data sets. It is used to gain insights into business problems like predicting customer behavior or identifying fraud. The key steps in data mining include requirement analysis, data collection/preparation, exploration of techniques, implementation/evaluation, and visualization of results. Applications include prediction, relationship marketing, customer profiling, outlier detection, and customer segmentation.

Unit-1 -2-3- BDA PIET 6 AIDS.pptx

YashiBatra1

The document discusses the syllabus for a course on Big Data Analytics. The syllabus covers four units: (1) an introduction to big data concepts like distributed file systems, Hadoop, and MapReduce; (2) Hadoop architecture including HDFS, MapReduce, and YARN; (3) Hadoop ecosystem components like Hive, Pig, HBase, and Spark; and (4) new features of Hadoop 2.0 like high availability for NameNode and HDFS federation. The course aims to provide students with foundational knowledge of big data technologies and tools for processing and analyzing large datasets.

13 pv-do es-18-bigdata-v3

Aravindharamanan S

1. Determine if a Big Data approach is suitable based on factors like volume, variety and velocity of data as well as the need for iterative, exploratory analysis. 2. Use techniques like Hadoop, MapReduce and NoSQL databases that can analyze large, diverse, unstructured datasets in a distributed, parallel manner. 3. Follow data management best practices like data governance, quality checks, and master data management to ensure clean, well-organized data.

বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf

eBook.com.bd (প্রয়োজনীয় বাংলা বই)

বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত .. আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ... তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...। বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...

Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...

Diana Rendina

Librarians are leading the way in creating future-ready citizens – now we need to update our spaces to match. In this session, attendees will get inspiration for transforming their library spaces. You’ll learn how to survey students and patrons, create a focus group, and use design thinking to brainstorm ideas for your space. We’ll discuss budget friendly ways to change your space as well as how to find funding. No matter where you’re at, you’ll find ideas for reimagining your space in this session.

BBR 2024 Summer Sessions Interview Training

Katrina Pritchard

South African Journal of Science: Writing with integrity workshop (2024)

Academy of Science of South Africa

The History of Stoke Newington Street Names

History of Stoke Newington

Your Skill Boost Masterclass: Strategies for Effective Upskilling

Excellence Foundation for South Sudan

How to Manage Your Lost Opportunities in Odoo 17 CRM

Celine George

Recently uploaded (20)

বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf

Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...

UGC NET Exam Paper 1- Unit 1:Teaching Aptitude

PCOS corelations and management through Ayurveda.

A Independência da América Espanhola LAPBOOK.pdf

MARY JANE WILSON, A “BOA MÃE” .

The Diamonds of 2023-2024 in the IGRA collection

Cognitive Development Adolescence Psychology

Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...

ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf

How to deliver Powerpoint Presentations.pptx

Pengantar Penggunaan Flutter - Dart programming language1.pptx

BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...

คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1

How to Make a Field Mandatory in Odoo 17

BBR 2024 Summer Sessions Interview Training

South African Journal of Science: Writing with integrity workshop (2024)

The History of Stoke Newington Street Names

Your Skill Boost Masterclass: Strategies for Effective Upskilling

How to Manage Your Lost Opportunities in Odoo 17 CRM

SEMINAR PRESENTATION FOR DATA SCIENCE

1. TITLE:PRESENTATION ON hyper transport Presented by:- Name-Goutam Nayak Section-ECE Semester:-3rd Regd no-2201298516 Guided by-

2. What Is Hyper Transport: Hyper Transport is a scalable packet-based, high- bandwidth, and low-latency point-to-point interconnect technology intended to interconnect processors and also link them to I/O peripheral devices. Hyper Transport was initially devised as an efficient replacement for traditional system buses for on-board communications.

3. History Of Hyper Transport: • In earlier time there is no much more data of user in the internet so all data are stores in excel form .But gradually number user increases so to store the data of user large no of ware house are made. This data is used to deal with real world problem like to improve the business strategy improve user experience.

4. Data collection : The first step of data science is data collection .This involves gathering data from various sources and cleaning it to ensure accuracy .It’s important to use reliable sources and appropriate methods for collecting data. Data is collected from different source like 1.web data, e-commerce 2.Financial transactions, bank/credit transactions 3.Online trading and purchasing 4.Social network

5. Data analysis: Data analysis involves exploring and interpreting data to identify patterns and trends. This can be done using statistical methods and visualization tools .It’s important to have a hypothesis and a plan analysis.  According to a report Google process 20PB data daily.  Facebook has 60TB of daily logs.  eBay has 6.5PB of user data.

6. Machine learning: Machine learning is a subset of data science that involves using algorithm to learn patterns from data. It can be used for predication and classification tasks. It’s important to choose the right algorithm and evaluates its performance.

7. Feature of data science:

8. Conclusion: Data science is a powerful tools for extracting insights from data. To be effective, it requires a combination of technical skills and domain expertise . By following the techniques and strategies discussed one can become a more effective data scientist.

SEMINAR PRESENTATION FOR DATA SCIENCE

Recommended

Recommended

More Related Content

Similar to SEMINAR PRESENTATION FOR DATA SCIENCE

Similar to SEMINAR PRESENTATION FOR DATA SCIENCE (20)

Recently uploaded

Recently uploaded (20)

SEMINAR PRESENTATION FOR DATA SCIENCE