SlideShare a Scribd company logo
What is “Data Engineering?”
Data Engineering Lab.
Kim Yong Dam
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
<Contents>
1. Introduction
2. What is Data Engineering?
3. Role of Data Engineer
4. What I’m doing..?
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
1. Introduction
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
???? Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
How?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
2. What is Data Engineering?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Data Engineering
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
Build Systems
with respect to
each data domain
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
“On Computer Architecture”
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
3. Role of Data Engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
“Should focus on something”
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
4. What I’m doing?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
For what?
For what?
For what?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.ibmbigdatahub.com/infographic/four-vs-big-data
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
Make a implemented connection
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
As a TEAM!
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. Tree Optimization for Spatial data in Non-Volatile Memory
2. Keyword Clustering for SNS data analysis
3. Clustering technique as unsupervised learning
4. Spatial Web Querying using Spatial Database
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. PB+ tree, R-tree for PCM
2. Ontology-based Keyword Clustering, Review on Sematic
Document Clustering
3. An efficient K-Means Algorithm integrated with Jaccard Distance
Measure for Document Clustering, A New Mallows Distance
Based Metric for Comparing Clusterings, Measuring Similarity
between Sets of Overlapping Clusters
4. Efficient Processing of Spatial Group Keyword Queries, Keyword
Search in Spatial Databases: Toward Searching by Document
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Q & A
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Thank you

More Related Content

What's hot

Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for Scale
Databricks
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Cambridge Semantics
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Hadi Fadlallah
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and Strategy
Shivam Dhawan
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0
Guillaume LE GALIARD
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
Sergio Zenatti Filho
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
Databricks
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
Kujambu Murugesan
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
DATAVERSITY
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
Introduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OKIntroduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OK
Kriangkrai Chaonithi
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
Novita Sari
 
Power BI as a storyteller
Power BI as a storytellerPower BI as a storyteller
Power BI as a storyteller
Berkovich Consulting
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
Databricks
 

What's hot (20)

Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for Scale
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and Strategy
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Introduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OKIntroduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OK
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
 
Power BI as a storyteller
Power BI as a storytellerPower BI as a storyteller
Power BI as a storyteller
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 

Similar to What is data engineering?

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
IMC Institute
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
arpit206900
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
Mohammed Barakat
 
Silabus mop 2
Silabus mop 2Silabus mop 2
Silabus mop 2
Berto Usman
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
ShahlKv
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
YashShiva3
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
Dr. Umesh Rao.Hodeghatta
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
santosh naidu
 
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET Journal
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
santosh naidu
 
Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ?
Edureka!
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
IRJET Journal
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
Parviz Vakili
 
Sample Resume Format
Sample Resume FormatSample Resume Format
Sample Resume Format
Thesis Scientist Private Limited
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
prateek kumar
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_management
shumon khan
 
Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018
Analytics8
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUME
Anuj Thakur
 
Data Driven Economy @CMU
Data Driven Economy @CMUData Driven Economy @CMU
Data Driven Economy @CMU
Komes Chandavimol
 

Similar to What is data engineering? (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Silabus mop 2
Silabus mop 2Silabus mop 2
Silabus mop 2
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering Mechanism
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ?
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Sample Resume Format
Sample Resume FormatSample Resume Format
Sample Resume Format
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_management
 
Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUME
 
Data Driven Economy @CMU
Data Driven Economy @CMUData Driven Economy @CMU
Data Driven Economy @CMU
 

Recently uploaded

DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
sameer shah
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 

Recently uploaded (20)

DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 

What is data engineering?

  • 1. What is “Data Engineering?” Data Engineering Lab. Kim Yong Dam DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 2. <Contents> 1. Introduction 2. What is Data Engineering? 3. Role of Data Engineer 4. What I’m doing..? 5. Future Work DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 3. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 1. Introduction
  • 4. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
  • 5. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value
  • 6. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Price / Analysis
  • 7. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value ???? Price / Analysis
  • 8. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis
  • 9. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis How?
  • 10. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 2. What is Data Engineering?
  • 11. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Data Engineering https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do..
  • 12. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. Build Systems with respect to each data domain Data Engineering
  • 13. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. “On Computer Architecture” Data Engineering
  • 14. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 3. Role of Data Engineer
  • 15. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
  • 16. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 17. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 18. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 19. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 20. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance “Should focus on something”
  • 21. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 4. What I’m doing?
  • 22. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 23. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance For what? For what? For what?
  • 24. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.ibmbigdatahub.com/infographic/four-vs-big-data
  • 25. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
  • 26. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ Make a implemented connection
  • 27. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ As a TEAM!
  • 28. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 5. Future Work
  • 29. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. Tree Optimization for Spatial data in Non-Volatile Memory 2. Keyword Clustering for SNS data analysis 3. Clustering technique as unsupervised learning 4. Spatial Web Querying using Spatial Database
  • 30. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. PB+ tree, R-tree for PCM 2. Ontology-based Keyword Clustering, Review on Sematic Document Clustering 3. An efficient K-Means Algorithm integrated with Jaccard Distance Measure for Document Clustering, A New Mallows Distance Based Metric for Comparing Clusterings, Measuring Similarity between Sets of Overlapping Clusters 4. Efficient Processing of Spatial Group Keyword Queries, Keyword Search in Spatial Databases: Toward Searching by Document
  • 31. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Q & A
  • 32. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Thank you