SlideShare a Scribd company logo
1 of 35
‫الضخمة‬ ‫البيانات‬ ‫تقنيات‬
‫شاملة‬ ‫نظرة‬
‫تاريخية‬ ‫نظرة‬
File
‫منظمة‬ ‫ر‬
‫غي‬ ‫بيانات‬
Object
‫منظمة‬ ‫ر‬
‫غي‬ ‫بيانات‬
Table
‫منظمة‬ ‫بيانات‬
XML
‫منظمة‬ ‫شبة‬ ‫بيانات‬
Metadata
Database
Data Warehouse
‫الضخمة‬ ‫البيانات‬
‫البيانات‬ ‫مستودع‬
‫أساسية‬ ‫مفاهيم‬
‫الضخمة‬ ‫البيانات‬ ‫تقنيات‬ ‫ي‬
‫ف‬
‫العمليات‬
Data Scientist,
Analyst, Engineer
‫محلل‬ ‫و‬ ‫مهندس‬
‫البيانات‬ ‫وعالم‬
Data Scientist
‫البيانات‬ ‫عالم‬ Data Engineer
‫البيانات‬ ‫مهندس‬
Data Type
‫البيانات‬ ‫نوع‬
Data Analyst
‫البيانات‬ ‫محلل‬
Visualization
‫البيانات‬‫تصوير‬
Data Mining
‫عن‬ ‫التنقيب‬
‫البيانات‬
Multidimensions -
Tabular
‫االبعاد‬ ‫متعدد‬
-
‫مجدول‬
Data Warehouse
‫البيانات‬ ‫مستودع‬
ETL
‫اج‬‫ر‬‫استخ‬
–
‫تحويل‬
-
‫تحمي‬
‫ل‬
Data
Warehouse
(Structured
data)
‫البيانات‬
‫مستودع‬
(
‫منظمة‬
‫بيانات‬
)
No SQL – Hadoop –
Object Storage
Aggregation Tables
‫المجاميع‬ ‫جداول‬
Streaming + Transform
‫جمع‬
+
‫تحويل‬
Big
Data
(Unstructured
Data)
‫الضخمة‬
‫البيانات‬
(
‫منظمة‬
‫ر‬
‫غي‬
‫بيانات‬
)
Stream Platform
‫وهندسة‬ ‫مفهوم‬
Stream Platform
Hadoop
‫وهندسة‬ ‫مفهوم‬
Hadoop
Hadoop
Streaming + Transform
‫جمع‬
+
‫تحويل‬ Visualization
‫البيانات‬‫تصوير‬
Archive
‫أرشفة‬
Sources ‫مصادر‬ Process ‫معالجة‬
Aggregation Database
‫المجاميع‬ ‫بيانات‬ ‫قاعدة‬
Hadoop
Cleaning
‫تنظيف‬
Quality
‫جودة‬
Filter
‫تنقيح‬
Databases
Files
Could ML
‫اآللة‬ ‫تعلم‬
Reports
‫التقارير‬
Hadoop
DEMO
Object Storage
‫الضخمة‬ ‫البيانات‬ ‫تقنيات‬
Object Storage
S3
HDFS
S3 vs HDFS
Elasticity ‫المرونة‬
-
Yes
No
S3 ‫مرونة‬ ‫اكثر‬
Cost/TB/month ‫التكلفة‬
-
$23
$206
10X
Availability ‫االتاحة‬
-
99.99%
99.9% (estimated)
10X
Durability ‫المتانة‬
-
99.999999999%
99.9999% (estimated)
10X+
Transactional writes
Yes with DBIO
Yes
‫مشابه‬
‫الحديثة‬ ‫البيانات‬‫اكز‬‫ر‬‫م‬
CPUs
Distributed Object Storage
• AWS S3 (Simple Storage
Service)
• Azure Blob Storage
• Google cloud storage
RAMs Hard Disks
NoSQL
‫شاملة‬ ‫نظرة‬
NoSQL
NoSQL
NoSQL Database:
NoSQL
NoSQL Database:
NoSQL
NoSQL Database:
NoSQL
NoSQL Database:
NoSQL
NoSQL Database:
NoSQL
Data Process
‫الضخمة‬ ‫البيانات‬ ‫تقنيات‬ ‫ي‬
‫ف‬
‫سبارك‬
Apache Spark
‫ماه‬
:
‫منصة‬
‫المصدر‬ ‫مفتوحة‬
‫تقنية‬ ‫تدعم‬
Cluster
‫ملفات‬ ‫إلدارة‬
‫هادوب‬
‫و‬
Object
‫وعرضها‬ ‫البيانات‬ ‫وتحليل‬
1
2
3
Database of Databases
Druid vs Presto: What are the differences?
What is Druid? Fast column-oriented distributed data store. Druid is
a distributed, column-oriented, real-time analytics data store that is
commonly used to power exploratory dashboards in multi-tenant
environments. Druid excels as a data warehousing solution for fast
aggregate queries on petabyte sized data sets. Druid supports a variety
of flexible filters, exact calculations, approximate algorithms, and other
useful calculations.
What is Presto? Distributed SQL Query Engine for Big Data. Presto
is an open source distributed SQL query engine for running interactive
analytic queries against data sources of all sizes ranging from gigabytes
to petabytes.
Druid and Presto can be categorized as "Big Data" tools.
Both Support OLAP (Cube) concept.
Database of Databases
Database of Databases
‫لكم‬ ‫ا‬‫ر‬‫شك‬
Fahad Albakri
fnbakri
Fahad Albakri
fnbakri.blogspot.com

More Related Content

Similar to تقنيات البيانات الضخمة.pptx

Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24
Martin Bém
 
DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]
vasanth kumar C
 
Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
ssuser290967
 

Similar to تقنيات البيانات الضخمة.pptx (20)

Testing Big Data: Automated Testing of Hadoop with QuerySurge
Testing Big Data: Automated  Testing of Hadoop with QuerySurgeTesting Big Data: Automated  Testing of Hadoop with QuerySurge
Testing Big Data: Automated Testing of Hadoop with QuerySurge
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
Hadoop Big data Solution Provider
Hadoop Big data Solution ProviderHadoop Big data Solution Provider
Hadoop Big data Solution Provider
 
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCampSteve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
QuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing WebinarQuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing Webinar
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
Datalake Architecture
Datalake ArchitectureDatalake Architecture
Datalake Architecture
 
Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24
 
"Building Data Warehouse with Google Cloud Platform", Artem Nikulchenko
"Building Data Warehouse with Google Cloud Platform",  Artem Nikulchenko"Building Data Warehouse with Google Cloud Platform",  Artem Nikulchenko
"Building Data Warehouse with Google Cloud Platform", Artem Nikulchenko
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]
 
Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
 
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APSModernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 

Recently uploaded

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 

Recently uploaded (20)

Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
How to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in PakistanHow to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in Pakistan
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 

تقنيات البيانات الضخمة.pptx