SlideShare a Scribd company logo
1 of 11
Photo credit: thetelecomblog.com
Photo credit: old-photos.com
Active Directory
Source: Big Data & CIO Adoption Quantitative Research, Sept 2012, Microsoft
Hadoop
native libraries
for Windows
Contributed
FileSystem
implementation
for Azure Storage
Hive
interactive
query
execution
10,000+
Code line
contributions
HDFS
permissions
model mapped
to Windows
Azure VM
donation
intended for
Jenkins Servers
supporting
Continuous
Integration efforts
6,000+
Engineering
hours
Native Task
Controller
for Hadoop
on Windows
+
Hortonworks
Data Platform
for Windows
HDInsight
Service
Transform Healthcare
Master Chief Meets Big Data
HDP
Reaching a Billion Users with Hadoop

More Related Content

What's hot

Data revolution by Doug Cutting
Data revolution by Doug CuttingData revolution by Doug Cutting
Data revolution by Doug CuttingFelicia Haggarty
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Imam Raza
 
An overview of BigQuery
An overview of BigQuery An overview of BigQuery
An overview of BigQuery GirdhareeSaran
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014James Chittenden
 
Hadoop Desktop Cluster
Hadoop Desktop ClusterHadoop Desktop Cluster
Hadoop Desktop ClusterPaul Morse
 
Alluxio - Virtual Unified File System
Alluxio - Virtual Unified File System Alluxio - Virtual Unified File System
Alluxio - Virtual Unified File System Alluxio, Inc.
 
(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data
(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data
(STG308) How EA, State Of Texas & H3 Biomedicine Protect DataAmazon Web Services
 
Single View of Well, Production and Assets
Single View of Well, Production and AssetsSingle View of Well, Production and Assets
Single View of Well, Production and AssetsJohn Archer
 
Cloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Chris Jang
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQueryWlodek Bielski
 
Quick Intro to Google Cloud Technologies
Quick Intro to Google Cloud TechnologiesQuick Intro to Google Cloud Technologies
Quick Intro to Google Cloud TechnologiesChris Schalk
 
StackEngine Demo - Docker Austin
StackEngine Demo - Docker AustinStackEngine Demo - Docker Austin
StackEngine Demo - Docker AustinBoyd Hemphill
 
Cloud Expo NYC 2017: Running Databases in Containers
Cloud Expo NYC 2017: Running Databases in Containers Cloud Expo NYC 2017: Running Databases in Containers
Cloud Expo NYC 2017: Running Databases in Containers Ocean9, Inc.
 
BigQuery for the Big Data win
BigQuery for the Big Data winBigQuery for the Big Data win
BigQuery for the Big Data winKen Taylor
 
Cloud Expo NYC 2017: Big Data in IoT
Cloud Expo NYC 2017: Big Data in IoTCloud Expo NYC 2017: Big Data in IoT
Cloud Expo NYC 2017: Big Data in IoTOcean9, Inc.
 

What's hot (20)

Data revolution by Doug Cutting
Data revolution by Doug CuttingData revolution by Doug Cutting
Data revolution by Doug Cutting
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
 
An overview of BigQuery
An overview of BigQuery An overview of BigQuery
An overview of BigQuery
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014
 
Hadoop Desktop Cluster
Hadoop Desktop ClusterHadoop Desktop Cluster
Hadoop Desktop Cluster
 
Alluxio - Virtual Unified File System
Alluxio - Virtual Unified File System Alluxio - Virtual Unified File System
Alluxio - Virtual Unified File System
 
(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data
(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data
(STG308) How EA, State Of Texas & H3 Biomedicine Protect Data
 
Single View of Well, Production and Assets
Single View of Well, Production and AssetsSingle View of Well, Production and Assets
Single View of Well, Production and Assets
 
Cloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's New
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQuery
 
Google Bigtable
Google BigtableGoogle Bigtable
Google Bigtable
 
Bigquery 101
Bigquery 101Bigquery 101
Bigquery 101
 
Quick Intro to Google Cloud Technologies
Quick Intro to Google Cloud TechnologiesQuick Intro to Google Cloud Technologies
Quick Intro to Google Cloud Technologies
 
StackEngine Demo - Docker Austin
StackEngine Demo - Docker AustinStackEngine Demo - Docker Austin
StackEngine Demo - Docker Austin
 
Cloud Expo NYC 2017: Running Databases in Containers
Cloud Expo NYC 2017: Running Databases in Containers Cloud Expo NYC 2017: Running Databases in Containers
Cloud Expo NYC 2017: Running Databases in Containers
 
BigQuery for the Big Data win
BigQuery for the Big Data winBigQuery for the Big Data win
BigQuery for the Big Data win
 
Make your data talk
Make your data talkMake your data talk
Make your data talk
 
SGI Big Data Launch
SGI Big Data LaunchSGI Big Data Launch
SGI Big Data Launch
 
Cloud Expo NYC 2017: Big Data in IoT
Cloud Expo NYC 2017: Big Data in IoTCloud Expo NYC 2017: Big Data in IoT
Cloud Expo NYC 2017: Big Data in IoT
 

Viewers also liked

Aplicación ERP Analytics (spanish)
Aplicación ERP Analytics (spanish)Aplicación ERP Analytics (spanish)
Aplicación ERP Analytics (spanish)Stratebi
 
From 400 bugs to 0 in 2 weeks Using FindBugs
From 400 bugs to 0 in 2 weeks Using FindBugsFrom 400 bugs to 0 in 2 weeks Using FindBugs
From 400 bugs to 0 in 2 weeks Using FindBugsYonatan Maman
 
UTTAM - Desktop_Administration around 3 years
UTTAM - Desktop_Administration around 3 yearsUTTAM - Desktop_Administration around 3 years
UTTAM - Desktop_Administration around 3 yearsUttam Reddy
 
Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel   Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel geektimecoil
 
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guan
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan GuanWorkshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guan
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guanux singapore
 
United states postal service
United states postal serviceUnited states postal service
United states postal servicestudent
 
Theories of Architecture & Urbanism (Project 1part 1 stage 1)
Theories of Architecture & Urbanism (Project 1part 1 stage 1)Theories of Architecture & Urbanism (Project 1part 1 stage 1)
Theories of Architecture & Urbanism (Project 1part 1 stage 1)Soh Shing
 
Special places powerpoint
Special places powerpointSpecial places powerpoint
Special places powerpointsweems1995
 
Building Hybrid Apps with AngularJS and Ionic
Building Hybrid Apps with AngularJS and IonicBuilding Hybrid Apps with AngularJS and Ionic
Building Hybrid Apps with AngularJS and IonicYounes Adounis
 
Cognitive Mapping of Brickfield
Cognitive Mapping of Brickfield Cognitive Mapping of Brickfield
Cognitive Mapping of Brickfield JoyeeLee0131
 
Web Analytics Wednesday - Session Replay Tools are Vital
Web Analytics Wednesday - Session Replay Tools are VitalWeb Analytics Wednesday - Session Replay Tools are Vital
Web Analytics Wednesday - Session Replay Tools are VitalCraig Sullivan
 
Workshop #14: Behaviour, government policy and me: applying behavioural insig...
Workshop #14: Behaviour, government policy and me: applying behavioural insig...Workshop #14: Behaviour, government policy and me: applying behavioural insig...
Workshop #14: Behaviour, government policy and me: applying behavioural insig...ux singapore
 
Workshop #13: Scenario Based Design_handoutsA
Workshop #13: Scenario Based Design_handoutsAWorkshop #13: Scenario Based Design_handoutsA
Workshop #13: Scenario Based Design_handoutsAux singapore
 
Building Technology 1
Building Technology 1Building Technology 1
Building Technology 1Gertrude Lee
 
53 Claves para conocer Machine Learning
53 Claves para conocer Machine Learning53 Claves para conocer Machine Learning
53 Claves para conocer Machine LearningStratebi
 

Viewers also liked (20)

Aplicación ERP Analytics (spanish)
Aplicación ERP Analytics (spanish)Aplicación ERP Analytics (spanish)
Aplicación ERP Analytics (spanish)
 
From 400 bugs to 0 in 2 weeks Using FindBugs
From 400 bugs to 0 in 2 weeks Using FindBugsFrom 400 bugs to 0 in 2 weeks Using FindBugs
From 400 bugs to 0 in 2 weeks Using FindBugs
 
UTTAM - Desktop_Administration around 3 years
UTTAM - Desktop_Administration around 3 yearsUTTAM - Desktop_Administration around 3 years
UTTAM - Desktop_Administration around 3 years
 
Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel   Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel
 
Gerty Theresa Radnitz Cori
Gerty Theresa Radnitz CoriGerty Theresa Radnitz Cori
Gerty Theresa Radnitz Cori
 
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guan
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan GuanWorkshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guan
Workshop #11: What is Right and Wrong Mindfulness by Venerable Chuan Guan
 
Phrasal verbs
Phrasal verbsPhrasal verbs
Phrasal verbs
 
Ionic by Example
Ionic by ExampleIonic by Example
Ionic by Example
 
United states postal service
United states postal serviceUnited states postal service
United states postal service
 
Theories of Architecture & Urbanism (Project 1part 1 stage 1)
Theories of Architecture & Urbanism (Project 1part 1 stage 1)Theories of Architecture & Urbanism (Project 1part 1 stage 1)
Theories of Architecture & Urbanism (Project 1part 1 stage 1)
 
Special places powerpoint
Special places powerpointSpecial places powerpoint
Special places powerpoint
 
Building Hybrid Apps with AngularJS and Ionic
Building Hybrid Apps with AngularJS and IonicBuilding Hybrid Apps with AngularJS and Ionic
Building Hybrid Apps with AngularJS and Ionic
 
Cognitive Mapping of Brickfield
Cognitive Mapping of Brickfield Cognitive Mapping of Brickfield
Cognitive Mapping of Brickfield
 
Web Analytics Wednesday - Session Replay Tools are Vital
Web Analytics Wednesday - Session Replay Tools are VitalWeb Analytics Wednesday - Session Replay Tools are Vital
Web Analytics Wednesday - Session Replay Tools are Vital
 
Report on Softener
Report on SoftenerReport on Softener
Report on Softener
 
Workshop #14: Behaviour, government policy and me: applying behavioural insig...
Workshop #14: Behaviour, government policy and me: applying behavioural insig...Workshop #14: Behaviour, government policy and me: applying behavioural insig...
Workshop #14: Behaviour, government policy and me: applying behavioural insig...
 
Ooh overview 1_p2016_12092016
Ooh overview 1_p2016_12092016Ooh overview 1_p2016_12092016
Ooh overview 1_p2016_12092016
 
Workshop #13: Scenario Based Design_handoutsA
Workshop #13: Scenario Based Design_handoutsAWorkshop #13: Scenario Based Design_handoutsA
Workshop #13: Scenario Based Design_handoutsA
 
Building Technology 1
Building Technology 1Building Technology 1
Building Technology 1
 
53 Claves para conocer Machine Learning
53 Claves para conocer Machine Learning53 Claves para conocer Machine Learning
53 Claves para conocer Machine Learning
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

Reaching a Billion Users with Hadoop

Editor's Notes

  1. 2 minsUser readinessReachTechnology enablers IntroductionWhy we are here- MSFT understands that Hadoop technology will transform business capabilities Hadoop in nascent stage, audience are early adopters MSFT’s role is to help make democratization of Hadoop happen
  2. 1.5 minsExample of democratization of technology throughout history: Cell Phone What was the trend that enabled this transformation? AMPS was a pioneering technology that helped drive mass market usage of cellular technology, but it had several serious issues by modern standards. It was unencrypted and easily vulnerable to eavesdropping via a scanner. These phones were eventually superseded by Digital AMPS (D-AMPS) in 1990, and AMPS service was shut down by most North American carriers by 2008. In the 1990s, the 'second generation' mobile phone systems emerged. Two systems competed for supremacy in the global market: GSM standard and CDMA standard. These differed from the previous generation by using digital instead of analog transmission, and also fast out-of-band phone-to-network signaling. When 2G emerged, mobile phone usage skyrocketed and has progressed from there with Mobile broadband data (3G) and eventually Native IP Networks (4G). What did we hit a billion cell phone users? In 2003 there were 1 billon cell phone users.  (A History of Popular Culture: More of Everything, Faster and Brighter’  Raymond F. Betts)1 Billions users in 2003
  3. 1.5minsExample of democratization of technology throughout history:TravelWhat was the trend that enabled this transformation? The introduction of the Commercial Jet Airliner in 1952 allowed safe, long distance, non-stop air travel across the Atlantic ocean. What was the point in time in which we hit a billion airline travelers?TBD
  4. 3 minsWhy Are We Here/Crossroads Ripeness for the democratization of Hadoop technology Adoption challengesMSFT solutionsEvery person in the business will benefit from the value the technology brings Adoption Blockers: Market Analysis Data Research Hadoop is perceived to deliver multiple benefits specifically around unstructured data, however, lack of in-house expertise stands out as the main impediment to its implementation.Top three concerns/blockers:Lack of in-house expertise 59%Lack of integration with existing BI tools 41%Lack of security & Complex to manage and administer 37% MSFT Solutions to blockers SQL Server- 46% share. Most widely deployed database on the planet. Office- Excel is the most widely used BI tool with over 1 billion people using Office Windows Azure Active Directory- Of all the organizations that have a directory, roughly 95% of them use Active DirectoryBlog: http://blogs.msdn.com/b/windowsazure/archive/2012/11/27/windows-azure-active-directory-processes-200-billion-authentications-connecting-people-data-and-devices-around-the-globe.aspx
  5. 2 minsMSFT investments and contributions to Hadoop We are starting at the bottom, in the source code. Making the source base work better not just for Windows, but for everyoneCommunity Contributions:10s of thousands of code line contributed (across all deliverables) 6000+ engineering hours contributed (since February 2012)Others:Apache Build/Verification Infrastructure:Working with Apache Infrastructure team & Hadoop Core PMC on donation of Azure VM’s to be used as Jenkins Servers for Continuous IntegrationInteractive Query: Contributing code and query processing experience to help with Hive query performance (Stinger, ORC & Tez projects)Hadoop on Windows (1.0 & 2.0):Contributed back our porting efforts for Hadoop on Windows including:Command-line scripts for the Hadoop surface areaMapping the HDFS permissions model to WindowsNative Task Controller for Windows Implementation of Hadoop native libraries for Windows (compression codecs, native I/O)ASV Driver:Contributed our FileSystem implementation for Azure Storage
  6. 2 minsThese contributions allow us to deliver great products to the market with HadoopHortonworks Data Platform on Windows (Highlight Hortonworks partnership)HDInight Service on Windows Azure
  7. 1 minMomentum: Here is what some of our customers are doing with BI Ascribe: http://www.microsoft.com/casestudies/Case_Study_Detail.aspx?CaseStudyID=710000002092 Ascribe created a BI solution that monitors infectious disease outbreaks on the national level, and also improves operations for local care providers.BI solution that helps healthcare providers detect, predict, and respond more quickly to outbreaks of infectious disease and other health threats.Delivering healthcare tools faster, speeding response, and providing actionable insight into large volumes of data.About: One of the UK’s leading suppliers of clinically focused IT solutions for the healthcare industry.Need: Provide rapid insight into large volumes of data from multiple sources to help clinicians improve services. Solution: Designed an end-to-end Big Data solution with BI tools based on Microsoft SQL Server 2012 and Windows Azure HDInsight Service.Benefits:Transforms healthcare with near-real-time access to information Speeds response to health threats Provides actionable insight into large volumes of structured and unstructured data
  8. 1 min343 Industries/Halo 4: http://www.microsoft.com/casestudies/Case_Study_Detail.aspx?CaseStudyID=710000002102 Helping to improve the gaming experience – we make weekly tweaks on the game itself based on heuristics gathered from player data and spot trends that even allow us to prevent cheaters.About: The 343 Industries development team hosts and manages Halo 4Need: The team needed to provide BI insight about the game to internal and external customers.Solution: The team implemented a solution that uses Windows Azure HDInsight Service, based on the Apache Hadoop data-processing framework, and Microsoft BI technologies.Benefits:Enhances user experience through increased agility and faster response times. Connects Halo 4 team directly to customers through weekly updates. Keeps playing field level by providing in-game analysis to detect cheaters. Facilitates customized campaigns aimed at retaining players. Opportunity to plug Mike Flasko’s “Master Chief Love Hive: Hadoop in the Cloud” session at 4:25-5:05pm.
  9. It's not enough to have great Hadoop runtime, incredibly scalable, easy access, usable and great TCO, etc. You have to get the value out of the end. Show a demo about how we are delivering end value from data managed by HadoopData Explorer & GeoFlowGeoFlow builds:Summit attendees time lapse Hadoop/Big data heat map Summit attendees + Hadoop/Big data together
  10. 1 min