Getting more out of your big data

554 views

Published on

A presentation we gave together with Microsoft at the latest inspirience days in Belgium.

Published in: Technology, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
554
On SlideShare
0
From Embeds
0
Number of Embeds
22
Actions
Shares
0
Downloads
17
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Getting more out of your big data

  1. 1. Cloud
  2. 2. Creating New Business OpportunitiesRevenueGrowthIncreases ad revenue byprocessing 3.5 billionevents per dayMassiveVolumesProcesses 464 billion rowsper quarter, with averagequery time under 10 secs.BusinessInnovation1Measures and ranks onlineuser influence by processing3 billion signals per dayCloudConnectivityConnects across 13 socialnetworks via the cloud fordata and API accessOperationalEfficienciesUses sentiment analysis andweb analytics for its internalcloudGEReal-TimeInsightImproves operationaldecision making for ITmanagers and users1. Klout Case Study: http://www.microsoft.com/casestudies/Microsoft-SQL-Server-2012-Enterprise/Klout/Data-Services-Firm-Uses-Microsoft-BI-and-Hadoop-to-Boost-Insight-into-Big-Data/710000000129
  3. 3. GartnerMcKinseyForresterResearch
  4. 4. Big Data Creating TransparencyEnablingExperimentation todiscoverneeds, exposevariability andimproveperformanceSegmenting populations tocustomize actionsReplacing/Supporting humandecision making withautomated algorithmsInnovating new businessmodels, products andservices with big data
  5. 5. HADOOP DataRepositoryInternal Data through APIsFetcher & Parser to enrich &validate with external dataData SilosHigh VelocitychangesCost of ChangesBusinessChallengesSolutionSolutionBenefitsFaster UpdatesFull dataset refresh possibleevery week instead of a fewtimes per yearCost ReductionSignificant reduction invalidation phone callsSolutionBenefits
  6. 6. HADOOP Real-TimeArchitectureScaleable architecture tosupport current and futurereal-time insight needsHigh Volume &High VelocityOld solution not able tohandle incoming volumes ofdata in timely mannerBusinessChallengesSolutionSolutionBenefitsFaster InsightsRealtime handling of thegrowing Volume & Velocity ofthe data. Adding at least 1TBper year.Grow with NeedsSolution scales with businessneeds without upfront costSolutionBenefits
  7. 7. HADOOP Data &Processing ClusterScalable Image LibraryProcessing cluster to processimagesHigh Variety &High VolumeAnalyses of 30.000+ giantimages of medical scans ofPancreasBusinessChallengesSolutionSolutionBenefitsFaster Research &Diagnostical InsightDiagnoses can be validatedagainst previous diagnosesNew research ideas can bechecked across full image setCost FriendlyReliabilityImprovementInexpensive data duplicationover HADOOP storage nodesprovides needed reliabilityimprovementsSolutionBenefits
  8. 8. Share your data with the world viaAzure MarketplaceEnrich with social media data via SocialAnalyticsAdvanced analytics with HadoopConnectingwith the World’s DataAnalyze Big Data with familiar toolsImmersive insights from any dataJavaScript based simple programmingImmersive Insight,Wherever you areSimplicity and manageability ofWindows to HadoopExtended data warehousing withHadoopScale & elasticity of cloudAny Data, Any SizeAnywhereHDInsight - Microsoft’s approach to Big Data
  9. 9. Hive Excel Plugin, ODBC Driver integratesHadoop to SQL Server AnalysisServices, PowerPivot, and Power ViewFamiliar BI tools with structured andunstructured dataBenefitsKeyFeatures
  10. 10. Integration with enterprise BIsolutionsMicrosoft SQL Server connectorfor Apache Hadoop with SQOOP(SQL to Hadoop)Integration withMicrosoft EnterpriseData WarehousesSQL Server Parallel Data Warehouseconnector for Apache Hadoop withSQOOPDeeper insights fromstructured andunstructured dataBenefitsKeyFeatures
  11. 11. Unlock rare patterns from bespoke datamining modelsSupport for open source predictiveanalytics tools such as R and MahoutNew business insights withpredictive analytics from MicrosoftHive ODBC Driver connects Hadoopto SQL Server Data Mining toolsBenefitsKeyFeatures
  12. 12. Mashing up of internal andpublic data sets via DataExplorerIntegration with third-partydata and servicesSharing of data andinsights through WindowsAzure MarketplaceIntegration with WindowsAzure MarketplaceBenefitsKeyFeatures
  13. 13. Integration of socialinformation withbusiness applicationsSocial AnalyticsStronger customerrelationshipsIntegration with socialmedia sitesModels augmented withpublicly available datafrom social media sitesBenefitsKeyFeatures
  14. 14. MicrosoftHDInsightEnterprise-classsecurityIntegration withMicrosoft SystemCenterIntegration withWindows Server®Active DirectorySimplifiedmanagement ofHadoop on WindowsSmart packaging ofHadoop on premisesFast deployment ofHadoop on Azure100% Microsoft SupportEasy setup on-premisesand in the cloudBenefitsKeyFeatures
  15. 15. MicrosoftHDInsightElastic peta-scaleanalytics on Microsoft’scloud platformHadoop-based Service onWindows Azure platformEnterprise-class Big Dataplatform on-premisesHadoop-based distributionon Windows ServerBenefitsKeyFeatures
  16. 16. 1. Take a large problem and divide it into sub-problems2. Perform the same function on all sub-problems3. Combine the output from all sub-problems……OutputMAPREDUCEDoWork() DoWork() DoWork()…
  17. 17. spanning relational and non-relational WorldsNON-RELATIONAL100111DATA MANAGEMENTSHAREAND GOVERNDISCOVERAND RECOMMENDTRANSFORMAND CLEANINSIGHTSDATA ENRICHMENTOPERATIONALSELF-SERVICE MOBILEPREDICTIVEREAL-TIMECOLLABORATIVEMARKETPLACEExternalDataandServicesRELATIONAL MULTIDIMENSIONAL STREAMING
  18. 18. ObjectivesStarter OfferStructured (+- 4/5weeks)engagement thatdemonstrates thecapabilities of theMicrosoft Big Dataplatform with a prototypeusing real customer dataWho DeliversMicrosoftConsultingServices &IndustryExpertsExpected OutcomeDefine Big DataCompany StrategyImplement Big DataPrototype solutionCustomerMeeting todiscuss BigData Needs& Scopingfor StarterOfferScoping
  19. 19. At your service

×