PARC host safety announcements
Stay in the designated event areas only; you are not allowed to go elsewhere in the
buildin...
Unlocking Data ROI
Fast
CXO Big Data Seminar
September 26, 2012
By Jim Kaskade
CEO, Infochimps
Infochimps
• Managed Big Data Services
• Elastic & Secure Private &
Public Clouds
• Across a Global Network of
Trusted Dat...
“ ”
8/17/2013 Infochimps Confidential 5
Information is powerful.
But it is how we use is it that will define us.
1.
8/17/2013 Infochimps Confidential 6
What is Big Data?
“data sets so large and complex that it
becomes difficult to proc...
2.
8/17/2013 Infochimps Confidential 7
Is there a reference design?
“It depends on your application.”
Reference Design
8
UI Framework (Hue)
Other App Services: Search, Reporting,
Visualization, Auto Response
Analytics (Mahou...
3.
8/17/2013 Infochimps Confidential 9
What are typical Big Data Applications?
“It depends on what data you have.”
Big Data Applications
10
ADVANCEDANALYTICS
1 2
Core Use Cases (1 & 2)
Applied Across Verticals
DATAPROCESSING
Social Netwo...
4.
8/17/2013 Infochimps Confidential 11
What is the Big Data Process?
“It depends on how much you’re will to pay.”
Process
8/17/2013 12Infochimps Confidential
BUSINESS
DISCOVERY
INFORMATION
DISCOVERY
LOGICAL
DATAMODEL
PHYSICAL
DATAMODEL
...
5.
8/17/2013 Infochimps Confidential 13
What are my deployment options?
“Well, that also depends on…”
? Time to Market
? Cost
? Expertise
? Culture
? Data Governance / Security
? Data Volume & Velocity
? Performance
? Legacy...
Deployment Options
8/17/2013 Infochimps Confidential 15
Private Cloud
(On-Premise)
Virtual Private Cloud
(3rd Party Truste...
8/17/2013 Infochimps Confidential 16
Private Big Data
Cloud (You
Manage)
Virtual Private Big
Data Cloud (You
Manage)
Publi...
Unlocking Data ROI
Fast
CXO Big Data Seminar
September 26, 2012
By Jim Kaskade
CEO, Infochimps
Poll: Please Describe Your Position At Your Or...
To view the poll live, enter slideshow
This object is the poll's placeho...
Poll: Do you currently have a big data project...
To view the poll live, enter slideshow
This object is the poll's placeho...
Poll: Which deployment approach could benefit ...
To view the poll live, enter slideshow
This object is the poll's placeho...
Poll: What mix best represents your current IT...
To view the poll live, enter slideshow
This object is the poll's placeho...
• Surendra Reddy, Big Data CTO, PARC
• Ron Bodin, CEO, Think Big
• Charles Fan,SVP Strategic R&D, VMware
• Eddie Sattery, ...
Infochimps CxO Seminar @ PARC
Upcoming SlideShare
Loading in …5
×

Infochimps CxO Seminar @ PARC

943 views

Published on

A presentation on Big DAta Analytics at PARC, a Xerox company

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
943
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
12
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • “A Data Intelligence Delivery Network For Data-Driven Enterprises”Tell a good storyMost of the articles on pitching are generally right about the topics, even if they miss the nuance (sell, don’t explain). But don’t take any template as graven in stone. Your story may require a moderate or even a dramatic variation on the list of slides below. You may need to explain the solution before you can explain the market; or if you are in a crowded space you may need to explain why you are different than everyone else early on in the conversation; or you may want to drop some very impressive brand-name customers before you explain your product or your market. The one thing you may not do is expand the number of slides to 20 (or 30 or 50)! Other than that, let the specifics of your situation dictate the flow of your slides.Nevertheless, it is useful to have a guide. With the caveats above in mind, here is a basic outline for your pitch:Cover Slide: Company name, location, tagline, presenter’s name and title.If there are multiple team members participating in the pitch, put names on the next slide instead. Key objective: Everyone in the room should know the basic idea and value proposition of the company, including the target market, before the next slide is shown. All the words should not be on this slide, but with one or two sentences orally, reinforcing and extending the tagline, everyone should have a foundation for what is to come. Cardinal sin: Launching into your presentation with an investor at the table thinking, “I wonder what these guys do?”
  • Slide 1: Company Overview.The best way to give an overview of your company is to state concisely your core value proposition: What unique benefit will you provide to what set of customers to address what particular need? Then you can add three or four additional dot points to clarify your target markets, your unique technology/solution, and your status (launch date, current customers, revenue rate, pipeline, funding needed). Key objective: Flesh out the foundation you established at the beginning. At this point, no one should have any question about what it is that your company does, or plans to do. The only questions that should remain are the details of how you are going to do it. Another key objective you should have achieved by this point in your presentation is to make sure that if there are some compelling brand names associated with your company (customers, partners, investors, advisors), your audience knows about them. Feel free to drop names early and often—starting with your first email introduction to the investor. Brand name relationships build your credibility, but do not overstate them if they are tenuous.Use-cases:RunaAutomated real-time online offers - monitors and analyzes shopper behavior on web, and then makes each shopper a personalized offerInfochimps helps Runa configure and manage their entire production system, including Hadoop, HBase, messaging, monitoring, and more. (using Ironfan – Robert Berger)SpringSenseintelligence enterprise document searchSpringSense uses Infochimps to scale its award-winning technology to process the full Wikipedia corpus - over 4 million articles - for rapid meaning-based search. (using Ironfan)Black LocusCompetitive pricing analytics platform for enterprisesIngesting millions of product pricing data points from the web, analyzing historical and current data, presenting analytic results in real-time.Koupon MediaMobile coupon platformFor every user who enters into the mobile coupon system, more demographic information is needed to help target the right coupon to the right customer and in real-time.BlueCavaBehavioral target marketing platform - joins customers across any/all devices & augments w/ demograph / behavioral for targeted advertisingFor every user who enters into the mobile coupon system, more demographic information is needed to help target the right coupon to the right customer and in real-time.A new Attribution data product (using Hadoop) which determines correlations between customer purchases / conversions to advertising impressions and website behavior.InfoMartLargest media company in Canada transforming business from print to digital – focus is on engaging and better understanding their audiencesSocial media listening platform which consists of both real-time social feed search / analytics / reporting for InfoMart and their customers + historic analysis / trending research.
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March….and quoted an Kenyan Farmer.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • Example components of the platformSASL / Kerberos
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • Slide 2: Problem/Opportunity.You need to make it clear that there is a big, important problem (current or emerging) that you are going to solve, or opportunity you are going to exploit, and that you understand the market dynamics surrounding the opportunity—why does this situation exist and persist, and why is it only now that it can be addressed? Show that you really understand the very particular market segment you are targeting, and frame your market analysis according to the specific problem and solution you are laying out. In some cases, however, the problem you are attacking is so obvious and clear that you can drop this slide altogether. You do not have to tell investors that there are a lot of cell phones out there, or that teenagers like to socialize. Save yourself, and the investors, the pain of restating the obvious.
  • AvinashKaushik gave a talk at Strata 2012 in Santa Clara in March.If you listen to all the hype of Big Data, it solves for the first problem.If you listen to all the vendors, there is a lot of emphasis on the first part (perhaps Infochimps included), and very little on the second.I think that’s because we don’t exactly know how to truly empower the organization to interact directly with any/all data available.It’s too expensive, risky, complex.
  • “A Data Intelligence Delivery Network For Data-Driven Enterprises”Tell a good storyMost of the articles on pitching are generally right about the topics, even if they miss the nuance (sell, don’t explain). But don’t take any template as graven in stone. Your story may require a moderate or even a dramatic variation on the list of slides below. You may need to explain the solution before you can explain the market; or if you are in a crowded space you may need to explain why you are different than everyone else early on in the conversation; or you may want to drop some very impressive brand-name customers before you explain your product or your market. The one thing you may not do is expand the number of slides to 20 (or 30 or 50)! Other than that, let the specifics of your situation dictate the flow of your slides.Nevertheless, it is useful to have a guide. With the caveats above in mind, here is a basic outline for your pitch:Cover Slide: Company name, location, tagline, presenter’s name and title.If there are multiple team members participating in the pitch, put names on the next slide instead. Key objective: Everyone in the room should know the basic idea and value proposition of the company, including the target market, before the next slide is shown. All the words should not be on this slide, but with one or two sentences orally, reinforcing and extending the tagline, everyone should have a foundation for what is to come. Cardinal sin: Launching into your presentation with an investor at the table thinking, “I wonder what these guys do?”
  • Poll: Please Describe Your Position At Your Or...



    Press F5 or enter presentation mode to view the poll

    In an emergency during your presentation, if the poll isn't showing, navigate to this link in your web browser:

    http://www.polleverywhere.com/multiple_choice_polls/MjExMTU4NTMzNgIf you like, you can use this slide as a template for your own voting slides. You might use a slide like this if you feel your audience would benefit from the picture showing a text message on a phone.
  • Poll: Do you currently have a big data project...



    Press F5 or enter presentation mode to view the poll

    In an emergency during your presentation, if the poll isn't showing, navigate to this link in your web browser:

    http://www.polleverywhere.com/multiple_choice_polls/MTgzMTA2NDg0OQIf you like, you can use this slide as a template for your own voting slides. You might use a slide like this if you feel your audience would benefit from the picture showing a text message on a phone.
  • Poll: Which deployment approach could benefit ...



    Press F5 or enter presentation mode to view the poll

    In an emergency during your presentation, if the poll isn't showing, navigate to this link in your web browser:

    http://www.polleverywhere.com/multiple_choice_polls/MTY2NTQ3NzczNgIf you like, you can use this slide as a template for your own voting slides. You might use a slide like this if you feel your audience would benefit from the picture showing a text message on a phone.
  • Poll: What mix best represents your current IT...



    Press F5 or enter presentation mode to view the poll

    In an emergency during your presentation, if the poll isn't showing, navigate to this link in your web browser:

    http://www.polleverywhere.com/multiple_choice_polls/MTQ4NDE5NTk2NwIf you like, you can use this slide as a template for your own voting slides. You might use a slide like this if you feel your audience would benefit from the picture showing a text message on a phone.
  • Infochimps CxO Seminar @ PARC

    1. 1. PARC host safety announcements Stay in the designated event areas only; you are not allowed to go elsewhere in the building without a PARC escort. No smoking in the building or within 20 feet of the entrances; restrict smoking to the outdoor area near the designated ash cans. In case of an emergency (evacuation as signaled by alarm horns and flashing lights or by an earthquake), please: – Walk quickly but don’t run through the foyer and up the outside stairs to the upper parking lot. (If the foyer doors are blocked, take the nearest stairway up to the 3rd floor, then out to the upper parking lot.) – Do not leave the site before checking in with your group! Gather with your group and notify the Emergency Response Team (in orange vests) about missing members after checking lists or roll call. – Do not go back into the building until the Emergency Response Team gives the “All Clear”. If anyone in your group needs medical attention please: – Call Security at x2222 from a wall phone or 650-812-4000 from a cell phone. – Security will contact the PARC Emergency Response Team and will call 911 for paramedics. This is the fastest way for you to get trained medical assistance.
    2. 2. Unlocking Data ROI Fast CXO Big Data Seminar September 26, 2012 By Jim Kaskade CEO, Infochimps
    3. 3. Infochimps • Managed Big Data Services • Elastic & Secure Private & Public Clouds • Across a Global Network of Trusted Data Center & Cloud Service Providers • With Batch & Real-time Analytic Framework • Supporting Structured & Unstructured Data Big Data Intelligence Delivery Network Global Network Of Data Center Infrastructure Providers Enterprise Customers Hadoop NoSQL Data Delivery Data Lang App BI Analytics Sys BI Infra Delivery 8/17/2013 4Infochimps Confidential
    4. 4. “ ” 8/17/2013 Infochimps Confidential 5 Information is powerful. But it is how we use is it that will define us.
    5. 5. 1. 8/17/2013 Infochimps Confidential 6 What is Big Data? “data sets so large and complex that it becomes difficult to process using on-hand database management tools.”
    6. 6. 2. 8/17/2013 Infochimps Confidential 7 Is there a reference design? “It depends on your application.”
    7. 7. Reference Design 8 UI Framework (Hue) Other App Services: Search, Reporting, Visualization, Auto Response Analytics (Mahout, Revolution R, Custom Algorithms) Big Data Applications Oozie (Workflow) Oozie (Scheduling) Hive (Metadata) Languages (Pig, Hive) Data Processing (Map Reduce) Distributed Storage (HDFS) Read / Write NoSQL (Hbase Cassandra) Data Integration (Flume, Sqoop, Storm, Talend) Infrastructure As A Service (IaaS) – Cloud or Commodity HW App-Dev API Coordination (Zookeeper) ApplicationLifecycleManagement Security(Kerberos) SystemsMonitoring&Management(Gangila,Nagios,Ambari) Complex Event Processing (Espers) Metadata Services (Hcatalog)
    8. 8. 3. 8/17/2013 Infochimps Confidential 9 What are typical Big Data Applications? “It depends on what data you have.”
    9. 9. Big Data Applications 10 ADVANCEDANALYTICS 1 2 Core Use Cases (1 & 2) Applied Across Verticals DATAPROCESSING Social Network Analysis Content Optimization Network Analytics Loyalty & Promotions Analysis Fraud Analysis Entity Analysis Clickstream Sessionization Engagement Mediation Data Factory Trade Reconciliation Signal Intelligence VERTICAL Web Media Telco Retail Financial Federal Bioinformatics Genome MappingSequencing Analysis
    10. 10. 4. 8/17/2013 Infochimps Confidential 11 What is the Big Data Process? “It depends on how much you’re will to pay.”
    11. 11. Process 8/17/2013 12Infochimps Confidential BUSINESS DISCOVERY INFORMATION DISCOVERY LOGICAL DATAMODEL PHYSICAL DATAMODEL SYSTEM STAGING DATA INGESTION & TRANSFORMATION APPLICATION DEVELOPMENT Data Warehouse Project = 12-24 months BUSINESS DISCOVERY INFORMATION DISCOVERY SYSTEM STAGING DATA INGESTION & TRANSFORMAT ION APP DEV On-Premise Big Data Project = 6-12 months PRODUCTION TUNING PRODUCTION TUNING ANALYTICS ANALYTICS
    12. 12. 5. 8/17/2013 Infochimps Confidential 13 What are my deployment options? “Well, that also depends on…”
    13. 13. ? Time to Market ? Cost ? Expertise ? Culture ? Data Governance / Security ? Data Volume & Velocity ? Performance ? Legacy Infrastructure Integration 8/17/2013 Infochimps Confidential 14
    14. 14. Deployment Options 8/17/2013 Infochimps Confidential 15 Private Cloud (On-Premise) Virtual Private Cloud (3rd Party Trusted Data Center) Public Cloud Provider
    15. 15. 8/17/2013 Infochimps Confidential 16 Private Big Data Cloud (You Manage) Virtual Private Big Data Cloud (You Manage) Public Big Data Cloud (You Manage) Virtual Private Big Data Cloud (Managed Service) Public Big Data Cloud (Managed Service) $Cost Security Risk Time To Market You Manage “They” Manage $ $ $ $ $
    16. 16. Unlocking Data ROI Fast CXO Big Data Seminar September 26, 2012 By Jim Kaskade CEO, Infochimps
    17. 17. Poll: Please Describe Your Position At Your Or... To view the poll live, enter slideshow This object is the poll's placeholder mode by pressing F5 Embedded polls only work in PowerPoint for Windows
    18. 18. Poll: Do you currently have a big data project... To view the poll live, enter slideshow This object is the poll's placeholder mode by pressing F5 Embedded polls only work in PowerPoint for Windows
    19. 19. Poll: Which deployment approach could benefit ... To view the poll live, enter slideshow This object is the poll's placeholder mode by pressing F5 Embedded polls only work in PowerPoint for Windows
    20. 20. Poll: What mix best represents your current IT... To view the poll live, enter slideshow This object is the poll's placeholder mode by pressing F5 Embedded polls only work in PowerPoint for Windows
    21. 21. • Surendra Reddy, Big Data CTO, PARC • Ron Bodin, CEO, Think Big • Charles Fan,SVP Strategic R&D, VMware • Eddie Sattery, Chief Big Data Evangelist, Splunk • Jim Kaskade, CEO, Infochimps

    ×