SlideShare a Scribd company logo
1 of 19
Download to read offline
THIN SLICING A BLACK
      SWAN: A SEARCH FOR
      THE UNKNOWNS



      Michele Chubirka
      Transaction Network Services/
      Packetpushers.net




Session ID: MASH-­‐F41A	
  
Session Classification: Intermediate	
  
Something’s Broken

In Verizon’s 2012 Data
Breach Investigations
Report, it was found that
across organizations, an
external party discovers
92% of breaches.
From Compromise To Discovery




►  We believe we can solve the issue of the unknowns,
   intrusions, with more data.
►  The more information we have, the less we know.
►  This makes us no better than security archeologists.
The Black Swan Event

►  An unknown unknown.
►  Can’t be predicted by
   probability theories.
►  Rationalized after the fact.
►  How often do we try to
   predict the Black Swan
   Event in security and fail?
Information Gluttony?

“Military drone operators amass untold amounts of data
that never is fully analyzed because it is simply too much.”

Michael W. Isherwood, defense analyst and former Air
Force fighter pilot.
Digital Kudzu

•  From beginning of recorded time to 2003 - five exabytes
   of information.

•  2011 - that much created every two days.

•  2012 - prediction is every 10 minutes.
Current Solutions

►  SIEMs: never gets fully implemented.
►  Predictions using Logistic Regression/Bayesian
   Probability.
►  Huge amounts of data, not enough time.
►  “Open world” problem using “closed world” assumptions.
►  More staff, more money.
Alternative Model: Thin Slicing

“…the ability of our unconscious to find patterns in
situations and behavior based on very narrow slices of
experience.”

Malcolm Gladwell, Blink
Case Study: A Hospital in Trouble

►  Cook County Hospital struggled with identifying patients
   in danger of an imminent heart attack.
►  Coronary care unit was overwhelmed.
►  Public hospital, limited resources.
Applied Thin-Slicing

►  Lee Goldman, a cardiologist, created a protocol based
   upon an algorithm developed in partnership with
   mathematicians.
►  After two years of using a decision tree, hospital staff
   were 70% more effective at recognizing patients at risk.
►  Less information led to greater success.
►  Technique used by first-responders every day.
Fast and Frugal Trees
320                                                              LUAN, SCHOOLER, AND GIGERENZER


a                                            ST segment                           b
                                              change?                                                                      Did prosecution request
                                                                                                                        conditional bail or oppose bail?

                               No                                     Yes                                                                                   Yes
                                                                                                                   No or N.A.
                                                                      Coronary                                                                             Punitive
                  Chief complaint of                                  Care Unit
                     chest pain?                                                                            Did previous court impose
                                                                                                         conditions or remand in custody?
             No
                                       Yes                                                                                                   Yes
                                                                                                    No or N.A.
     Regular
                                                                                                                                            Punitive
    Nursing Bed
                                    Any other factor?                                       Did police impose conditions or
                                (NTG, MI, ST↔, ST↑↓, T↑↓)                                        remand in custody?

                               No                          Yes                    No or N.A.                           Yes

                       Regular                            Coronary
                      Nursing Bed                         Care Unit                   Nonpunitive                     Punitive


                       Figure 4. Two examples of fast-and-frugal trees (FFTs) applied to large world problems. The left tree (a) is
                       designed to help emergency room doctors decide whether to send a patient with severe chest pain to the Coronary
                       Care Unit (CCU) or a regular nursing bed (Green & Mehr, 1997). The right tree (b) is a model of how British
                       judges decide whether to make a punitive bail decision (Dhami, 2003).



(1997) found that, compared with a logistic regression model that                          Tree models of categorization and decision making have been
uses eight cues simultaneously to make a decision, this FFT had a                       studied in a variety of disciplines, such as medicine, applied
higher overall predictive accuracy, in addition to its advantages in                    statistics, computer science, and psychology (e.g., Breiman, Fried-
Method: Resource Description
 Framework (RDF)



►  Semantic Web technology.
►  Queries based on relationships or mental associations.
►  Graphs treat each packet from capture file as a discrete
   event with properties.
►  TCP header info in a metadata model.
►  Model replicates human cognitive economy.
Thin-Slicing with SPARQL

►  SPARQL query language uses a concise approach for
   quickly traversing large data sets while capturing
   similarities between packets as generalizations.
►  RDF statement contains a subject, predicate and an
   object.
   ►  Subject defines the event.
   ►  Predicate defines a characteristic or property.
   ►  Object contains the value for the predicate.
Example: Building A Query
sparql select * {
?s
?p
?o.};

sparql select *{
?e1
<http://www.rrecktek.com/demo/src>
?ip1.};
Example

•  All source IPs and their destination IPs.
•  For each source, count how many times it went to a
   destination.
•  Report source destination and count.

sparql SELECT ?src ?dst (count (?dst) as ?count) {
?e1 <http://www.rrecktek.com/demo/src> ?src.
?e1 <http://www.rrecktek.com/demo/dst> ?dst.
 } ORDER BY DESC (?count);
SPARQL web
interface
We Can’t Fight All Unknowns

►  What we can do
   ►  Build strong infrastructures minimizing technical debt.
   ►  Add the equivalent of air bags to the architecture for when
      intrusions occur.
   ►  Recognize signature limitations.
   ►  Investigate the creation of real-time fast and frugal trees.

   Our patient is dying on the table. It’s up to us to change the
   outcome.
Thanks!

►  Michele Chubirka
  Twitter @MrsYisWhy
  networksecurityprincess@gmail.com


►  RDF/SPARQL contribution courtesy of Ronald P. Reck
  rreck@rrecktek.com
References
"Eclectic Tech." Semantic Web Introduction. N.p., n.d. Web. 20 Dec. 2012.
Erwin, Sandra I. "Too Much Information, Not Enough Intelligence." National Defense Magazine. N.p.,
May 2012. Web. <http://www.nationaldefense.org>.
Gigerenzer, Gerd. Gut Feelings: The Intelligence of the Unconscious. New York: Viking, 2007. Print.
Gladwell, Malcolm. Blink: The Power of Thinking without Thinking. New York: Little, Brown and, 2005.
Print.
Luan, Shenghua, Lael J. Schooler, and Gerd Gigerenzer. "A Signal-detection Analysis of Fast-and-
frugal Trees." Psychological Review 118.2 (2011): 316-38. Print.
Marewski, Julian N., PhD, and Gerd Gigerenzer, PhD. "Heuristic Decision Making in Medicine."
Dialogues in Clinical Neuroscience 14.1 (2012): 77-89. Print.
Messmer, Ellen. "SANS Warns IT Groups Fail to Focus on Logs for Security Clues." TechWorld. IDG,
May 2012. Web.
"RDF." -Semantic Web Standards. W3C, n.d. Web. 02 Jan. 2013.
"Resource Description Framework (RDF)Model and Syntax." RDF Model and Syntax. W3C, n.d. Web.
02 Jan. 2013.
Rieland, Randy. "Big Data or Too Much Information?" Innovations. Smithsonian, 7 May 2012. Web.
"Semantic Web Standards." W3C. W3C, n.d. Web. 02 Jan. 2013.
Taleb, Nassim. The Black Swan: The Impact of the Highly Improbable. New York: Random House,
2007. Print.
Turek, Dave. "The Case Against Digital Sprawl." The Management Blog. Bloomberg Businessweek, 2
May 2012. Web.
Verizon 2012 Data Breach Investigation Report. Rep. N.p.: Verizon, n.d. Print.

More Related Content

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

RSA Security Conference 2013: Thin Slicing a Black Swan

  • 1. THIN SLICING A BLACK SWAN: A SEARCH FOR THE UNKNOWNS Michele Chubirka Transaction Network Services/ Packetpushers.net Session ID: MASH-­‐F41A   Session Classification: Intermediate  
  • 2. Something’s Broken In Verizon’s 2012 Data Breach Investigations Report, it was found that across organizations, an external party discovers 92% of breaches.
  • 3. From Compromise To Discovery ►  We believe we can solve the issue of the unknowns, intrusions, with more data. ►  The more information we have, the less we know. ►  This makes us no better than security archeologists.
  • 4. The Black Swan Event ►  An unknown unknown. ►  Can’t be predicted by probability theories. ►  Rationalized after the fact. ►  How often do we try to predict the Black Swan Event in security and fail?
  • 5. Information Gluttony? “Military drone operators amass untold amounts of data that never is fully analyzed because it is simply too much.” Michael W. Isherwood, defense analyst and former Air Force fighter pilot.
  • 6. Digital Kudzu •  From beginning of recorded time to 2003 - five exabytes of information. •  2011 - that much created every two days. •  2012 - prediction is every 10 minutes.
  • 7. Current Solutions ►  SIEMs: never gets fully implemented. ►  Predictions using Logistic Regression/Bayesian Probability. ►  Huge amounts of data, not enough time. ►  “Open world” problem using “closed world” assumptions. ►  More staff, more money.
  • 8. Alternative Model: Thin Slicing “…the ability of our unconscious to find patterns in situations and behavior based on very narrow slices of experience.” Malcolm Gladwell, Blink
  • 9. Case Study: A Hospital in Trouble ►  Cook County Hospital struggled with identifying patients in danger of an imminent heart attack. ►  Coronary care unit was overwhelmed. ►  Public hospital, limited resources.
  • 10. Applied Thin-Slicing ►  Lee Goldman, a cardiologist, created a protocol based upon an algorithm developed in partnership with mathematicians. ►  After two years of using a decision tree, hospital staff were 70% more effective at recognizing patients at risk. ►  Less information led to greater success. ►  Technique used by first-responders every day.
  • 11. Fast and Frugal Trees 320 LUAN, SCHOOLER, AND GIGERENZER a ST segment b change? Did prosecution request conditional bail or oppose bail? No Yes Yes No or N.A. Coronary Punitive Chief complaint of Care Unit chest pain? Did previous court impose conditions or remand in custody? No Yes Yes No or N.A. Regular Punitive Nursing Bed Any other factor? Did police impose conditions or (NTG, MI, ST↔, ST↑↓, T↑↓) remand in custody? No Yes No or N.A. Yes Regular Coronary Nursing Bed Care Unit Nonpunitive Punitive Figure 4. Two examples of fast-and-frugal trees (FFTs) applied to large world problems. The left tree (a) is designed to help emergency room doctors decide whether to send a patient with severe chest pain to the Coronary Care Unit (CCU) or a regular nursing bed (Green & Mehr, 1997). The right tree (b) is a model of how British judges decide whether to make a punitive bail decision (Dhami, 2003). (1997) found that, compared with a logistic regression model that Tree models of categorization and decision making have been uses eight cues simultaneously to make a decision, this FFT had a studied in a variety of disciplines, such as medicine, applied higher overall predictive accuracy, in addition to its advantages in statistics, computer science, and psychology (e.g., Breiman, Fried-
  • 12. Method: Resource Description Framework (RDF) ►  Semantic Web technology. ►  Queries based on relationships or mental associations. ►  Graphs treat each packet from capture file as a discrete event with properties. ►  TCP header info in a metadata model. ►  Model replicates human cognitive economy.
  • 13. Thin-Slicing with SPARQL ►  SPARQL query language uses a concise approach for quickly traversing large data sets while capturing similarities between packets as generalizations. ►  RDF statement contains a subject, predicate and an object. ►  Subject defines the event. ►  Predicate defines a characteristic or property. ►  Object contains the value for the predicate.
  • 14. Example: Building A Query sparql select * { ?s ?p ?o.}; sparql select *{ ?e1 <http://www.rrecktek.com/demo/src> ?ip1.};
  • 15. Example •  All source IPs and their destination IPs. •  For each source, count how many times it went to a destination. •  Report source destination and count. sparql SELECT ?src ?dst (count (?dst) as ?count) { ?e1 <http://www.rrecktek.com/demo/src> ?src. ?e1 <http://www.rrecktek.com/demo/dst> ?dst. } ORDER BY DESC (?count);
  • 17. We Can’t Fight All Unknowns ►  What we can do ►  Build strong infrastructures minimizing technical debt. ►  Add the equivalent of air bags to the architecture for when intrusions occur. ►  Recognize signature limitations. ►  Investigate the creation of real-time fast and frugal trees. Our patient is dying on the table. It’s up to us to change the outcome.
  • 18. Thanks! ►  Michele Chubirka Twitter @MrsYisWhy networksecurityprincess@gmail.com ►  RDF/SPARQL contribution courtesy of Ronald P. Reck rreck@rrecktek.com
  • 19. References "Eclectic Tech." Semantic Web Introduction. N.p., n.d. Web. 20 Dec. 2012. Erwin, Sandra I. "Too Much Information, Not Enough Intelligence." National Defense Magazine. N.p., May 2012. Web. <http://www.nationaldefense.org>. Gigerenzer, Gerd. Gut Feelings: The Intelligence of the Unconscious. New York: Viking, 2007. Print. Gladwell, Malcolm. Blink: The Power of Thinking without Thinking. New York: Little, Brown and, 2005. Print. Luan, Shenghua, Lael J. Schooler, and Gerd Gigerenzer. "A Signal-detection Analysis of Fast-and- frugal Trees." Psychological Review 118.2 (2011): 316-38. Print. Marewski, Julian N., PhD, and Gerd Gigerenzer, PhD. "Heuristic Decision Making in Medicine." Dialogues in Clinical Neuroscience 14.1 (2012): 77-89. Print. Messmer, Ellen. "SANS Warns IT Groups Fail to Focus on Logs for Security Clues." TechWorld. IDG, May 2012. Web. "RDF." -Semantic Web Standards. W3C, n.d. Web. 02 Jan. 2013. "Resource Description Framework (RDF)Model and Syntax." RDF Model and Syntax. W3C, n.d. Web. 02 Jan. 2013. Rieland, Randy. "Big Data or Too Much Information?" Innovations. Smithsonian, 7 May 2012. Web. "Semantic Web Standards." W3C. W3C, n.d. Web. 02 Jan. 2013. Taleb, Nassim. The Black Swan: The Impact of the Highly Improbable. New York: Random House, 2007. Print. Turek, Dave. "The Case Against Digital Sprawl." The Management Blog. Bloomberg Businessweek, 2 May 2012. Web. Verizon 2012 Data Breach Investigation Report. Rep. N.p.: Verizon, n.d. Print.