SlideShare a Scribd company logo
1 of 21
Download to read offline
sqrrl  
Secure.	
  Scale.	
  Adapt	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
sqrrl  
Secure.	
  Scale.	
  Adapt.	
  
Adam	
  Fuchs,	
  CTO	
  
11	
  April,	
  2013	
  
2	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
Management
Ely Kahn
sqrrl VP BizDev,
White House
Investors
Adam
Fuchs
sqrrl CTO, NSA
Who  We  Are  
20+	
  years	
  of	
  combined	
  
Apache	
  Accumulo	
  
engineering	
  exper9se	
  
Mark
Terenzoni
sqrrl CEO, F5
•  Founded	
  July	
  2012	
  
•  Funded	
  August	
  2012	
  
•  Team	
  includes	
  former	
  Tech	
  
Director	
  of	
  Accumulo	
  at	
  NSA	
  and	
  
6	
  commiDers/contributors	
  	
  
3	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
3	
  
Our  Mission  
Security	
  
AdapGvity	
  Scalability	
  
4	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
4	
  
Apache  Accumulo  
"   Sorted, Distributed Key/Value Store
"   Based on Google’s Big Table Design
"   Built on Top of Apache Hadoop and Apache Zookeeper
"   Augments and Integrates With the Hadoop ecosystem
"   Originally developed at the National Security Agency, now
an Apache Software Foundation project
5	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
5	
  
Applica9ons	
  
Analy9cs	
  APIs	
  
Security	
  &	
  Access	
  
Controls	
  
Data	
  Integra9on	
  
Search,	
  Sta*s*cs,	
  Graph,	
  
Lucene,	
  SQL,	
  Custom	
  
Extensions	
  
IAM,	
  Encryp*on,	
  
DAM,	
  Secure	
  Code	
  
ETL,	
  Hadoop	
  
Accumulo	
  
Sqrrl  Enterprise  Architecture  
6	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
"   Start	
  small,	
  but	
  design	
  for	
  scalability	
  
–  One	
  applicaGon	
  first,	
  then	
  grow	
  to	
  hundreds	
  
–  One	
  gigabyte	
  first,	
  then	
  grow	
  to	
  petabytes	
  
"   Itera*ve	
  schema	
  refinement	
  
–  IniGally,	
  let	
  the	
  data	
  define	
  the	
  schema	
  
–  Refine	
  the	
  schema	
  in	
  bulk	
  as	
  you	
  beDer	
  understand	
  the	
  data	
  
–  Middle	
  ground	
  between	
  flat	
  files	
  and	
  complete	
  ontologies	
  
"   Discovery	
  analy*cs	
  as	
  applica*on	
  building	
  blocks	
  
–  Universal	
  search:	
  structured	
  and	
  unstructured	
  data,	
  across	
  data	
  sets,	
  low	
  latency	
  
–  Basic	
  staGsGcs:	
  aggregaGons	
  of	
  query	
  results,	
  parallelized,	
  low	
  latency,	
  to	
  support	
  big	
  
picture	
  analysis	
  
–  Graphs:	
  scalable	
  graph	
  analyGcs	
  for	
  analyzing	
  how	
  everything	
  is	
  connected	
  
"   Data-­‐centric	
  security	
  
–  Separate	
  modeling	
  of	
  security	
  and	
  analysis	
  
–  Simplifies	
  mulG-­‐tenancy	
  and	
  applicaGon	
  accreditaGon	
  
Big  Data  Lessons  Learned  
7	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
7	
  
Schema  Discovery  
8	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
The	
  future	
  of	
  Big	
  Data	
  
innovaGon	
  is	
  Apps,	
  built	
  on:	
  
•  Universal	
  Search	
  
•  Schema-­‐less	
  StaGsGcs	
  
•  Graphs	
  
•  IntuiGve	
  Languages	
  
•  Secure,	
  Scalable,	
  and	
  
Adaptable	
  plaorms	
  
Lightweight  Apps  
9	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
9	
  
Targeted  Analysis  
10	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
10	
  
Big-Picture  Analytics  
11	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
DefiniGon:	
  A	
  form	
  of	
  security	
  in	
  which	
  data	
  carries	
  
with	
  it	
  the	
  elements	
  of	
  provenance	
  that	
  are	
  
required	
  to	
  make	
  policy	
  decisions	
  on	
  its	
  
releasability.	
  
•  Separate	
  data	
  modeling	
  for	
  Security	
  and	
  Analysis	
  
•  Reusability	
  of	
  applicaGons	
  across	
  security	
  
domains	
  
•  Distributed	
  development	
  of	
  ingest	
  and	
  query	
  
applicaGons	
  
•  Supported	
  by	
  Accumulo’s	
  cell-­‐level	
  security	
  
Data-Centric  Security  
12	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
12	
  
Cell-Level  Security  
13	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
13	
  
Scalable  Data-Centric  Security  
Data	
   Labeler	
   Accumulo	
   Apps	
  
User	
  
ACributes	
  
Audits	
  
Policies	
  
HDFS,	
  
Zookeeper	
  
End	
  Users	
  
Auth.	
  
Service	
  
Policy	
  
Engine	
  
14	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
14	
  
Accumulo’s  Strengths  
"   Security	
  
–  Cell-­‐level	
  security	
  reduces	
  the	
  cost	
  of	
  applicaGon	
  development	
  in	
  the	
  
presence	
  of	
  complex	
  legal	
  or	
  policy	
  restricGons	
  on	
  data	
  use	
  
–  IAM	
  and	
  encrypGon	
  Ges	
  into	
  enterprise	
  security	
  standards	
  
	
  
"   Scalability	
  
–  Proven	
  reliability	
  and	
  performance	
  at	
  the	
  mulG-­‐petabyte	
  scale	
  
–  High-­‐performance	
  parallel	
  I/O	
  library	
  
	
  
"   Adap9vity	
  
–  Flexible	
  schema	
  support	
  to	
  quickly	
  ingest	
  new	
  data	
  sources	
  
–  Sorted	
  key/value	
  paradigm	
  supports	
  a	
  mulGtude	
  of	
  search	
  and	
  
analysis	
  applicaGons	
  
–  Server-­‐side	
  programming	
  framework	
  “iterator	
  trees”	
  support	
  best-­‐in-­‐
class	
  aggregaGon,	
  filtering,	
  and	
  complex	
  query	
  semanGcs	
  
15	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
15	
  
An	
  Accumulo	
  key	
  is	
  a	
  5-­‐tuple,	
  consis9ng	
  of:	
  	
  
	
  
"   Row:	
  Controls	
  Atomicity	
  
"   Column	
  Family:	
  Controls	
  Locality	
  	
  
"   Column	
  Qualifier:	
  	
  Controls	
  Uniqueness	
  
"   Visibility	
  Label:	
  	
  Controls	
  Access	
  
"   Timestamp:	
  	
  Controls	
  Versioning	
  
Row	
   Col.	
  Fam.	
   Col.	
  Qual.	
   Visibility	
   Timestamp	
   Value	
  
John	
  Doe	
   Notes	
   PCP	
   PCP_JD	
   20120912	
  
PaGent	
  suffers	
  
from	
  an	
  acute	
  …	
  
John	
  Doe	
   Test	
  Results	
   Cholesterol	
   JD|PCP_JD	
   20120912	
   183	
  
John	
  Doe	
   Test	
  Results	
   Mental	
  Health	
   JD|PSYCH_JD	
   20120801	
   Pass	
  
John	
  Doe	
   Test	
  Results	
   X-­‐Ray	
   JD|PHYS_JD	
   20120513	
   1010110110100…	
  
Accumulo	
  Key/Value	
  Example	
  
Accumulo  Key  Structure  
16	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
16	
  
Accumulo  Architecture  
Tablet	
  Server	
  
Tablet	
  
Tablet	
  Server	
  
Tablet	
  
Tablet	
  Server	
  
Tablet	
  
ApplicaGon	
  
Zookeeper	
  
Zookeeper	
  
Zookeeper	
  
Master	
  
HDFS	
  
Read/Write	
  
Store/Replicate	
  
Assign/Balance	
  
Delegate	
  
Authority	
  
Delegate	
  
Authority	
  
ApplicaGon	
  
ApplicaGon	
  
17	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
17	
  
Tablet  Data  Flow  
In-­‐Memory	
  
Map	
  
Write	
  Ahead	
  
Log	
  
(For	
  Recovery)	
  
Sorted,	
  
Indexed	
  
File	
  
Sorted,	
  
Indexed	
  
File	
  
Sorted,	
  
Indexed	
  
File	
  
Tablet	
  
Reads	
  
Iterator	
  
Tree	
  
Minor	
  
Compac<on	
  
Merging	
  /	
  Major	
  
Compac<on	
  
Iterator	
  
Tree	
  
Writes	
   Iterator	
  
Tree	
  
Scan	
  
Iterator  Framework  
18	
  
Secure.	
  	
  	
  	
  Scale.	
  	
  	
  	
  Adapt.	
  
Iterator	
  Opera9ons:	
  
	
  
"   File	
  Reads	
  
"   Block	
  Caching	
  
"   Merging	
  
"   DeleGon	
  
"   IsolaGon	
  
"   Locality	
  Groups	
  
"   Range	
  SelecGon	
  
"   Column	
  SelecGon	
  
"   Cell-­‐level	
  Security	
  
"   Versioning	
  
"   Filtering	
  
"   AggregaGon	
  
"   ParGGoned	
  Joins	
  
info@sqrrl.com  |  @sqrrl_inc  |  617.520.4375                          sqrrl  data,  INC.,    All  Rights  Reserved  
19	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
•  No	
  built-­‐in	
  secondary	
  
indices	
  
•  Sort	
  Order	
  ó	
  Index	
  
•  Balance	
  between	
  
ingest	
  and	
  query	
  
•  Avoid	
  introducing	
  
boDlenecks	
  
•  Preserve	
  cell-­‐level	
  
security	
  and	
  scalability	
  
Table  Design  
Table:	
  
Row:	
  
Column	
  
Family:	
  
Column	
  
Qualifier:	
  
Value:	
  
Forward	
  Index	
  
<UUID>	
  
<Type>	
  
<Field>	
  
<Term>	
  
Inverted	
  Index	
  
<Term>	
  
<Type>	
  +	
  
<Field>	
  
<UUID>	
  
<Digest	
  of	
  
Event>	
  
20	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
20	
  
Ecosystem  Architecture  
Apache	
  HDFS	
  
Apache	
  Accumulo	
  
Sqrrl	
  Enterprise	
  
Custom	
  Ingester	
  Web	
  Server	
  	
  Custom	
  AnalyGc	
  Map/Reduce	
  Task	
  
Sqrrl	
  API	
  over	
  Apache	
  Thrip	
  RPC	
  :	
  	
  
Hierarchical	
  Documents	
  +	
  Graphs,	
  Lucene	
  +	
  SQL	
  +	
  more	
  
Accumulo	
  RPC	
  :	
  Sorted	
  Key/Value	
  I/O	
  
Hadoop	
  RPC	
  :	
  File	
  I/O	
  	
  
21	
  
Sqrrl  Data,  Inc.    All  Rights  Reserved  
21	
  
sqrrl	
  data,	
  inc.	
  
275	
  Third	
  St.	
  
Cambridge,	
  MA	
  02142	
  
	
  
617-­‐902-­‐0784	
  
www.sqrrl.com	
  
@sqrrl_inc	
  
info@sqrrl.com	
  
Contact  

More Related Content

What's hot

Achieving HIPAA on GCP
Achieving HIPAA on GCPAchieving HIPAA on GCP
Achieving HIPAA on GCPIdan Tohami
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionSplunk
 
Getting Started with Splunk Enterprises
Getting Started with Splunk EnterprisesGetting Started with Splunk Enterprises
Getting Started with Splunk EnterprisesSplunk
 
Analyzing 1.2 Million Network Packets per Second in Real-time
Analyzing 1.2 Million Network Packets per Second in Real-timeAnalyzing 1.2 Million Network Packets per Second in Real-time
Analyzing 1.2 Million Network Packets per Second in Real-timeDataWorks Summit
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldDataWorks Summit
 
Oracle Database Appliance - Introduction in Cyprus
Oracle Database Appliance - Introduction in CyprusOracle Database Appliance - Introduction in Cyprus
Oracle Database Appliance - Introduction in CyprusAndy Panayiotou
 
Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_dataxband
 
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...DataWorks Summit
 
Deep Learning with Cloudera
Deep Learning with ClouderaDeep Learning with Cloudera
Deep Learning with ClouderaCloudera, Inc.
 
Reduccion TCO sistemas Integrados
Reduccion TCO sistemas IntegradosReduccion TCO sistemas Integrados
Reduccion TCO sistemas IntegradosFran Navarro
 
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a Service
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a ServiceAWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a Service
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a ServiceAmazon Web Services
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelDataWorks Summit
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera, Inc.
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsDataWorks Summit
 
Searching Encrypted Cloud Data: Academia and Industry Done Right
Searching Encrypted Cloud Data: Academia and Industry Done RightSearching Encrypted Cloud Data: Academia and Industry Done Right
Searching Encrypted Cloud Data: Academia and Industry Done RightSkyhigh Networks
 
Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSematext Group, Inc.
 
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIntel IT Center
 
Cloudera showcase c5.4
Cloudera showcase c5.4Cloudera showcase c5.4
Cloudera showcase c5.4Cloudera, Inc.
 

What's hot (20)

Using Graphs for Data Analysis
Using Graphs for Data AnalysisUsing Graphs for Data Analysis
Using Graphs for Data Analysis
 
Achieving HIPAA on GCP
Achieving HIPAA on GCPAchieving HIPAA on GCP
Achieving HIPAA on GCP
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Getting Started with Splunk Enterprises
Getting Started with Splunk EnterprisesGetting Started with Splunk Enterprises
Getting Started with Splunk Enterprises
 
Analyzing 1.2 Million Network Packets per Second in Real-time
Analyzing 1.2 Million Network Packets per Second in Real-timeAnalyzing 1.2 Million Network Packets per Second in Real-time
Analyzing 1.2 Million Network Packets per Second in Real-time
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected World
 
Oracle Database Appliance - Introduction in Cyprus
Oracle Database Appliance - Introduction in CyprusOracle Database Appliance - Introduction in Cyprus
Oracle Database Appliance - Introduction in Cyprus
 
Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_data
 
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...
Beyond Kerberos and Ranger - Tips to discover, track and manage risks in hybr...
 
Deep Learning with Cloudera
Deep Learning with ClouderaDeep Learning with Cloudera
Deep Learning with Cloudera
 
Reduccion TCO sistemas Integrados
Reduccion TCO sistemas IntegradosReduccion TCO sistemas Integrados
Reduccion TCO sistemas Integrados
 
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a Service
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a ServiceAWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a Service
AWS Public Sector Symposium 2014 Canberra | Secure Hadoop as a Service
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
 
Searching Encrypted Cloud Data: Academia and Industry Done Right
Searching Encrypted Cloud Data: Academia and Industry Done RightSearching Encrypted Cloud Data: Academia and Industry Done Right
Searching Encrypted Cloud Data: Academia and Industry Done Right
 
Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL Backend
 
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
 
Cloudera showcase c5.4
Cloudera showcase c5.4Cloudera showcase c5.4
Cloudera showcase c5.4
 

Viewers also liked

Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo OverviewBill Havanki
 
Accumulo design
Accumulo designAccumulo design
Accumulo designscsorensen
 
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...Accumulo Summit
 
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]Accumulo Summit
 
Accumulo Summit 2016: Accumulo in the Enterprise
Accumulo Summit 2016: Accumulo in the EnterpriseAccumulo Summit 2016: Accumulo in the Enterprise
Accumulo Summit 2016: Accumulo in the EnterpriseAccumulo Summit
 
Apache Accumulo and the Data Lake
Apache Accumulo and the Data LakeApache Accumulo and the Data Lake
Apache Accumulo and the Data LakeAaron Cordova
 
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?Accumulo Summit
 
Large Scale Accumulo Clusters
Large Scale Accumulo ClustersLarge Scale Accumulo Clusters
Large Scale Accumulo ClustersAaron Cordova
 
Accumulo: A Quick Introduction
Accumulo: A Quick IntroductionAccumulo: A Quick Introduction
Accumulo: A Quick IntroductionJames Salter
 
Accumulo Summit 2016: Embedding Authenticated Data Structures in Accumulo
Accumulo Summit 2016: Embedding Authenticated Data Structures in AccumuloAccumulo Summit 2016: Embedding Authenticated Data Structures in Accumulo
Accumulo Summit 2016: Embedding Authenticated Data Structures in AccumuloAccumulo Summit
 
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]Accumulo Summit
 
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit
 
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...Accumulo Summit
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit
 
GeoMesa LocationTech DC
GeoMesa LocationTech DCGeoMesa LocationTech DC
GeoMesa LocationTech DCCCRinc
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech ProjectsJody Garnett
 
Apache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosApache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosJoe Stein
 
Introduction to Apache Accumulo
Introduction to Apache AccumuloIntroduction to Apache Accumulo
Introduction to Apache AccumuloAaron Cordova
 
An Introduction to Accumulo
An Introduction to AccumuloAn Introduction to Accumulo
An Introduction to AccumuloDonald Miner
 
Big Data in The Cloud: Architecting a Better Platform
Big Data in The Cloud: Architecting a Better PlatformBig Data in The Cloud: Architecting a Better Platform
Big Data in The Cloud: Architecting a Better PlatformAmazon Web Services
 

Viewers also liked (20)

Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo Overview
 
Accumulo design
Accumulo designAccumulo design
Accumulo design
 
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...
Accumulo Summit 2014: Four Orders of Magnitude: Running Large Scale Accumulo ...
 
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]
Accumulo Summit 2015: Tracing in Accumulo and HDFS [Internals]
 
Accumulo Summit 2016: Accumulo in the Enterprise
Accumulo Summit 2016: Accumulo in the EnterpriseAccumulo Summit 2016: Accumulo in the Enterprise
Accumulo Summit 2016: Accumulo in the Enterprise
 
Apache Accumulo and the Data Lake
Apache Accumulo and the Data LakeApache Accumulo and the Data Lake
Apache Accumulo and the Data Lake
 
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
Accumulo Summit 2014: Benchmarking Accumulo: How Fast Is Fast?
 
Large Scale Accumulo Clusters
Large Scale Accumulo ClustersLarge Scale Accumulo Clusters
Large Scale Accumulo Clusters
 
Accumulo: A Quick Introduction
Accumulo: A Quick IntroductionAccumulo: A Quick Introduction
Accumulo: A Quick Introduction
 
Accumulo Summit 2016: Embedding Authenticated Data Structures in Accumulo
Accumulo Summit 2016: Embedding Authenticated Data Structures in AccumuloAccumulo Summit 2016: Embedding Authenticated Data Structures in Accumulo
Accumulo Summit 2016: Embedding Authenticated Data Structures in Accumulo
 
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]
Accumulo Summit 2015: Accumulo In-Depth: Building Bulk Ingest [Sponsored]
 
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
 
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...
Accumulo Summit 2015: Real-Time Distributed and Reactive Systems with Apache ...
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
 
GeoMesa LocationTech DC
GeoMesa LocationTech DCGeoMesa LocationTech DC
GeoMesa LocationTech DC
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech Projects
 
Apache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosApache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on Mesos
 
Introduction to Apache Accumulo
Introduction to Apache AccumuloIntroduction to Apache Accumulo
Introduction to Apache Accumulo
 
An Introduction to Accumulo
An Introduction to AccumuloAn Introduction to Accumulo
An Introduction to Accumulo
 
Big Data in The Cloud: Architecting a Better Platform
Big Data in The Cloud: Architecting a Better PlatformBig Data in The Cloud: Architecting a Better Platform
Big Data in The Cloud: Architecting a Better Platform
 

Similar to Sqrrl real time_big_data_20130411

Hugaccumulo 121018192044-phpapp02
Hugaccumulo 121018192044-phpapp02Hugaccumulo 121018192044-phpapp02
Hugaccumulo 121018192044-phpapp02Sqrrl
 
Meetup presenation 06192013
Meetup presenation 06192013 Meetup presenation 06192013
Meetup presenation 06192013 Sqrrl
 
What_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cWhat_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cMaria Colgan
 
Getting Started with Splunk Enterprise Hands-On Breakout Session
Getting Started with Splunk Enterprise Hands-On Breakout SessionGetting Started with Splunk Enterprise Hands-On Breakout Session
Getting Started with Splunk Enterprise Hands-On Breakout SessionSplunk
 
Gp Introduction 200811
Gp Introduction 200811Gp Introduction 200811
Gp Introduction 200811iswaha
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionSplunk
 
Cloudera's Original Pitch Deck from 2008
Cloudera's Original Pitch Deck from 2008Cloudera's Original Pitch Deck from 2008
Cloudera's Original Pitch Deck from 2008Accel
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Leaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real WorldLeaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real WorldArmonDadgar
 
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsMySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsOlivier DASINI
 
Sqrrl Enterprise: Big Data Security Analytics Use Case
Sqrrl Enterprise: Big Data Security Analytics Use CaseSqrrl Enterprise: Big Data Security Analytics Use Case
Sqrrl Enterprise: Big Data Security Analytics Use CaseSqrrl
 
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Charlie Berger
 
stackArmor - FedRAMP and 800-171 compliant cloud solutions
stackArmor - FedRAMP and 800-171 compliant cloud solutionsstackArmor - FedRAMP and 800-171 compliant cloud solutions
stackArmor - FedRAMP and 800-171 compliant cloud solutionsGaurav "GP" Pal
 
Essentials of PCI Assessment
Essentials of PCI AssessmentEssentials of PCI Assessment
Essentials of PCI Assessment Gazzang
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Cloudera, Inc.
 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioAlluxio, Inc.
 
Splunk FISMA for Continuous Monitoring
Splunk FISMA for Continuous Monitoring Splunk FISMA for Continuous Monitoring
Splunk FISMA for Continuous Monitoring Greg Hanchin
 
Best Practices for implementing Database Security Comprehensive Database Secu...
Best Practices for implementing Database Security Comprehensive Database Secu...Best Practices for implementing Database Security Comprehensive Database Secu...
Best Practices for implementing Database Security Comprehensive Database Secu...Kal BO
 

Similar to Sqrrl real time_big_data_20130411 (20)

Hugaccumulo 121018192044-phpapp02
Hugaccumulo 121018192044-phpapp02Hugaccumulo 121018192044-phpapp02
Hugaccumulo 121018192044-phpapp02
 
Meetup presenation 06192013
Meetup presenation 06192013 Meetup presenation 06192013
Meetup presenation 06192013
 
What_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cWhat_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12c
 
Getting Started with Splunk Enterprise Hands-On Breakout Session
Getting Started with Splunk Enterprise Hands-On Breakout SessionGetting Started with Splunk Enterprise Hands-On Breakout Session
Getting Started with Splunk Enterprise Hands-On Breakout Session
 
Gp Introduction 200811
Gp Introduction 200811Gp Introduction 200811
Gp Introduction 200811
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Cloudera's Original Pitch Deck from 2008
Cloudera's Original Pitch Deck from 2008Cloudera's Original Pitch Deck from 2008
Cloudera's Original Pitch Deck from 2008
 
Sqrrl and Accumulo
Sqrrl and AccumuloSqrrl and Accumulo
Sqrrl and Accumulo
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Leaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real WorldLeaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real World
 
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsMySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
 
Sqrrl Enterprise: Big Data Security Analytics Use Case
Sqrrl Enterprise: Big Data Security Analytics Use CaseSqrrl Enterprise: Big Data Security Analytics Use Case
Sqrrl Enterprise: Big Data Security Analytics Use Case
 
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
 
stackArmor - FedRAMP and 800-171 compliant cloud solutions
stackArmor - FedRAMP and 800-171 compliant cloud solutionsstackArmor - FedRAMP and 800-171 compliant cloud solutions
stackArmor - FedRAMP and 800-171 compliant cloud solutions
 
Essentials of PCI Assessment
Essentials of PCI AssessmentEssentials of PCI Assessment
Essentials of PCI Assessment
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
 
Splunk FISMA for Continuous Monitoring
Splunk FISMA for Continuous Monitoring Splunk FISMA for Continuous Monitoring
Splunk FISMA for Continuous Monitoring
 
Encrypted Databases for Untrusted Cloud
Encrypted Databases for Untrusted CloudEncrypted Databases for Untrusted Cloud
Encrypted Databases for Untrusted Cloud
 
Best Practices for implementing Database Security Comprehensive Database Secu...
Best Practices for implementing Database Security Comprehensive Database Secu...Best Practices for implementing Database Security Comprehensive Database Secu...
Best Practices for implementing Database Security Comprehensive Database Secu...
 

More from Sqrrl

Transitioning Government Technology
Transitioning Government TechnologyTransitioning Government Technology
Transitioning Government TechnologySqrrl
 
Leveraging Threat Intelligence to Guide Your Hunts
Leveraging Threat Intelligence to Guide Your HuntsLeveraging Threat Intelligence to Guide Your Hunts
Leveraging Threat Intelligence to Guide Your HuntsSqrrl
 
How to Hunt for Lateral Movement on Your Network
How to Hunt for Lateral Movement on Your NetworkHow to Hunt for Lateral Movement on Your Network
How to Hunt for Lateral Movement on Your NetworkSqrrl
 
Machine Learning for Incident Detection: Getting Started
Machine Learning for Incident Detection: Getting StartedMachine Learning for Incident Detection: Getting Started
Machine Learning for Incident Detection: Getting StartedSqrrl
 
Building a Next-Generation Security Operations Center (SOC)
Building a Next-Generation Security Operations Center (SOC)Building a Next-Generation Security Operations Center (SOC)
Building a Next-Generation Security Operations Center (SOC)Sqrrl
 
User and Entity Behavior Analytics using the Sqrrl Behavior Graph
User and Entity Behavior Analytics using the Sqrrl Behavior GraphUser and Entity Behavior Analytics using the Sqrrl Behavior Graph
User and Entity Behavior Analytics using the Sqrrl Behavior GraphSqrrl
 
Threat Hunting Platforms (Collaboration with SANS Institute)
Threat Hunting Platforms (Collaboration with SANS Institute)Threat Hunting Platforms (Collaboration with SANS Institute)
Threat Hunting Platforms (Collaboration with SANS Institute)Sqrrl
 
Sqrrl and IBM: Threat Hunting for QRadar Users
Sqrrl and IBM: Threat Hunting for QRadar UsersSqrrl and IBM: Threat Hunting for QRadar Users
Sqrrl and IBM: Threat Hunting for QRadar UsersSqrrl
 
Threat Hunting for Command and Control Activity
Threat Hunting for Command and Control ActivityThreat Hunting for Command and Control Activity
Threat Hunting for Command and Control ActivitySqrrl
 
Modernizing Your SOC: A CISO-led Training
Modernizing Your SOC: A CISO-led TrainingModernizing Your SOC: A CISO-led Training
Modernizing Your SOC: A CISO-led TrainingSqrrl
 
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together Sqrrl
 
Leveraging DNS to Surface Attacker Activity
Leveraging DNS to Surface Attacker ActivityLeveraging DNS to Surface Attacker Activity
Leveraging DNS to Surface Attacker ActivitySqrrl
 
The Art and Science of Alert Triage
The Art and Science of Alert TriageThe Art and Science of Alert Triage
The Art and Science of Alert TriageSqrrl
 
Reducing Mean Time to Know
Reducing Mean Time to KnowReducing Mean Time to Know
Reducing Mean Time to KnowSqrrl
 
The Linked Data Advantage
The Linked Data AdvantageThe Linked Data Advantage
The Linked Data AdvantageSqrrl
 
Sqrrl Enterprise: Integrate, Explore, Analyze
Sqrrl Enterprise: Integrate, Explore, AnalyzeSqrrl Enterprise: Integrate, Explore, Analyze
Sqrrl Enterprise: Integrate, Explore, AnalyzeSqrrl
 
Sqrrl Datasheet: Cyber Hunting
Sqrrl Datasheet: Cyber HuntingSqrrl Datasheet: Cyber Hunting
Sqrrl Datasheet: Cyber HuntingSqrrl
 
Benchmarking The Apache Accumulo Distributed Key–Value Store
Benchmarking The Apache Accumulo Distributed Key–Value StoreBenchmarking The Apache Accumulo Distributed Key–Value Store
Benchmarking The Apache Accumulo Distributed Key–Value StoreSqrrl
 
Scalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelScalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelSqrrl
 
What's Next for Google's BigTable
What's Next for Google's BigTableWhat's Next for Google's BigTable
What's Next for Google's BigTableSqrrl
 

More from Sqrrl (20)

Transitioning Government Technology
Transitioning Government TechnologyTransitioning Government Technology
Transitioning Government Technology
 
Leveraging Threat Intelligence to Guide Your Hunts
Leveraging Threat Intelligence to Guide Your HuntsLeveraging Threat Intelligence to Guide Your Hunts
Leveraging Threat Intelligence to Guide Your Hunts
 
How to Hunt for Lateral Movement on Your Network
How to Hunt for Lateral Movement on Your NetworkHow to Hunt for Lateral Movement on Your Network
How to Hunt for Lateral Movement on Your Network
 
Machine Learning for Incident Detection: Getting Started
Machine Learning for Incident Detection: Getting StartedMachine Learning for Incident Detection: Getting Started
Machine Learning for Incident Detection: Getting Started
 
Building a Next-Generation Security Operations Center (SOC)
Building a Next-Generation Security Operations Center (SOC)Building a Next-Generation Security Operations Center (SOC)
Building a Next-Generation Security Operations Center (SOC)
 
User and Entity Behavior Analytics using the Sqrrl Behavior Graph
User and Entity Behavior Analytics using the Sqrrl Behavior GraphUser and Entity Behavior Analytics using the Sqrrl Behavior Graph
User and Entity Behavior Analytics using the Sqrrl Behavior Graph
 
Threat Hunting Platforms (Collaboration with SANS Institute)
Threat Hunting Platforms (Collaboration with SANS Institute)Threat Hunting Platforms (Collaboration with SANS Institute)
Threat Hunting Platforms (Collaboration with SANS Institute)
 
Sqrrl and IBM: Threat Hunting for QRadar Users
Sqrrl and IBM: Threat Hunting for QRadar UsersSqrrl and IBM: Threat Hunting for QRadar Users
Sqrrl and IBM: Threat Hunting for QRadar Users
 
Threat Hunting for Command and Control Activity
Threat Hunting for Command and Control ActivityThreat Hunting for Command and Control Activity
Threat Hunting for Command and Control Activity
 
Modernizing Your SOC: A CISO-led Training
Modernizing Your SOC: A CISO-led TrainingModernizing Your SOC: A CISO-led Training
Modernizing Your SOC: A CISO-led Training
 
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together
Threat Hunting vs. UEBA: Similarities, Differences, and How They Work Together
 
Leveraging DNS to Surface Attacker Activity
Leveraging DNS to Surface Attacker ActivityLeveraging DNS to Surface Attacker Activity
Leveraging DNS to Surface Attacker Activity
 
The Art and Science of Alert Triage
The Art and Science of Alert TriageThe Art and Science of Alert Triage
The Art and Science of Alert Triage
 
Reducing Mean Time to Know
Reducing Mean Time to KnowReducing Mean Time to Know
Reducing Mean Time to Know
 
The Linked Data Advantage
The Linked Data AdvantageThe Linked Data Advantage
The Linked Data Advantage
 
Sqrrl Enterprise: Integrate, Explore, Analyze
Sqrrl Enterprise: Integrate, Explore, AnalyzeSqrrl Enterprise: Integrate, Explore, Analyze
Sqrrl Enterprise: Integrate, Explore, Analyze
 
Sqrrl Datasheet: Cyber Hunting
Sqrrl Datasheet: Cyber HuntingSqrrl Datasheet: Cyber Hunting
Sqrrl Datasheet: Cyber Hunting
 
Benchmarking The Apache Accumulo Distributed Key–Value Store
Benchmarking The Apache Accumulo Distributed Key–Value StoreBenchmarking The Apache Accumulo Distributed Key–Value Store
Benchmarking The Apache Accumulo Distributed Key–Value Store
 
Scalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelScalable Graph Clustering with Pregel
Scalable Graph Clustering with Pregel
 
What's Next for Google's BigTable
What's Next for Google's BigTableWhat's Next for Google's BigTable
What's Next for Google's BigTable
 

Recently uploaded

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Sqrrl real time_big_data_20130411

  • 1. sqrrl   Secure.  Scale.  Adapt   Sqrrl  Data,  Inc.    All  Rights  Reserved   sqrrl   Secure.  Scale.  Adapt.   Adam  Fuchs,  CTO   11  April,  2013  
  • 2. 2   Sqrrl  Data,  Inc.    All  Rights  Reserved   Management Ely Kahn sqrrl VP BizDev, White House Investors Adam Fuchs sqrrl CTO, NSA Who  We  Are   20+  years  of  combined   Apache  Accumulo   engineering  exper9se   Mark Terenzoni sqrrl CEO, F5 •  Founded  July  2012   •  Funded  August  2012   •  Team  includes  former  Tech   Director  of  Accumulo  at  NSA  and   6  commiDers/contributors    
  • 3. 3   Sqrrl  Data,  Inc.    All  Rights  Reserved   3   Our  Mission   Security   AdapGvity  Scalability  
  • 4. 4   Sqrrl  Data,  Inc.    All  Rights  Reserved   4   Apache  Accumulo   "   Sorted, Distributed Key/Value Store "   Based on Google’s Big Table Design "   Built on Top of Apache Hadoop and Apache Zookeeper "   Augments and Integrates With the Hadoop ecosystem "   Originally developed at the National Security Agency, now an Apache Software Foundation project
  • 5. 5   Sqrrl  Data,  Inc.    All  Rights  Reserved   5   Applica9ons   Analy9cs  APIs   Security  &  Access   Controls   Data  Integra9on   Search,  Sta*s*cs,  Graph,   Lucene,  SQL,  Custom   Extensions   IAM,  Encryp*on,   DAM,  Secure  Code   ETL,  Hadoop   Accumulo   Sqrrl  Enterprise  Architecture  
  • 6. 6   Sqrrl  Data,  Inc.    All  Rights  Reserved   "   Start  small,  but  design  for  scalability   –  One  applicaGon  first,  then  grow  to  hundreds   –  One  gigabyte  first,  then  grow  to  petabytes   "   Itera*ve  schema  refinement   –  IniGally,  let  the  data  define  the  schema   –  Refine  the  schema  in  bulk  as  you  beDer  understand  the  data   –  Middle  ground  between  flat  files  and  complete  ontologies   "   Discovery  analy*cs  as  applica*on  building  blocks   –  Universal  search:  structured  and  unstructured  data,  across  data  sets,  low  latency   –  Basic  staGsGcs:  aggregaGons  of  query  results,  parallelized,  low  latency,  to  support  big   picture  analysis   –  Graphs:  scalable  graph  analyGcs  for  analyzing  how  everything  is  connected   "   Data-­‐centric  security   –  Separate  modeling  of  security  and  analysis   –  Simplifies  mulG-­‐tenancy  and  applicaGon  accreditaGon   Big  Data  Lessons  Learned  
  • 7. 7   Sqrrl  Data,  Inc.    All  Rights  Reserved   7   Schema  Discovery  
  • 8. 8   Sqrrl  Data,  Inc.    All  Rights  Reserved   The  future  of  Big  Data   innovaGon  is  Apps,  built  on:   •  Universal  Search   •  Schema-­‐less  StaGsGcs   •  Graphs   •  IntuiGve  Languages   •  Secure,  Scalable,  and   Adaptable  plaorms   Lightweight  Apps  
  • 9. 9   Sqrrl  Data,  Inc.    All  Rights  Reserved   9   Targeted  Analysis  
  • 10. 10   Sqrrl  Data,  Inc.    All  Rights  Reserved   10   Big-Picture  Analytics  
  • 11. 11   Sqrrl  Data,  Inc.    All  Rights  Reserved   DefiniGon:  A  form  of  security  in  which  data  carries   with  it  the  elements  of  provenance  that  are   required  to  make  policy  decisions  on  its   releasability.   •  Separate  data  modeling  for  Security  and  Analysis   •  Reusability  of  applicaGons  across  security   domains   •  Distributed  development  of  ingest  and  query   applicaGons   •  Supported  by  Accumulo’s  cell-­‐level  security   Data-Centric  Security  
  • 12. 12   Sqrrl  Data,  Inc.    All  Rights  Reserved   12   Cell-Level  Security  
  • 13. 13   Sqrrl  Data,  Inc.    All  Rights  Reserved   13   Scalable  Data-Centric  Security   Data   Labeler   Accumulo   Apps   User   ACributes   Audits   Policies   HDFS,   Zookeeper   End  Users   Auth.   Service   Policy   Engine  
  • 14. 14   Sqrrl  Data,  Inc.    All  Rights  Reserved   14   Accumulo’s  Strengths   "   Security   –  Cell-­‐level  security  reduces  the  cost  of  applicaGon  development  in  the   presence  of  complex  legal  or  policy  restricGons  on  data  use   –  IAM  and  encrypGon  Ges  into  enterprise  security  standards     "   Scalability   –  Proven  reliability  and  performance  at  the  mulG-­‐petabyte  scale   –  High-­‐performance  parallel  I/O  library     "   Adap9vity   –  Flexible  schema  support  to  quickly  ingest  new  data  sources   –  Sorted  key/value  paradigm  supports  a  mulGtude  of  search  and   analysis  applicaGons   –  Server-­‐side  programming  framework  “iterator  trees”  support  best-­‐in-­‐ class  aggregaGon,  filtering,  and  complex  query  semanGcs  
  • 15. 15   Sqrrl  Data,  Inc.    All  Rights  Reserved   15   An  Accumulo  key  is  a  5-­‐tuple,  consis9ng  of:       "   Row:  Controls  Atomicity   "   Column  Family:  Controls  Locality     "   Column  Qualifier:    Controls  Uniqueness   "   Visibility  Label:    Controls  Access   "   Timestamp:    Controls  Versioning   Row   Col.  Fam.   Col.  Qual.   Visibility   Timestamp   Value   John  Doe   Notes   PCP   PCP_JD   20120912   PaGent  suffers   from  an  acute  …   John  Doe   Test  Results   Cholesterol   JD|PCP_JD   20120912   183   John  Doe   Test  Results   Mental  Health   JD|PSYCH_JD   20120801   Pass   John  Doe   Test  Results   X-­‐Ray   JD|PHYS_JD   20120513   1010110110100…   Accumulo  Key/Value  Example   Accumulo  Key  Structure  
  • 16. 16   Sqrrl  Data,  Inc.    All  Rights  Reserved   16   Accumulo  Architecture   Tablet  Server   Tablet   Tablet  Server   Tablet   Tablet  Server   Tablet   ApplicaGon   Zookeeper   Zookeeper   Zookeeper   Master   HDFS   Read/Write   Store/Replicate   Assign/Balance   Delegate   Authority   Delegate   Authority   ApplicaGon   ApplicaGon  
  • 17. 17   Sqrrl  Data,  Inc.    All  Rights  Reserved   17   Tablet  Data  Flow   In-­‐Memory   Map   Write  Ahead   Log   (For  Recovery)   Sorted,   Indexed   File   Sorted,   Indexed   File   Sorted,   Indexed   File   Tablet   Reads   Iterator   Tree   Minor   Compac<on   Merging  /  Major   Compac<on   Iterator   Tree   Writes   Iterator   Tree   Scan  
  • 18. Iterator  Framework   18   Secure.        Scale.        Adapt.   Iterator  Opera9ons:     "   File  Reads   "   Block  Caching   "   Merging   "   DeleGon   "   IsolaGon   "   Locality  Groups   "   Range  SelecGon   "   Column  SelecGon   "   Cell-­‐level  Security   "   Versioning   "   Filtering   "   AggregaGon   "   ParGGoned  Joins   info@sqrrl.com  |  @sqrrl_inc  |  617.520.4375                          sqrrl  data,  INC.,    All  Rights  Reserved  
  • 19. 19   Sqrrl  Data,  Inc.    All  Rights  Reserved   •  No  built-­‐in  secondary   indices   •  Sort  Order  ó  Index   •  Balance  between   ingest  and  query   •  Avoid  introducing   boDlenecks   •  Preserve  cell-­‐level   security  and  scalability   Table  Design   Table:   Row:   Column   Family:   Column   Qualifier:   Value:   Forward  Index   <UUID>   <Type>   <Field>   <Term>   Inverted  Index   <Term>   <Type>  +   <Field>   <UUID>   <Digest  of   Event>  
  • 20. 20   Sqrrl  Data,  Inc.    All  Rights  Reserved   20   Ecosystem  Architecture   Apache  HDFS   Apache  Accumulo   Sqrrl  Enterprise   Custom  Ingester  Web  Server    Custom  AnalyGc  Map/Reduce  Task   Sqrrl  API  over  Apache  Thrip  RPC  :     Hierarchical  Documents  +  Graphs,  Lucene  +  SQL  +  more   Accumulo  RPC  :  Sorted  Key/Value  I/O   Hadoop  RPC  :  File  I/O    
  • 21. 21   Sqrrl  Data,  Inc.    All  Rights  Reserved   21   sqrrl  data,  inc.   275  Third  St.   Cambridge,  MA  02142     617-­‐902-­‐0784   www.sqrrl.com   @sqrrl_inc   info@sqrrl.com   Contact