AKUDA Labs: Pulsar

AKUDA LABS PROPRIETARY AND CONFIDENTIAL

Our Superpower
100000#
10000#
250#
0# 10000# 20000# 30000# 40000# 50000# 60000# 70000# 80000# 90000# 100000#
Bananas#
Spark#Streaming#
(latency#<#3s)#
Spark#Streaming#
(op@mized#for#latency)#
Throughput)(packets/s))
10x
Throughput!
24,000x
Lower Latency!
400x
Throughput !
when Spark Streaming
optimized for latency!

The Benchmark:
Pattern Detection in Unstructured Streaming Text Data
Spark Streaming Setup#
Bananas Setup#
Text Stream
Generator!
Throughput Regulator#
Throughput
Regulator#

The Benchmark:
Platform and Setup
•  Dell 815 Servers!
!
•  48 Text Classiﬁcation
Pipelines!
!
•  10 Gbit Connection!
!
Spark Streaming Conﬁgurations:!
•  Receiver-Based Model!
•  12 Kafka Topic
Partitions!
•  Block Size: 200 ms!
•  Batch Size: 1.5 – 20 s!

Why does it matter?
•  Reliability – May add 2 Nines!
•  Hardware Cost – Potentially 100x Less Cost In Hardware!
•  Energy – Potentially 100x Less Energy !
•  Data Center Footprint – Potentially 100x Less Racks !
•  Manageability – 10 machines versus 1000 machines!!
•  Network BW – Potentially 100x less network BW!
•  Total Cost of Ownership – Potentially < 1000x !!!!
•  Greater Peace of Mind!

Who will pay for Real-Time Solutions?
Real-Time: Expected Latency < 1ms
•  Online Marketers!
–  Process over 100k events per second for thousands
of social media websites!
–  Expected revenue > $2.1 Trillion!
•  IoT Businesses!
–  Process thousands of events per second from
millions of connected devices!
–  Expected revenue > $100 Billion!
•  Spam and Fraud Detection!
–  Detect multiple complex patterns in millions of
transactions and documents per second!
–  Expected revenue > $40 Billion!

The Akuda Quest
•  To enable truly-real time classification of
extremely high rate data streams !
•  To enable subject matter experts who
possess extensive knowledge of the domain
the data belongs to, and who are often non-
programmers, to directly create classifiers!
•  To enable the fast development and
refinement of data classifiers!

The Real-Time Classification Challenge
Latency < 1ms
ocuments/second
ytes/packet
Ultra Fast Classification & Correlation
0.001 seconds (max latency)
1,000,000
Distinct Possible
Events/Trigger/Results
K1
K5
K4
K3
K2
K6
K7 K8
K9
K10
Actionable
Information
Previous Knowledge Previous Knowledge Previous Knowledge
100 !
events/s #
10,000 !
Devices#
1,000,000 !
packets/s #
10,000!
Classifiers#
10 Billion
Classification
Operations/s#

Pulsar Analyst Workbench
Quick, Intuitive Classiﬁer Development Sandbox

Quick Model Optimization
Specialized Compiler, Data Analysis Tools
RESOLVED FILTERING NETWORK
Optimizing Parallelizing Compiler
Cycle
Detection
Reordering
DFA
Pruning
Platform
Targeting
TARGET
PLATFORM
TOPOLOGY
Execution
Engine

AKUDA Technology Delivery
•  SaaS turn-key solution, with a model
development system that allows for
deployment of complete solutions in hours,
without any coding requirements.!
•  Privately deployable enterprise solution on
a Cloud Infrastructure. !
•  Software Development Infrastructure for
developing highly speciﬁc and targeted
solutions.!

The SaaS Platform: Pulsar
High Level View
INBOUND
DATA
HUB
DATA
AUGMENTATION
&
CORRELATION
CLASSIFICATION
INDEXING
CLUSTER
ANALYSIS
OUTBOUND
DATA
HUB

Pulsar
System View
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
AuthorAtributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

Pulsar
Inbound Data Hub
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
(LGM)
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

Pulsar
LGM: Data Augmentation and Correlation
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
(LGM)
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

Pulsar
Bananas: Data Classification
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
(LGM)
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

Pulsar
Corona: Cluster Analysis
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
(LGM)
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

Pulsar
Outbound Data Hub
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Image Universal
Searchable
Index
Image Store
AKUDA
Broadcaster
(LGM)
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator

THE AKUDA CORE!
MASSIVELY PARALLEL STREAMING
CLASSIFICATION INFRASTRUCTURE!

Possible Solution 1
NOT THIS - GTS: Scalability & Latency Problems
Feed BC
Rx
Rx
Rx
Rx
Indexer
Broadcaster
GTS
Indexing
System
Query With Frequency 2 q/s
Indexer
Indexer
Indexer
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Index Storage

Possible Solution 2
NOT THIS - HADOOP: Latency Problems
Feed BC
Brodcaster
HADOOP
Broadcaster#

Possible Solution 3
Not Quite There: Spark Streaming Pipeline of RDDs
Source
1,000,000 documents/second
1,024 bytes/packet
MicroBatcher
1,000,000 Sequential Stages
Doc 01
Doc 02
Doc 03
Doc 04
Doc 05
Doc 06
Doc 07
Doc 08
Doc 09
Doc 10
Doc 11
Doc 12
Doc 13
Doc 14
Doc 15
Doc 16
Latency of minutes, hours??
Network Transfers and/or Data
Copying Across Host Nodes or
Pipeline Stages

Possible Solution 4
Almost There: Data Flow Pipelines, Data Replication
Source
1,000 bytes/packet
Broadcaster
Bisection Bandwidth
1,000,000,000,000,000 bytes/second
~10,000,000 GBits/second
~10,000 TBits/second !!!
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
1,000,000
Stages Running
Simultaneously
Low Lat
Broadcasting Doc Replicas
becomes extreme bottleneck
PCIe 3.0 lane BW: ~ 1GByte/second
10Gbps Ethernet: ~ 1GB/second
Infiniband: Mellanox 56Gb/s FDR IB:
6.8GB/s
Cisco Catalyst 2960G-49TC-L Switching
Fabric: 40mpps. At 1000 bytes/
packet: 40,000 MBytes/second
==> 40 GBytes/second
Intel-Xeon-Processor-E7-8890 (15 cores)
Max Mem BW:85GBytes/second

Possible Solution 5
Cost & Latency Issues: Data Broadcasting Tree
Feed BC
Rx
Rx
Rx
Rx
Broadcaster
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
1
10
10
10
R
x
R
x
R
x
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
R
x
R
x
R
x
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Rx
Rx
Rx
Rx
Rx
Rx
10
10
10
1 + 10 x 10 x 10 x 10 x 10 x 10 = 1,000,001 Nodes
Worst Case Cost = 1,000,001 * $1000/month:
~ $ 1 Billion / Month !!!!
Latency Goes back to hours or days!

Honey, I Shrunk the Trees!
AKUDA Core Topology
Indexing / Analytics
Feed
Rx Tx
Visualization
Broadcaster
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx
Model Pipelines
Tx
Visualization
1,000 bytes/packet
100,000 Short Pipelines * 10 Stages each= 1,000,000 Stages
Akuda Queue Technology
using on-chip inter-core
networks
Akuda Buﬀer Technology
networks
Akuda Correlator Technology
networks
0.001 seconds typical latency

The Solution
Utilize Inter-Core Communication Channel
Data Communication Hardware! Typical Bandwidth! Typical Cost!
10 Gbps Ethernet! 1 GB/s! $ 1,000!
PCIe 3.0 Lane! 1 GB/s! $10,000!
Inﬁniband, Mellanox 56Gb/s FDR IB! 6.8 GB/s! $1,000,000!
Cisco Catalyst Switching Fabric! 40 GB/s! $10,000,000!
Inter-core/Inter-processor Fabric
Bisection Bandwidth!
1000 GB/s !
(for IA64 Chips)!
$500!

Data Broadcasting
Use The Best Broadcasting Network
340GB/s
> 1000GB/s

The Solution
AKUDA Core Differentiating Factors
Lockfree Queue, Pipeline
Control!
Lockfree Correlator!
Lockfree Multithreaded
Processing!
Feed BC
Broadcaster
Indexing /
AnalyticsRx Tx
Visualization
1
10
1000
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Akuda
Core
Akuda
Core
Zero-replication
Data Broadcasting!
On-chip-
network
Communication
Control!

Adaptive Topology
Continuous Optimization of Data Comm & Pipeline Execution

Akuda Core Scalability
Bisection Bandwidth
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
BBW
Processors
Akuda Lock-free
Algorithms
Standard Algorithms
Processing Latency
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
Time
Processors
Akuda Lock-free
Algorithms
Standard Algorithms
Processing Cost
200 400 600 800 1000 1200 1400 1600
800
700
600
500
400
300
200
100
1000 $ / Month
MILLION [Stream Rate * Pipelines * Patterns]
Akuda Lock-free
Algorithms
Standard
Algorithms
Parallelization Speedup
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
Speedup
Processors
Akuda Lock-free
Algorithms
Standard Algorithms

AKUDA Core in Action
Election2016.io: Real-Time Online Polls
“The problem is that when polls are wrong, they tend to
be wrong in the same direction. If they miss in New
Hampshire, for instance, they all miss on the same
mistake.” -- Nate Silver!

Akuda Core in Action
Election2016.io Backend
Feed
Rx
Model Pipelines
Tx
Visualization
50,000 documents/second (peak)
1,000 bytes/document
3000 Models (Author Classification + App Classification)
Akuda Broadcasting
Technology
using on-chip inter-core fabric
fabric
Akuda Correlator Technology
Sub-second Latency
150 GigaBytes Bisection Bandwidth
(Over 1 TERAbit/second)
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Rx Tx
Visualization
Learned People
Attributes
Akuda Classification
Technology
Correlator
Akuda Data Analysis
Technology
100 Patterns/Model
15 BILLION
Patterns/second

STANDALONE#USE#OF#
BANANAS##

General Statistical Classiﬁcation
K-MEANS, LDA, NN
Feed
Rx Tx
Broadcaster
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx
k-means Model SubMatrix
Tx
1,000 bytes/packet
10,000 Nodes * 100 k-means Centroid Vectors
networks
networks
Aggregator
networks
k-means
cluster
label
for data
item
Akuda Lockless
Matrix Ops
Akuda Lockless
Correlator

IOT Classification POC
K-MEANS
Feed
Rx Tx
Broadcaster
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
1,000,000 sensor-vectors/second
1,000 bytes/vector
Classification Using DFA, K-MEANS, LDA, or NN Models
networks
networks
Aggregator
networks
Sensor
Warnings
Akuda Lockless
Matrix Ops
Sensor State
Classification
Akuda Lockless
Correlator

IOT Classification POC
K-MEANS
LINEAR
ALGEBRA
ENGINE - 1
LINEAR
ALGEBRA
ENGINE - 2
LINEAR
ALGEBRA
ENGINE - N
LINEAR
ALGEBRA
ENGINE - 100
DATA
RECEIVER
INPUT DATA
CHANNEL
L2 NORM
CHANNEL
AGGREGATOR
LOCKLESS HASH
UNSORTED
CHANNEL
MIN FINDER
INPUT DATA
STREAM
OUTPUT DATA
STREAM
Packet ID
Input Packet: D
Packet ID
Transformed Packet: D’
Packet ID
Minimum Elements Vector
Minimum Distance
from Classifier: Pn
Packet ID
Classified Packet
For, K = 100,000 (number of clusters)
N = 100 (number of processors)
P = 1000 (cardinality of feature set)
D : Input Vector to be classified
A : Model matrix representing trained
values for classification centroids

AKUDA#LABS#PATENTS#
Pending#&#Provisional#

PATENT LIST (1/3)#
1 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR
CREATION OF SAME
2 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL-TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR
CREATION OF SAME
3
MASSIVELY-PARALLEL SYSTEM ARCHITECTURE AND METHOD FOR REAL-TIME EXTRACTION OF HIGH-VALUE INFORMATION FROM DATA STREAMS
4
OPTIMIZATION FOR REAL-TIME, PARALLEL EXECUTION OF MODELS FOR EXTRACTING HIGH-VALUE INFORMATION FROM DATA STREAMS
5
EXTRACTION OF HIGH VALUE INFORMATION FROM UNSTRUCTURED IMAGES IN MASSIVELY PARALLEL PROCESSING SYSTEM
6
REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM
7
ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC ASPECTS/IMPROVEMENTS OF REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM
8
AUTOMATIC TOPIC DISCOVERY IN STREAMS OF SOCIAL MEDIA POSTS
9
TOPIC AND TREND DISCOVERY WITHIN REAL-TIME ONLINE CONTENT STREAMS
10
SYSTEM AND METHOD FOR IMPLEMENTING ENTERPRISE RISK MODELS BASED ON INFORMATION POSTS
11
ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC MODELS OTHER THAN RISK MODELS
12
LAZY PARSER FOR INFERENCE IN UNSTRUCTURED DATA STREAMS
13
REALTIME DATA STREAM CLUSTER SUMMARIZATION AND LABELING SYSTEM
14
DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED DATA
15
REAL-TIME STREAM CORRELATION WITH PRE-EXISTING KNOWLEDGE (STATE)
16
LOCKLESS KEY-VALUE STORE AND MEMORY CACHING SYSTEM
17
DYNAMIC RESOURCE ALLOCATOR FOR REAL-TIME PARALLEL PIPELINE PROCESSING SYSTEM

PATENT LIST (2/3)#
18
REALTIME LOW LATENCY DATA STREAM DFA CLASSIFICATION ENGINE
19 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR SOCIAL MEDIA AUTHOR CLASSIFICATION AND
ANALYSIS STREAM
20
ATTRIBUTE VECTOR COMPRESSION FOR STREAM PROCESSING
21
REATIME IOT PARALLEL VECTOR CLASSIFICATION
22
REALTIME IMAGE HARVESTING AND STORAGE SYSTEM
23
DATA STREAM HISTORIC REPLAY VERSIONING (SKYLINE)
24
DATA STREAM HISTORIC REPLAY SYSTEM AND STORAGE
25
EXTRACTION OF AUTHOR(PEOPLE) ATTRIBUTES THROUGH COMPLEX DFA MODELS
26
REALTIME IMAGE HARVESTING AND STORAGE SYSTEM
27
NEURAL NETWORK-BASED SYSTEM FOR EXTRACTION OF DEMOGRAPHICS FROM SOCIAL MEDIA IMAGES
28
METHODFORSOCIALMEDIAEVENTDETECTIONANDCAUSEANALYSIS
29
METHOD FOR REAL-TIME TAGGING OF DATA STREAM DOCUMENTS
30
PEOPLE ATTRIBUTE QUERY AND VISUALIZATION TOOL
31
WORD SET VISUAL NORMALIZED WEIGHT DAMPENING
32 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED
ELECTION DATA
33 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED
RETAIL DATA

PATENT LIST (3/3)#
34
SYSTEMS AND METHODS FOR ANALYZING UNSOLICITED PRODUCT/SERVICE CUSTOMER REVIEWS
35
SYSTEM FOR CREDIT/INSURANCE PROCESSING USING UNSTRUCTURED DATA
36
SYSTEM AND METHOD FOR CORRELATING SOCIAL MEDIA DATA AND COMPANY FINANCIAL DATA
37
SYSTEMS AND METHODS FOR IDENTIFYING AN ILLNESS AND COURSE OF TREATMENT FOR A PATIENT
38
SYSTEM AND METHOD FOR IDENTIFYING FACIAL EXPRESSIONS FROM SOCIAL MEDIA IMAGES
39
SYSTEM AND METHOD FOR DETECTING HEALTH MALADIES IN A PATIENT USING UNSTRUCTURED IMAGES
40 SYSTEM AND METHOD FOR DETECTING POLITICAL DESTABILIZATION AT A SPECIFIC GEOGRAPHIC LOCATION BASED ON SOCIAL
MEDIA DATA
41
SYSTEM AND METHOD FOR IDENTIFYING CORRELATIONS BETWEEN SOCIAL MEDIA IMAGES USING NEURAL NETWORKS
42
SYSTEM AND METHOD FOR SCALABLE PROCESSING OF DATA PIPELINES USING A LOCKLESS SHARED MEMORY SYSTEM
43
ASYNCHRONOUS WEB PAGE DATA AGGREGATOR
44
APPLICATIONS OF DISTIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY TO REAL TIME NEWS SERVICE
45
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME THREAT ANALYSIS
46
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME EMERGENCY RESPONSE
47
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR CLIMATE ANALYTICS
48
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR INSURANCE RISK ASSESSMENT
49
DISTRIBUTED PARALLEL ARCHITECTURES FOR REAL TIME PROCESSING OF STREAMS OF STRUCTURED AND UNSTRUCTURED DATA

THE#AKUDA#SYSTEM#
Addi@onal#Informa@on#

The Solution
Akuda Core Topology with Kafka
UU#OnUchipUnetwork#Comm#Control#
UU#ZeroUcopy#Data#Broadcas@ng#
UU#Lockfree#queue,#pipeline#control#
UU#Lockfree#correlator#
UU#Lockfree#Mul@threaded#Processing#
Feed BC
Kafka
Indexing /
AnalyticsRx Tx
Visualization
1
10
1000
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Akuda
Core
Akuda
Core

Pulsar
Functional View
Unstructured
Data Source
Streams
Unstructured
Data Source
Batch
Unstructured
Data Source
Images
MILLIONS OF DOCUMENTS
PER SECOND
LDA
CONTROL
AKUDA
DEEP INSPECTION
THIRD-PARTY
DATA ANALYTICS
HADOOP
BASED ANALYTICS
THIRD-PARTY
VISUALIZATION
AKUDA
DASHBOARD
RT
Content
Classiﬁcation
(DFA/LDA/VEC)
RT
Author
Classiﬁcation
(DFA/LDA)
Optimizing Parallelizing
Compiler
Normalization
RT
Author
Image Analysis
(NEURAL NETS)
Universal
Indexing
P-GRAM GEN
Indexer
STATS /
ANALYTICS
Author ATTR
Author GEO
Author DEM
LDA PROC
P-GRAM GEN LDA PROC
10+ BILLIONS OF CLASSIFICATIONS
PER SECOND
MISSION
EDITOR

Automatic Cluster Discovery
P-GRAMS, LDA, CONVERGENCE
Mission Deep
Inspection Store
Summarizer
p-GRAM
Generator
Mission Stream
Concept
Extractor
LDA
Solver
Convergence
Monitor
p-GRAMS
Corpus
Summary
Corpus
Concept
Cloud
Labeled
Corpus
Clusters
Classification
Model
Library
LDA
Cluster Generation & Labeling
LDA
Cluster Refinement
DFA
Classifier Refinement
LDA
Classifier Generator

Author Attribute Discovery
Neural Networks, Bayesian Models, DFAs
Ethnicity
Image Analyzer
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
AKU
DA
Broad
caster
(LGM)
Unstructured
Data Source
A
Unstructured
Data Source
B
Unstructured
Data Source
C
Normalization
Age
Image Analyzer
Gender
Image Analyzer
Labeled
Image
Generator
Neural Network
Trainer
Author Bayesian
Classiﬁcation
Model Trainer

Generalized Image Classification
Neural Networks, Bayesian Models, DFAs
Ethnicity
Image Analyzer
Age
Image Analyzer
Gender
Image Analyzer
Labeled
Image
Generator
Neural Network
Trainer
Image
Data
Sources
Image
Harvester
Logo
Identification
Face
Detector Glasses
Image Analyzer
Weight
Image Analyzer
Hair-style
Image Analyzer
Shape
Identification
Emotion
Image Analyzer
Image
Label
Classifier
Image DB

Pipeline Editor
Automatic LDA Models, User-specified DFAs
RT
Content
Classification
(DFA/LDA/VEC)
Optimizing Parallelizing
Compiler
PIPELINE EDITOR
Filtering, Analysis And Action Network
LDA
Classifier
Vector
String
CMP
Vector
INT/
FP
CMP
DFA
Counter
Tap
Action
Block
DFA
Counter
Tap
Counter
Tap
DFA
Action
Block
Outp
utInou
t
LDA
Classifier
Vector
String
CMP
Vector
INT/FP
CMP
DFA Action
Block
Counter
Tap
Model Library
Airlines
Auto
Auto Insurance
Cable
Beverages
Fast Food
Finance
Housing
Legal
Pharma/Health
Most Used Detectors
Tech
Advertisement
Inquiry
Customer Service
Irate Customers
Thankful Customers
Consumers
STATE
MANAGEMENT
P-GRAM GEN
Indexer
LDA PROC

AKUDA Labs: Pulsar

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to AKUDA Labs: Pulsar

Similar to AKUDA Labs: Pulsar (20)

Recently uploaded

Recently uploaded (20)

AKUDA Labs: Pulsar