Leverage heterogeneous compute to gain real-time insights from Elasticsearch

•Download as PPTX, PDF•

0 likes•219 views

Expanding Elastic: Learn how anyone can leverage heterogeneous compute to extend and accelerate Elasticsearch The document discusses how Ryft provides solutions to leverage powerful FPGA- and x86-based heterogeneous compute resources to gain immediate insight from data in Elasticsearch without indexing or data preparation latency. Ryft can accelerate workflows, speed search and analysis across unstructured data with no transformation needed, and increase the power of edit distance searches for fuzzy matching beyond a distance of 2. Ryft offers flexible deployment on-premise, in the cloud, or in hybrid environments to extend the capabilities of Elasticsearch.

Technology

Expanding Elastic:
Learn how anyone can leverage heterogeneous compute
to extend and accelerate Elasticsearch
Pat McGarry, Vice President of Engineering
Al Leyva, Director of Product Management for Analytics Integration
Ryft
March 7, 2017
@Ryft

Analytics at the Speed of Your Business
 Challenges:
 Exponential data growth
 Traditional text searching is
hindered with complex
indexing and transformation
requirements
 Answers are delayed as
legacy architectures require
hours or days to find insights
Solutions:
 Deploy heterogeneous
compute on-premise, hybrid
or in the cloud
 Abstract away the
complexities of powerful
FPGA-based compute
technology
 Real-time insights with no
indexing or data preparation
latency
Fast data growth is creating the largest business threats & opportunities since the
Internet

Accelerate and Extend Elasticsearch with Ryft
Leverage powerful FPGA- and x86-based heterogeneous compute resources to
gain immediate insight without indexing
Reduce need for data indexing and transformation, accelerate searches and extend
Elasticsearch capabilities to:
 Accelerate workflows with the ability to deploy pre-index and post-index
searches
 Speed search and analysis across unstructured data and JSON, XML, LOGs,
CSV, TSV and other files with no transformation
 Increase the power of edit distance with user selectable changes to large (>2)
distance requests for Fuzzy Hamming or Levenshtein searches.
 Enhance wildcard searches to include leading wildcard characters

Flexible Deployment On-Premise, in the Cloud or in
Hybrid Environments
Kibana Layer
Elasticsearch Layer
Lucene Layer
ES to Lucene
Ryft ONE / AWS F1 Instance
ES Plugin mechanism routes requests
(fuzziness & metric)
The Elastic Stack implements a distributed,
JSON-based search and analytics engine:
ES to Ryft
Primitive
Elasticsearch on Ryft can be deployed in your
environment, on-premise, hybrid or in the
cloud:
Deploy via the new Amazon
F1 platform.
Deploy on-premise or in
hybrid environments with the
Ryft ONE accelerator.

Speed search and analysis across unstructured data
and JSON, XML, LOGs, CSV, TSV files with no ETL
• Using
Elasticsearch
command
• Un-indexed
human genome
data
• Match query
with Levenshtein
search
• Edit distance of
4

Increase edit distance values beyond 2 using Fuzzy
Hamming or Levenshtein searches powered by Ryft
• Using
Elasticsearch
command
• Match phrase
query with
Levenshtein
search
• With edit
distance of 6
• No re-indexing
necessary
Pharmaceutical Research
Pharmaceutical Scien ce
r deleted

Enhance wildcard searches to include leading
wildcard characters powered by Ryft
• Using
Elasticsearch
command
• Using XML
pcap file
• Looking for IP
addresses
xx.0.90.xx
• No ETL or
indexes
necessary

Thank you!
Contact us at info@ryft.com or www.ryft.com for a demo.

What's hot

mcubed london - data science at the edgeSimon Elliston Ball

Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli Spark Summit

R Then and NowRevolution Analytics

Accumulo Summit 2015: From Big Data to Linked Data: Making Sense of Massive, ...Accumulo Summit

Data munging and analysisRaminder Singh

Managed Cluster ServicesAdam Doyle

American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...Elasticsearch

Data-Driven @ NetflixMichelle Ufford

Big Data Day LA 2016 Keynote - Reynold Xin/ DatabricksData Con LA

Elastic Stack roadmap deep diveElasticsearch

R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...Spark Summit

The Power of DataDataWorks Summit

Shifting Data Science into High GearSpark Summit

Webinar: Fusion for Data ScienceLucidworks

Optimizing Elastic for Search at McQueen SolutionsElasticsearch

Data saturday malta - ADX Azure Data Explorer overviewRiccardo Zamana

Data streaming at VRTMatthias De Vriendt

Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkSingleStore

Cascalog at May Bay Area Hadoop User Groupnathanmarz

InfoTrack: Creating a single source of truth with the Elastic StackElasticsearch

What's hot (20)

mcubed london - data science at the edge

Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli

R Then and Now

Accumulo Summit 2015: From Big Data to Linked Data: Making Sense of Massive, ...

Data munging and analysis

Managed Cluster Services

American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...

Data-Driven @ Netflix

Big Data Day LA 2016 Keynote - Reynold Xin/ Databricks

Elastic Stack roadmap deep dive

R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...

The Power of Data

Shifting Data Science into High Gear

Webinar: Fusion for Data Science

Optimizing Elastic for Search at McQueen Solutions

Data saturday malta - ADX Azure Data Explorer overview

Data streaming at VRT

Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark

Cascalog at May Bay Area Hadoop User Group

InfoTrack: Creating a single source of truth with the Elastic Stack

Viewers also liked

ElasticSearch: Distributed Multitenant NoSQL Datastore and Search EngineDaniel N

Elasticsearch in ZalandoAlaa Elhadba

Diario Resumen 20170315Diario Resumen

Munit_in_mule_naveenVenkataNaveen Kumar

James elastic searchLearningTech

$IEEE CLOUD \'11$ $IEEE CLOUD \'11$

IEEE CLOUD \'11David Ribeiro Alves

Introduction to ElasticsearchSperasoft

eBay Experimentation Platform on HadoopTony Ng

Grokking Grok: Monitorama PDX 2015GregMefford

2014 devops conferencesDavid Lutz

Monitorama: How monitoring can improve the rest of the companyJeff Weinstein

V de gonwinelias barrios

How to make innovation happen.Dave Zamora

Elastic Search Indexing InternalsGaurav Kukal

Monitorama PDX 2016 - Vizceral: Traffic IntuitionJustin Reynolds

quick intro to elastic search medcl

Stream Processing Inside Librato [Monitorama PDX 2015]Librato, Inc.

Improving Schools With Social MediaEric Sheninger

Diagrama de la neurolinguisticaDaniel Alejandro Ramirez Chacon

Elastic search WalkthroughSuhel Meman

Viewers also liked (20)

ElasticSearch: Distributed Multitenant NoSQL Datastore and Search Engine

Elasticsearch in Zalando

Diario Resumen 20170315

Munit_in_mule_naveen

James elastic search

$IEEE CLOUD \'11$ $IEEE CLOUD \'11$

IEEE CLOUD \'11

Introduction to Elasticsearch

eBay Experimentation Platform on Hadoop

Grokking Grok: Monitorama PDX 2015

2014 devops conferences

Monitorama: How monitoring can improve the rest of the company

V de gonwin

How to make innovation happen.

Elastic Search Indexing Internals

Monitorama PDX 2016 - Vizceral: Traffic Intuition

quick intro to elastic search

Stream Processing Inside Librato [Monitorama PDX 2015]

Improving Schools With Social Media

Diagrama de la neurolinguistica

Elastic search Walkthrough

Similar to Leverage heterogeneous compute to gain real-time insights from Elasticsearch

Explore Elasticsearch and Why It’s Worth UsingInexture Solutions

Elastic search overviewABC Talks

Filebeat Elastic Search Presentation.pptxKnoldus Inc.

Elastic searchMahmoud91Tx

Spark Infrastructure Made EasyBlueData, Inc.

ElasticsearchOto Brglez

Testing Big Data: Automated Testing of Hadoop with QuerySurgeRTTS

The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive

Big Data , Big Problem?Mohammadhasan Farazmand

Using Data Lakes Amazon Web Services

Wasp2 - IoT and Streaming PlatformPaolo Platter

Teradata Technology Leadership and InnovationTeradata

IEEE 2014 JAVA DATA MINING PROJECTS Scalable keyword search on large rdf dataIEEEFINALYEARSTUDENTPROJECTS

Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Perficient, Inc.

Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS Amazon Web Services LATAM

Elastic Search Capability Presentation.pptxKnoldus Inc.

Python + MPP Database = Large Scale AI/ML Projects in Production FasterPaige_Roberts

4AA6-4492ENWMichecarly Osirus

Actian Matrix DatasheetEdgar Alejandro Villegas

QuerySurge Slide Deck for Big Data Testing WebinarRTTS

Similar to Leverage heterogeneous compute to gain real-time insights from Elasticsearch (20)

Explore Elasticsearch and Why It’s Worth Using

Elastic search overview

Filebeat Elastic Search Presentation.pptx

Elastic search

Spark Infrastructure Made Easy

Elasticsearch

Testing Big Data: Automated Testing of Hadoop with QuerySurge

The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...

Big Data , Big Problem?

Using Data Lakes

Wasp2 - IoT and Streaming Platform

Teradata Technology Leadership and Innovation

IEEE 2014 JAVA DATA MINING PROJECTS Scalable keyword search on large rdf data

Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...

Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS

Elastic Search Capability Presentation.pptx

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

4AA6-4492ENW

Actian Matrix Datasheet

QuerySurge Slide Deck for Big Data Testing Webinar

Recently uploaded

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Key Features Of Token Development (1).pptxLBM Solutions

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Recently uploaded (20)

Are Multi-Cloud and Serverless Good or Bad?

Scanning the Internet for External Cloud Exposures via SSL Certs

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Advanced Test Driven-Development @ php[tek] 2024

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Unblocking The Main Thread Solving ANRs and Frozen Frames

Key Features Of Token Development (1).pptx

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Human Factors of XR: Using Human Factors to Design XR Systems

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

DMCC Future of Trade Web3 - Special Edition

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Benefits Of Flutter Compared To Other Frameworks

Understanding the Laravel MVC Architecture

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Leverage heterogeneous compute to gain real-time insights from Elasticsearch

1. Expanding Elastic: Learn how anyone can leverage heterogeneous compute to extend and accelerate Elasticsearch Pat McGarry, Vice President of Engineering Al Leyva, Director of Product Management for Analytics Integration Ryft March 7, 2017 @Ryft

2. Analytics at the Speed of Your Business  Challenges:  Exponential data growth  Traditional text searching is hindered with complex indexing and transformation requirements  Answers are delayed as legacy architectures require hours or days to find insights Solutions:  Deploy heterogeneous compute on-premise, hybrid or in the cloud  Abstract away the complexities of powerful FPGA-based compute technology  Real-time insights with no indexing or data preparation latency Fast data growth is creating the largest business threats & opportunities since the Internet

3. Accelerate and Extend Elasticsearch with Ryft Leverage powerful FPGA- and x86-based heterogeneous compute resources to gain immediate insight without indexing Reduce need for data indexing and transformation, accelerate searches and extend Elasticsearch capabilities to:  Accelerate workflows with the ability to deploy pre-index and post-index searches  Speed search and analysis across unstructured data and JSON, XML, LOGs, CSV, TSV and other files with no transformation  Increase the power of edit distance with user selectable changes to large (>2) distance requests for Fuzzy Hamming or Levenshtein searches.  Enhance wildcard searches to include leading wildcard characters

4. Flexible Deployment On-Premise, in the Cloud or in Hybrid Environments Kibana Layer Elasticsearch Layer Lucene Layer ES to Lucene Ryft ONE / AWS F1 Instance ES Plugin mechanism routes requests (fuzziness & metric) The Elastic Stack implements a distributed, JSON-based search and analytics engine: ES to Ryft Primitive Elasticsearch on Ryft can be deployed in your environment, on-premise, hybrid or in the cloud: Deploy via the new Amazon F1 platform. Deploy on-premise or in hybrid environments with the Ryft ONE accelerator.

5. Speed search and analysis across unstructured data and JSON, XML, LOGs, CSV, TSV files with no ETL • Using Elasticsearch command • Un-indexed human genome data • Match query with Levenshtein search • Edit distance of 4

6. Increase edit distance values beyond 2 using Fuzzy Hamming or Levenshtein searches powered by Ryft • Using Elasticsearch command • Match phrase query with Levenshtein search • With edit distance of 6 • No re-indexing necessary Pharmaceutical Research Pharmaceutical Scien ce r deleted

7. Enhance wildcard searches to include leading wildcard characters powered by Ryft • Using Elasticsearch command • Using XML pcap file • Looking for IP addresses xx.0.90.xx • No ETL or indexes necessary

8. Thank you! Contact us at info@ryft.com or www.ryft.com for a demo.

Editor's Notes

Threat: Reliance upon legacy network & compute architectures that were never designed to organize, store & process data at the rate required today. Opportunity: There is an enormous growth market for organizations capable of gaining instant analysis on data in any analytics ecosystem and regardless of data type, format or structure.
Ryft leverages FPGA/x86 heterogeneous compute technology to eliminate indexing and transformation, in addition to providing expanded Elasticsearch functionality. For instance, Elasticsearch supports complex Levenshtein distance searches up to a distance of two. However, many of our customers often require more than a distance of two in cases when there are misspellings or abbreviations, there are many permutations of the same words, or when lots of transcriptions exist in the data. By leveraging FPGA and x86-based heterogeneous architectures either on-premise or in the cloud via the Amazon F1 platform, companies can eliminate data indexing and transformation, accelerate searches and extend Elasticsearch capabilities to: Accelerate workflows with the ability to deploy pre-index and post-index searches Speed search and analysis across unstructured data and JSON, XML, LOGs, CSV, TSV and other files with no ETL Increase edit distance values up to 10 or more using Fuzzy Hamming or Levenshtein searches Enhance wildcard searches to include leading wildcard characters With Ryft: Adds both edit distance & hamming fuzzy search capabilities to Elastic Search for phrase matching and word matches Extends the fuzziness distance beyond 2 Reuses existing ES “metric” to request search type No restrictions on Elasticsearch operations No reliance on Elasticsearch indexes Adds support non-JSON elements via ES syntax without indexes Wildcard support (limited in ES), PCRE2-compliant regular expressions, unstructured (non-indexed) file support, indexing performance improvements
All you see is the Elasticsearch application you know and love. Behind the scenes, Ryft abstracts away the complexities of using FPGA-based heterogeneous technology to execute multiple complex algorithms, and return data in native Elasticsearch JSON formats for visualization through the Elasticsearch application, Kibana, and/or other applications using the Elasticsearch API.
Genomics use case

Leverage heterogeneous compute to gain real-time insights from Elasticsearch

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Leverage heterogeneous compute to gain real-time insights from Elasticsearch

Similar to Leverage heterogeneous compute to gain real-time insights from Elasticsearch (20)

Recently uploaded

Recently uploaded (20)

Leverage heterogeneous compute to gain real-time insights from Elasticsearch

Editor's Notes