Elastic Stack Roadmap Deep Dive

•

1 like•3,278 views

Learn about features with demos and announcements, from cross-cluster replication and frozen indices in Elasticsearch to Kibana Spaces and the ever-growing set of data integrations in Beats and Logstash.

Technology

Steve Kearns
Product Lead, Elastic Stack
The Elastic Stack Multiplier

Enterprise Search Observability Security
Elasticsearch Kibana

Data Management
Data Analysis
Actions & Alerting

Hot Warm Cold
Snapshot Lifecycle Management
Index Lifecycle Management

myindex-03
myindex
myindex-01
Alias
myindex-02

myindex-03
myindex
myindex-01
Datastream
myindex-02
• Auto-create from templates
• Smarter rollups
• Smarter query routing
• Smarter index management

Improved Memory Usage
Improved Disk Usage

Doc Values
Stored Fields
Term Dictionary
Term Proximity
Normalization Factors
Point Values
Meta Lookup

SELECT *
FROM logs-*
WHERE region = “us-east”
AND service = “mysql”
AND @timestamp > “2020-01-01”
ORDER BY @timestamp DESC
LIMIT 20

SELECT *
FROM logs-*
WHERE region = “us-east”
AND service = “mysql”
AND @timestamp > “2020-01-01”
ORDER BY @timestamp DESC
LIMIT 20
WHERE region = “us-east”
AND service = “mysql”
const_keyword fields

WHERE region = “us-east”
AND service = “mysql”
AND @timestamp > “2020-01-01” AND @timestamp > “2020-01-01”
Range normalization

AND @timestamp > “2020-01-01”
ORDER BY @timestamp DESC
LIMIT 20
Sorted execution

Elasticsearch is built for speed
• Every field is indexed
• Indexes built at ingest
• Denormalized data, no joins
• Distributed execution
But needs disk, cpu, memory!

Warm
Cold
More data, for longer
Cheap nodes
Remote file store
Frozen

Possibilities…
(How I learned to love the bomb)

Find users who:
• in the previous 12 months
• have used an application
• on today’s Malicious Apps list
• with param “powershell.exe”

Find users who:
• in the previous 12 months
• have used an application
• on today’s Malicious Apps list
• with param “powershell.exe”
Limited Join
}

Find users who:
• in the previous 12 months
• have used an application
• on today’s Malicious Apps list
• with param “powershell.exe”
Schema on Read}

$GET logs-*/_search { “query”: { “bool”: { “filter”: { “range”: { “@timestamp”: { “gte”: “2020-01-01”, “lt”: “2020-02-01” } } } } } }$

SELECT *
FROM logs-*
WHERE @timestamp
BETWEEN “2020-01-01”
AND “2020-02-01”
SQL

@timestamp >= "2020-01-01" AND @timestamp < "2020-02-01"
KQL

$GET /api/v1/query_range? start=2020-01-01T00:00:00Z &end=2020-02-01T00:00:00Z &query=node_network_receive_bytes_total{device="eth1"} PromQL$

sequence by unique_pid
[process where process_name in
("mshta.exe", "rundll32.exe")]
[image_load where image_name in
("jscript9.dll", "winhttp.dll")]
EQL

This document summarizes the key components and collaborations in Apache Druid. It describes Zookeeper's role in coordination, the Overlord's role in task management, the Broker's role in query routing, and the Middle Manager's role in ingestion and indexing. It provides diagrams illustrating how these components work together to ingest and store distributed data, and answer queries in a scalable way.

Move your on prem data to a lake in a Lake in Cloud

CAMMS

With the boom in data; the volume and its complexity, the trend is to move data to the cloud. Where and How do we do this? Azure gives you the answer. In this session, I will give you an introduction to Azure Data Lake and Azure Data Factory, and why they are good for the type of problem we are talking about. You will learn how large datasets can be stored on the cloud, and how you could transport your data to this store. The session will briefly cover Azure Data Lake as the modern warehouse for data on the cloud,

Hermes: Free the Data! Distributed Computing with MongoDB

MongoDB

Moving data throughout an organization is an art form. Whether mastering the art of ETL or building micro services, we are often left with either business logic embedded where it doesn't belong or monolithic apps that do too much. In this talk, we will show you how we built a persisted messaging bus to ‘Free the Data’ from the apps, making it available across the organization without having to write custom ETL code. This in turn makes it possible for business apps to be standalone, testable and more reliable. We will discuss the basic architecture and how it works, go through some code samples (server side and client side), and present some statistics and visualizations.

MoPub, a Twitter company, provides monetization solutions for mobile app publishers and developers around the globe. MoPub receives over 33 Billion ad requests per day generating over 200TB of raw logs every day. We built MoPub Analytics as the analytics platform, using Druid + Imply for our end users who are Publishers, Demand side partners and Internal users. We will talk about the architecture of the analytics platform, our Druid cluster setup, hardware choices, monitoring, use cases, limiting factors, challenges with lookups and solutions we used. Watch video:https://imply.io/virtual-druid-summit/analytics-over-terabytes-of-data-at-twitter-apache-druid

Open source log analytics

Vinod Nayal

This document summarizes an open source scalable log analytics solution. The solution uses Lumberjack for log collection, Logstash for indexing and filtering logs, Redis for buffering, Elasticsearch for indexing and searching logs, MongoDB for document storage, and Kibana with D3.js for visualization. Logs are collected from servers by Lumberjack, sent to Logstash for processing, indexed by Elasticsearch for searching, stored in MongoDB for retrieval, and visualized through dashboards and reports in Kibana. The solution allows for real-time log analysis, flexible searching and filtering, and scales horizontally as needs grow.

Dremio introduction

Alexis Gendronneau

This document discusses Apache Dremio, an open source data virtualization platform that provides self-service SQL access to data sources like Elasticsearch, MongoDB, HDFS, and relational databases. It aims to make data analytics faster by avoiding the need for data staging, warehouses, cubes, and extracts. Dremio uses techniques like reflections, pushdowns, and a universal relational algebra to optimize queries and leverage caches. It is based on projects like Apache Drill, Calcite, Arrow, and Parquet and can be deployed on Hadoop or the cloud. The presentation includes a demo of using Dremio to create datasets, curate/prepare data, accelerate queries with reflections, and manage resources.

Building Data Applications with Apache Druid

Imply

One of the most popular use cases for Apache Druid is building data applications. Data applications exist to deliver data into the hands of everyone on a team in a business, and are used by these teams to make faster, better decisions. To fulfill this role, they need to support granular drill down, because the devil is in the details, but also be extremely fast, because otherwise people won't use them! In this talk, Gian Merlino will cover: *The unique technical challenges of powering data-driven applications *What attributes of Druid make it a good platform for data applications *Some real-world data applications powered by Druid

Building a Real-Time Gaming Analytics Service with Apache Druid

Imply

At GameAnalytics we receive and process real time behavioural data from more than 100 million daily active users, helping thousands of game studios and developers understand user behaviour and improve their games. In this talk, you will learn how we managed to migrate our legacy backend system from using an in-house built streaming analytics service to Apache Druid, and the lessons learned along the way. By adopting Druid, we have been able to reduce development costs, increase reliability of our systems and implement new features that would have not been possible with our old stack. We will provide an overview of our approach to schema design, segments optimization, creation of our query layer, caching and datasources optimisation, which can help you better understand how you can successfully use Druid as a key component on your data processing and reporting infrastructure.

Archmage, Pinterest’s Real-time Analytics Platform on Druid

Imply

In this talk, we will talk about: 1) the motivation of switching from Hbase backed analytics system to Druid 2) the architecture design of Druid as a platform in Pinterest (Archmage, Hadoop, Kafka) including a query interface, Archmage, a thrift service in front of Druid which exposes a thrift api to company-wise clients, handles Druid broker hosts discovery, serves as a relay to broker hosts to abstract the async HTTP connection and provides query optimizations transparent to clients including directly translating fixed pattern SQL to Druid native JSON queries to save planning time. In addition, we’ll cover the production Hadoop batch and Kafka real time ingestion pipeline setup and the reason we picked a pull-based solution instead of a push-based solution for real time ingestion. 3) We will also talk about the use cases currently running in production on this platform including their data volume, QPS, Druid cluster setup, the unique challenges we met while onboarding and how we addressed them with extensive tunings to meet SLA and lessons learned for use cases including: partner insights, which provides partners with stats on organic pins; realtime spam detection, which detects user login related anomaly events and pin related spamming events like pin creation and repin; and migrating the backend from Presto to Druid for Ads related experiments data analysis.

Apache Druid Vision and Roadmap

Imply

Real-Time Video Analytics Using Hadoop and HBase (HBaseCon 2013)

Suman Srinivasan

Video Analysis in Hadoop

DataWorks Summit

Our secure remote connectivity tool provides full video recording of all work our engineers perform on client systems. We have requirements to analyze the video log to detect suspicious activity, provide forensic and root cause analysis capabilities. Some of the obvious use cases include detection of credit card patterns or personally identifiable information (PII) as well as malicious activity like dropping database objects. We need to process hundreds of gigabytes per day representing thousands of hours of video. Our solution leverages a variety of Hadoop components to perform optical text recognition and indexing, keyboard and mouse movement analysis as well as integration with variety of other data sources such as our monitoring, documentation, ticketing and communication systems. We will present our complete architecture starting from multi-source data ingestion through data processing and analysis up to the end user interface, reporting and integration layer.

The of Operational Analytics Data Store

Rommel Garcia

Druid Adoption Tips and Tricks

Imply

Peter Marshall, Technology Evangelist at Imply Abstract: Apache Druid® can revolutionise business decision-making with a view of the freshest of fresh data in web, mobile, desktop, and data science notebooks. In this talk, we look at key activities to integrate into Apache Druid POCs, discussing common hurdles and signposting to important information. Bio: Peter Marshall (https://petermarshall.io) is an Apache Druid Technology Evangelist at Imply (http://imply.io/), a company founded by original developers of Apache Druid. He has 20 years architecture experience in CRM, EDRM, ERP, EIP, Digital Services, Security, BI, Analytics, and MDM. He is TOGAF certified and has a BA degree in Theology and Computer Studies from the University of Birmingham in the United Kingdom.

Sparkler Presentation for Spark Summit East 2017

Karanjeet Singh

Big data in Azure

Venkatesh Narayanan

This document discusses Big Data solutions in Microsoft Azure. It introduces Azure cloud services and provides an overview of Big Data and how it differs from traditional databases. It then outlines Microsoft's Big Data solutions built on Hortonworks Data Platform, including HDInsight which allows running Hadoop on Azure. HDInsight supports various data storage options and processing tools like Hive, Pig, and Storm. The document also covers designing HDInsight clusters and Azure Data Lake for unlimited storage of structured and unstructured data.

MongoDB and the Internet of Things

MongoDB

Hype, buzzword, threat; however you want to characterize it, the Internet of Things (IoT) is here. IoT scenarios that were hypothetical only a few years ago are real today. Still thinking along the line of fleet management and temperature measurements? You’re out. Endless possibilities of IoT applications are surfacing every day, from the connected cow (huh?) to things that monitor and analyze your daily life (really?). In this webinar, we will discuss architecture of IoT data management solutions and the challenges that arise. We will explore how MongoDB features provide solutions to those problems. Time permitting, we will demonstrate an IoT Cloud service built on top of MongoDB.

Apache Arrow: In Theory, In Practice

Dremio Corporation

This document discusses Apache Arrow, an open source cross-language development platform for in-memory analytics. It provides an overview of Arrow's goals of being cross-language compatible, optimized for modern CPUs, and enabling interoperability between systems. Key components include core C++/Java libraries, integrations with projects like Pandas and Spark, and common message patterns for sharing data. The document also describes how Arrow is implemented in practice in systems like Dremio's Sabot query engine.

Data Analytics and Processing at Snap - Druid Meetup LA - September 2018

Charles Allen

MongoDB in the Middle of a Hybrid Cloud and Polyglot Persistence Architecture

MongoDB

How Netflix Uses Druid in Real-time to Ensure a High Quality Streaming Experi...

Imply

A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics

DataWorks Summit

1. The Hadoop Image Processing (HIP) pipeline acquires vehicle images, identifies updates, generates URLs, crops and resizes images, copies them to asset servers, and removes duplicates. 2. It uses HBase for image storage and archiving, MapReduce for image processing, Kafka for publishing to asset servers, OpenCV for image processing, and Avro for data serialization. 3. Performance testing showed HIP scales linearly and is at least 10x faster than the previous system, and using cascading downloads provided a 20% performance gain.

Análisis del roadmap del Elastic Stack

Elasticsearch

What’s Evolving in the Elastic Stack

Elasticsearch

What's hot

August meetup - All about Apache Druid

Imply

Analytics over Terabytes of Data at Twitter

Imply

Open source log analytics

Vinod Nayal

Dremio introduction

Alexis Gendronneau

Building Data Applications with Apache Druid

Imply

Building a Real-Time Gaming Analytics Service with Apache Druid

Imply

Archmage, Pinterest’s Real-time Analytics Platform on Druid

Imply

Apache Druid Vision and Roadmap

Imply

Real-Time Video Analytics Using Hadoop and HBase (HBaseCon 2013)

Suman Srinivasan

Video Analysis in Hadoop

DataWorks Summit

The of Operational Analytics Data Store

Rommel Garcia

Druid Adoption Tips and Tricks

Imply

Sparkler Presentation for Spark Summit East 2017

Karanjeet Singh

Big data in Azure

Venkatesh Narayanan

MongoDB and the Internet of Things

MongoDB

Apache Arrow: In Theory, In Practice

Dremio Corporation

Data Analytics and Processing at Snap - Druid Meetup LA - September 2018

Charles Allen

MongoDB in the Middle of a Hybrid Cloud and Polyglot Persistence Architecture

MongoDB

How Netflix Uses Druid in Real-time to Ensure a High Quality Streaming Experi...

Imply

A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics

DataWorks Summit

What's hot (20)

August meetup - All about Apache Druid

Analytics over Terabytes of Data at Twitter

Open source log analytics

Dremio introduction

Building Data Applications with Apache Druid

Building a Real-Time Gaming Analytics Service with Apache Druid

Archmage, Pinterest’s Real-time Analytics Platform on Druid

Apache Druid Vision and Roadmap

Real-Time Video Analytics Using Hadoop and HBase (HBaseCon 2013)

Video Analysis in Hadoop

The of Operational Analytics Data Store

Druid Adoption Tips and Tricks

Sparkler Presentation for Spark Summit East 2017

Big data in Azure

MongoDB and the Internet of Things

Apache Arrow: In Theory, In Practice

Data Analytics and Processing at Snap - Druid Meetup LA - September 2018

MongoDB in the Middle of a Hybrid Cloud and Polyglot Persistence Architecture

How Netflix Uses Druid in Real-time to Ensure a High Quality Streaming Experi...

A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics

Similar to Elastic Stack Roadmap Deep Dive

Análisis del roadmap del Elastic Stack

Elasticsearch

What’s Evolving in the Elastic Stack

Elasticsearch

How to Reduce Public Cloud Storage Costs

Buurst

Choosing the right data storage in the Cloud.

Amazon Web Services

Data is gravity. Your workloads and processing is dependent on where your data is and how it is stored. With AWS, you have a host of storage options and the key to successfully leverage them is to know when to use which option. This session will explain in details about each of the AWS Storage offerings along with data ingestion optins into the Cloud using Snowball and Snowmobile Marc Trimuschat, Head - Business Developement, AWS Storage, AWS APAC

AWS Tech Talks - Data Lake Analytics

Amazon Web Services LATAM

This document discusses a webinar on data lakes and analytics hosted by Karlos Correia and Claudio Chiba, AWS solutions architects for the public sector. The agenda covers what a data lake is, why organizations use data lakes, how data lakes expand traditional analytics approaches, and the benefits of data lakes such as centralized data storage and schema-on-read capabilities. Amazon S3 and AWS analytics services are positioned as enabling technologies for building data lakes.

Elasticsearch in Production (London version)

foundsearch

Realtime Analytics on AWS

Sungmin Kim

This document discusses real-time analytics on streaming data. It describes why real-time data streaming and analytics are important due to the perishable nature of data value over time. It then covers key components of real-time analytics systems including data sources, stream storage, stream ingestion, stream processing, and stream delivery. Finally, it discusses streaming data processing techniques like filtering, enriching, and converting streaming data.

Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks

Amazon Web Services

(ARC311) Decoding The Genetic Blueprint Of Life On A Cloud Ecosystem

Amazon Web Services

Thermo Fisher Scientific, a world leader in biotechnology, has built a new polymerase chain reaction (PCR) system for DNA sequencing. Designed for low- to midlevel throughput laboratories that conduct real time PCR experiments, the system runs on individual QuantStudio devices. These devices are connected to Thermo Fisher’s cloud computing platform, which is built on AWS using Amazon EC2, Amazon DynamoDB, and Amazon S3. With this single platform, applied and clinical researchers can learn, analyze, share, collaborate, and obtain support. Researchers worldwide can now collaborate online in real time and access their data wherever and whenever necessary. Laboratories can also share experimental conditions and results with their partners while providing a uniform experience for every user and helping to minimize training and errors. The net result is increased collaboration, faster time to market, fewer errors, and lower cost. We have architected a solution that uses Amazon EMR, DynamoDB, Amazon Elasticache, and S3. In this presentation, we share our architecture, lessons learned, best design patterns for NoSQL, strategies for leveraging EMR with DynamoDB, and a flexible solution that our scientist use. We also share our next step in architecture evolution.

Real-time Analytics with Redis

Cihan Biyikoglu

AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...

Sungmin Kim

How to build Business Intelligence System from scratch on AWS (Day1, Day2) ------------------------------------------------------------------------------------------ 2020-03-18(수)~19(목) 2일 동안 온라인으로 진행한 Online AWS Analytics Immersion Day 전체 발표 자료 입니다. BI(Business Intelligence) 시스템을 설계하는 과정에서 AWS Analytics 서비스들을 어떻게 활용할 수 있는지 설명 드리고자 만든 자료 입니다. Target Audience ------------------- Online Analytics Immersion Day는 다음과 같은 고객을 대상으로 진행됩니다. - AWS Analytics Services (ex. Kinesis, Athena, Redshift, EMR, etc)의 기본 개념을 알고 있지만, 이러한 서비스 활용 방법 및 데이터 분석 시스템 구축 과정이 궁금하신 분 - 데이터 분석 시스템을 구축한 경험은 있지만, 자신이 만든 시스템을 아키텍처 관점에서 어떻게 평가하고 확인할 수 있는지 궁금하신 분

Implementation of Dense Storage Utilizing HDDs with SSDs and PCIe Flash Acc...

Red_Hat_Storage

Getting Started with Amazon Redshift

Amazon Web Services

Amazon Redshift is a fast, fully managed data warehousing service that allows customers to analyze petabytes of structured data, at one-tenth the cost of traditional data warehousing solutions. It provides massively parallel processing across multiple nodes, columnar data storage for efficient queries, and automatic backups and recovery. Customers have seen up to 100x performance improvements over legacy systems when using Redshift for applications like log and clickstream analytics, business intelligence reporting, and real-time analytics.

Getting Started with Amazon Redshift

Amazon Web Services

Traditional data warehouses become expensive and slow down as the volume of your data grows. Amazon Redshift is a fast, petabyte-scale data warehouse that makes it easy to analyze all of your data using existing business intelligence tools for 1/10th the traditional cost. This session will provide an introduction to Amazon Redshift and cover the essentials you need to deploy your data warehouse in the cloud so that you can achieve faster analytics and save costs.

Elasticsearch in Production

foundsearch

Black friday logs - Scaling Elasticsearch

Sylvain Wallez

Introduction to Storage on AWS - AWS Summit Cape Town 2017

Amazon Web Services

Elasticsearch in production New York Meetup at Twitter October 2014

beiske

Elasticsearch easily lets you develop amazing things, and it has gone to great lengths to make Lucene's features readily available in a distributed setting. However, when it comes to running Elasticsearch in production, you still have a fairly complicated system on your hands: a system with high demands on network stability, a huge appetite for memory, and a system that assumes all users are trustworthy. This talk will cover some of the lessons we've learned from securing and herding hundreds of Elasticsearch clusters.

Elasticsearch in production Boston Meetup October 2014

beiske

Three Steps to Modern Media Asset Management with Active Archive

Avere Systems

This document discusses a three step approach to modern media asset management with an active archive: 1) Using object storage like Cleversafe for scalable, low-cost archive storage that is geo-dispersed for resilience. 2) Making the archive easily accessible using tools like Avere to provide NAS simplicity and performance. 3) Managing large quantities of media assets using asset management tools like CatDV for ingest, metadata, search, collaboration and workflows.

Similar to Elastic Stack Roadmap Deep Dive (20)

Análisis del roadmap del Elastic Stack

What’s Evolving in the Elastic Stack

How to Reduce Public Cloud Storage Costs

Choosing the right data storage in the Cloud.

AWS Tech Talks - Data Lake Analytics

Elasticsearch in Production (London version)

Realtime Analytics on AWS

Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks

(ARC311) Decoding The Genetic Blueprint Of Life On A Cloud Ecosystem

Real-time Analytics with Redis

AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...

Implementation of Dense Storage Utilizing HDDs with SSDs and PCIe Flash Acc...

Getting Started with Amazon Redshift

Elasticsearch in Production

Black friday logs - Scaling Elasticsearch

Introduction to Storage on AWS - AWS Summit Cape Town 2017

Elasticsearch in production New York Meetup at Twitter October 2014

Elasticsearch in production Boston Meetup October 2014

Three Steps to Modern Media Asset Management with Active Archive

More from Elasticsearch

An introduction to Elasticsearch's advanced relevance ranking toolbox

Elasticsearch

The hallmark of a great search experience is always delivering the most relevant results, quickly, to every user. The difficulty lies behind the scenes in making that happen elegantly and at a scale. From App Search’s intuitive drag and drop interface to the advanced relevance capabilities built into the core of Elasticsearch — Elastic offers a range of tools for developers to tune relevance ranking and create incredible search experiences. In this session, we’ll explore some of Elasticsearch’s advanced relevance ranking features, such as dense vector fields, BM25F, ranking evaluation, and more. Plus we’ll give you some ideas for how these features are being used by other Elastic users to create world-class, category defining search experiences.

From MSP to MSSP using Elastic

Elasticsearch

Eze Castle Integration is a managed service provider (MSP), cloud service provider (CSP), and internet service provider (ISP) that delivers services to more than 1,000 clients around the world. Different departments within Eze Castle have devised their own log aggregation solutions in order to provide visibility, meet regulatory compliance requirements, conduct cybersecurity investigations, and help engineers with troubleshooting infrastructure issues. In 2019, they partnered with Elastic to consolidate the data generated from different systems into a single pane of glass. And thanks to the ease of deployment on Elastic Cloud, professional consultation services from Elastic engineers, and on-demand training courses available on Elastic Learning, Eze Castle was able to go from proof-of-concept to a fully functioning ""Eze Managed SIEM"" product within a month! Learn about Eze Castle's journey with Elastic and how they grew Eze Managed SIEM from zero to 100 customers In less than 14 months.

Cómo crear excelentes experiencias de búsqueda en sitios web

Elasticsearch

Descubre lo fácil que es crear búsquedas relevantes y enriquecidas en sitios web de cara al público para impulsar las conversiones, incrementar el consumo de contenido y ayudar a los visitantes a encontrar lo que necesitan. Realiza un recorrido por las herramientas de Elastic a las que puedes sacar partido para transformar con facilidad tu sitio web, lo que incluye nuestro nuevo y potente rastreador web.

Te damos la bienvenida a una nueva forma de realizar búsquedas

Elasticsearch

1) The document introduces ElasticON Solution Series, which provides out-of-the-box personalized, centralized, and secure organizational search across internal and external sources. 2) It discusses how Elastic Enterprise Search can improve productivity, satisfaction, collaboration, and decision making by connecting all applications and content with a single scalable search platform. 3) The solution achieves this through intuitive search features, powerful analytics and visualization tools, simplified administration, and security certifications to ensure data protection.

Tirez pleinement parti d'Elastic grâce à Elastic Cloud

Elasticsearch

Découvrez pourquoi Elastic Cloud est la solution idéale pour exploiter toutes les offres d'Elastic. Bénéficiez d'une flexibilité d'achat et de déploiement au sein de Google Cloud, de Microsoft Azure, d'Amazon Web Services ou des trois à la fois. Apprenez quels avantages vous apporte une offre de service géré et déterminez la solution qui vous permet de la gérer par vous-même grâce à des outils intégrés d'automatisation et d'orchestration. Et ce n'est pas tout ! Familiarisez-vous avec les fonctionnalités qui peuvent vous aider à scaler vos opérations au fur et à mesure de l'évolution de votre déploiement, à stocker vos données d'une manière rentable et à optimiser vos recherches. Ainsi, vous n'aurez plus à abandonner de données et obtiendrez les informations exploitables dont vous avez besoin pour assurer le fonctionnement de votre entreprise.

Comment transformer vos données en informations exploitables

Elasticsearch

Plongez au cœur de la recherche dans tous ses états.

Elasticsearch

À l'instar de la plupart des entreprises modernes, vos équipes utilisent probablement plus de 10 applications hébergées dans le cloud chaque jour, mais passent aussi bien trop de temps à chercher les informations dont elles ont besoin dans ces outils. Grâce aux fonctionnalités prêtes à l'emploi d'Elastic Workplace Search, découvrez combien il est facile de mettre le contenu pertinent à portée de la main de vos équipes grâce à une recherche unifiée sur l'ensemble des applications qu'elles utilisent pour faire leur travail.

Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]

Elasticsearch

An introduction to Elasticsearch's advanced relevance ranking toolbox

Elasticsearch

Welcome to a new state of find

Elasticsearch

Like most modern organizations, your teams are likely using upwards of 10 cloud-based applications on a daily basis, but spending far too many hours a day searching for the information they need across all of them. With the out-of-the-box capabilities of Elastic Workplace Search, see how easy it is to put relevant content right at your teams’ fingertips with unified search across all the apps they rely on to get work done.

Building great website search experiences

Elasticsearch

Keynote: Harnessing the power of Elasticsearch for simplified search

Elasticsearch

Cómo transformar los datos en análisis con los que tomar decisiones

Elasticsearch

Explore relève les défis Big Data avec Elastic Cloud

Elasticsearch

Spécialisée dans le développement et la gestion de solutions de veille documentaire et commerciale, Explore offre à ses clients une lecture précise et organisée de l’actualités des marchés et projets sur leurs territoires d'intervention. Afin de rendre leur offre plus agile et performante, Explore a choisi l’offre Elastic Cloud hébergée sur Microsoft Azure. Découvrez comment les équipes de production et de développement sont désormais en mesure de mieux exploiter les données pour les clients d’Explore et gagnent du temps sur la gestion de leur infrastructure.

Comment transformer vos données en informations exploitables

Elasticsearch

Transforming data into actionable insights

Elasticsearch

Opening Keynote: Why Elastic?

Elasticsearch

"Elastic enables the world’s leading organization to exceed their business objectives and power their mission-critical systems by eliminating data silos, connecting the dots, and transforming data of all types into actionable insights. Come learn how the power of search can help you quickly surface relevant insights at scale. Whether you are an executive looking to reduce operational costs, a department head striving to do more with fewer tools, or engineer monitoring and protecting your IT environment, this session is for you. "

Empowering agencies using Elastic as a Service inside Government

Elasticsearch

It has now been four years since the beta release of Elastic Cloud Enterprise which kicked off a wave of the Elastic public sector community running Elastic as a service within Government rather than utilizing purely hosted solutions. Fast forward to 2021 and we have multiple options for multiple mission needs. Learn top tips from Elastic architects and their experience enabling their teams with the automation and provisioning of Elastic tech to change the game in how government delivers solutions.

The opportunities and challenges of data for public good

Elasticsearch

The document discusses data for public good and the opportunities and challenges involved. It notes that data infrastructure is needed to deliver public good through data. There are almost endless opportunities to use data for public services, policy, and citizen benefits. However, challenges include legacy systems, data silos, unclear governance, and risk aversion. As a case study, it outlines how the UK Census 2021 addressed index faced challenges but showed progress on using data better, with lessons for continued public sector transformation.

Enterprise search and unstructured data with CGI and Elastic

Elasticsearch

More from Elasticsearch (20)

An introduction to Elasticsearch's advanced relevance ranking toolbox

From MSP to MSSP using Elastic

Cómo crear excelentes experiencias de búsqueda en sitios web

Te damos la bienvenida a una nueva forma de realizar búsquedas

Tirez pleinement parti d'Elastic grâce à Elastic Cloud

Comment transformer vos données en informations exploitables

Plongez au cœur de la recherche dans tous ses états.

Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]

An introduction to Elasticsearch's advanced relevance ranking toolbox

Welcome to a new state of find

Building great website search experiences

Keynote: Harnessing the power of Elasticsearch for simplified search

Cómo transformar los datos en análisis con los que tomar decisiones

Explore relève les défis Big Data avec Elastic Cloud

Comment transformer vos données en informations exploitables

Transforming data into actionable insights

Opening Keynote: Why Elastic?

Empowering agencies using Elastic as a Service inside Government

The opportunities and challenges of data for public good

Enterprise search and unstructured data with CGI and Elastic

Recently uploaded

Skybuffer SAM4U tool for SAP license adoption

Tatiana Kojar

Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool. SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.

Y-Combinator seed pitch deck template PP

c5vrf27qcz

What is an RPA CoE? Session 1 – CoE Vision

DianaGray10

Fueling AI with Great Data with Airbyte Webinar

Zilliz

"Choosing proper type of scaling", Olena Syrota

Fwdays

Northern Engraving | Nameplate Manufacturing Process - 2024

Northern Engraving

Manufacturing custom quality metal nameplates and badges involves several standard operations. Processes include sheet prep, lithography, screening, coating, punch press and inspection. All decoration is completed in the flat sheet with adhesive and tooling operations following. The possibilities for creating unique durable nameplates are endless. How will you create your brand identity? We can help!

Christine's Product Research Presentation.pptx

christinelarrosa

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels

Northern Engraving

Apps Break Data

Ivo Velitchkov

How information systems are built or acquired puts information, which is what they should be about, in a secondary place. Our language adapted accordingly, and we no longer talk about information systems but applications. Applications evolved in a way to break data into diverse fragments, tightly coupled with applications and expensive to integrate. The result is technical debt, which is re-paid by taking even bigger "loans", resulting in an ever-increasing technical debt. Software engineering and procurement practices work in sync with market forces to maintain this trend. This talk demonstrates how natural this situation is. The question is: can something be done to reverse the trend?

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency

ScyllaDB

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...

saastr

[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...

Jason Yip

The typical problem in product engineering is not bad strategy, so much as “no strategy”. This leads to confusion, lack of motivation, and incoherent action. The next time you look for a strategy and find an empty space, instead of waiting for it to be filled, I will show you how to fill it in yourself. If you’re wrong, it forces a correction. If you’re right, it helps create focus. I’ll share how I’ve approached this in the past, both what works and lessons for what didn’t work so well.

Dandelion Hashtable: beyond billion requests per second on a commodity server

Antonios Katsarakis

This slide deck presents DLHT, a concurrent in-memory hashtable. Despite efforts to optimize hashtables, that go as far as sacrificing core functionality, state-of-the-art designs still incur multiple memory accesses per request and block request processing in three cases. First, most hashtables block while waiting for data to be retrieved from memory. Second, open-addressing designs, which represent the current state-of-the-art, either cannot free index slots on deletes or must block all requests to do so. Third, index resizes block every request until all objects are copied to the new index. Defying folklore wisdom, DLHT forgoes open-addressing and adopts a fully-featured and memory-aware closed-addressing design based on bounded cache-line-chaining. This design offers lock-free index operations and deletes that free slots instantly, (2) completes most requests with a single memory access, (3) utilizes software prefetching to hide memory latencies, and (4) employs a novel non-blocking and parallel resizing. In a commodity server and a memory-resident workload, DLHT surpasses 1.6B requests per second and provides 3.5x (12x) the throughput of the state-of-the-art closed-addressing (open-addressing) resizable hashtable on Gets (Deletes).

High performance Serverless Java on AWS- GoTo Amsterdam 2024

Vadym Kazulkin

Java is for many years one of the most popular programming languages, but it used to have hard times in the Serverless community. Java is known for its high cold start times and high memory footprint, comparing to other programming languages like Node.js and Python. In this talk I'll look at the general best practices and techniques we can use to decrease memory consumption, cold start times for Java Serverless development on AWS including GraalVM (Native Image) and AWS own offering SnapStart based on Firecracker microVM snapshot and restore and CRaC (Coordinated Restore at Checkpoint) runtime hooks. I'll also provide a lot of benchmarking on Lambda functions trying out various deployment package sizes, Lambda memory settings, Java compilation options and HTTP (a)synchronous clients and measure their impact on cold and warm start times.

Must Know Postgres Extension for DBA and Developer during Migration

Mydbops

Mydbops Opensource Database Meetup 16 Topic: Must-Know PostgreSQL Extensions for Developers and DBAs During Migration Speaker: Deepak Mahto, Founder of DataCloudGaze Consulting Date & Time: 8th June | 10 AM - 1 PM IST Venue: Bangalore International Centre, Bangalore Abstract: Discover how PostgreSQL extensions can be your secret weapon! This talk explores how key extensions enhance database capabilities and streamline the migration process for users moving from other relational databases like Oracle. Key Takeaways: * Learn about crucial extensions like oracle_fdw, pgtt, and pg_audit that ease migration complexities. * Gain valuable strategies for implementing these extensions in PostgreSQL to achieve license freedom. * Discover how these key extensions can empower both developers and DBAs during the migration process. * Don't miss this chance to gain practical knowledge from an industry expert and stay updated on the latest open-source database trends. Mydbops Managed Services specializes in taking the pain out of database management while optimizing performance. Since 2015, we have been providing top-notch support and assistance for the top three open-source databases: MySQL, MongoDB, and PostgreSQL. Our team offers a wide range of services, including assistance, support, consulting, 24/7 operations, and expertise in all relevant technologies. We help organizations improve their database's performance, scalability, efficiency, and availability. Contact us: info@mydbops.com Visit: https://www.mydbops.com/ Follow us on LinkedIn: https://in.linkedin.com/company/mydbops For more details and updates, please follow up the below links. Meetup Page : https://www.meetup.com/mydbops-databa... Twitter: https://twitter.com/mydbopsofficial Blogs: https://www.mydbops.com/blog/ Facebook(Meta): https://www.facebook.com/mydbops/

Choosing The Best AWS Service For Your Website + API.pptx

Brandon Minnick, MBA

Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API? Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose? Which one is cheapest? Which one is fastest? Which one will scale to meet our needs? Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

Neo4j

GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph

Neo4j

Session 1 - Intro to Robotic Process Automation.pdf

UiPathCommunity

👉 Check out our full 'Africa Series - Automation Student Developers (EN)' page to register for the full program: https://bit.ly/Automation_Student_Kickstart In this session, we shall introduce you to the world of automation, the UiPath Platform, and guide you on how to install and setup UiPath Studio on your Windows PC. 📕 Detailed agenda: What is RPA? Benefits of RPA? RPA Applications The UiPath End-to-End Automation Platform UiPath Studio CE Installation and Setup 💻 Extra training through UiPath Academy: Introduction to Automation UiPath Business Automation Platform Explore automation development with UiPath Studio 👉 Register here for our upcoming Session 2 on June 20: Introduction to UiPath Studio Fundamentals: https://community.uipath.com/events/details/uipath-lagos-presents-session-2-introduction-to-uipath-studio-fundamentals/

Christine's Supplier Sourcing Presentaion.pptx

christinelarrosa

Recently uploaded (20)

Skybuffer SAM4U tool for SAP license adoption

Y-Combinator seed pitch deck template PP

What is an RPA CoE? Session 1 – CoE Vision

Fueling AI with Great Data with Airbyte Webinar

"Choosing proper type of scaling", Olena Syrota

Northern Engraving | Nameplate Manufacturing Process - 2024

Christine's Product Research Presentation.pptx

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels

Apps Break Data

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...

[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...

Dandelion Hashtable: beyond billion requests per second on a commodity server

High performance Serverless Java on AWS- GoTo Amsterdam 2024

Must Know Postgres Extension for DBA and Developer during Migration

Choosing The Best AWS Service For Your Website + API.pptx

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph

Session 1 - Intro to Robotic Process Automation.pdf

Christine's Supplier Sourcing Presentaion.pptx

Elastic Stack Roadmap Deep Dive

1. Steve Kearns Product Lead, Elastic Stack The Elastic Stack Multiplier

2. Elasticsearch Kibana

3. Enterprise Search Observability Security Elasticsearch Kibana

6. Data Management Data Analysis Actions & Alerting

7. Data Management

9. Hot Warm Cold Snapshot

10. Hot Warm Cold Snapshot Lifecycle Management Index Lifecycle Management

11. myindex myindex-01 Alias

12. myindex myindex-01 Alias myindex-02

13. myindex-03 myindex myindex-01 Alias myindex-02

14. myindex-03 myindex myindex-01 Datastream myindex-02 • Auto-create from templates • Smarter rollups • Smarter query routing • Smarter index management

15. Hot Warm Cold Snapshot $$ $

16. Improved Memory Usage Improved Disk Usage

17. Heap File system cache Disk < 30GB

18. Heap File system cache Disk

19. Heap File system cache Disk

20. Heap File system cache Disk 6TB 10GB

21. Heap File system cache Disk 6TB 1.5GB

22. Disk

23. Disk

24. Archive Disk Cold

25. Doc Values Stored Fields Term Dictionary Term Proximity Normalization Factors Point Values Meta Lookup

26. Doc Values Stored Fields Term Dictionary Term Proximity Normalization Factors Point Values Meta Lookup

27. Archive Disk Frozen

28. Archive Disk Frozen

29. Smarter Query Routing

30. SELECT * FROM logs-* WHERE region = “us-east” AND service = “mysql” AND @timestamp > “2020-01-01” ORDER BY @timestamp DESC LIMIT 20

31. SELECT * FROM logs-* WHERE region = “us-east” AND service = “mysql” AND @timestamp > “2020-01-01” ORDER BY @timestamp DESC LIMIT 20

32. SELECT * FROM logs-* WHERE region = “us-east” AND service = “mysql” AND @timestamp > “2020-01-01” ORDER BY @timestamp DESC LIMIT 20 WHERE region = “us-east” AND service = “mysql” const_keyword fields

33. WHERE region = “us-east” AND service = “mysql” AND @timestamp > “2020-01-01” AND @timestamp > “2020-01-01” Range normalization

34. AND @timestamp > “2020-01-01” ORDER BY @timestamp DESC LIMIT 20 Sorted execution

35. Teaching Kibana patience

36. Elasticsearch is built for speed • Every field is indexed • Indexes built at ingest • Denormalized data, no joins • Distributed execution But needs disk, cpu, memory!

37. Warm Cold More data, for longer Cheap nodes Remote file store Frozen

38. 408 Request Time-out

39.

40.

41. Possibilities… (How I learned to love the bomb)

42. Find users who: • in the previous 12 months • have used an application • on today’s Malicious Apps list • with param “powershell.exe”

43. Find users who: • in the previous 12 months • have used an application • on today’s Malicious Apps list • with param “powershell.exe” Limited Join }

44. Find users who: • in the previous 12 months • have used an application • on today’s Malicious Apps list • with param “powershell.exe” Schema on Read}

45. Data Analysis

46. Speaking your language

47. GET logs-*/_search { “query”: { “bool”: { “filter”: { “range”: { “@timestamp”: { “gte”: “2020-01-01”, “lt”: “2020-02-01” } } } } } }

48. SELECT * FROM logs-* WHERE @timestamp BETWEEN “2020-01-01” AND “2020-02-01” SQL

49. @timestamp >= "2020-01-01" AND @timestamp < "2020-02-01" KQL

50. GET /api/v1/query_range? start=2020-01-01T00:00:00Z &end=2020-02-01T00:00:00Z &query=node_network_receive_bytes_total{device="eth1"} PromQL

51. sequence by unique_pid [process where process_name in ("mshta.exe", "rundll32.exe")] [image_load where image_name in ("jscript9.dll", "winhttp.dll")] EQL

52.

53. 53

54. 54

55. 55

56. 56