SlideShare a Scribd company logo
1 of 20
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
© 2022, Amazon Web Services, Inc. or its affiliates.
Similarity Search
Take your search engine to the next level
N O V E M B E R 1 8 T H – D S C E U R O P E 2 2
Francisco Losada
AWS Specialist Solutions Architect
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Agenda
• Text search: Analysis and queries
• Image search: Analysis and queries
• End-to-end architecture
2
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Text Search: Analysis
3
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Text Search: Analysis
4
Source: https://codingexplained.com/coding/elasticsearch/understanding-analysis-in-elasticsearch-analyzers
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Text Search: Query
5
GET /my-index-000001/_search
{
"query": {
"match": {
”texto": ”Europe"
}
}
}
1. DSL: Domain Specific Language
2. Analysis: Alicante  [europe]
3. Search
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image Search
6
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image Search: Analysis
7
• Characteristics:
• Dress
• White
• Bugs Bunny
• Round neck
• Short sleeve
• Black line around neck
• Tight
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
CNN: Feature Extraction
8
Source: https://neurohive.io/en/popular-networks/vgg16/
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates. 9
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image Search query: KNN algorithm
10
Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image Search query: KNN algorithm
11
Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image Search query: KNN algorithm
12
Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Image search pipeline
13
Source: https://aws.amazon.com/blogs/machine-learning/building-a-visual-search-application-with-amazon-sagemaker-and-amazon-es/
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
What is OpenSearch ?
OpenSearch is a community-driven, open source
search and analytics suite derived from Apache 2.0
licensed Elasticsearch 7.10.2 & Kibana 7.10.2.
The OpenSearch Project comprises OpenSearch, a
search engine daemon, OpenSearch Dashboards for
visualization and user interface, and tools and
plugins, providing additional functionality
https://opensearch.org/
14
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Amazon OpenSearch Service
Easy integration
Open source OpenSearch APIs, managed Opensearch
dashboards, integration with Logstash
Cost-effective
Pay only for resources used with choice of on-
demand and Reserved Instance compute pricing,
and save
up to 90% with Ultrawarm low-cost storage tier
Fully managed
Deployment in minutes, software installation and
patching, failure recovery, backups, and
monitoring
Scalable, secure, and compliant
Network isolation with Amazon VPC, encryption
at-rest and in transit, and compliant with
HIPPA PCI DSS, and ISO
15
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
AWS Cloud - region
VPC
Customer domain
Application Load
Balancing (ALB)
Data
nodes
Leader
nodes
UltraWarm
nodes
IAM, Cognito, SAML
for Dashboards Login
SAML
AWS
CloudTrail
Amazon
CloudWatch
OpenSearch Fine-grained access control
AWS Database
Migration
Service
Amazon Kinesis
Data Firehose
Amazon
CloudWatch Logs
Amazon Managed
Streaming for Kafka
16
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Pick
algorithm
Visualize in
notebooks
Label
data
Collect and
prepare data
Store
features
Check
data
Train
models
Tune
parameters
Deploy in
production
Manage
and monitor
CI/CD
Amazon SageMaker: Built to make ML more accessible
17
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Architecture
18
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Implementation
19
DSC EUROPE 22
© 2022, Amazon Web Services, Inc. or its affiliates.
Thank you!
Questions?
20
https://eventbox.dev/survey/IJKGHW4

More Related Content

Similar to [DSC Europe 22] Similarity search: Take your search engine to the next level - Francisco Losada

COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZER
Ashish Tanwer
 
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
Databricks
 
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_SingaporeCI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
Amazon Web Services
 

Similar to [DSC Europe 22] Similarity search: Take your search engine to the next level - Francisco Losada (20)

API workshop by AWS and 3scale
API workshop by AWS and 3scaleAPI workshop by AWS and 3scale
API workshop by AWS and 3scale
 
SEC301 - New AWS security services for container threat detection - final.pdf
SEC301 - New AWS security services for container threat detection - final.pdfSEC301 - New AWS security services for container threat detection - final.pdf
SEC301 - New AWS security services for container threat detection - final.pdf
 
Axsys Technologies Software Offerings
Axsys Technologies Software OfferingsAxsys Technologies Software Offerings
Axsys Technologies Software Offerings
 
Biometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWSBiometric Systems - Automate Video Streaming Analysis with Azure and AWS
Biometric Systems - Automate Video Streaming Analysis with Azure and AWS
 
Simplificando Arquiteturas Microsoft com os Serviços da AWS - ARC204 - Sao P...
Simplificando Arquiteturas Microsoft com os Serviços da AWS -  ARC204 - Sao P...Simplificando Arquiteturas Microsoft com os Serviços da AWS -  ARC204 - Sao P...
Simplificando Arquiteturas Microsoft com os Serviços da AWS - ARC204 - Sao P...
 
COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZER
 
RICOH THETA x IoT Developers Contest : Cloud API Seminar (2nd installation)
RICOH THETA x IoT Developers Contest : Cloud API Seminar (2nd installation)RICOH THETA x IoT Developers Contest : Cloud API Seminar (2nd installation)
RICOH THETA x IoT Developers Contest : Cloud API Seminar (2nd installation)
 
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
 
Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...
 
20210608 - Desarrollo de aplicaciones en la nube
20210608 - Desarrollo de aplicaciones en la nube20210608 - Desarrollo de aplicaciones en la nube
20210608 - Desarrollo de aplicaciones en la nube
 
The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)
 
Confluent_AWS_ImmersionDay_Q42023.pdf
Confluent_AWS_ImmersionDay_Q42023.pdfConfluent_AWS_ImmersionDay_Q42023.pdf
Confluent_AWS_ImmersionDay_Q42023.pdf
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Look Before You Leap: Migrating On-Premises Hadoop to AWS
Look Before You Leap: Migrating On-Premises Hadoop to AWSLook Before You Leap: Migrating On-Premises Hadoop to AWS
Look Before You Leap: Migrating On-Premises Hadoop to AWS
 
AZ-204 A Top-Notch Exam Of Developing Solutions for Microsoft Azure.pdf
AZ-204 A Top-Notch Exam Of Developing Solutions for Microsoft Azure.pdfAZ-204 A Top-Notch Exam Of Developing Solutions for Microsoft Azure.pdf
AZ-204 A Top-Notch Exam Of Developing Solutions for Microsoft Azure.pdf
 
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_SingaporeCI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
CI-CD with AWS Developer Tools and Fargate_AWSPSSummit_Singapore
 
Your App Deserves More – The Art of App Modernization
Your App Deserves More – The Art of App ModernizationYour App Deserves More – The Art of App Modernization
Your App Deserves More – The Art of App Modernization
 
AWS Lambda Powertools walkthrough.pdf
AWS Lambda Powertools walkthrough.pdfAWS Lambda Powertools walkthrough.pdf
AWS Lambda Powertools walkthrough.pdf
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 
AWS Partner Data Analytics on AWS_Handout.pdf
AWS Partner Data Analytics on AWS_Handout.pdfAWS Partner Data Analytics on AWS_Handout.pdf
AWS Partner Data Analytics on AWS_Handout.pdf
 

More from DataScienceConferenc1

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
DataScienceConferenc1
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
DataScienceConferenc1
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
DataScienceConferenc1
 

More from DataScienceConferenc1 (20)

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
 
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
 
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
 
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
 
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
 
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
 
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
 
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
 
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
 
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
 
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
 
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
 
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
 
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
 
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
 
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
 
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
 
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
 

Recently uploaded

一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 

Recently uploaded (20)

一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptxMALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 

[DSC Europe 22] Similarity search: Take your search engine to the next level - Francisco Losada

  • 1. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. © 2022, Amazon Web Services, Inc. or its affiliates. Similarity Search Take your search engine to the next level N O V E M B E R 1 8 T H – D S C E U R O P E 2 2 Francisco Losada AWS Specialist Solutions Architect
  • 2. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Agenda • Text search: Analysis and queries • Image search: Analysis and queries • End-to-end architecture 2
  • 3. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Text Search: Analysis 3
  • 4. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Text Search: Analysis 4 Source: https://codingexplained.com/coding/elasticsearch/understanding-analysis-in-elasticsearch-analyzers
  • 5. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Text Search: Query 5 GET /my-index-000001/_search { "query": { "match": { ”texto": ”Europe" } } } 1. DSL: Domain Specific Language 2. Analysis: Alicante  [europe] 3. Search
  • 6. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image Search 6
  • 7. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image Search: Analysis 7 • Characteristics: • Dress • White • Bugs Bunny • Round neck • Short sleeve • Black line around neck • Tight
  • 8. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. CNN: Feature Extraction 8 Source: https://neurohive.io/en/popular-networks/vgg16/
  • 9. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. 9
  • 10. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image Search query: KNN algorithm 10 Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
  • 11. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image Search query: KNN algorithm 11 Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
  • 12. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image Search query: KNN algorithm 12 Source: https://medium.com/swlh/k-nearest-neighbor-ca2593d7a3c4
  • 13. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Image search pipeline 13 Source: https://aws.amazon.com/blogs/machine-learning/building-a-visual-search-application-with-amazon-sagemaker-and-amazon-es/
  • 14. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. What is OpenSearch ? OpenSearch is a community-driven, open source search and analytics suite derived from Apache 2.0 licensed Elasticsearch 7.10.2 & Kibana 7.10.2. The OpenSearch Project comprises OpenSearch, a search engine daemon, OpenSearch Dashboards for visualization and user interface, and tools and plugins, providing additional functionality https://opensearch.org/ 14
  • 15. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Amazon OpenSearch Service Easy integration Open source OpenSearch APIs, managed Opensearch dashboards, integration with Logstash Cost-effective Pay only for resources used with choice of on- demand and Reserved Instance compute pricing, and save up to 90% with Ultrawarm low-cost storage tier Fully managed Deployment in minutes, software installation and patching, failure recovery, backups, and monitoring Scalable, secure, and compliant Network isolation with Amazon VPC, encryption at-rest and in transit, and compliant with HIPPA PCI DSS, and ISO 15
  • 16. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. AWS Cloud - region VPC Customer domain Application Load Balancing (ALB) Data nodes Leader nodes UltraWarm nodes IAM, Cognito, SAML for Dashboards Login SAML AWS CloudTrail Amazon CloudWatch OpenSearch Fine-grained access control AWS Database Migration Service Amazon Kinesis Data Firehose Amazon CloudWatch Logs Amazon Managed Streaming for Kafka 16
  • 17. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Pick algorithm Visualize in notebooks Label data Collect and prepare data Store features Check data Train models Tune parameters Deploy in production Manage and monitor CI/CD Amazon SageMaker: Built to make ML more accessible 17
  • 18. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Architecture 18
  • 19. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Implementation 19
  • 20. DSC EUROPE 22 © 2022, Amazon Web Services, Inc. or its affiliates. Thank you! Questions? 20 https://eventbox.dev/survey/IJKGHW4

Editor's Notes

  1. https://www.youtube.com/shorts/n5t0B3N71vs
  2. Text Analysis: Es el proceso de convertir text no estructurado, a un formato estructurado que este optimizado para la busqueda. https://youtube.com/shorts/n5t0B3N71vs?feature=share
  3. Query DSL (Domain Specific Language)
  4. https://www.youtube.com/shorts/n5t0B3N71vs
  5. Purpose of the slide: Give a brief background on OpenSearch, the open-source engine that powers Amazon OpenSearch Service. In January 21, 2021, Elastic NV announced that they would change their software licensing strategy and not release new versions of Elasticsearch and Kibana under the permissive Apache License, Version 2.0 (ALv2) license. Instead, new versions of the software will be offered under the Elastic license, with source code available under the Elastic License or SSPL. These are not open source and do not offer users the freedoms of open source. To ensure that the open source community and our customers continue to have a secure, high-quality, fully open source search and analytics suite, AWS introduced OpenSearch; a community-driven, ALv2 licensed fork of open source Elasticsearch and Kibana. The ALv2 license gives the open source community and our customers the freedom to use, modify, extend, embed, monetize, resell, and offer OpenSearch as part of their products and services. The announcement of OpenSearch has garnered positive support from the community. Numerous organizations such as SAP, Capital One, Dow Jones, Logz.io, and Red Hat, and individual contributors have expressed interest in joining the project and helping develop OpenSearch.
  6. Amazon OpenSearch is a distributed system. We have number of different nodes , data nodes which hold the your data and indexing the data, responding to your quires, master nodes – orchestration the cluster and keep the cluster functioning as a whole Ultrawarm nodes – that are high density storage by S3 by much reduced cost. To store long tail data at much reduced cost. We have number of security features , we have IAM to provide access to cluster , we have open distro for OpenSearch plugin for providing fine grained access controls to your OpenSearch cluster We have integration with other services –On the Input side – We have Kinesis Firehose which can push your data to OpenSearch for log workloads, DMS – that can deliver database data into OpenSearch Cloud watch logs that supports lambda batch delivery to to OpenSearch On the Output side - metrics to cloud watch and audit data goes into cloud trail. We called it as domain.
  7. Amazon SageMaker is the most complete end-to-end ML service helping our customer through improved agility, productivity, and cost-effectiveness. We built Amazon SageMaker from the ground up to provide every developer and data scientist with the ability to build, train, and deploy ML models quickly and at lower cost by providing the tools required for every step of the ML development lifecycle in one integrated, fully managed service. In fact, we have launched 50+ capabilities in the past year alone, all aimed at making this process easier for our customers. And last year we launched Amazon SageMaker Studio to bring this all together in a single pane of glass so that you get access to all your tools in one place.
  8. 1. Dataset – Imagenes 2. Una red neuronal convolucional – para extraer el vector de las imagenes 3. Tenemos que tener el modelo accessible – Mejor si es a traves de un endpoint con una API 4. Tenemos que guardar todos los vectores del dataset en Opensearch