Amazon CloudSearch is a fully-managed search service in the cloud that allows customers to easily integrate fast and highly scalable search functionality into their applications. In this session, we cover the basics of search and search engines. We take an introductory look at CloudSearch along with a deep dive showing how to build a CloudSearch-based web application.
7. DNS / Load Balancing AWS Query
Search Domain
Search API Console Doc Command Console Config Command Console
Svc API Line Tools API Line Tools
ACCESS CONTROL ACCESS CONTROL ACCESS CONTROL
SEARCH SERVICE DOCUMENT SERVICE CONFIG SERVICE
Search Documents Add Documents Create Domains
Update Documents Configure Domains
Delete Documents Delete Domains
8. DATA
Document Quantity and Size
SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Index Partition 1 Index Partition 2 Index Partition n
Copy 1 Copy 1 Copy 1
TRAFFIC
Search
Request
Volume and SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Complexity Index Partition 1 Index Partition 2 Index Partition n
Copy 2 Copy 2 Copy 2
SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Index Partition 1 Index Partition 2 Index Partition n
Copy n Copy n Copy n
9.
10. • The Challenge
• The Data: The Million Song Data Set
http://labrosa.ee.columbia.edu/millionsong/
• The Application
11. Field name Description
artist_mbid The musicbrainz.org ID
artist_name Name of the artist
audio_md5 Hash code of the audio
danceability According to The Echo Nest
duration In seconds
loudness General loudness of the track
song_hottnesss According to Echo Nest
title Song title
year Song year
12. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
13. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
14.
15. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
20. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Prepare and upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
21.
22. SDF
Batches
Amazon
Million Song CloudSearch
DataSet
24. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Prepare and upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
29. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Prepare and upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
30.
31.
32.
33. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Prepare and upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date
39. • Create an Amazon CloudSearch domain
• Identify use case and supporting data
• Prepare and upload data
• Configure the domain
• Improve document ranking
• Integrate with the front end
• Keep documents up-to-date