Ads Personalization at Spotify - NYC Data Engineering 10/23Kinshuk Mishra
Spotify engineers (Kinshuk Mishra and Noel Cody) share their experiences about building personalized ad experiences for users through iterative engineering and product development. The slide explains their process of continuous problem discovery, hypothesis generation, product development and experimentation. They deep dive into the specific ad personalization problems Spotify is solving and explain their data infrastructure technology stack in detail. They also explain how they've experimented various product hypothesis and iteratively evolved their infrastructure to keep up with the product requirements.
A brief insight into Spotify brands and ads. What formats they allow you to advertise, their connection and positive connection with brands, and how they are targeting audiences.
Legal Issues For Online Communities - David DeakinFeverBee Limited
Hosting an online community raises numerous legal and regulatory challenges. Are you, as the community host, legally responsible for content submitted or shared by your members? This content may infringe other people’s intellectual property rights; might be defamatory; or could amount to some form of criminal activity. But do you know your responsibilities in relation to these types of content and activity, and how do you ensure you protect yourself by taking advantage of the legal defences available to avoid liability? And what about Data Privacy? European Data Protection Laws are changing dramatically over the next 2 years. Collection and exploitation of personal data may be a significant asset and driver for establishing our community. And in any event trust, privacy by design and regulatory compliance should be at the heart of your strategy. These issues, along with key Marketing Regulations and understanding your relationship with and responsibilities to third parties - such as Facebook and YouTube, or your hosting provider, will be summarised.
What To Do In A Post Reach World (Attracting An Audience In A Competitive Fie...FeverBee Limited
Understand the evolution of the community manager - from managing and building communities to CRM specialist and paid social strategist. The main responsibilities of a community manager over the past 3-4 years have focused greatly on finding best strategies to build communities and plan content that would both entertain the existing community (engagement) and drive additional users to the brand's community. However, as both the users behaviour on social platforms has gradually changed, and the organic distribution algorithms for most social platforms, the skills and thinking required for a community manager have greatly evolved to encourage proactive engagement outside the brand community and the use of a much more hyper-targeted approach to communications. SPRINT London
Ads Personalization at Spotify - NYC Data Engineering 10/23Kinshuk Mishra
Spotify engineers (Kinshuk Mishra and Noel Cody) share their experiences about building personalized ad experiences for users through iterative engineering and product development. The slide explains their process of continuous problem discovery, hypothesis generation, product development and experimentation. They deep dive into the specific ad personalization problems Spotify is solving and explain their data infrastructure technology stack in detail. They also explain how they've experimented various product hypothesis and iteratively evolved their infrastructure to keep up with the product requirements.
A brief insight into Spotify brands and ads. What formats they allow you to advertise, their connection and positive connection with brands, and how they are targeting audiences.
Legal Issues For Online Communities - David DeakinFeverBee Limited
Hosting an online community raises numerous legal and regulatory challenges. Are you, as the community host, legally responsible for content submitted or shared by your members? This content may infringe other people’s intellectual property rights; might be defamatory; or could amount to some form of criminal activity. But do you know your responsibilities in relation to these types of content and activity, and how do you ensure you protect yourself by taking advantage of the legal defences available to avoid liability? And what about Data Privacy? European Data Protection Laws are changing dramatically over the next 2 years. Collection and exploitation of personal data may be a significant asset and driver for establishing our community. And in any event trust, privacy by design and regulatory compliance should be at the heart of your strategy. These issues, along with key Marketing Regulations and understanding your relationship with and responsibilities to third parties - such as Facebook and YouTube, or your hosting provider, will be summarised.
What To Do In A Post Reach World (Attracting An Audience In A Competitive Fie...FeverBee Limited
Understand the evolution of the community manager - from managing and building communities to CRM specialist and paid social strategist. The main responsibilities of a community manager over the past 3-4 years have focused greatly on finding best strategies to build communities and plan content that would both entertain the existing community (engagement) and drive additional users to the brand's community. However, as both the users behaviour on social platforms has gradually changed, and the organic distribution algorithms for most social platforms, the skills and thinking required for a community manager have greatly evolved to encourage proactive engagement outside the brand community and the use of a much more hyper-targeted approach to communications. SPRINT London
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...Marketo
In this presention, you'll learn how to get the most out of your content by optimizing your campaigns and landing pages. See how you can take online advertising one step further and personalize your outbound digital ads with personalized retargeting.
This talk is an updated version of my earlier talk "Failing Up" that I presented at Tom Tom and at App Builders Switzerland, 2016. It's a talk about how to create a failsafe environment for software companies and teams. It's critical to acknowledge that failure is necessary for innovation. So, if failure is a given, how do you fail well?
This version of the talk was first presented at Seattle Code Camp 2016
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin EaganUXPA International
The User Experience community faces an all-new challenge in the rise of hyper-personalized web content, which allows sites to serve targeted messages to individual users based on behavioral or demographic data. While the technologies that support targeting grow more powerful by the day, the UX and content strategies behind them are often lacking -- or nonexistent. This has resulted in something we’ve all experienced online at one time or another: the creepy targeted ad.
It’s time to fight back. While targeted marketing isn't going anywhere soon, the advent of Web Experience Management systems has democratized the landscape, meaning more clients have the ability to design personalized digital experiences. By re-framing the motivations behind targeting, UX designers have a significant opportunity to direct this power in ways that are user-centric, not marketer-centric. In this talk we’ll cover new tools and techniques to get started in the effort to take back personalized content for Good.
The Real World is Posterscope's monthly market update, containing latest industry news, key facts and figures and some really cool out-of-home campaigns.
Spotify's Music Recommendations Lambda ArchitectureEsh Vckay
Spotify's Collaborative Filtering platform powers our Discover Page. In light of the fact, that we have been adding one new user every three seconds, it is paramount that we do things in real-time. We redesigned our recommendation system and added a Storm based real-time platform.
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...DATAVERSITY
Real-time content, offer and ad targeting decisions must happen quickly. When a user requests information from a web application, a processing clock starts, requiring a decision in as little as 40 msec. Delays in targeting decisions lead to delays in responding to the user. These delays can lead to user dissatisfaction and, ultimately, loss of audience and revenue.
This session describes how AOL Advertising uses Hadoop to create sophisticated user profiles and NoSQL database technology from Couchbase to access those profiles in real-time, with sub-millisecond latency. This architecture leaves the bulk of the processing time budget for improved content, offer and ad targeting and even real-time content customization.
An introduction to the Spotify matrix model including recent updates we've made as we have continued to grow. I presented this talk at the Spark the Change Conference in London, UK on July 1, 2015.
How Apache Drives Music Recommendations At SpotifyJosh Baer
The slides go through the high-level process of generating personalized playlists for all Spotify's users, using Apache big data products extensively.
Presentation given at Apache: Big Data Europe conference on September 29th, 2015 in Budapest.
Why does Spotify use a microservices architecture? What are the benefits and challenges we've encountered? How does our organizational model support our architecture?
Video of the talk is posted on YouTube: https://youtu.be/7LGPeBgNFuU
Vinylmint is a non-union, freelance marketplace of voice actors, sound designers, and translators. Getting variety and quality in freelance sound talent is tough to find at an affordable rate. Vinylmint simplifies the search, transaction, and rights transfer for sounds.
Spotify: the largest on-demand music service in the world, and growing fast. They offer a “freemium” as well as a paid premium model, allowing users the option to avoid advertisements and listen offline.
Tens of millions of daily Spotify users generates data that gives Spotify a unique opportunity to develop their products and services based on user behavior.
This report will provide some background on Spotify and explore their Customer Value Proposition using our CVP Canvas.
We will identify the brand initiatives surrounding product innovation, added-value services, shopping experience, and communication that contributed to a new success.
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...Marketo
In this presention, you'll learn how to get the most out of your content by optimizing your campaigns and landing pages. See how you can take online advertising one step further and personalize your outbound digital ads with personalized retargeting.
This talk is an updated version of my earlier talk "Failing Up" that I presented at Tom Tom and at App Builders Switzerland, 2016. It's a talk about how to create a failsafe environment for software companies and teams. It's critical to acknowledge that failure is necessary for innovation. So, if failure is a given, how do you fail well?
This version of the talk was first presented at Seattle Code Camp 2016
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin EaganUXPA International
The User Experience community faces an all-new challenge in the rise of hyper-personalized web content, which allows sites to serve targeted messages to individual users based on behavioral or demographic data. While the technologies that support targeting grow more powerful by the day, the UX and content strategies behind them are often lacking -- or nonexistent. This has resulted in something we’ve all experienced online at one time or another: the creepy targeted ad.
It’s time to fight back. While targeted marketing isn't going anywhere soon, the advent of Web Experience Management systems has democratized the landscape, meaning more clients have the ability to design personalized digital experiences. By re-framing the motivations behind targeting, UX designers have a significant opportunity to direct this power in ways that are user-centric, not marketer-centric. In this talk we’ll cover new tools and techniques to get started in the effort to take back personalized content for Good.
The Real World is Posterscope's monthly market update, containing latest industry news, key facts and figures and some really cool out-of-home campaigns.
Spotify's Music Recommendations Lambda ArchitectureEsh Vckay
Spotify's Collaborative Filtering platform powers our Discover Page. In light of the fact, that we have been adding one new user every three seconds, it is paramount that we do things in real-time. We redesigned our recommendation system and added a Storm based real-time platform.
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...DATAVERSITY
Real-time content, offer and ad targeting decisions must happen quickly. When a user requests information from a web application, a processing clock starts, requiring a decision in as little as 40 msec. Delays in targeting decisions lead to delays in responding to the user. These delays can lead to user dissatisfaction and, ultimately, loss of audience and revenue.
This session describes how AOL Advertising uses Hadoop to create sophisticated user profiles and NoSQL database technology from Couchbase to access those profiles in real-time, with sub-millisecond latency. This architecture leaves the bulk of the processing time budget for improved content, offer and ad targeting and even real-time content customization.
An introduction to the Spotify matrix model including recent updates we've made as we have continued to grow. I presented this talk at the Spark the Change Conference in London, UK on July 1, 2015.
How Apache Drives Music Recommendations At SpotifyJosh Baer
The slides go through the high-level process of generating personalized playlists for all Spotify's users, using Apache big data products extensively.
Presentation given at Apache: Big Data Europe conference on September 29th, 2015 in Budapest.
Why does Spotify use a microservices architecture? What are the benefits and challenges we've encountered? How does our organizational model support our architecture?
Video of the talk is posted on YouTube: https://youtu.be/7LGPeBgNFuU
Vinylmint is a non-union, freelance marketplace of voice actors, sound designers, and translators. Getting variety and quality in freelance sound talent is tough to find at an affordable rate. Vinylmint simplifies the search, transaction, and rights transfer for sounds.
Spotify: the largest on-demand music service in the world, and growing fast. They offer a “freemium” as well as a paid premium model, allowing users the option to avoid advertisements and listen offline.
Tens of millions of daily Spotify users generates data that gives Spotify a unique opportunity to develop their products and services based on user behavior.
This report will provide some background on Spotify and explore their Customer Value Proposition using our CVP Canvas.
We will identify the brand initiatives surrounding product innovation, added-value services, shopping experience, and communication that contributed to a new success.
Mobile development case study that chronicles a company's foray into mobile development, native app development, responsive design and mobile marketing. Highlights a publishing company overcoming resource, bandwidth and structural issues to deliver its content to mobile devices.
We have an explosion of wearables - AirPods Pro - and smart speakers are now everywhere. What would an audio-first experience be like?
A business idea generated while in Antler, Singapore, 2020.
We have a packed agenda with hands-on tutorials, customer spotlights, product roadmaps and peer-to-peer sessions for like-minded system users. We are also covering industry best practices that impact long-term success in today’s business environment, including retail execution, business intelligence, mobility and data synchronization.
This presentation, taken from our "Lifting the Lid on Performance Content" event examines:
- Defining demand across the key points in the user journey
- The value and role of platforms across the key digital touchpoints with your brand
- How to measure effectiveness when traditional metrics are no longer a true reflection of performance
Content publishers are being constantly challenged to find new ways of communicating with customers whilst keeping costs under control. Print and direct mail remain a vitally important part of the marketing mix but as more audiences migrate to mobile, are you capitalising on the opportunity that this brings? How do you go about setting up a customer-centric marketing strategy so you can deliver the right campaigns to the right audience at the right time?
Wyndeham and Rhapsody who are a combined print and digital media organisation under the umbrella of the Walstead Group discussed these issues at their last event in Mancheser.
The presentation discusses the importance of data in delivering accurate, effective marketing campaigns and the opportunities of mobile as a marketing channel.
For more information please visit our website: www.rhapsodymedia.co.uk
As an ecommerce and web design agency, our design and development team stays on top of the trends in website design and digital marketing in order to give clients websites that beat their competition in both form and function for online marketing success in 2017.
We start with a solid foundation: Open source CMS platform, sensible custom design, a brand identity, and bring it all together in a structure that makes the most of modern search engine optimization techniques so those clients can get found on Google.
We pride ourselves on being industry agnostic. By not limiting ourselves to a particular industry, we can keep our eye on the ball and work toward making remarkable websites.
Similar to Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra (20)
Bringing Sequential Analysis to A/B Testing with examples from his work at Optimizely.
These slides are from a talk given at the SF Data Engineering meetup. http://www.meetup.com/SF-Data-Engineering/events/231047195/
DataEngConf SF16 - Multi-temporal Data StructuresHakka Labs
A mind-bending way of dealing with time syncing when aggregating data from many disparate sources. Talk by Jasmine Tsai and Alyssa Kwan, Clover Health. To hear about future conferences go to http://dataengconf.com
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...Hakka Labs
Tips for suceeding in your data science job interview. Talk by Bridge Mellichamp, Stitch Labs. To hear about future conferences go to http://dataengconf.com
DataEngConf SF16 - Methods for Content Relevance at LinkedInHakka Labs
Learn how LinkedIn makes article recommendations for its users. Talk by Ajit Singh, LinkedIn. To hear about future conferences go to http://dataengconf.com
Launch Your Streaming Platforms in MinutesRoshan Dwivedi
The claim of launching a streaming platform in minutes might be a bit of an exaggeration, but there are services that can significantly streamline the process. Here's a breakdown:
Pros of Speedy Streaming Platform Launch Services:
No coding required: These services often use drag-and-drop interfaces or pre-built templates, eliminating the need for programming knowledge.
Faster setup: Compared to building from scratch, these platforms can get you up and running much quicker.
All-in-one solutions: Many services offer features like content management systems (CMS), video players, and monetization tools, reducing the need for multiple integrations.
Things to Consider:
Limited customization: These platforms may offer less flexibility in design and functionality compared to custom-built solutions.
Scalability: As your audience grows, you might need to upgrade to a more robust platform or encounter limitations with the "quick launch" option.
Features: Carefully evaluate which features are included and if they meet your specific needs (e.g., live streaming, subscription options).
Examples of Services for Launching Streaming Platforms:
Muvi [muvi com]
Uscreen [usencreen tv]
Alternatives to Consider:
Existing Streaming platforms: Platforms like YouTube or Twitch might be suitable for basic streaming needs, though monetization options might be limited.
Custom Development: While more time-consuming, custom development offers the most control and flexibility for your platform.
Overall, launching a streaming platform in minutes might not be entirely realistic, but these services can significantly speed up the process compared to building from scratch. Carefully consider your needs and budget when choosing the best option for you.
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...informapgpstrackings
Keep tabs on your field staff effortlessly with Informap Technology Centre LLC. Real-time tracking, task assignment, and smart features for efficient management. Request a live demo today!
For more details, visit us : https://informapuae.com/field-staff-tracking/
Software Engineering, Software Consulting, Tech Lead.
Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Security,
Spring Transaction, Spring MVC,
Log4j, REST/SOAP WEB-SERVICES.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Globus
The U.S. Geological Survey (USGS) has made substantial investments in meeting evolving scientific, technical, and policy driven demands on storing, managing, and delivering data. As these demands continue to grow in complexity and scale, the USGS must continue to explore innovative solutions to improve its management, curation, sharing, delivering, and preservation approaches for large-scale research data. Supporting these needs, the USGS has partnered with the University of Chicago-Globus to research and develop advanced repository components and workflows leveraging its current investment in Globus. The primary outcome of this partnership includes the development of a prototype enterprise repository, driven by USGS Data Release requirements, through exploration and implementation of the entire suite of the Globus platform offerings, including Globus Flow, Globus Auth, Globus Transfer, and Globus Search. This presentation will provide insights into this research partnership, introduce the unique requirements and challenges being addressed and provide relevant project progress.
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxrickgrimesss22
Discover the essential features to incorporate in your Winzo clone app to boost business growth, enhance user engagement, and drive revenue. Learn how to create a compelling gaming experience that stands out in the competitive market.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
How to Position Your Globus Data Portal for Success Ten Good PracticesGlobus
Science gateways allow science and engineering communities to access shared data, software, computing services, and instruments. Science gateways have gained a lot of traction in the last twenty years, as evidenced by projects such as the Science Gateways Community Institute (SGCI) and the Center of Excellence on Science Gateways (SGX3) in the US, The Australian Research Data Commons (ARDC) and its platforms in Australia, and the projects around Virtual Research Environments in Europe. A few mature frameworks have evolved with their different strengths and foci and have been taken up by a larger community such as the Globus Data Portal, Hubzero, Tapis, and Galaxy. However, even when gateways are built on successful frameworks, they continue to face the challenges of ongoing maintenance costs and how to meet the ever-expanding needs of the community they serve with enhanced features. It is not uncommon that gateways with compelling use cases are nonetheless unable to get past the prototype phase and become a full production service, or if they do, they don't survive more than a couple of years. While there is no guaranteed pathway to success, it seems likely that for any gateway there is a need for a strong community and/or solid funding streams to create and sustain its success. With over twenty years of examples to draw from, this presentation goes into detail for ten factors common to successful and enduring gateways that effectively serve as best practices for any new or developing gateway.
GraphSummit Paris - The art of the possible with Graph TechnologyNeo4j
Sudhir Hasbe, Chief Product Officer, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Developing Distributed High-performance Computing Capabilities of an Open Sci...Globus
COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among public health practitioners, mathematical modelers, and scientific computing specialists, while revealing critical gaps in exploiting advanced computing systems to support urgent decision making. Informed by our team’s work in applying high-performance computing in support of public health decision makers during the COVID-19 pandemic, we present how Globus technologies are enabling the development of an open science platform for robust epidemic analysis, with the goal of collaborative, secure, distributed, on-demand, and fast time-to-solution analyses to support public health.
Software Engineering, Software Consulting, Tech Lead, Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Transaction, Spring MVC, OpenShift Cloud Platform, Kafka, REST, SOAP, LLD & HLD.
Globus Connect Server Deep Dive - GlobusWorld 2024Globus
We explore the Globus Connect Server (GCS) architecture and experiment with advanced configuration options and use cases. This content is targeted at system administrators who are familiar with GCS and currently operate—or are planning to operate—broader deployments at their institution.
Understanding Globus Data Transfers with NetSageGlobus
NetSage is an open privacy-aware network measurement, analysis, and visualization service designed to help end-users visualize and reason about large data transfers. NetSage traditionally has used a combination of passive measurements, including SNMP and flow data, as well as active measurements, mainly perfSONAR, to provide longitudinal network performance data visualization. It has been deployed by dozens of networks world wide, and is supported domestically by the Engagement and Performance Operations Center (EPOC), NSF #2328479. We have recently expanded the NetSage data sources to include logs for Globus data transfers, following the same privacy-preserving approach as for Flow data. Using the logs for the Texas Advanced Computing Center (TACC) as an example, this talk will walk through several different example use cases that NetSage can answer, including: Who is using Globus to share data with my institution, and what kind of performance are they able to achieve? How many transfers has Globus supported for us? Which sites are we sharing the most data with, and how is that changing over time? How is my site using Globus to move data internally, and what kind of performance do we see for those transfers? What percentage of data transfers at my institution used Globus, and how did the overall data transfer performance compare to the Globus users?
Utilocate offers a comprehensive solution for locate ticket management by automating and streamlining the entire process. By integrating with Geospatial Information Systems (GIS), it provides accurate mapping and visualization of utility locations, enhancing decision-making and reducing the risk of errors. The system's advanced data analytics tools help identify trends, predict potential issues, and optimize resource allocation, making the locate ticket management process smarter and more efficient. Additionally, automated ticket management ensures consistency and reduces human error, while real-time notifications keep all relevant personnel informed and ready to respond promptly.
The system's ability to streamline workflows and automate ticket routing significantly reduces the time taken to process each ticket, making the process faster and more efficient. Mobile access allows field technicians to update ticket information on the go, ensuring that the latest information is always available and accelerating the locate process. Overall, Utilocate not only enhances the efficiency and accuracy of locate ticket management but also improves safety by minimizing the risk of utility damage through precise and timely locates.
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Globus
The Earth System Grid Federation (ESGF) is a global network of data servers that archives and distributes the planet’s largest collection of Earth system model output for thousands of climate and environmental scientists worldwide. Many of these petabyte-scale data archives are located in proximity to large high-performance computing (HPC) or cloud computing resources, but the primary workflow for data users consists of transferring data, and applying computations on a different system. As a part of the ESGF 2.0 US project (funded by the United States Department of Energy Office of Science), we developed pre-defined data workflows, which can be run on-demand, capable of applying many data reduction and data analysis to the large ESGF data archives, transferring only the resultant analysis (ex. visualizations, smaller data files). In this talk, we will showcase a few of these workflows, highlighting how Globus Flows can be used for petabyte-scale climate analysis.
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisGlobus
JASMIN is the UK’s high-performance data analysis platform for environmental science, operated by STFC on behalf of the UK Natural Environment Research Council (NERC). In addition to its role in hosting the CEDA Archive (NERC’s long-term repository for climate, atmospheric science & Earth observation data in the UK), JASMIN provides a collaborative platform to a community of around 2,000 scientists in the UK and beyond, providing nearly 400 environmental science projects with working space, compute resources and tools to facilitate their work. High-performance data transfer into and out of JASMIN has always been a key feature, with many scientists bringing model outputs from supercomputers elsewhere in the UK, to analyse against observational or other model data in the CEDA Archive. A growing number of JASMIN users are now realising the benefits of using the Globus service to provide reliable and efficient data movement and other tasks in this and other contexts. Further use cases involve long-distance (intercontinental) transfers to and from JASMIN, and collecting results from a mobile atmospheric radar system, pushing data to JASMIN via a lightweight Globus deployment. We provide details of how Globus fits into our current infrastructure, our experience of the recent migration to GCSv5.4, and of our interest in developing use of the wider ecosystem of Globus services for the benefit of our user community.
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra
1. May 6th, 2014
Real-Time Personalization
with Ad Tech @Spotify
Kinshuk Mishra
kinshuk@spotify.com
@_kinshukmishra
1
2. About Me
• @Spotify since 2011
• Data and Backend engineering
• With the Monetization team
3. May 12, 2014
Spotify in numbers
Started in 2006, available in 56 markets
20+ million songs, 20,000 added daily
24+ million active users, 6+ million subscribers
1.5 billion playlists
4. May 12, 2014
Monetization at Spotify
• $1 billion paid to rights holders since launch
• 1 in every 4 Spotify users is a paying subscriber
• Ad revenue pays for free tier music streaming
• Build platform to power artist and music label promotions
5. What is our mission?
• Power the free tier with ads customers love.
– Delight users, brands and artists with standout
promotional experiences.
– Deliver the right message for every moment.
5
6. Delight users, brands and artists with standout
promotional experience
6
• Ad formats need to be
- Native
- Actionable
- Scalable
- Engaging
12. Things we had to do
• Make ads context aware.
• Create measurable ad formats.
• Build tools to help brands understand their audience.
• Build flexible targeting infrastructure.
• Analyze user behavior to improve ad relevancy.
12
13. Why is real time interesting?
• True assessment of the context
- current view, last track, etc.
• Immediate feedback
- ad clicked, followed a playlist, etc.
• Increased ad opportunities
- hyperlocal, current mood, etc.
13
source : http://nativemobile.com/tag/real-time-bidding
14. How to serve right message for every moment ?
• Fix the WTFs for ads in the music context
- Realtime short window activity analysis
• Improve ad relevancy based on user behavior
- (Realtime + Batch) long window activity analysis
14
15. Some quick wins
• Fix the WTFs for ads in the music context
- Fix the “Not Safe For Kids” ads problem.
- Fix the relevancy of ads in your music session.
• Improve ad relevancy based on user behavior
- Fix ad relevancy based on user’s musical taste.
15
18. Our requirements
• Scalable log collection
• Capability to process logs in batch and
realtime mode and aggregate user activity
• Capability to store user profiles and
enable serve time lookups
• Ability to update and add new features to
existing user profiles
19. Bigdata @Spotify and choices
• 700 node hadoop cluster
• 400 GB service logs daily
• 4.5 TB user data daily
• 7500 hadoop jobs daily
• 64 TB data generated daily
• YARN Map-Reduce, Giraph,
Storm, Spark, etc.
!
19
28. May 12, 2014
Measuring effectiveness
• Build ad quality scores from lower-level metrics
• Positive response, avoidance, etc.
• A/B test hypotheses, evaluate against quality scores
• Build quality score optimization into real-time system
29. May 6th, 2014
Want to join the band?
https://www.spotify.com/us/jobs/ or
https://twitter.com/Spotifyjobs
!
Kinshuk Mishra
kinshuk@spotify.com @_kinshukmishra