When using user signals to improve relevance, what should you use? Clicks are more frequent, but really only correspond to a search result looking attractive. A conversion is a powerful signal of true relevance but occurs less frequently. Can we combine shallow "this looks interesting" click events along with strong, but rare conversion signals in a robust fashion to generate learning to rank training data? In this talk, we introduce click models, an industry-proven way of measuring search result attractiveness from clicks, and propose a systematic way of incorporating conversion data into click models. Whether your industry is conversion heavy (like e-commerce), or lacking in any clear conversion signal (like publishing) you'll take away from this talk a system for turning any search analytics into robust judgments and training data. Because, after all, there is no AI-based Search without good training data!
Doug Turnbull, OpenSource Connections
Applying NLP and Machine Learning to Keyword AnalysisDan Segal
From a presentation at Text Analytics Forum, Washington, DC, Nov. 7, 2018. Keyword research allows companies to learn the voice of their customers and tune their marketing messages for them. One of the challenges in keyword research is to find collections of keywords that are topically relevant and in demand and therefore likely to draw search traffic and customer engagement. Data sources such as search logs and search engine result pages provide valuable sources of keywords, as well as insight into audience-specific language. Additionally, cognitive technologies such as natural language processing and machine learning provide capabilities for mining those sources at scale. With a few tools and some minimal coding, an analyst can generate clusters of best-bet keywords that are not only syntactically similar but semantically related. This how-to talk presents some practical techniques for automated analysis of keyword source data using off-the-shelf APIs.
Analyse de logs SEO : pour qui, pour quoi, comment ?Julien Deneuville
Conférence du 20 février 2016 au SEO Campus de Nantes.
En quoi consiste l'analyse de logs pour le SEO ?
A qui cela s'adresse-t-il ?
Comment s'y prendre ?
Quelles analyses effectuer, et comment réagir ?
Plus d'infos : http://blog.1-clic.info/referencement/analyse-de-logs-seocampus-nantes/
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...Qi Guo
*** Please check out our LinkedIn Engineering blog post: https://engineering.linkedin.com/blog/2019/04/ai-behind-linkedin-recruiter-search-and-recommendation-systems ***
LinkedIn Talent Solutions business contributes to around 65% of LinkedIn’s annual revenue, and provides tools for job providers to reach out to potential candidates and for job seekers to find suitable career opportunities. LinkedIn’s job ecosystem has been designed as a platform to connect job providers and job seekers, and to serve as a marketplace for efficient matching between potential candidates and job openings. A key mechanism to help achieve these goals is the LinkedIn Recruiter product, which enables recruiters to search for relevant candidates and obtain candidate recommendations for their job postings.
We highlight a few unique information retrieval, system, and modeling challenges associated with talent search and recommendation systems.
In this talk, we will present how we formulated and addressed the problems, the overall system design and architecture, the challenges encountered in practice, and the lessons learned from the production deployment of these systems at LinkedIn. By presenting our experiences of applying techniques at the intersection of recommender systems, information retrieval, machine learning, and statistical modeling in a large-scale industrial setting and highlighting the open problems, we hope to stimulate further research and collaborations within the SIGIR community.
Vemos cómo podemos crear clusters de URLs con el mismo significado para definir qué URLs deben redirigir a qué URLs y evitar contenido duplicado.
Ponencia SEO presentada por Lino Uruñuela en el Seonthebeach 2022 #SOB22 #SEO #SEOtecnico #seoavanzaado
OSA Con 2022 - Using ClickHouse Database to Power Analytics and Customer Enga...Altinity Ltd
OSA Con 2022: Using ClickHouse Database to Power Analytics and Customer Engagement Platform
Prafulla Gupta - Times Internet
This talk covers how we empowered Product Managers and Editors at Times Internet by developing an in-house product, GrowthRx, using Clickhouse Open Source Database to track and analyze user behavior to increase user retention and customer engagement. Times Internet is India's largest digital news publisher, which manages leading brands like Times of India, Economic Times, Navbharat Times, etc, where we are tracking more than 10 billion events per month in the ClickHouse Database.
Tree-like data relationships are common, but working with trees in SQL usually requires awkward recursive queries. This talk describes alternative solutions in SQL, including:
- Adjacency List
- Path Enumeration
- Nested Sets
- Closure Table
Code examples will show using these designs in PHP, and offer guidelines for choosing one design over another.
A Multi-Armed Bandit Framework For Recommendations at NetflixJaya Kawale
In this talk, we present a general multi-armed bandit framework for recommendations on the Netflix homepage. We present two example case studies using MABs at Netflix - a) Artwork Personalization to recommend personalized visuals for each of our members for the different titles and b) Billboard recommendation to recommend the right title to be watched on the Billboard.
Applying NLP and Machine Learning to Keyword AnalysisDan Segal
From a presentation at Text Analytics Forum, Washington, DC, Nov. 7, 2018. Keyword research allows companies to learn the voice of their customers and tune their marketing messages for them. One of the challenges in keyword research is to find collections of keywords that are topically relevant and in demand and therefore likely to draw search traffic and customer engagement. Data sources such as search logs and search engine result pages provide valuable sources of keywords, as well as insight into audience-specific language. Additionally, cognitive technologies such as natural language processing and machine learning provide capabilities for mining those sources at scale. With a few tools and some minimal coding, an analyst can generate clusters of best-bet keywords that are not only syntactically similar but semantically related. This how-to talk presents some practical techniques for automated analysis of keyword source data using off-the-shelf APIs.
Analyse de logs SEO : pour qui, pour quoi, comment ?Julien Deneuville
Conférence du 20 février 2016 au SEO Campus de Nantes.
En quoi consiste l'analyse de logs pour le SEO ?
A qui cela s'adresse-t-il ?
Comment s'y prendre ?
Quelles analyses effectuer, et comment réagir ?
Plus d'infos : http://blog.1-clic.info/referencement/analyse-de-logs-seocampus-nantes/
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...Qi Guo
*** Please check out our LinkedIn Engineering blog post: https://engineering.linkedin.com/blog/2019/04/ai-behind-linkedin-recruiter-search-and-recommendation-systems ***
LinkedIn Talent Solutions business contributes to around 65% of LinkedIn’s annual revenue, and provides tools for job providers to reach out to potential candidates and for job seekers to find suitable career opportunities. LinkedIn’s job ecosystem has been designed as a platform to connect job providers and job seekers, and to serve as a marketplace for efficient matching between potential candidates and job openings. A key mechanism to help achieve these goals is the LinkedIn Recruiter product, which enables recruiters to search for relevant candidates and obtain candidate recommendations for their job postings.
We highlight a few unique information retrieval, system, and modeling challenges associated with talent search and recommendation systems.
In this talk, we will present how we formulated and addressed the problems, the overall system design and architecture, the challenges encountered in practice, and the lessons learned from the production deployment of these systems at LinkedIn. By presenting our experiences of applying techniques at the intersection of recommender systems, information retrieval, machine learning, and statistical modeling in a large-scale industrial setting and highlighting the open problems, we hope to stimulate further research and collaborations within the SIGIR community.
Vemos cómo podemos crear clusters de URLs con el mismo significado para definir qué URLs deben redirigir a qué URLs y evitar contenido duplicado.
Ponencia SEO presentada por Lino Uruñuela en el Seonthebeach 2022 #SOB22 #SEO #SEOtecnico #seoavanzaado
OSA Con 2022 - Using ClickHouse Database to Power Analytics and Customer Enga...Altinity Ltd
OSA Con 2022: Using ClickHouse Database to Power Analytics and Customer Engagement Platform
Prafulla Gupta - Times Internet
This talk covers how we empowered Product Managers and Editors at Times Internet by developing an in-house product, GrowthRx, using Clickhouse Open Source Database to track and analyze user behavior to increase user retention and customer engagement. Times Internet is India's largest digital news publisher, which manages leading brands like Times of India, Economic Times, Navbharat Times, etc, where we are tracking more than 10 billion events per month in the ClickHouse Database.
Tree-like data relationships are common, but working with trees in SQL usually requires awkward recursive queries. This talk describes alternative solutions in SQL, including:
- Adjacency List
- Path Enumeration
- Nested Sets
- Closure Table
Code examples will show using these designs in PHP, and offer guidelines for choosing one design over another.
A Multi-Armed Bandit Framework For Recommendations at NetflixJaya Kawale
In this talk, we present a general multi-armed bandit framework for recommendations on the Netflix homepage. We present two example case studies using MABs at Netflix - a) Artwork Personalization to recommend personalized visuals for each of our members for the different titles and b) Billboard recommendation to recommend the right title to be watched on the Billboard.
Counterfactual Learning for RecommendationOlivier Jeunen
Slides for our presentation at the REVEAL workshop for RecSys '19 in Copenhagen and a Data Science Leuven Meetup, titled "Counterfactual Learning for Recommendation".
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Sease
RRE is an open-source search quality evaluation tool that can be used to produce a set of reports about the quality of a system, iteration after iteration, and that can be integrated within a continuous integration infrastructure to monitor quality metrics after each release.
Many aspects remained problematic though:
– how to directly evaluate a middle layer search-API that communicates with Apache Solr or Elasticsearch?
– how to easily generate explicit and implicit ratings without spending hours on tedious json files?
– how to better explore the evaluation results? with nice widgets and interesting insights?
Rated Ranking Evaluator Enterprise solves these problems and much more.
Join us as we introduce the next generation of open-source search quality evaluation tools, exploring the internals and real-world scenarios!
Learning to Rank Presentation (v2) at LexisNexis Search GuildSujit Pal
An introduction to Learning to Rank, with case studies using RankLib with and without plugins provided by Solr and Elasticsearch. RankLib is a library of learning to rank algorithms, which includes some popular LTR algorithms such as LambdaMART, RankBoost, RankNet, etc.
Deep neural methods have recently demonstrated significant performance improvements in several IR tasks. In this lecture, we will present a brief overview of deep models for ranking and retrieval.
This is a follow-up lecture to "Neural Learning to Rank" (https://www.slideshare.net/BhaskarMitra3/neural-learning-to-rank-231759858)
Deep Learning for Semantic Search in E-commerceSomnath Banerjee
Learn how deep learning is used in incorporating semantic understanding to solve the complex and challenging problem of e-commerce search. Get informed about the deep learning-based query understanding, image understanding and embedding generation systems developed at Walmart Labs. Gain insights on several practical aspects of building and deploying DL models on production to serve large scale live traffic.
Video SEO In Google & YouTube Search: Making the most out of it #SMXWestAleyda Solís
How can you optimize your videos for YouTube and Google search results? Take a look at the top criteria, tools and steps to maximize your search visibility and results!
Whilst passage indexing may seem like a small tweak to search ranking, it is potentially much more symptomatic of the beginning of a fundamental shift in the way that search engines understand unstructured content, determine relevance in natural language, and rank efficiently and effectively.
It could also be a means of assessing overall quality of content and a means of dynamic index pruning. We will look at the landscape, and also provide some takeaways for brands and business owners looking to improve quality in unstructured content overall in this fast changing landscape.
Little Big Data #1 다양한 사람들의 데이터 사이언스 이야기에서 발표한 자료입니다
궁금한 것은 언제나 문의주세요 :)
행사 후기는 https://zzsza.github.io/etc/2018/04/21/little-big-data/ 에 있습니다!
(2018.5 내용 추가) 현재 회사가 없으니, 제게 관심있으신 분들도 연락 환영합니다 :)
An overview of the quantitative and qualitative data provided by live chat, and how to measure the sales, marketing, and customer support ROI of a chat widget.
Counterfactual Learning for RecommendationOlivier Jeunen
Slides for our presentation at the REVEAL workshop for RecSys '19 in Copenhagen and a Data Science Leuven Meetup, titled "Counterfactual Learning for Recommendation".
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Sease
RRE is an open-source search quality evaluation tool that can be used to produce a set of reports about the quality of a system, iteration after iteration, and that can be integrated within a continuous integration infrastructure to monitor quality metrics after each release.
Many aspects remained problematic though:
– how to directly evaluate a middle layer search-API that communicates with Apache Solr or Elasticsearch?
– how to easily generate explicit and implicit ratings without spending hours on tedious json files?
– how to better explore the evaluation results? with nice widgets and interesting insights?
Rated Ranking Evaluator Enterprise solves these problems and much more.
Join us as we introduce the next generation of open-source search quality evaluation tools, exploring the internals and real-world scenarios!
Learning to Rank Presentation (v2) at LexisNexis Search GuildSujit Pal
An introduction to Learning to Rank, with case studies using RankLib with and without plugins provided by Solr and Elasticsearch. RankLib is a library of learning to rank algorithms, which includes some popular LTR algorithms such as LambdaMART, RankBoost, RankNet, etc.
Deep neural methods have recently demonstrated significant performance improvements in several IR tasks. In this lecture, we will present a brief overview of deep models for ranking and retrieval.
This is a follow-up lecture to "Neural Learning to Rank" (https://www.slideshare.net/BhaskarMitra3/neural-learning-to-rank-231759858)
Deep Learning for Semantic Search in E-commerceSomnath Banerjee
Learn how deep learning is used in incorporating semantic understanding to solve the complex and challenging problem of e-commerce search. Get informed about the deep learning-based query understanding, image understanding and embedding generation systems developed at Walmart Labs. Gain insights on several practical aspects of building and deploying DL models on production to serve large scale live traffic.
Video SEO In Google & YouTube Search: Making the most out of it #SMXWestAleyda Solís
How can you optimize your videos for YouTube and Google search results? Take a look at the top criteria, tools and steps to maximize your search visibility and results!
Whilst passage indexing may seem like a small tweak to search ranking, it is potentially much more symptomatic of the beginning of a fundamental shift in the way that search engines understand unstructured content, determine relevance in natural language, and rank efficiently and effectively.
It could also be a means of assessing overall quality of content and a means of dynamic index pruning. We will look at the landscape, and also provide some takeaways for brands and business owners looking to improve quality in unstructured content overall in this fast changing landscape.
Little Big Data #1 다양한 사람들의 데이터 사이언스 이야기에서 발표한 자료입니다
궁금한 것은 언제나 문의주세요 :)
행사 후기는 https://zzsza.github.io/etc/2018/04/21/little-big-data/ 에 있습니다!
(2018.5 내용 추가) 현재 회사가 없으니, 제게 관심있으신 분들도 연락 환영합니다 :)
An overview of the quantitative and qualitative data provided by live chat, and how to measure the sales, marketing, and customer support ROI of a chat widget.
The key to using the Internet as a business tool is to reduce that frustration and connect with customers in the easiest, most direct way possible. SEO (search engine optimization) is, perhaps, the fastest growing marketing tool available today and it works by putting your website at the top of search results page on Google and Bing when customers are searching for terms relevant to your business.
The Internet can work wonders: If you are looking for guidance on a home improvement project, there is a YouTube video for every conceivable “how to.” Need a last-minute recipe or want to find a unique gift item? Presto, with a few clicks results appear on your screen. Of course, it can also be frustrating. The key to using the Internet as a business tool is to reduce that frustration and connect with customers in the easiest, most direct way possible. In other words, don’t let potential leads get lost on their way to your website! (http://bit.ly/2ghsK8H)
Fight Back Against Back: How Search Engines & Social Networks' AI Impacts Mar...Rand Fishkin
Rand's presentation on machine learning and deep learning in Google, Facebook, and beyond, and how engagement reputation will become key to every online marketing effort.
Creative Career Hacking 2015: The not-so-well-known ways to find and apply fo...Red Bamboo Marketing
Applying for jobs can be a real drag. You apply to hundreds of jobs on the standard job boards, only to get silence in return. Its demoralizing, time consuming, and downright overdone.
In this new presentation, volunteer career coach and speaker Stephen Murphy shows you a different way to approach the job search process. Learn how to use a variety of digital tools, websites and forums to change the status quo and take a more creative approach to finding your next job online.
Analytics is more than "slap on the google analytics tag and we're done". Any good Digital project starts out with a good set of Goals & Objectives...but when was the last time that you measured the result of those goals & objectives? Lean Analytics is about integrating the analytics in the whole process...from the start. In a LEAN way
Is the conflicting advice you are getting on SEO keeping you from success? Attending this session will help you get straight answers on the SEO issues keeping you from the top of the search engines.
The art and science of improving a website's content, usability, keyword relevancy, and HTML to appear more prominently in search engine results is SEO.
Search is the Tip of the Spear for Your B2B eCommerce StrategyLucidworks
With ecommerce experiencing explosive growth, it seems intuitive that the B2B segment of that ecosystem is mirroring the same trajectory. That said, B2B has very different needs when it comes to transacting with the same style of experiences that we see in B2C. For instance, B2B ecommerce is about precision findability, whereas B2C customers can convert at higher rates when they’re just browsing online. In order for the B2B buying experience to be successful, search needs to be tuned to meet the unique needs of the segment.
In this webinar with Forrester senior analyst Joe Cicman, you’ll learn:
-Which verticals in B2B will drive the most growth, and how machine-learning powered personalization tactics can be deployed to support those specific verticals
-Why an omnichannel selling approach must be deployed in order to see success in B2B
-How deploying content search capabilities will support a longer sales cycle at scale
-What the next steps are to support a robust B2B commerce strategy supported by new technology
Speakers
Joe Cicman, Senior Analyst, Forrester
Jenny Gomez, VP of Marketing, Lucidworks
Customer loyalty starts with quickly responding to your customer’s needs. When it comes to resolving open support cases, time is of the essence. Time spent searching for answers adds up and creates inefficiencies in resolving cases at scale. Relevant answers need to be a few clicks away and easily accessible for agents directly from their service console.
We will explore how Lucidworks’ Agent Insights application automatically connects agents with the correct answers and resources. You’ll learn how to:
-Configure a proactive widget in an agent’s case view page to access resources across third-party systems (such as Sharepoint, Confluence, JIRA, Zendesk, and ServiceNow).
-Easily set up query pipelines to autonomously route assets and resources that are relevant to the case-at-hand—directly to the right agent.
-Identify subject matter experts within your support data and access tribal knowledge with lightning-fast speed.
How Crate & Barrel Connects Shoppers with Relevant ProductsLucidworks
Lunch and Learn during Retail TouchPoints #RIC21 virtual event.
***
Crate & Barrel’s previous search solution couldn’t provide its shoppers with an online search and browse experience consistent with the customer-centric Crate & Barrel brand. Meanwhile, Crate & Barrel merchandisers spent the bulk of their time manually creating and maintaining search rules. The search experience impacted customer retention, loyalty, and revenue growth.
Join this lunch & learn for an interactive chat on how Crate & Barrel partnered with Lucidworks to:
-Improve search and browse by modernizing the technology stack with ML-based personalization and merchandising solutions
-Enhance the experience for both shoppers and merchandisers
-Explore signals to transform the omnichannel shopping experience
Questions? Visit https://lucidworks.com/contact/
Learn how to guide customers to relevant products using eCommerce search, hyper-personalisation, and recommendations in our ‘Best-In-Class Retail Product Discovery’ webinar.
Nowadays, shoppers want their online experience to be engaging, inspirational and fulfilling. They want to find what they’re looking for quickly and easily. If the sought after item isn’t available, they want the next best product or content surfaced to them. They want a website to understand their goals as though they were talking to a sales assistant in person, in-store.
In this webinar, we explore IMRG industry data insights and a best-in-class example of retail product discovery. You’ll learn:
- How AI can drive increased revenue through hyper-personalised experiences
- How user intent can be easily understood and results displayed immediately
- How merchandisers can be empowered to curate results and product placement – all without having to rely on IT.
Presented by:
Dave Hawkins, Principal Sales Engineer - Lucidworks
Matthew Walsh, Director of Data & Retail - IMRG
Connected Experiences Are Personalized ExperiencesLucidworks
Many companies claim personalization and omnichannel capabilities are top priorities. Few are able to deliver on those experiences.
For a recent Lucidworks-commissioned study, Forrester Consulting surveyed 350+ global business decision-makers to see what gets in the way of achieving these goals. They discovered that inefficient technology, lack of behavioral insights, and failure to tie initiatives to enterprise-wide goals are some of the most frequent blockers to personalization success.
Join guest speaker, Forrester VP and Principal Analyst, Brendan Witcher, and Lucidworks CEO, Will Hayes, to hear the results of the Forrester Consulting study, how to avoid “digital blindness,” and how to apply VoC data in real-time to delight customers with personalized experiences connected across every touchpoint.
In this webinar, you’ll learn:
- Why companies who utilize real-time customer signals report more effective personalization
- How to connect employees and customers in a shared experience through search and browse
- How Lucidworks clients Lenovo, Morgan Stanley and Red Hat fast-tracked improvements in conversion, engagement and customer satisfaction
Featuring
- Will Hayes, CEO, Lucidworks
- Brendan Witcher, VP, Principal Analyst, Forrester
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Lucidworks
Intelligent Policing. Leveraging Data to more effectively Serve Communities.
Policing in the next decade is anticipated to be very different from historical methods. More data driven, more focused on the intricacies of communities they serve and more open and collaborative to make informed recommendations a reality. Whether its social populations, NIBRS or organization improvement that’s the driver, the IT requirement is largely the same. Provide 360 access to large volumes of siloed data to gain a full 360 understanding of existing connections and patterns for improved insight and recommendation.
Join us for a round table discussion of how the Toronto Police Service is better serving their community through deploying a unified intelligent data platform.
Data innovation improves officers' engagement with existing data and streamlines investigation workflows by enhancing collaboration. This improved visibility into existing police data allows for a more intelligent and responsive police force.
In this webinar, we'll cover:
-The technology needs of an intelligent police force.
-How a Global Search improves an officer's interaction with existing data.
Featuring:
-Simon Taylor, VP, Worldwide Channels & Alliances, Lucidworks
-Michael Cizmar, Managing Director, MC+A
-Ian Williams, Manager of Analytics & Innovation, Toronto Police Service
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...Lucidworks
Policing in the next decade is anticipated to be very different from historical methods. More data driven, more focused on the intricacies of communities they serve and more open and collaborative to make informed recommendations a reality. Whether its social populations, NIBRS or organization improvement that’s the driver, the IT requirement is largely the same. Provide 360 access to large volumes of siloed data to gain a full 360 understanding of existing connections and patterns for improved insight and recommendation.
Join us for a round table discussion of how the Toronto Police Service is better serving their community through deploying a unified intelligent data platform.
Data innovation improves officers' engagement with existing data and streamlines investigation workflows by enhancing collaboration. This improved visibility into existing police data allows for a more intelligent and responsive police force.
In this webinar, we'll cover:
The technology needs of an intelligent police force.
How a Global Search improves an officer's interaction with existing data.
Featuring
-Simon Taylor, VP, Worldwide Channels & Alliances, Lucidworks
-Michael Cizmar, Managing Director, MC+A
-Ian Williams, Manager of Analytics & Innovation, Toronto Police Service
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...Lucidworks
Wish your conversion rates were higher? Can’t figure out how to efficiently and effectively serve all the visitors on your site? Embarrassed by the quality of your product discovery experience? The bar is high and the influx of online shopping over recent months has reminded us that the opportunities are real. We’re all deep in holiday prep, but let’s take a few minutes to think about January 2021 and beyond. How can we position ourselves for success with our customers and against our competition?
Grab your lunch and let’s dive into three strategies that need to be part of your 2021 roadmap. You don’t need an army to get there. But you do need to take action and capitalize on the shoppers abandoning the product discovery journey on your site.
In this session, attendees will find out how to:
-Take control of merchandising at scale;
-Implement hands-free search relevancy; and
-Address personalization challenges.
AI-Powered Linguistics and Search with Fusion and RosetteLucidworks
For a personalized search experience, search curation requires robust text interpretation, data enrichment, relevancy tuning and recommendations. In order to achieve this, language and entity identification are crucial.
For teams working on search applications, advanced language packages allow them to achieve greater recall without sacrificing precision.
Join us for a guided tour of our new Advanced Linguistics packages, available in Fusion, thanks to the technology partnership between Lucidworks and Basistech.
We’ll explore the application of language identification and entity extraction in the context of search, along with practical examples of personalizing search and enhancing entity extraction.
In this webinar, we’ll cover:
-How Fusion uses the Rosette Basic Linguistics and Entity Extraction packages
-Tips for improving language identification and treatment as well as data enrichment for personalization
-Speech2 demo modeling Active Recommendation
-Use Rosette’s packages with Fusion Pipelines to build custom entities for specific domain use cases
Featuring:
-Radu Miclaus, Director of Product, AI and Cloud, Lucidworks, Lucidworks
-Robert Lucarini, Senior Software Engineer, Lucidworks
-Nick Belanger, Solutions Engineer, Basis Technology
The Service Industry After COVID-19: The Soul of Service in a Virtual MomentLucidworks
Before COVID-19, almost 80% of the US workforce worked service in jobs that involve in-person interaction with strangers. Now, leaders of service organizations must reshape their offerings during the pandemic and prepare for whatever the new normal turns out to be. Our three panelists will share ideas for adapting their service businesses, now that closer-than-six-feet isn’t an option.
Join Lucidworks as we talk shop with 3 service business leaders, covering:
-Common impacts of the pandemic on service businesses (and what to do about them),
-How service teams can maintain a human touch across virtual channels, and
-Plans for the future, before and after the pandemic subsides.
Featuring
-Sara Nathan, President & CEO, AMIGOS
-Anthony Carruesco, Founder, AC Fly Fishing
-sara bradley, chef and proprietor, freight house
-Justin Sears, VP Product Marketing, Lucidworks
Webinar: Smart answers for employee and customer support after covid 19 - EuropeLucidworks
The COVID-19 pandemic has forced companies to support far more customers and employees through digital channels than ever before. Many are turning to chatbots to help meet increasing demand, but traditional rules-based approaches can’t keep up. Our new Smart Answers add-on to Lucidworks Fusion makes existing chatbots and virtual assistants more intelligent and more valuable to the people you serve.
Smart Answers for Employee and Customer Support After COVID-19Lucidworks
Watch our on-demand webinar showcasing Smart Answers on Lucidworks Fusion. This technology makes existing chatbots and virtual assistants more intelligent and more valuable to the people you serve.
In this webinar, we’ll cover off:
-How search and deep learning extend conversational frameworks for improved experiences
-How Smart Answers improves customer care, call deflection, and employee self-service
-A live demo of Smart Answers for multi-channel self-service support
Applying AI & Search in Europe - featuring 451 ResearchLucidworks
In the current climate, it’s now more important than ever to digitally enable your workforce and customers.
Hear from Simon Taylor, VP Global Partners & Alliances, Lucidworks and Matt Aslett, Research Vice President, 451 Research to get the inside scoop on how industry leaders in Europe are developing and executing their digital transformation strategies.
In this webinar, we’ll discuss:
The top challenges and aspirations European business and technology leaders are solving using AI and search technology
Which search and AI use cases are making the biggest impact in industries such as finance, healthcare, retail and energy in Europe
What technology buyers should look for when evaluating AI and search solutions
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce StrategyLucidworks
In this webinar with 451 Research, you'll understand how retailers are using AI to predict customer intent and learn which key performance metrics are used by more than 120 online retailers in Lucidworks’ 2019 Retail Benchmark Survey.
In this webinar, you’ll learn:
● What trends and opportunities are facing the ecommerce industry in 2020
● Why search is the universal path to understanding customer intent
● How large online retailers apply AI to maximize the effectiveness of their personalization efforts
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...Lucidworks
Nordstrom Rack | Hautelook curates and serves customers a wide selection of on-trend apparel, accessories, and shoes at an everyday savings of up to 75 percent off regular prices. With over a million visitors shopping across different platforms every day, and a realization that customers have become accustomed to robust and personalized search interactions, Nordstrom Rack | Hautelook launched an initiative over a year ago to provide data science-driven digital experiences to their customers.
In this session, we’ll discuss Nordstrom Rack | Hautelook’s journey of operationalizing a hefty strategy, optimizing a fickle infrastructure, and rallying troops around a single vision of building an expansible machine-learning driven product discovery engine.
The audience will learn about:
-The key technical challenges and outcomes that come with onboarding a solution
-The lessons learned of creating and executing operational design
-The use of Lucidworks Fusion to plug custom data science models into search and browse applications to understand user intent and deliver personalized experiences
Apply Knowledge Graphs and Search for Real-World Decision IntelligenceLucidworks
Knowledge graphs and machine learning are on the rise as enterprises hunt for more effective ways to connect the dots between the data and the business world. With newer technologies, the digital workplace can dramatically improve employee engagement, data-driven decisions, and actions that serve tangible business objectives.
In this webinar, you will learn
-- Introduction to knowledge graphs and where they fit in the ML landscape
-- How breakthroughs in search affect your business
-- The key features to consider when choosing a data discovery platform
-- Best practices for adopting AI-powered search, with real-world examples
Presenter: Justin Sears, Lucidworks and Shalin Mangar
Description: Apache Solr is one of the most trusted and performant search engines. Many of the world’s largest companies use Solr to power their e-commerce and workplace web applications. But obtaining the required expertise, managing and scaling infrastructure, administering security, and performing upgrades for an enterprise-grade Solr environment can be difficult and expensive — preventing infrastructure leaders from delivering Solr’s time-tested advantages to their organizations.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Conversion Models: A Systematic Method of Building Learning to Rank Training Data - Doug Turnbull, OpenSource Connections
1. Conversion Models
ABSOLUTELY AMAZING learning to rank training data ?
Activate 2019
Discount Code ctwact19 for 40% off!
Doug Turnbull, http://o19s.com
WE'RE HIRING!
2. Relevance Cornucopia🦃 Training Event:
http://o19s.com/blog/2019/09/11/announcing-relevance-cornucopia/
(Early Bird (gobble gobble) till end of Sept)
● Week of Nov 10
● "Think Like a Relevance Engineer" for Solr or Elasticsearch
● "Learning to Rank" & "Natural Language Search" training
● Delivered by our crack team of expert relevance consultants!
3. What I'm currently up to...
THEY'RE
HIRING!
(see Dennis Chaney's talk)
https://www.lexisnexis.com/en-us/about-us/careers.page
4. Outline
1. What holds orgs back from AI-Powered Search?
2. Click Models help?
3. Click Models for The Rest of Us
5. What holds orgs
back from this?
http://aipoweredsearch.com
Discount Code: ctwact19
6. How most 'Machine Learning Search' Projects
Fail
Our
Jerk-face
AI Search
Garbage
Training Data In
Garbage Results Out
7. This difficulty is a major theme in our
community
From User Actions to Better Rankings Agnes Van Belle, Haystack EU 2018; Learning Learning To Rank Torsten Köster & Fabian Klenk & René Kriegler,
Haystack EU 2018; Learning to rank (LTR) in an Activity Marketplace Ashraf Aaref & Felipe Besson - MICES 2018
Through 4 iterations of LtR
"Consistent theme of being hindered
by judgment quality"
V1 LTR model failed. We need to "Redefine our criteria for
measuring relevance" and "Judge the judgements very often"
(entire talk about this problem)
9. First: what is the training data?
grade,keywords,docId
4,Rambo,7555 # Rambo
3,Rambo,1370 # Rambo III
0,Rambo,102947 # First Daughter
4,Rocky,1366 # Rocky
...
Doc 7555 is
perfectly relevant
for query "Rambo"
Doc 102947 very
irrelevant for
"Rambo"
Judgment List:
10. Measuring how good is search...
grade,keywords,docId
4,Rambo,7555 # Rambo
3,Rambo,1370 # Rambo III
0,Rambo,102947 # First Daughter
4,Rocky,1366 # Rocky
...
Our
Search
Solution
Keywords NDCG@5 ERR@5
Rambo 0.95 0.56
Rocky 0.58 0.21
Offline testing: How is our tuning going?
Rambo: going pretty good!
Rocky: not so great… let's focus here
11. … and for training Learning to Rank
grade,keywords,docId
4,Rambo,7555 # Rambo
3,Rambo,1370 # Rambo III
0,Rambo,102947 # First Daughter
4,Rocky,1366 # Rocky
...
Our
LtR
Model
Keywords NDCG@5
Rambo 0.95
Rocky 0.58
Train
modelJudgments are training data...
Analyze
Results
Elite
Search
Team
12. Of course there's manual judgments
http://github.com/o19s/quepid
For a good talk on a robust human judgment program, see Tito Sierra and Tara
Diedrichson's Haystack Talk "Making the Case for Human Judgment Relevance
Testing" https://haystackconf.com/2019/human-judgement/
(Usually not enough data for LtR training
data)
13. For LtR: use implicit data from user behavior
Less
'Opinion'?
14. How to do this - maybe something like this!?
if purchased=True:
grade = 4
if clicked + dwell for 5 secs:
grade = 3
if click:
grade = 2
if shown, but not clicked:
grade = 1
Clickstream
grade,keywords,docId
4,Rambo,7555 # Rambo
3,Rambo,1370 # Rambo III
0,Rambo,102947 # First Daughter
4,Rocky,1366 # Rocky
...
Is this a good approach?
Thoughts?
15. Self reinforcing bad search
Search
Engine
'Santa Claus Conquers
Martians' most relevant!
Users only interact with what
the search engine shows them
ML reinforces search's current
(bad?) behavior
Position bias: 'Santa Claus…' clicked more as its in posn 1
Presentation bias: where is "The Martian"?
q=stuck on mars
16. Domain-specific considerations
Lack of a clear 'Conversion' - what if this is just IMDB getting info on the
movie?; what if users just want to research an expensive purchase first?
What are YOUR user's goals? Shopping vs research vs known-item search vs
passive browsing vs … all have different fingerprints
UI layout? How does a grid vs a list influence user's click behaviors? What
about a chat-bot system or Alexa-style question answering!??
'Good Abandonments' - what if your snippets answer the user's question
without them clicking on a thing!
17. How you get judgments is a model too!
Your
Intuition
<your assumptions go
here>
Clickstream
grade,keywords,docId
4,Rambo,7555 # Rambo
3,Rambo,1370 # Rambo III
0,Rambo,102947 # First Daughter
4,Rocky,1366 # Rocky
...
18. This means when you hear...
"I think that clicking
and spending > 5
seconds on the page
indicates relevant
document!"
"I think that we should
oversample clicks
farther down the page
to compensate for
position bias""Carefully inspecting
the product is an
indication of relevance"
19. NDCG - but based on what judgment methodology?
"We improved
NDCG 20% through
X ML search technique!"
Overconfident
search consultant
20. We need to study these models too
Hard-Coded
Ranking 2
Hard-Coded
Ranking 1 Clickstream
Judgment
Aggregation
Solution 1
Show users hard-coded
corresponding to judgment list
Judgment
Aggregation
Solution 2
A
B
- A/B Test the Judgment system
- Consensus with other judgment
systems (ie manual)
- Continue to evolve & improve
21. This is why this is so hard
- Search behaviors / UIs constantly
evolving
- Your domain & products
considerations dominate
- SERP UIs have biases
24. What is a click model
CLICKS
q=waffle maker
So hot right now
Really really really
ridiculously good
looking
What is this? A search
result for q=ANTS?
Click Models for Web Search by Chuklin, Markov, de Rijke
https://www.morganclaypool.com/doi/abs/10.2200/S00654ED1V01Y201507ICR043
25. Attractiveness vs Satisfaction
Attractiveness
~Perceived Relevance
Denoted 'A'
The snippet *looked*
useful/interesting for
what I need - tied to
clicks
All click models
provide A
≠
Satisfaction
~Actual Relevance
Denoted 'S'
The document satisfied
my information need
Some click models
attempt S
27. A=0.45 / 0.50
= 0.9
A = 0.25 / 0.20
= 1.25
A = 0.15 / 0.16
= 0.9375
CTR/Avg Posn CTR:
The World's Second Simplest Click Model
So Hot
Right
Now
(aka COEC - clicks over expected clicks)
Personalized Click Prediction in Sponsored Search, Chang, Cantu-Paz
http://www.wsdm-conference.org/2010/proceedings/docs/p351.pdf
Avg CTR for posn 1
over all queries
This Query's
CTR for posn 1
28. Probabilistic Models ~ e.g. Position Based Model
C
d
Ed
Ad
Ad User found doc d attractive
Ed User Examined document d
αdq
γr
αdq Attractiveness for doc d, query q
γr
Examine probability for rank r
across all queries
C
d
Document d
clicked
Observed:
Rank examine
prob
Doc attractiveness
for Query
P(Cd) = P(Ed) * P(Ad)
~ γr * αdq
29. PBM ~ Two Unknowns, One Equation
P(Cd) ~ γr * αdq
Find best
examine for
observed clicks
Find best
attractiveness for
doc/query pair It's definitely examined P(Ed)=1 if it's clicked!
It's definitely attractive P(Ad)=1 if it's clicked!
Unlikely something was examined if users never click on
that position (or is the document unattractive)?
Unlikely something is attractive, if users seem to examine
that position (see posn clicked a lot) but don't click this
particular document
Assumptions:
30. Assumptions -> TERRIFYING MAAAAAATH!!!
Iteratively improve attractiveness & examine probabilities over the search session until they
converge to most likely
Clicked 'assumptions'
Not Clicked, then probably not
attractive if this posn is
examined a lot (trust me 😊 )
For each session with
query/doc pair
(t - iteration)
32. Dynamic Bayesian Network
A Dynamic Bayesian Network Click Modelfor Web Search Ranking by Chapelle, Zhang
http://olivier.chapelle.cc/pub/DBN_www2009.pdf
Wikimedia Foundation's use of DBN:
https://blog.wikimedia.org/2017/10/17/elasticsearch-learning-to-rank-plugin/
Er
Cd
Ar
αdq
Sr
sdq
Er-1
Cd
Ar-1
αdq
Sr-1
sdq
We can compute 'attractiveness' and 'satisfaction' of doc for query
......
γ
You examine the next
result if you clicked but
were not satisfied, or at
probability γ if you were
satisfied
Simplified DBN: last
clicked result satisfied me
33. We are not building Web Search
● Low visibility just the SERP clicks, we
don't see what happens beyond...
● High volume simpler assumptions
help map just clicks to satisfaction
Web Search:
34. Most of us - 'Average Joes'
● More visibility clicks, conversions, and
more from the session after search!
● Lower volume may not be able to rely
on simpler assumptions for satisfaction
Most other search apps:
36. Click models for the rest of us
● Click Model CAN be used to overcome
SERP UI biases to derive
attractiveness for Average Joes
● What about satisfaction? Aka 'actual
relevance'
● Can we use our advantage to measure
that directly?
38. Most of us have some kind of 'post click' tracking
Conversions: Direct/explicit goal completed by user - like
"purchase"
Pseudo-conversions: "goals" not directly recognized by
user or clear in analytics - like "read article" or "add to cart"
Indications of interest: not quite "goals" but indications
user is happy - like "click plus dwell"
39. q=heart attack
0.7
'Shallow' events dense; 'deeper' events sparse
Attractiveness: click!
These clicks are fleeting to
users
Top of
funnel/path
Click+
Dwell
Click+
Dwell+
Scroll
Read
Reviews
Add to
Cart
Checkout
End of
funnel/path
Most people
should get here...
...a few will get all
the way through...
40. q=waffle maker
0.7
If user can't bother to do shallow event, attractiveness
discounted
Attractiveness:
User immediately hits back
button!
Time on page = 0.001s
Not actually relevant
41. q=waffle maker
0.7
If user moves deep into page, attractiveness confirmed
Attractiveness:
Add to Cart
Bought
Definitely relevant
42. q=heart attack
0.7
Discount attractiveness based on event not achieved
Attractiveness: click!
Click+
Dwell
Click+
Dwell+
Scroll
Read
Reviews
Add to
Cart
Checkout
Quit here?
Discount A: 0.01
Quit here?
Discount A: 0.95
43. Update over multiple sessions...
q=waffle maker
0.7
Attractiveness:
Bought
Session 1
Immediately
returned to
SERP
Session 2
Stayed on
page, read
reviews
Session 3
Further 'post query' evidence:
D=0.65 D=0.01 D=0.20
J = Discount * Attractiveness
Σ
num_sessions
J =0.7 x 0.65+0.01+0.2 = 0.29
3
44. User Value-Cost Model
What is the value of a page for the user
We can't really measure the value but we can indirectly the cost to the user in
time & money
...I can't be
bothered...
Click+
Dwell
Click+
Dwell+
Scroll
Read
Reviews… this was at least
worth some of my time
towards my goal...
Back immediately
Discount heavily
Discount
moderately
45.
46. Bayes justification to judgments
P (J | V) = P (V | J) * P(J)
P(V)
Prior, earlier belief in relevance given by
attractiveness as derived from click model
Probability of user getting value in the
context of it being deemed relevant
to this query
Probability of user getting value
regardless of query
Judgment in the
context of value
47. Bayes approach to judgments
J = avgPageValueForThisQuery * A
avgPageValue
48. When avg_page_value = 0.3
q=waffle maker
0.7
Attractiveness:
Bought
Session 1
Immediately
returned to
SERP
Session 2
Stayed on
page, read
reviews
Session 3
Further 'post query' evidence:
D=0.65
user_value=0.01 user_value=0.20
Discount * Attractiveness
Σ
num_sessions
J =0.7 x 0.65+0.01+0.2 = 0.95
3 / 0.3
avg_page_value
J =
49. Zhong, et. al. Incorporating Post-Click Behaviors into a Click Model
https://zhangyuc.github.io/files/zhang11kdd.pdf
Your Take home reading