Optimizing GenAI apps, by N. El Mawass and Maria Knorps

•

0 likes•75 views

Paris Women in Machine Learning and Data Science

Nour and Maria present the work they did at Tweag, Modus Create innovation arm, where the GenAI team developed an evaluation framework for Retrieval-Augmented Generation (RAG) systems. RAG systems provide an easy and low-cost way to extend the knowledge of Large Language Models (LLMs) but measuring their performance is not an easy task. The presentation will review existing evaluation frameworks, ranging from those based on the traditional ML approach of using groundtruth datasets, including Tweag's, to those that use LLMs to compute evaluation metrics. It will also delve into the practical implementation of Tweag's chatbot over two distinct documents datasets and provide insights on chunking, embedding and how open source and commercial LLMs compare.

Optimizing GenAI apps
Evaluating
Retrieval in RAGs
Nour El Mawass, Maria Knorps
7.Feb.2024

What are RAGs?
And why do we need them anyway?
3

4
Retrieval Augmented Generation
<your question here>
● LLMs have a learning cutoﬀ
● Fine-tuning is costly
● Adding relevant context to the
prompt is cheap and easy
● Find relevant context with
semantic search

Semantic search
● Vectorizing a documents base:
○ Chunking
○ Embedding/Vectorizing
○ Indexing
● Finding documents similar to a query:
○ Vectorize query
○ Find closest vectors
5

8
<your question here>
The GenAI team at Tweag has been working on
applying the Retrieval-Augmented Generation (RAG)
paradigm together with commercial and open source
LLMs to perform intelligent search and suggestion
over a collection of Conﬂuence and Bazel documents.
The LLM processing can be carried out within a virtual
private cloud domain (AWS in this case) so that no
information is shared with third parties.

Experimenting vs "eyeballing"
10
- No benchmark: No guarantee that
the introduced change did not
degrade performance on other
questions.
- No experiments tracking: Likely
none of the intermediate states was
committed or properly tracked.
- No evaluation metrics: We cannot
numerically compare the current RAG
state to any other possible state.
- No solution space: What alternatives
are we exploring?

Evaluation's golden quartet
12
Experiments tracking
Evaluation metrics Parameters space
Benchmark

Benchmark
● Benchmark over the documents database:
○ Questions
○ Pairs of (question, answers)
○ Pairs of (question, relevant_documents)
● Not easy: need representative and varied queries
13
Human-generated LLM-generated
● Can be automated with LLMs
○ generate questions over documents
○ reformulate questions

$Parameters space 14 "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, }$

Evaluation metrics
● Information Retrieval metrics (traditional ML)
○ Labeled dataset
○ Evaluate recall and precision at k
● LLM-based evaluation
○ Context relevance
■ ratio of relevant to total sentences in
the retrieved documents
○ Context recall
15
LLM-based RAG
metrics
Information
retrieval metrics

Experiments tracking
16
Data
(benchmark + vectors)
+
Experiment's
parameters
+
Version-controlled
code
Experiment
tracking
+ +
Parameters
- k
- embedding
- chunking
- …

Tweag’s evaluation framework
17
Experiments
tracking
Evaluation
metrics
Parameters
space
Benchmark

Key strategies: Chunking and Embedding
18
● Embedding models:
○ all-MiniLM-L12-v2
○ Multi-qa-mpnet-nase-dot-v1
○ All-mpnet-base-v2
○ SpacyEmbeddings
● Chunking models
○ RecursiveCharacterTextSplitter
○ SentenceTransformersTokenTextSplitter
● Benchmarks:
○ User questions
○ User questions reformulated with ChatGPT3.5

Takeaways
● You need to evaluate your system, no eyeballing!
● Many frameworks and tools: check our blog posts for an
introduction.
20
https://www.tweag.io/group/genai/

Conversational AI and Chatbots (or rather - and more extensively - Virtual Agents) offer great benefits, especially in combination with technologies like RPA or IDP. Corneliu Niculite (Presales Director - EMEA @DRUID AI) and Roman Tobler (CEO @Routinuum & UiPath MVP) are discussing Conversational AI and why Virtual Agents play a significant role in modern ways of working. Moreover, Corneliu will be displaying how to build a Workflow and showcase an Accounts Payable Use Case, integrating DRUID and UiPath Robots. 📙 Agenda: The focus of our meetup is around the following areas - with a lot of room to discuss and share experiences: - What is "Conversational AI" and why do we need Chatbots (Virtual Agents); - Deep-Dive to a DRUID-UiPath Integration via an Accounts Payable Use Case; - Discussion, Q&A Speakers: 👨🏻‍💻 Corneliu Niculite, Presales Director - EMEA DRUID AI 👨🏼‍💻 Roman Tobler, UiPath MVP, Co-Founder & CEO Routinuum GmbH This session streamed live on March 8, 2023, 16:00 PM CET. Check out our upcoming events at: community.uipath.com Contact us at: community@uipath.com

Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...

Mihai Criveti

Mihai is the Principal Architect for Platform Engineering and Technology Solutions at IBM, responsible for Cloud Native and AI Solutions. He is a Red Hat Certified Architect, CKA/CKS, a leader in the IBM Open Innovation community, and advocate for open source development. Mihai is driving the development of Retrieval Augmentation Generation platforms, and solutions for Generative AI at IBM that leverage WatsonX, Vector databases, LangChain, HuggingFace and open source AI models. Mihai will share lessons learned building Retrieval Augmented Generation, or “Chat with Documents” platforms and APIs that scale, and deploy on Kubernetes. His talk will cover use cases for Generative AI, limitations of Large Language Models, use of RAG, Vector Databases and Fine Tuning to overcome model limitations and build solutions that connect to your data and provide content grounding, limit hallucinations and form the basis of explainable AI. In terms of technology, he will cover LLAMA2, HuggingFace TGIS, SentenceTransformers embedding models using Python, LangChain, and Weaviate and ChromaDB vector databases. He’ll also share tips on writing code using LLM, including building an agent for Ansible and containers. Scaling factors for Large Language Model Architectures: • Vector Database: consider sharding and High Availability • Fine Tuning: collecting data to be used for fine tuning • Governance and Model Benchmarking: how are you testing your model performance over time, with different prompts, one-shot, and various parameters • Chain of Reasoning and Agents • Caching embeddings and responses • Personalization and Conversational Memory Database • Streaming Responses and optimizing performance. A fine tuned 13B model may perform better than a poor 70B one! • Calling 3rd party functions or APIs for reasoning or other type of data (ex: LLMs are terrible at reasoning and prediction, consider calling other models) • Fallback techniques: fallback to a different model, or default answers • API scaling techniques, rate limiting, etc. • Async, streaming and parallelization, multiprocessing, GPU acceleration (including embeddings), generating your API using OpenAPI, etc.

𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬

VINCI Digital - Industrial IoT (IIoT) Strategic Advisory

Best Practice on using Azure OpenAI Service

Kumton Suttiraksiri

Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)

Naoki (Neo) SATO

Generative AI: Past, Present, and Future – A Practitioner's Perspective

Huahai Yang

Generative AI: Past, Present, and Future – A Practitioner's Perspective As the academic realm grapples with the profound implications of generative AI and related applications like ChatGPT, I will present a grounded view from my experience as a practitioner. Starting with the origins of neural networks in the fields of logic, psychology, and computer science, I trace its history and align it within the wider context of the pursuit of artificial intelligence. This perspective will also draw parallels with historical developments in psychology. Against this backdrop, I chart a proposed trajectory for the future. Finally, I provide actionable insights for both academics and enterprising individuals in the field.

AI and ML Series - Introduction to Generative AI and LLMs - Session 1

DianaGray10

Session 1 👉This first session will cover an introduction to Generative AI & harnessing the power of large language models. The following topics will be discussed: Introduction to Generative AI & harnessing the power of large language models. What’s generative AI & what’s LLM. How are we using it in our document understanding & communication mining models? How to develop a trustworthy and unbiased AI model using LLM & GenAI. Personal Intelligent Assistant Speakers: 📌George Roth - AI Evangelist at UiPath 📌Sharon Palawandram - Senior Machine Learning Consultant @ Ashling Partners & UiPath MVP 📌Russel Alfeche - Technology Leader RPA @qBotica & UiPath MVP

Generative-AI-in-enterprise-20230615.pdf

Liming Zhu

Chat GPT 4 can pass the American state bar exam, but before you go expecting to see robot lawyers taking over the courtroom, hold your horses cowboys – we're not quite there yet. That being said, AI is becoming increasingly more human-like, and as a VC we need to start thinking about how this new wave of technology is going to affect the way we build and run businesses. What do we need to do differently? How can we make sure that our investment strategies are reflecting these changes? It's a brave new world out there, and we’ve got to keep the big picture in mind! Sharing here with you what we at Cavalry Ventures found out during our Generative AI deep dive.

Using the power of Generative AI at scale

Maxim Salnikov

In this session, you'll get all the answers about how ChatGPT and other GPT-X models can be applied to your current or future project. First, we'll put in order all the terms – OpenAI, GPT-3, ChatGPT, Codex, Dall-E, etc., and explain why Microsoft and Azure are often mentioned in this context. Then, we'll go through the main capabilities of the Azure OpenAI and respective usecases that might inspire you to either optimize your product or build a completely new one.

ChatGPT, Foundation Models and Web3.pptx

Jesus Rodriguez

The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!

taozen

H2O.ai's Driverless AI

Sri Ambati

Unlocking the Power of Generative AI An Executive's Guide.pdf

PremNaraindas1

Generative AI is here, and it can revolutionize your business. With its powerful capabilities, this technology can help companies create more efficient processes, unlock new insights from data, and drive innovation. But how do you make the most of these opportunities? This guide will provide you with the information and resources needed to understand the ins and outs of Generative AI, so you can make informed decisions and capitalize on the potential. It covers important topics such as strategies for leveraging large language models, optimizing MLOps processes, and best practices for building with Generative AI.

Understanding generative AI models A comprehensive overview.pdf

StephenAmell4

Generative AI Use cases for Enterprise - Second Session

Gene Leybzon

Leveraging Generative AI & Best practices

DianaGray10

Intro to LLMs

Loic Merckel

How do OpenAI GPT Models Work - Misconceptions and Tips for Developers

Ivo Andreev

Have you ever wondered why GPT models work? Do you ask questions like: ◉ How does GPT work? Why does the same problem receive different answers for different users? Is there a way to improve explainability? ◉ Can GPT model provide its sources? Why does Bing chat work differently? What are my ways to have better performance and improve completions? ◉ How can I work with data in my enterprise? What practical business cases could a generative AI model fit solving? If you are tired of sessions just scratching the surface of OpenAI GPT, this one will go deeper and answer questions like why, why not and how. Key Terms; ChatGPT Enterprise; Top Questions; Enterprise Data; Azure Search; Functions; Embeddings; Context Encoding; General Intelligence; Emerging Abilities; Chain of Thought; Plugins; Multimodal with DALL-E; Project Florence

AzureOpenAI.pptx

Udaiappa Ramachandran

Azure OpenAI Service provides REST API access to OpenAI's powerful language models, including the GPT-3, GPT-4, DALL-E, Codex, and Embeddings model series. These models can be easily adapted to any specific task, including but not limited to content generation, summarization, semantic search, translation, transformation, and code generation. Microsoft offers the accessibility of the service through REST APIs, Python or C# SDK, or the Azure OpenAI Studio.

Generative AI

Carlos J. Costa

ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...

DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions

GENERATIVE AI, THE FUTURE OF PRODUCTIVITY

Andre Muscat

Discuss the impact and opportunity of using Generative AI to support your development and creative teams * Explore business challenges in content creation * Cost-per-unit of different types of content * Use AI to reduce cost-per-unit * New partnerships being formed that will have a material impact on the way we search and engage with content Part 4 of a 9 Part Research Series named "What matters in AI" published on www.andremuscat.com

How Does Generative AI Actually Work? (a quick semi-technical introduction to...

ssuser4edc93

LLMs Bootcamp

Fiza987241

LanGCHAIN Framework

Keymate.AI

Langchain Framework is an innovative approach to linguistic data processing, combining the principles of language sciences, blockchain technology, and artificial intelligence. This deck introduces the groundbreaking elements of the framework, detailing how it enhances security, transparency, and decentralization in language data management. It discusses its applications in various fields, including machine learning, translation services, content creation, and more. The deck also highlights its key features, such as immutability, peer-to-peer networks, and linguistic asset ownership, that could revolutionize how we handle linguistic data in the digital age.

Microsoft AI Platform Overview

David Chou

The current state of generative AI

Benjaminlapid1

OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...

NETWAYS

At the intersection of search and AI, melding Large Language Models (LLMs) with OpenSearch opens transformative avenues. In this talk, we explore how LLMs can simplify the interaction between users and OpenSearch, converting natural language into OpenSearch queries. We will also leverage OpenSearch’s Vector Storage, enriching traditional term-based searches with semantic understanding. Dive into a future where search engines transcend being mere tools, becoming intuitive partners in knowledge discovery.

MongoDB .local London 2019: Fast Machine Learning Development with MongoDB

MongoDB

Today an increasingly large number of products use machine learning and AI to deliver a great personalized user experience, and workplace software is no exception. Spoke goes beyond traditional ticketing with their friendly, AI-powered chatbot that gives workplace teams hours of time back as it automatically responds to questions on Slack, email, SMS, and web. Learn how Spoke uses MongoDB to do dynamic model training in real time from user interaction data and serves thousands of models, with multiple customized models per client.

What's hot

Cavalry Ventures | Deep Dive: Generative AI

Cavalry Ventures

Using the power of Generative AI at scale

Maxim Salnikov

ChatGPT, Foundation Models and Web3.pptx

Jesus Rodriguez

The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!

taozen

H2O.ai's Driverless AI

Sri Ambati

Unlocking the Power of Generative AI An Executive's Guide.pdf

PremNaraindas1

Understanding generative AI models A comprehensive overview.pdf

StephenAmell4

Generative AI Use cases for Enterprise - Second Session

Gene Leybzon

Leveraging Generative AI & Best practices

DianaGray10

Intro to LLMs

Loic Merckel

How do OpenAI GPT Models Work - Misconceptions and Tips for Developers

Ivo Andreev

AzureOpenAI.pptx

Udaiappa Ramachandran

Generative AI

Carlos J. Costa

ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...

DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions

GENERATIVE AI, THE FUTURE OF PRODUCTIVITY

Andre Muscat

How Does Generative AI Actually Work? (a quick semi-technical introduction to...

ssuser4edc93

LLMs Bootcamp

Fiza987241

LanGCHAIN Framework

Keymate.AI

Microsoft AI Platform Overview

David Chou

The current state of generative AI

Benjaminlapid1

What's hot (20)

Cavalry Ventures | Deep Dive: Generative AI

Using the power of Generative AI at scale

ChatGPT, Foundation Models and Web3.pptx

The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!

H2O.ai's Driverless AI

Unlocking the Power of Generative AI An Executive's Guide.pdf

Understanding generative AI models A comprehensive overview.pdf

Generative AI Use cases for Enterprise - Second Session

Leveraging Generative AI & Best practices

Intro to LLMs

How do OpenAI GPT Models Work - Misconceptions and Tips for Developers

AzureOpenAI.pptx

Generative AI

ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...

GENERATIVE AI, THE FUTURE OF PRODUCTIVITY

How Does Generative AI Actually Work? (a quick semi-technical introduction to...

LLMs Bootcamp

LanGCHAIN Framework

Microsoft AI Platform Overview

The current state of generative AI

Similar to Optimizing GenAI apps, by N. El Mawass and Maria Knorps

OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...

NETWAYS

MongoDB .local London 2019: Fast Machine Learning Development with MongoDB

MongoDB

MongoDB .local London 2019: Fast Machine Learning Development with MongoDB

Lisa Roth, PMP

MongoDB World 2019: Fast Machine Learning Development with MongoDB

MongoDB

MongoDB World 2018: Tutorial - MongoDB & NodeJS: Zero to Hero in 80 Minutes

MongoDB

Elasticsearch & "PeopleSearch"

George Stathis

Big Query - Women Techmarkers (Ukraine - March 2014)

Ido Green

Webinar: Scaling MongoDB

MongoDB

Has your app taken off? Are you thinking about scaling? MongoDB makes it easy to horizontally scale out with built-in automatic sharding, but did you know that sharding isn't the only way to achieve scale with MongoDB? In this webinar, we'll review three different ways to achieve scale with MongoDB. We'll cover how you can optimize your application design and configure your storage to achieve scale, as well as the basics of horizontal scaling. You'll walk away with a thorough understanding of options to scale your MongoDB application. Topics covered include: - Scaling Vertically - Hardware Considerations - Index Optimization - Schema Design - Sharding

The need for sophistication in modern search engine implementations

Ben DeMott

Eagle6 mongo dc revisedMongoDB

Eagle6 Enterprise Situational Awareness

MongoDB

Eagle6 is a product that use system artifacts to create a replica model that represents a near real-time view of system architecture. Eagle6 was built to collect system data (log files, application source code, etc.) and to link system behaviors in such a way that the user is able to quickly identify risks associated with unknown or unwanted behavioral events that may result in unknown impacts to seemingly unrelated down-stream systems. This session is designed to present the capabilities of the Eagle6 modeling product and how we are using MongoDB to support near-real-time analysis of large disparate datasets.

How to Achieve Scale with MongoDB

MongoDB

Multiplatform Spark solution for Graph datasources by Javier Dominguez

Big Data Spain

[DSC Europe 23] Djordje Grozdic - Transforming Business Process Automation wi...

DataScienceConferenc1

In today's competitive business environment, automation of business processes, especially document processing workflows, has become critical for companies seeking to improve efficiency and reduce manual errors. Traditional methods often struggle to keep up with the volume and complexity of the tasks, while human-led processes are slow, error-prone, and may not always deliver consistent results. Large Language Models (LLMs) have made significant strides in handling complex tasks involving human-like text generation. However, they often face challenges with domain-specific data. Here's where Retrieval-Augmented Generation (RAG) steps in. RAG offers an exciting breakthrough, enabling the integration of domain-specific data in real-time without the need for constant model retraining or fine-tuning. It stands as a more affordable, secure, and explainable alternative to general-purpose LLMs, drastically reducing the likelihood of hallucination.

The Quest for an Open Source Data Science Platform

QAware GmbH

Cloud Native Night July 2019, Munich: Talk by Jörg Schad (@joerg_schad, Head of Engineering & ML at ArangoDB) === Please download slides if blurred! === Abstract: With the rapid and recent rise of data science, the Machine Learning Platforms being built are becoming more complex. For example, consider the various Kubeflow components: Distributed Training, Jupyter Notebooks, CI/CD, Hyperparameter Optimization, Feature store, and more. Each of these components is producing metadata: Different (versions) Datasets, different versions a of a jupyter notebooks, different training parameters, test/training accuracy, different features, model serving statistics, and many more. For production use it is critical to have a common view across all these metadata as we have to ask questions such as: Which jupyter notebook has been used to build Model xyz currently running in production? If there is new data for a given dataset, which models (currently serving in production) have to be updated? In this talk, we look at existing implementations, in particular MLMD as part of the TensorFlow ecosystem. Further, propose a first draft of a (MLMD compatible) universal Metadata API. We demo the first implementation of this API using ArangoDB.

Big data analysis in python @ PyCon.tw 2013

Jimmy Lai

When GenAI meets with Java with Quarkus and langchain4j

Jean-Francois James

stackconf 2022: Introduction to Vector Search with Weaviate

NETWAYS

In machine learning, e.g., recommendation tools or data classification, data is often represented as high-dimensional vectors. These vectors are stored in so-called vector databases. With vector databases you can efficiently run searching, ranking and recommendation algorithms. Therefore, vector databases became the backbone of ML deployments in industry. This session is all about vector databases. If you are a data scientist or data/software engineer this session would be interesting for you. You will learn how you can easily run your favourite ML models with the vector database Weaviate. You will get an overview of what a vector database like Weaviate can offer: such as semantic search, question answering, data classification, named entity recognition, multimodal search, and much more. After this session, you are able to load in your own data and query it with your preferred ML model! Session outline What is a vector database? You will learn the basic principles of vector databases. How data is stored, retrieved, and how that differs from other database types (SQL, knowledge graphs, etc). Performing your first semantic search with the vector database Weaviate. In this phase, you will learn how to set up a Weaviate vector database, how to make a data schema, how to load in data, and how to query data. You can follow along with examples, or you can use your own dataset. Advanced search with the vector database Weaviate. Finally, we will cover other functionalities of Weaviate: multi-modal search, data classification, connecting custom ML models, etc.

NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...

Anant Corporation

Series: Using AI / ChatGPT at Work - GPT Automation Are you a small business owner or web developer interested in leveraging the power of GPT (Generative Pretrained Transformer) technology to enhance your business processes? If so, Join us for a series of events focused on using GPT in business. Whether you're a small business owner or a web developer, you'll learn how to leverage GPT to improve your workflow and provide better services to your customers. GPT Automation: What it is and How it Works How Time-Saving GPT Automation Can Improve Your Business Cost-Effective GPT Automation: How it Can Save Your Business Money Using GPT Automation for Customer Service: Benefits and Best Practices The Power of GPT Automation for Content Creation Data Analysis Made Easy with GPT Automation Top GPT-3 Automation Tools for Businesses The Ethical Considerations of GPT Automation Overcoming Bias in GPT Automation: Best Practices The Future of GPT Automation: Trends and Predictions Since we focus on "no code" here, we'll explore the tools that are already out there such as ChatGPT plugins for Chrome, OpenAI GPT API, low-code/no-code platforms like Make/Integromat and Zapier, existing apps like Jasper/Rytr, and ecosystem tools like Everyprompt. We'll also discuss the resources available for those interested in learning more about GPT, including other people’s prompts.

2019 StartIT - Boosting your performance with Blackfire

Marko Mitranić

A workshop held in StartIT as part of Catena Media learning sessions. We aim to dispel the notion that large PHP applications tend to be sluggish, resource-intensive and slow compared to what the likes of Python, Erlang or even Node can do. The issue is not with optimising PHP internals - it's the lack of proper introspection tools and getting them into our every day workflow that counts! In this workshop we will talk about our struggles with whipping PHP Applications into shape, as well as work together on some of the more interesting examples of CPU or IO drain.

Similar to Optimizing GenAI apps, by N. El Mawass and Maria Knorps (20)

OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...

MongoDB .local London 2019: Fast Machine Learning Development with MongoDB

MongoDB World 2019: Fast Machine Learning Development with MongoDB

MongoDB World 2018: Tutorial - MongoDB & NodeJS: Zero to Hero in 80 Minutes

Elasticsearch & "PeopleSearch"

Big Query - Women Techmarkers (Ukraine - March 2014)

Webinar: Scaling MongoDB

The need for sophistication in modern search engine implementations

Eagle6 mongo dc revised

Eagle6 Enterprise Situational Awareness

How to Achieve Scale with MongoDB

Multiplatform Spark solution for Graph datasources by Javier Dominguez

[DSC Europe 23] Djordje Grozdic - Transforming Business Process Automation wi...

The Quest for an Open Source Data Science Platform

Big data analysis in python @ PyCon.tw 2013

When GenAI meets with Java with Quarkus and langchain4j

stackconf 2022: Introduction to Vector Search with Weaviate

NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...

2019 StartIT - Boosting your performance with Blackfire

More from Paris Women in Machine Learning and Data Science

Sequential and reinforcement learning for demand side management by Margaux B...

Paris Women in Machine Learning and Data Science

As electricity is difficult to store, it is crucial to strictly maintain the balance between production and consumption. The integration of intermittent renewable energies into the production mix has made the management of the balance more complex. However, access to near real-time data and communication with consumers via smart meters suggest demand response. Specifically, sending signals would encourage users to adjust their consumption according to the production of electricity. The algorithms used to select these signals must learn consumer reactions and optimize them while balancing exploration and exploitation. Various sequential or reinforcement learning approaches are being considered.

How and why AI should fight cybersexism, by Chloe Daudier

Paris Women in Machine Learning and Data Science

Anomaly detection and data imputation within time series

Paris Women in Machine Learning and Data Science

Managing international tech teams, by Natasha Dimban

Paris Women in Machine Learning and Data Science

Perspectives, by M. Pannegeon

Paris Women in Machine Learning and Data Science

Evaluation strategies for dealing with partially labelled or unlabelled data

Paris Women in Machine Learning and Data Science

Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...

Paris Women in Machine Learning and Data Science

An age-old question, by Caroline Jean-Pierre

Paris Women in Machine Learning and Data Science

Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré

Paris Women in Machine Learning and Data Science

How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier

Paris Women in Machine Learning and Data Science

Global Ambitions Local Realities, by Anna Abreu

Paris Women in Machine Learning and Data Science

Plug-and-Play methods for inverse problems in imagine, by Julie Delon

Paris Women in Machine Learning and Data Science

Sales Forecasting as a Data Product by Francesca Iannuzzi

Paris Women in Machine Learning and Data Science

Identifying and mitigating bias in machine learning, by Ruta Binkyte

Paris Women in Machine Learning and Data Science

“Turning your ML algorithms into full web apps in no time with Python" by Mar...

Paris Women in Machine Learning and Data Science

Abstract: Who hasn't heard of the "Pilot Syndrome"? 85% of Data Science Pilots remain pilots and do not make it to the production stage. Let's build a production-ready and end-user-friendly Data Science application. 100% python and 100% open source. Phase 1 | Building the GUI: create an interactive and powerful interface in a few lines of code Phase 2 | Integrated back end: Manage your models and pipelines and create scenarios the smart way

Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...

Paris Women in Machine Learning and Data Science

Sandrine Henry presents the BechdelAI project

Paris Women in Machine Learning and Data Science

Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...

Paris Women in Machine Learning and Data Science

Khrystyna Grynko WiMLDS - From marketing to Tech.pdf

Paris Women in Machine Learning and Data Science

Iana Iatsun_ML in production_20Dec2022.pdf

Paris Women in Machine Learning and Data Science

More from Paris Women in Machine Learning and Data Science (20)

Sequential and reinforcement learning for demand side management by Margaux B...

How and why AI should fight cybersexism, by Chloe Daudier

Anomaly detection and data imputation within time series

Managing international tech teams, by Natasha Dimban

Perspectives, by M. Pannegeon

Evaluation strategies for dealing with partially labelled or unlabelled data

Combinatorial Optimisation with Policy Adaptation using latent Space Search, ...

An age-old question, by Caroline Jean-Pierre

Applying Churn Prediction Approaches to the Telecom Industry, by Joëlle Lautré

How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier

Global Ambitions Local Realities, by Anna Abreu

Plug-and-Play methods for inverse problems in imagine, by Julie Delon

Sales Forecasting as a Data Product by Francesca Iannuzzi

Identifying and mitigating bias in machine learning, by Ruta Binkyte

“Turning your ML algorithms into full web apps in no time with Python" by Mar...

Nature Language Processing for proteins by Amélie Héliou, Software Engineer @...

Sandrine Henry presents the BechdelAI project

Anastasiia Tryputen_War in Ukraine or how extraordinary courage reshapes geop...

Khrystyna Grynko WiMLDS - From marketing to Tech.pdf

Iana Iatsun_ML in production_20Dec2022.pdf

Recently uploaded

Everything you wanted to know about LIHTC

Roger Valdez

Adjusting primitives for graph : SHORT REPORT / NOTES

Subhajit Sahu

Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is Multiply with different modes (map) 1. Performance of sequential execution based vs OpenMP based vector multiply. 2. Comparing various launch configs for CUDA based vector multiply. Sum with different storage types (reduce) 1. Performance of vector element sum using float vs bfloat16 as the storage type. Sum with different modes (reduce) 1. Performance of sequential execution based vs OpenMP based vector element sum. 2. Performance of memcpy vs in-place based CUDA based vector element sum. 3. Comparing various launch configs for CUDA based vector element sum (memcpy). 4. Comparing various launch configs for CUDA based vector element sum (in-place). Sum with in-place strategies of CUDA mode (reduce) 1. Comparing various launch configs for CUDA based vector element sum (in-place).

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样

axoqas

原版定制【Q微信:741003700】《(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书》【Q微信:741003700】成绩单、雅思、外壳、留信学历认证永久存档查询，采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【Q微信741003700】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信741003700】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理

oz8q3jxlp

原版定制【微信:41543339】【(Deakin毕业证书)迪肯大学毕业证】【微信:41543339】成绩单、外壳、offer、留信学历认证（永久存档真实可查）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【我们承诺采用的是学校原版纸张（纸质、底色、纹路），我们拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！】【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信41543339】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信41543339】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才留信网服务项目： 1、留学生专业人才库服务（留信分析） 2、国（境）学习人员提供就业推荐信服务 3、留学人员区块链存储服务 → 【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：客户在留信官方认证查询网站查询到认证通过结果后付款，不成功不收费！

一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理

mzpolocfi

原版定制【微信:41543339】【(Dalhousie毕业证书)达尔豪斯大学毕业证】【微信:41543339】成绩单、外壳、offer、留信学历认证（永久存档真实可查）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【我们承诺采用的是学校原版纸张（纸质、底色、纹路），我们拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！】【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信41543339】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信41543339】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才留信网服务项目： 1、留学生专业人才库服务（留信分析） 2、国（境）学习人员提供就业推荐信服务 3、留学人员区块链存储服务 → 【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：客户在留信官方认证查询网站查询到认证通过结果后付款，不成功不收费！

Ch03-Managing the Object-Oriented Information Systems Project a.pdf

haila53

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

NABLAS株式会社

The affect of service quality and online reviews on customer loyalty in the E...

jerlynmaetalle

Learn SQL from basic queries to Advance queries

manishkhaire30

Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively. Key Highlights: Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation. Advanced Queries: Learn to craft complex queries to uncover deep insights from your data. Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets. Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios. Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making. Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data! #DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics

一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理

74nqk8xf

毕业原版【微信:41543339】【(Coventry毕业证书)考文垂大学毕业证】【微信:41543339】成绩单、外壳、offer、留信学历认证（永久存档真实可查）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【我们承诺采用的是学校原版纸张（纸质、底色、纹路），我们拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！】【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信41543339】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信41543339】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才留信网服务项目： 1、留学生专业人才库服务（留信分析） 2、国（境）学习人员提供就业推荐信服务 3、留学人员区块链存储服务 → 【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：客户在留信官方认证查询网站查询到认证通过结果后付款，不成功不收费！

Adjusting OpenMP PageRank : SHORT REPORT / NOTES

Subhajit Sahu

For massive graphs that fit in RAM, but not in GPU memory, it is possible to take advantage of a shared memory system with multiple CPUs, each with multiple cores, to accelerate pagerank computation. If the NUMA architecture of the system is properly taken into account with good vertex partitioning, the speedup can be significant. To take steps in this direction, experiments are conducted to implement pagerank in OpenMP using two different approaches, uniform and hybrid. The uniform approach runs all primitives required for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid approach runs certain primitives in sequential mode (i.e., sumAt, multiply).

Best best suvichar in gujarati english meaning of this sentence as Silk road ...

AbhimanyuSinha9

原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样

u86oixdj

学校原件一模一样【微信：741003700 】《(swinburne毕业证书)斯威本科技大学毕业证》【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...

Subhajit Sahu

Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.

一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理

dwreak4tg

原版定制【微信:41543339】【(BCU毕业证书)伯明翰城市大学毕业证】【微信:41543339】成绩单、外壳、offer、留信学历认证（永久存档真实可查）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【我们承诺采用的是学校原版纸张（纸质、底色、纹路），我们拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！】【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信41543339】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信41543339】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才留信网服务项目： 1、留学生专业人才库服务（留信分析） 2、国（境）学习人员提供就业推荐信服务 3、留学人员区块链存储服务 → 【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：客户在留信官方认证查询网站查询到认证通过结果后付款，不成功不收费！

Machine learning and optimization techniques for electrical drives.pptx

balafet

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...

Timothy Spann

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx

AnirbanRoy608946

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样

u86oixdj

学校原件一模一样【微信：741003700 】《(Deakin毕业证书)迪肯大学毕业证学位证》【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才

Analysis insight about a Flyball dog competition team's performance

roli9797

Recently uploaded (20)

Everything you wanted to know about LIHTC

Adjusting primitives for graph : SHORT REPORT / NOTES

做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理

一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理

Ch03-Managing the Object-Oriented Information Systems Project a.pdf

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

The affect of service quality and online reviews on customer loyalty in the E...

Learn SQL from basic queries to Advance queries

一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理

Adjusting OpenMP PageRank : SHORT REPORT / NOTES

Best best suvichar in gujarati english meaning of this sentence as Silk road ...

原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...

一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理

Machine learning and optimization techniques for electrical drives.pptx

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样

Analysis insight about a Flyball dog competition team's performance

Optimizing GenAI apps, by N. El Mawass and Maria Knorps

1. Optimizing GenAI apps Evaluating Retrieval in RAGs Nour El Mawass, Maria Knorps 7.Feb.2024

2. www.tweag.io/group/genai 2

3. What are RAGs? And why do we need them anyway? 3

4. 4 Retrieval Augmented Generation <your question here> ● LLMs have a learning cutoﬀ ● Fine-tuning is costly ● Adding relevant context to the prompt is cheap and easy ● Find relevant context with semantic search

5. Semantic search ● Vectorizing a documents base: ○ Chunking ○ Embedding/Vectorizing ○ Indexing ● Finding documents similar to a query: ○ Vectorize query ○ Find closest vectors 5

6. You probably saw RAGs before 6

7. We've built a RAG too! 7

8. 8 <your question here> The GenAI team at Tweag has been working on applying the Retrieval-Augmented Generation (RAG) paradigm together with commercial and open source LLMs to perform intelligent search and suggestion over a collection of Conﬂuence and Bazel documents. The LLM processing can be carried out within a virtual private cloud domain (AWS in this case) so that no information is shared with third parties.

9. RAGs

10. Experimenting vs "eyeballing" 10 - No benchmark: No guarantee that the introduced change did not degrade performance on other questions. - No experiments tracking: Likely none of the intermediate states was committed or properly tracked. - No evaluation metrics: We cannot numerically compare the current RAG state to any other possible state. - No solution space: What alternatives are we exploring?

11. Evaluating retrieval: why? 11

12. Evaluation's golden quartet 12 Experiments tracking Evaluation metrics Parameters space Benchmark

13. Benchmark ● Benchmark over the documents database: ○ Questions ○ Pairs of (question, answers) ○ Pairs of (question, relevant_documents) ● Not easy: need representative and varied queries 13 Human-generated LLM-generated ● Can be automated with LLMs ○ generate questions over documents ○ reformulate questions

14. Parameters space 14 "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, } "retrieval": { "collection_name": "default", "embedding_model": { "name": "langchain.embeddings.SentenceTransformerEmbeddings", "parameters": { "model_name": "all-mpnet-base-v2" } }, "chunking_model": { "name": "langchain.text_splitter.RecursiveCharacterTextSplitter", "parameters": { "chunk_size": 500, "chunk_overlap": 5 } }, "top_k": 10, "preprocessing_model": { "name": "user_input_to_search_query" }, }

15. Evaluation metrics ● Information Retrieval metrics (traditional ML) ○ Labeled dataset ○ Evaluate recall and precision at k ● LLM-based evaluation ○ Context relevance ■ ratio of relevant to total sentences in the retrieved documents ○ Context recall 15 LLM-based RAG metrics Information retrieval metrics

16. Experiments tracking 16 Data (benchmark + vectors) + Experiment's parameters + Version-controlled code Experiment tracking + + Parameters - k - embedding - chunking - …

17. Tweag’s evaluation framework 17 Experiments tracking Evaluation metrics Parameters space Benchmark

18. Key strategies: Chunking and Embedding 18 ● Embedding models: ○ all-MiniLM-L12-v2 ○ Multi-qa-mpnet-nase-dot-v1 ○ All-mpnet-base-v2 ○ SpacyEmbeddings ● Chunking models ○ RecursiveCharacterTextSplitter ○ SentenceTransformersTokenTextSplitter ● Benchmarks: ○ User questions ○ User questions reformulated with ChatGPT3.5

19. Results on the Conﬂuence chatbot 19

20. Takeaways ● You need to evaluate your system, no eyeballing! ● Many frameworks and tools: check our blog posts for an introduction. 20 https://www.tweag.io/group/genai/

Optimizing GenAI apps, by N. El Mawass and Maria Knorps

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Optimizing GenAI apps, by N. El Mawass and Maria Knorps

Similar to Optimizing GenAI apps, by N. El Mawass and Maria Knorps (20)

More from Paris Women in Machine Learning and Data Science

More from Paris Women in Machine Learning and Data Science (20)

Recently uploaded

Recently uploaded (20)

Optimizing GenAI apps, by N. El Mawass and Maria Knorps