This document discusses finding and fixing production issues in a LAMP stack. It emphasizes having a clear aim or goal for the system and defining performance norms. It suggests monitoring tools, testing suites, and utilities to help audit and find problems outside of norms. The document also discusses optimizing through techniques like sharding, caching, and taking advantage of Moore's law.
This document discusses DevOps and continuous delivery. It defines DevOps as a cultural and professional movement to improve software delivery through automation, measurement, and collaboration between development and operations teams. The key aspects of DevOps discussed are adopting infrastructure as code practices, implementing continuous integration and deployment pipelines, and establishing a culture of automation, measurement, and sharing between teams. The overall goal is to enable software to be delivered safely and reliably in a matter of hours through establishing these collaborative practices.
An Agile Manager's Guide to a Nearly Peaceful Night's SleepVivek Ganesan
This document discusses strategies for agile managers to reduce stress and improve work-life balance. It addresses three main stress sources managers face: job security concerns, unpredictability in project timelines, and needing to enforce unpopular decisions. The document recommends using models that reflect the reality of work, teaching teams to adjust plans, and paying attention to both technical and motivational debt to maintain team motivation. The overall message is that agile management requires facilitating teams' work and building relationships, rather than rigid command-and-control.
Allan Crisp of Jet2.com delivered an interesting presentation at our networking event for Test Managers, discussing being a non-Tester and looking after Test, as well aas adventures in Automation.
The document explores the pros and cons of a "plug and play" approach to agile methodology. While ready-to-use and with proven practices, plug and play agile risks being too rigid or not customized enough for an organization's unique needs. It may disrupt existing processes and impose limitations. True plug and play agile requires seeing employees as volunteers, managers as coaches, and customers as co-creators, with a focus on creativity, adaptation, and continuous improvement over rigid compliance or perfection.
Are Automated Debugging Techniques Actually Helping ProgrammersChris Parnin
Automated debugging tools may help expert programmers but not others. A study of 34 programmers debugging two programs found:
1) Programmers did not follow ranked lists linearly but jumped around and zig-zagged.
2) When finding the bug, only 1 in 10 stopped investigating while others spent 10 extra minutes.
3) Overall, automated tools did not speed up debugging except for expert programmers who were 5 minutes faster.
4) Developers wanted explanations, not just recommendations from tools. More studies are needed on how programmers use tools.
TRIZ-Theory of Inventive Problem Solving.pptxSejalWasule
TRIZ (Theory of Inventive Problem Solving) is a problem-solving methodology developed by the Soviet engineer and inventor Genrich Altshuller. TRIZ is based on the analysis of thousands of patents and the identification of patterns of innovation and inventive principles.
The TRIZ methodology consists of several key components:
Problem formulation: This involves defining the problem and its parameters, as well as identifying the desired outcomes.
Analysis of the problem: This involves breaking down the problem into smaller parts and identifying the underlying contradictions and conflicts.
Use of inventive principles: TRIZ identifies a set of 40 inventive principles that can be used to solve problems. These principles are based on patterns of innovation that have been identified through the analysis of patents.
Ideation: This involves generating ideas and concepts for solving the problem, using the inventive principles and other ideation techniques.
Evaluation and implementation: This involves evaluating the ideas generated and selecting the most promising solutions for implementation.
The key benefit of TRIZ is that it provides a systematic approach to problem-solving, which can lead to more effective and efficient solutions. It also provides a structured framework for ideation, which can help to generate a larger number of creative ideas.The TRIZ methodology is particularly useful for addressing complex problems that seem unsolvable using traditional problem-solving approaches. It provides a structured framework for ideation and encourages innovative thinking, which can lead to breakthrough solutions. The TRIZ methodology has been used successfully in a wide range of industries, including aerospace, automotive, and consumer products.
This document discusses finding and fixing production issues in a LAMP stack. It emphasizes having a clear aim or goal for the system and defining performance norms. It suggests monitoring tools, testing suites, and utilities to help audit and find problems outside of norms. The document also discusses optimizing through techniques like sharding, caching, and taking advantage of Moore's law.
This document discusses DevOps and continuous delivery. It defines DevOps as a cultural and professional movement to improve software delivery through automation, measurement, and collaboration between development and operations teams. The key aspects of DevOps discussed are adopting infrastructure as code practices, implementing continuous integration and deployment pipelines, and establishing a culture of automation, measurement, and sharing between teams. The overall goal is to enable software to be delivered safely and reliably in a matter of hours through establishing these collaborative practices.
An Agile Manager's Guide to a Nearly Peaceful Night's SleepVivek Ganesan
This document discusses strategies for agile managers to reduce stress and improve work-life balance. It addresses three main stress sources managers face: job security concerns, unpredictability in project timelines, and needing to enforce unpopular decisions. The document recommends using models that reflect the reality of work, teaching teams to adjust plans, and paying attention to both technical and motivational debt to maintain team motivation. The overall message is that agile management requires facilitating teams' work and building relationships, rather than rigid command-and-control.
Allan Crisp of Jet2.com delivered an interesting presentation at our networking event for Test Managers, discussing being a non-Tester and looking after Test, as well aas adventures in Automation.
The document explores the pros and cons of a "plug and play" approach to agile methodology. While ready-to-use and with proven practices, plug and play agile risks being too rigid or not customized enough for an organization's unique needs. It may disrupt existing processes and impose limitations. True plug and play agile requires seeing employees as volunteers, managers as coaches, and customers as co-creators, with a focus on creativity, adaptation, and continuous improvement over rigid compliance or perfection.
Are Automated Debugging Techniques Actually Helping ProgrammersChris Parnin
Automated debugging tools may help expert programmers but not others. A study of 34 programmers debugging two programs found:
1) Programmers did not follow ranked lists linearly but jumped around and zig-zagged.
2) When finding the bug, only 1 in 10 stopped investigating while others spent 10 extra minutes.
3) Overall, automated tools did not speed up debugging except for expert programmers who were 5 minutes faster.
4) Developers wanted explanations, not just recommendations from tools. More studies are needed on how programmers use tools.
TRIZ-Theory of Inventive Problem Solving.pptxSejalWasule
TRIZ (Theory of Inventive Problem Solving) is a problem-solving methodology developed by the Soviet engineer and inventor Genrich Altshuller. TRIZ is based on the analysis of thousands of patents and the identification of patterns of innovation and inventive principles.
The TRIZ methodology consists of several key components:
Problem formulation: This involves defining the problem and its parameters, as well as identifying the desired outcomes.
Analysis of the problem: This involves breaking down the problem into smaller parts and identifying the underlying contradictions and conflicts.
Use of inventive principles: TRIZ identifies a set of 40 inventive principles that can be used to solve problems. These principles are based on patterns of innovation that have been identified through the analysis of patents.
Ideation: This involves generating ideas and concepts for solving the problem, using the inventive principles and other ideation techniques.
Evaluation and implementation: This involves evaluating the ideas generated and selecting the most promising solutions for implementation.
The key benefit of TRIZ is that it provides a systematic approach to problem-solving, which can lead to more effective and efficient solutions. It also provides a structured framework for ideation, which can help to generate a larger number of creative ideas.The TRIZ methodology is particularly useful for addressing complex problems that seem unsolvable using traditional problem-solving approaches. It provides a structured framework for ideation and encourages innovative thinking, which can lead to breakthrough solutions. The TRIZ methodology has been used successfully in a wide range of industries, including aerospace, automotive, and consumer products.
A hands-on session taking teams through a (not quite) real world scenario to learn Agile Scrum principles and practices. We'll form teams and walk through a Sprint Planning session, a Sprint, and a Retrospective. Although this is an intro-level workshop, we'll include some new games and ideas for more experienced practitioners.
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
This document provides instructions for a presentation on a system. It lists 6 points to cover in the presentation: 1) How the system works, 2) What parts make up the system, 3) What to do if the system slows down or stops, 4) Main problems, bugs and vulnerabilities of the system, 5) Who to invite to discuss the system, and 6) Why the presentation is being done. Presenters have 2 hours and are encouraged to have fun and use unlimited improvisation. The document also provides a schedule for presentation levels starting in November 2014 and ending in January 2015.
Introduction to Deep Learning | CloudxLabCloudxLab
( Machine Learning & Deep Learning Specialization Training: https://goo.gl/goQxnL )
This CloudxLab Deep Learning tutorial helps you to understand Deep Learning in detail. Below are the topics covered in this tutorial:
1) What is Deep Learning
2) Deep Learning Applications
3) Artificial Neural Network
4) Deep Learning Neural Networks
5) Deep Learning Frameworks
6) AI vs Machine Learning
Machine Learning for Designers - UX Camp SwitzerlandMemi Beltrame
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
Jason Yee - Chaos! - Codemotion Rome 2019Codemotion
As applications become more distributed and complex, so do our failure modes. In this presentation, I’ll share why you shouldn’t just embrace failure, but why you should induce it to intentionally cause and learn from failure. Together with the audience, I'll run a Chaos experiment to show how they can start their own Chaos engineering and make their systems more resilient.
Athletes, Firemen and Doctors train everyday to be the best at their chosen profession. As engineers, we spend much of our time getting stuff to production and making sure our infrastructure doesn’t burn down out right. In this talk, we'll discuss the need for and the options of creating a game day culture. Where we as engineers not only write, maintain and operate our software platforms but actively pursue ways to learn and predict its (non-functional) behavior. We'll look at tools like toxiproxy and the simian army for ways to prepare teams to tweak their testing and monitoring setup and work instructions to quickly observe, react to and resolve problems.
Scrum Plus Extreme Programming (XP) for Hyper ProductivityRon Quartel
If you want to go fast with Scrum, then your best option is to compliment it with Extreme Programming (XP). Inside is an activity that you can run your team and management through to prove and sell the concept.
The document discusses the key aspects of implementing Scrum at Paylogic. It describes why Scrum works through better control, incremental delivery of business value, and empowering self-managed teams. It outlines the Scrum process, including artifacts like the product and sprint backlogs, daily standups, sprints, and retrospectives. It defines the roles of product owner and Scrum master. Sprints are proposed to last 1 week to provide fine-grained control and ensure quality through frequent deliveries. The first sprints would focus on core features and getting initial feedback to adjust subsequent sprints.
Machine Learning for Designers - UX ScotlandMemi Beltrame
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
Coaching teams in creative problem solvingFlowa Oy
Agile has helped teams to collaborate and organize work better. That’s great. Better teamwork and better understanding of the work definitely helps a team to do right things. Agile has also lead the way toward technical practices such as Continuous Integration and Delivery, Test Driven Development and SOLID-architecture principles. Great, these things definitely help the team to do things right.
Then again, most of the time in software projects goes into problem solving and similar creative acts. Agile has relatively little to give on these areas. Currently, agile is not about creativity nor is it about problem solving.
This coaching circle session will focus on the creative core of software development: solving creatively novel, original and broad problems more effectively all the time. I will introduce some principles and tools I’ve found useful when helping people to solve hard problems and to find creative solutions.
This document presents an overview of maze path finding using backtracking. It defines backtracking as an algorithmic technique that solves problems incrementally using recursion to find all possible combinations. Examples where backtracking is used include puzzles, optimization problems, and logic programming languages. Applications discussed include optimization, constraint satisfaction, engineering, robotics, artificial intelligence, and solving puzzles. The document explains that backtracking is used when there are multiple options to choose from, with new options after each choice, repeating until a final state is reached. Advantages include effectiveness for some cases compared to dynamic programming, while disadvantages include inefficiency for strategic problems and slow overall runtime.
Adversarial Simulation Nickerson/Gates Wild West Hacking Fest Oct 2017Chris Gates
The evolution chain in security testing is fundamentally broken due to a lack of understanding, reduction of scope, and a reliance on vulnerability “whack a mole.” To help break the barriers of the common security program we are going to have to divorce ourselves from the metrics of vulnerability statistics and Pavlovian risk color charts and really get to work on how our security programs perform during a REAL event. To do so, we must create an entirely new set of metrics, tests, procedures, implementations and repeatable process. It is extremely rare that a vulnerability causes a direct risk to an environment, it is usually what the attacker DOES with the access gained that matters. In this talk, we will discuss the way that Internal and external teams have been created to simulate a REAL WORLD attack and work hand in hand with the Defensive teams to measure the environments resistance to the attacks. We will demonstrate attacks, capabilities, TTP’s tracking, trending, positive metrics, hunt integration and most of all we will lay out a road map to STOP this nonsense of Red vs BLUE and realize that we are all on the same team. Sparring and training every day to be ready for the fight when it comes to us. This is an update to our 2016 Brucon talk. We plan to discuss what have we accomplished regarding the above in the last year. We plan to show how we have progressed with the automation of attacker activities and event generation using MITRE’s Cyber Analytics Repository & CAR Exploration Tool (CARET) along with pumping these results to Unfetter (https://iadgov.github.io/unfetter/) for aggregation and display in a useful format.
Faster! Faster! Accelerate your business with blazing prototypesOSCON Byrum
Bring your ideas to life! Convince your boss to that open source development is faster and cheaper than the "safe" COTS solution they probably hate anyway. Let's investigate ways to get real-life, functional prototypes up with blazing speed. We'll look at and compare tools for truly rapid development including Python, Django, Flask, PHP, Amazon EC2 and Heroku.
Distributed ScrumMasters and the art of digital facilitationDavid Bland
This document discusses tips for facilitating distributed Scrum teams. It notes that distributed Scrum can be painful, especially for unprepared Scrum Masters, and that training often focuses on collocated teams. Some tips include using lightweight virtual Scrum boards, focusing on communication through video chat when possible, and humanizing the distributed experience by establishing trust and putting names to faces. The key messages are to start small with tools, prioritize communication effectively, and act as a champion to bind the distributed team together.
In every single day, we use the algorithm in our real life. Here we show the uses of the algorithm in our real life. What is an algorithm? Why we use algorithm? Algorithm to add number. Algorithm used in games, genetic algorithm, algorithm in programming, search algorithm, Fibonacci series algorithm and many topics we discuss in here.
Reinforcement Learning – a Rewards Based Approach to Machine Learning - Marko...Marko Lohert
Reinforcement learning is a branch of machine learning that relies on learning through the mechanism of rewards and punishments.
This is the presentation about reinforcement learning that I gave at DORS/CLUC conference in 2024 in Zagreb, Croatia (https://www.dorscluc.org).
The IT discipline of machine learning has become increasingly important in recent years. It promises to solve types of problems for which normal software development is considered unsuitable or too costly.
These slides are from a presentation on understanding Machine Learning at a high level. The talk touches on linear regression, neural networks, and how Deep Learning fits into Machine Learning.
Pragmatic ethical and fair AI for data scientistsDavid Graus
1. David Graus presented on pragmatic and fair AI for recruitment and news recommendations.
2. He discussed how algorithms can unintentionally learn and reflect human biases around gender and race. However, AI may also help address these biases, such as through representational ranking in recruitment to achieve demographic parity.
3. Graus also explored using editorial values like diversity, dynamism and serendipity to guide news recommendations, and found their system could increase dynamism without loss of accuracy through constrained intervention.
Slidedeck of my lecture at SIKS Course "Advances in Information Retrieval"
Read more here: https://graus.nu/blog/bias-in-recommendations-lecture-siks-course-on-advances-in-ir/
More Related Content
Similar to CAT/AI: Computer Assisted Translation Assessment for Impact
A hands-on session taking teams through a (not quite) real world scenario to learn Agile Scrum principles and practices. We'll form teams and walk through a Sprint Planning session, a Sprint, and a Retrospective. Although this is an intro-level workshop, we'll include some new games and ideas for more experienced practitioners.
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
This document provides instructions for a presentation on a system. It lists 6 points to cover in the presentation: 1) How the system works, 2) What parts make up the system, 3) What to do if the system slows down or stops, 4) Main problems, bugs and vulnerabilities of the system, 5) Who to invite to discuss the system, and 6) Why the presentation is being done. Presenters have 2 hours and are encouraged to have fun and use unlimited improvisation. The document also provides a schedule for presentation levels starting in November 2014 and ending in January 2015.
Introduction to Deep Learning | CloudxLabCloudxLab
( Machine Learning & Deep Learning Specialization Training: https://goo.gl/goQxnL )
This CloudxLab Deep Learning tutorial helps you to understand Deep Learning in detail. Below are the topics covered in this tutorial:
1) What is Deep Learning
2) Deep Learning Applications
3) Artificial Neural Network
4) Deep Learning Neural Networks
5) Deep Learning Frameworks
6) AI vs Machine Learning
Machine Learning for Designers - UX Camp SwitzerlandMemi Beltrame
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
Jason Yee - Chaos! - Codemotion Rome 2019Codemotion
As applications become more distributed and complex, so do our failure modes. In this presentation, I’ll share why you shouldn’t just embrace failure, but why you should induce it to intentionally cause and learn from failure. Together with the audience, I'll run a Chaos experiment to show how they can start their own Chaos engineering and make their systems more resilient.
Athletes, Firemen and Doctors train everyday to be the best at their chosen profession. As engineers, we spend much of our time getting stuff to production and making sure our infrastructure doesn’t burn down out right. In this talk, we'll discuss the need for and the options of creating a game day culture. Where we as engineers not only write, maintain and operate our software platforms but actively pursue ways to learn and predict its (non-functional) behavior. We'll look at tools like toxiproxy and the simian army for ways to prepare teams to tweak their testing and monitoring setup and work instructions to quickly observe, react to and resolve problems.
Scrum Plus Extreme Programming (XP) for Hyper ProductivityRon Quartel
If you want to go fast with Scrum, then your best option is to compliment it with Extreme Programming (XP). Inside is an activity that you can run your team and management through to prove and sell the concept.
The document discusses the key aspects of implementing Scrum at Paylogic. It describes why Scrum works through better control, incremental delivery of business value, and empowering self-managed teams. It outlines the Scrum process, including artifacts like the product and sprint backlogs, daily standups, sprints, and retrospectives. It defines the roles of product owner and Scrum master. Sprints are proposed to last 1 week to provide fine-grained control and ensure quality through frequent deliveries. The first sprints would focus on core features and getting initial feedback to adjust subsequent sprints.
Machine Learning for Designers - UX ScotlandMemi Beltrame
Artificial intelligence is more and more becoming the core of digital products. Designing for Products based on AI requires Designers to know about Machine Learning.
This talk is an easy walk through the most important elements of Machine Learning. It looks at the fundamental principles of using practical examples. It showcases applications of the different types of Machine Learning. The use-cases range from text categorization to image recognition, on to speech analysis. The goal is to show what is important for designers and why.
Coaching teams in creative problem solvingFlowa Oy
Agile has helped teams to collaborate and organize work better. That’s great. Better teamwork and better understanding of the work definitely helps a team to do right things. Agile has also lead the way toward technical practices such as Continuous Integration and Delivery, Test Driven Development and SOLID-architecture principles. Great, these things definitely help the team to do things right.
Then again, most of the time in software projects goes into problem solving and similar creative acts. Agile has relatively little to give on these areas. Currently, agile is not about creativity nor is it about problem solving.
This coaching circle session will focus on the creative core of software development: solving creatively novel, original and broad problems more effectively all the time. I will introduce some principles and tools I’ve found useful when helping people to solve hard problems and to find creative solutions.
This document presents an overview of maze path finding using backtracking. It defines backtracking as an algorithmic technique that solves problems incrementally using recursion to find all possible combinations. Examples where backtracking is used include puzzles, optimization problems, and logic programming languages. Applications discussed include optimization, constraint satisfaction, engineering, robotics, artificial intelligence, and solving puzzles. The document explains that backtracking is used when there are multiple options to choose from, with new options after each choice, repeating until a final state is reached. Advantages include effectiveness for some cases compared to dynamic programming, while disadvantages include inefficiency for strategic problems and slow overall runtime.
Adversarial Simulation Nickerson/Gates Wild West Hacking Fest Oct 2017Chris Gates
The evolution chain in security testing is fundamentally broken due to a lack of understanding, reduction of scope, and a reliance on vulnerability “whack a mole.” To help break the barriers of the common security program we are going to have to divorce ourselves from the metrics of vulnerability statistics and Pavlovian risk color charts and really get to work on how our security programs perform during a REAL event. To do so, we must create an entirely new set of metrics, tests, procedures, implementations and repeatable process. It is extremely rare that a vulnerability causes a direct risk to an environment, it is usually what the attacker DOES with the access gained that matters. In this talk, we will discuss the way that Internal and external teams have been created to simulate a REAL WORLD attack and work hand in hand with the Defensive teams to measure the environments resistance to the attacks. We will demonstrate attacks, capabilities, TTP’s tracking, trending, positive metrics, hunt integration and most of all we will lay out a road map to STOP this nonsense of Red vs BLUE and realize that we are all on the same team. Sparring and training every day to be ready for the fight when it comes to us. This is an update to our 2016 Brucon talk. We plan to discuss what have we accomplished regarding the above in the last year. We plan to show how we have progressed with the automation of attacker activities and event generation using MITRE’s Cyber Analytics Repository & CAR Exploration Tool (CARET) along with pumping these results to Unfetter (https://iadgov.github.io/unfetter/) for aggregation and display in a useful format.
Faster! Faster! Accelerate your business with blazing prototypesOSCON Byrum
Bring your ideas to life! Convince your boss to that open source development is faster and cheaper than the "safe" COTS solution they probably hate anyway. Let's investigate ways to get real-life, functional prototypes up with blazing speed. We'll look at and compare tools for truly rapid development including Python, Django, Flask, PHP, Amazon EC2 and Heroku.
Distributed ScrumMasters and the art of digital facilitationDavid Bland
This document discusses tips for facilitating distributed Scrum teams. It notes that distributed Scrum can be painful, especially for unprepared Scrum Masters, and that training often focuses on collocated teams. Some tips include using lightweight virtual Scrum boards, focusing on communication through video chat when possible, and humanizing the distributed experience by establishing trust and putting names to faces. The key messages are to start small with tools, prioritize communication effectively, and act as a champion to bind the distributed team together.
In every single day, we use the algorithm in our real life. Here we show the uses of the algorithm in our real life. What is an algorithm? Why we use algorithm? Algorithm to add number. Algorithm used in games, genetic algorithm, algorithm in programming, search algorithm, Fibonacci series algorithm and many topics we discuss in here.
Reinforcement Learning – a Rewards Based Approach to Machine Learning - Marko...Marko Lohert
Reinforcement learning is a branch of machine learning that relies on learning through the mechanism of rewards and punishments.
This is the presentation about reinforcement learning that I gave at DORS/CLUC conference in 2024 in Zagreb, Croatia (https://www.dorscluc.org).
The IT discipline of machine learning has become increasingly important in recent years. It promises to solve types of problems for which normal software development is considered unsuitable or too costly.
These slides are from a presentation on understanding Machine Learning at a high level. The talk touches on linear regression, neural networks, and how Deep Learning fits into Machine Learning.
Similar to CAT/AI: Computer Assisted Translation Assessment for Impact (20)
Pragmatic ethical and fair AI for data scientistsDavid Graus
1. David Graus presented on pragmatic and fair AI for recruitment and news recommendations.
2. He discussed how algorithms can unintentionally learn and reflect human biases around gender and race. However, AI may also help address these biases, such as through representational ranking in recruitment to achieve demographic parity.
3. Graus also explored using editorial values like diversity, dynamism and serendipity to guide news recommendations, and found their system could increase dynamism without loss of accuracy through constrained intervention.
Slidedeck of my lecture at SIKS Course "Advances in Information Retrieval"
Read more here: https://graus.nu/blog/bias-in-recommendations-lecture-siks-course-on-advances-in-ir/
RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.David Graus
The document summarizes research on recommender systems in the media industry. It discusses how FD Mediagroup uses recommender systems for their SMART Radio and SMART Journalism products. Key aspects of building a recommender system that FD focuses on include relevance, usefulness, and trust. Relevance is evaluated using metrics like NDCG, MAP, and R-Precision. Usefulness considers both algorithmic goals like diversity and business goals. Trust is evaluated based on whether users engage with the recommender system.
Zoeken, vinden, en aanbevelen: personalisatie vs. privacyDavid Graus
Lezing op de VOGIN-IP-lezing op 28 maart 2018 bij de Openbare Bibliotheek Amsterdam.
DISCLAIMER: dit praatje is een mooi stukje ouderwetse (menselijke) manipulatie: expert komt met een 5-tal aanbevelingen :-).
"Tegenwoordig kijkt men steeds vaker met argusogen naar technologiebedrijven die op grote schaal gebruikersgedrag verzamelen. In dit praatje zet ik uiteen waarom het inzetten van gebruikersgedrag van belang is, en hoe het wordt gebruikt om informatie effectief te kunnen ontsluiten en doorzoekbaar maken, of het nu gaat om een zoekmachine als Google, die zich een weg moet banen door een web van miljarden pagina’s, of een service als Spotify, die haar gebruikers graag de juiste muziek blijft aanbieden."
Layman's Talk: Entities of Interest --- Discovery in Digital TracesDavid Graus
The document outlines a program that includes a committee grilling a speaker at 10:00, the committee retreating afterwards, a ceremony at 10:15, and a reception downstairs from 11:00 to 12:30.
Slides of the talk I gave at PyData Amsterdam.
Abstract:
"The FD Mediagroep collects, analyses and filters valuable and relevant information, 24/7, for an influential group of professionals, business executives and high net worth individuals. Company.info (part of FDMG) provides complete, reliable, up-to-date company information and business news about no less than 2.7 million companies and other legal entities in the Netherlands. For Company.info we continuously monitor and crawl hundreds of (online) news sources, resulting in a large archive of (Dutch) business-related news, spanning hundreds of thousands of articles. These articles are automatically enriched, by linking the profiles of companies that are mentioned in the articles, using a custom in-house entity linking framework built in Python. In this talk, I will briefly explain the entity linking task, I will detail the implementation of our custom entity linking framework, and our pipeline for crawling and enriching news articles."
De Macht van Data --- Hoe algoritmen ons leven vormgevenDavid Graus
Slides of the introductory talk I gave at an event at De Balie: "De macht van data" on June 18th, 2017.
For a video recording of the talk see: http://graus.co/blog/mini-college-algoritmen/
Talk I gave at the Data Science Northeast Netherlands Meetup, where I detail the custom in-house entity linking framework, sentiment analysis, and entity salience scoring model we developed for Company.info, in addition to showing some example applications of our corpus of news articles linked to organization profiles.
Dynamic Collective Entity Representations for Entity RankingDavid Graus
This document proposes using collective intelligence to dynamically enrich entity representations from multiple sources like knowledge bases, anchors, tags, and tweets. It presents an adaptive ranking model that learns optimal weights for ranking features like field similarity and importance over time. An experiment on query logs shows expanding entities with different sources improves ranking and retraining the ranker with new content further enhances performance.
Dynamic Collective Entity Representations for Entity RankingDavid Graus
This document proposes using dynamic collective entity representations to improve entity ranking. It describes enriching static entity representations from knowledge bases with descriptions from dynamic sources like tweets, queries, and tags. An adaptive ranking model individually weights each description source and retrains over time using clicks. Experimental results show expanding representations and retraining the ranker improves ranking performance compared to a non-adaptive model, with different sources providing varying benefits depending on their dynamic nature and entity coverage.
David Graus presents his research on using semantic search techniques to improve information retrieval for digital forensic evidence from emails and other electronic documents. He discusses using social network analysis of communication patterns and language models of email content to predict likely recipients of emails. By combining these approaches, he is able to more accurately rank potential recipients than using either technique alone. Future work includes incorporating organizational structure and decay of communication patterns over time.
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27thDavid Graus
David Graus from the University of Amsterdam gave a presentation on entity linking at the Search Engines Amsterdam conference on June 27th. He began by defining entity linking as linking mentions of entities in text to their corresponding entities in a knowledge base. He then gave an example of entity linking and discussed ranking entity candidates based on their prior probabilities like link probability and commonness. Finally, he described using both local and global features in supervised learning models to improve entity linking accuracy.
This document discusses understanding email traffic patterns through recipient recommendation. It explores using social network analysis and language models of email content to predict likely recipients of a given email. Specifically, it examines using measures of node importance in the network, strength of connections between nodes, and similarity between language models of communication profiles to rank and select recipient nodes. The findings indicate that combining social network analysis and language modeling performs better than either approach individually, and that language model similarity is most important for interpersonal communication, while network metrics are more informative for highly active users. Recipient recommendation could help with applications like anomaly detection in e-discovery.
Generating Pseudo-ground Truth for Detecting New Concepts in Social StreamsDavid Graus
The manual curation of knowledge bases is a bottleneck in fast paced domains where new concepts constantly emerge. Identification of nascent concepts is important for improving early entity linking, content interpretation, and recommendation of new content in real-time applications. We present an unsupervised method for generating pseudo-ground truth for training a named entity recognizer to specifically identify entities that will become concepts in a knowledge base in the setting of social streams. We show that our method is able to deal with missing labels, justifying the use of pseudo-ground truth generation in this task. Finally, we show how our method significantly outperforms a lexical-matching baseline, by leveraging strategies for sampling pseudo-ground truth based on entity confidence scores and textual quality of input documents.
yourHistory - entity linking for a personalized timeline of historic eventsDavid Graus
The document describes an entity linking approach to generate a personalized timeline of historic events for a user. It involves 4 main parts: (1) fetching candidate historic events from DBpedia, (2) generating a user profile based on information extracted from the user's Facebook profile, (3) matching the candidate events to the user's interests in their profile, and (4) scoring and ranking the events to produce the final personalized timeline. The approach uses entity linking techniques to associate mentions of entities in the user's profile with the corresponding entries in a knowledge base, in order to identify the user's interests.
This document discusses research on applying text mining and information retrieval techniques for fact finding in regulatory investigations from electronic documents. The researchers are developing methods for semantic search in e-discovery to iteratively retrieve relevant evidence from emails, forums, and other sources by integrating structural context and extracting knowledge from unstructured text. Their current work includes using Twitter mining as a form of conversational search and entity linking to semantically enrich documents.
Semantic Annotation of the Cyttron DatabaseDavid Graus
Final Presentation for my MSc Graduation Project.
Abstract:
"Semantic annotation uses human knowledge formalized in ontologies to enrich texts, by providing structured and machine-understandable information of its content. This paper proposes an approach for automatically annotating texts of the Cyttron Scientific Image Database, using the NCI Thesaurus ontology. Several frequency-based keyword extraction algorithms were implemented and evaluated, aiming to extract important concepts and exclude less relevant ones. Furthermore, topic classification algorithms were applied to identify important concepts which do not occur in the text. The algorithms were evaluated by comparison to annotations provided by experts. Semantic networks were generated from these annotations and an ontology-based similarity metric was applied to perform the comparison. Finally the networks were visualized to provide further insights into the differences of the semantic structure generated by humans, and the algorithms."
More information: http://graus.nu/category/thesis
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceIndexBug
Imagine a world where machines not only perform tasks but also learn, adapt, and make decisions. This is the promise of Artificial Intelligence (AI), a technology that's not just enhancing our lives but revolutionizing entire industries.
Webinar: Designing a schema for a Data WarehouseFederico Razzoli
Are you new to data warehouses (DWH)? Do you need to check whether your data warehouse follows the best practices for a good design? In both cases, this webinar is for you.
A data warehouse is a central relational database that contains all measurements about a business or an organisation. This data comes from a variety of heterogeneous data sources, which includes databases of any type that back the applications used by the company, data files exported by some applications, or APIs provided by internal or external services.
But designing a data warehouse correctly is a hard task, which requires gathering information about the business processes that need to be analysed in the first place. These processes must be translated into so-called star schemas, which means, denormalised databases where each table represents a dimension or facts.
We will discuss these topics:
- How to gather information about a business;
- Understanding dictionaries and how to identify business entities;
- Dimensions and facts;
- Setting a table granularity;
- Types of facts;
- Types of dimensions;
- Snowflakes and how to avoid them;
- Expanding existing dimensions and facts.
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.