Building data pipelines: from simple to more advanced - hands-on experience /...Sergii Khomenko
Data is becoming one of the main decision-makers in an organisation. The more data we have the more challenges we face every day. Every decision we make will have long-term implications. In the talk we will go through different approaches to the data pipelines: from a simple in-house built, with comparison to open source solutions based on Apache stack(Apache Kafka, Apache Samza, Spark) and finally hosted auto-scaling solutions based Amazon(S3, Kinesis, Lambda, EMR) or Google(Pub/Sub, Dataflow, BigQuery). The talk covers the main aspects of data collecting processes altogether with further implications for data processing, highlighting appropriate solutions and architectures for the particular use-cases.
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...Sergii Khomenko
In the talk described our experience of setting up data reporting and Business Intelligence processes for an international company. Starting with an Excel file and bunch SQL queries, switching from in-house reporting solution to centralised hosted reports for building a flexible system for monitoring KPI of the company.
Attendees will learn from our experience how to integrate Tableau into the processes of a company, how to build independent ETL subsystems that scale to petabyte size and other useful learnings.
We will cover our early days with cloud solutions, that do not provide a DWH platform, so you can not expect any kind of production requirements. In the talk, we will go through the process of automatically duplicating our Tableau datasources to Amazon Redshift. That will enable us to be more flexible with scaling data, be sure about backup strategies and many-many more points. We will introduce our python toolchain that helps us in a daily management of our BI.
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Sergii Khomenko
Nowadays there are plenty of solution to build a search subsystem. The question is how to keep such a system flexible and easy to react on data-driven decisions, constantly improve the quality. In talk are presented lessons learned from our experience of building lean ranking infrastructure, that could be used with data-driven approach in product development. With slides we walk through the process of scaling out the search system from a couple to 13 countries around the world, but keeping flexibility, that allows to test hypothesis on different levels and perform a/b testing in different dimensions.
Updated version of the WordPress Accessibility talk given at WordCamp Minneapolis. Understanding accessibility, implementing accessibility, WordPress Accessibility options and availability.
Schedule based network orchestration using opendaylightCoreStack
Orchestration of Project Onboarding in an IT/ITES Environment enabling the automated provisioning and deployment of networking infrastructure for project workspaces.
Webinar: Tailored Big Data Solutions using MapReduce Design Patterns Edureka!
The webinar on MapReduce Design Patterns titled " Tailored Big Data Solutions using MapReduce Design Patterns " conducted by Edureka on 26th February 2015
OLAP Basics and Fundamentals by Bharat Kalia Bharat Kalia
OLAP is a category of software technology that enables analysts, managers, and executives to gain insight into the data through fast, consistent, interactive, access in a wide variety of possible views of information that has been transformed from raw data to reflect the real dimensionality of the enterprise as understood by the user.
Graph Features in Spark 3.0: Integrating Graph Querying and Algorithms in Spa...Databricks
Spark 3.0 introduces a new module: Spark Graph. Spark Graph adds the popular query language Cypher, its accompanying Property Graph Model and Graph Algorithms to the data science toolbox. Graphs have a plethora of useful applications in recommendation, fraud detection and research. The tutorial aims to help understanding when graphs should be used and how Spark Graph can be used to extend analytical workflows. In this tutorial we will explore the concepts and motivations behind graph querying and graph algorithms, the components of the new Spark Graph module and their APIs, and how those APIs allow you to successfully write your own graph applications and integrate them in your data science workflows.
The tutorial is a mixture of presentation, code examples, and notebooks. We will demonstrate how to write an end-to-end Graph application that operates on different kinds of input data. We will show how Spark Graph interacts with Spark SQL and openCypher Morpheus, a Spark Graph extension that allows you to easily manage multiple graphs and provides built-in Property Graph Data Sources for the Neo4j graph database as well as Cypher language extensions.
At the end of the tutorial, attendees will have a good understanding of when to apply graphs in their data science workflows, how to bring Spark Graph into an existing Spark workflow and how to make best use of the new APIs. This tutorial will be both lead by the presenters and also hands-on interactive session. The tutorial material will be made available during the presentation.
Building data pipelines: from simple to more advanced - hands-on experience /...Sergii Khomenko
Data is becoming one of the main decision-makers in an organisation. The more data we have the more challenges we face every day. Every decision we make will have long-term implications. In the talk we will go through different approaches to the data pipelines: from a simple in-house built, with comparison to open source solutions based on Apache stack(Apache Kafka, Apache Samza, Spark) and finally hosted auto-scaling solutions based Amazon(S3, Kinesis, Lambda, EMR) or Google(Pub/Sub, Dataflow, BigQuery). The talk covers the main aspects of data collecting processes altogether with further implications for data processing, highlighting appropriate solutions and architectures for the particular use-cases.
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...Sergii Khomenko
In the talk described our experience of setting up data reporting and Business Intelligence processes for an international company. Starting with an Excel file and bunch SQL queries, switching from in-house reporting solution to centralised hosted reports for building a flexible system for monitoring KPI of the company.
Attendees will learn from our experience how to integrate Tableau into the processes of a company, how to build independent ETL subsystems that scale to petabyte size and other useful learnings.
We will cover our early days with cloud solutions, that do not provide a DWH platform, so you can not expect any kind of production requirements. In the talk, we will go through the process of automatically duplicating our Tableau datasources to Amazon Redshift. That will enable us to be more flexible with scaling data, be sure about backup strategies and many-many more points. We will introduce our python toolchain that helps us in a daily management of our BI.
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Sergii Khomenko
Nowadays there are plenty of solution to build a search subsystem. The question is how to keep such a system flexible and easy to react on data-driven decisions, constantly improve the quality. In talk are presented lessons learned from our experience of building lean ranking infrastructure, that could be used with data-driven approach in product development. With slides we walk through the process of scaling out the search system from a couple to 13 countries around the world, but keeping flexibility, that allows to test hypothesis on different levels and perform a/b testing in different dimensions.
Updated version of the WordPress Accessibility talk given at WordCamp Minneapolis. Understanding accessibility, implementing accessibility, WordPress Accessibility options and availability.
Schedule based network orchestration using opendaylightCoreStack
Orchestration of Project Onboarding in an IT/ITES Environment enabling the automated provisioning and deployment of networking infrastructure for project workspaces.
Webinar: Tailored Big Data Solutions using MapReduce Design Patterns Edureka!
The webinar on MapReduce Design Patterns titled " Tailored Big Data Solutions using MapReduce Design Patterns " conducted by Edureka on 26th February 2015
OLAP Basics and Fundamentals by Bharat Kalia Bharat Kalia
OLAP is a category of software technology that enables analysts, managers, and executives to gain insight into the data through fast, consistent, interactive, access in a wide variety of possible views of information that has been transformed from raw data to reflect the real dimensionality of the enterprise as understood by the user.
Graph Features in Spark 3.0: Integrating Graph Querying and Algorithms in Spa...Databricks
Spark 3.0 introduces a new module: Spark Graph. Spark Graph adds the popular query language Cypher, its accompanying Property Graph Model and Graph Algorithms to the data science toolbox. Graphs have a plethora of useful applications in recommendation, fraud detection and research. The tutorial aims to help understanding when graphs should be used and how Spark Graph can be used to extend analytical workflows. In this tutorial we will explore the concepts and motivations behind graph querying and graph algorithms, the components of the new Spark Graph module and their APIs, and how those APIs allow you to successfully write your own graph applications and integrate them in your data science workflows.
The tutorial is a mixture of presentation, code examples, and notebooks. We will demonstrate how to write an end-to-end Graph application that operates on different kinds of input data. We will show how Spark Graph interacts with Spark SQL and openCypher Morpheus, a Spark Graph extension that allows you to easily manage multiple graphs and provides built-in Property Graph Data Sources for the Neo4j graph database as well as Cypher language extensions.
At the end of the tutorial, attendees will have a good understanding of when to apply graphs in their data science workflows, how to bring Spark Graph into an existing Spark workflow and how to make best use of the new APIs. This tutorial will be both lead by the presenters and also hands-on interactive session. The tutorial material will be made available during the presentation.
Have you heard about all the hot new features in SQL Server 2017? One of the game-changing features is Graph DB. Learn what it is, how you can use it, and what scenarios it excels in - specifically where data has strongly defined relationships and is more interconnected.
Back to the future : SQL 92 for Elasticsearch @nosqlmatters ParisLucian Precup
What if we would try to make Elasticsearch SQL 92 compliant (http://www.contrib.andrew.cmu.edu/~shadow/sql/sql1992.txt)? This wouldn't serve that much nowadays, you would say. Well, we actually tried to do the exercise and we have some interesting conclusions. While we take Elasticsearch as an example for this "side by side", the issues we are addressing also apply to nosql in general. With this unusual exercise, we take the occasion to compare relational databases / sql with Elasticsearch / nosql on all the levels : functionality, semantics, performance and user experience.
Did you know OutSystems enables you to use JavaScript to costumize how users interact with your applications? In this session, see how we've developed and extended patterns over the years and learn the best practices you can follow.
Link with resources: https://bit.ly/2F8P29j
UI Realigning & Refactoring by Jina BoltonCodemotion
Often designers and developers see Markup and CSS Refactoring as a dreaded, monolithic task. Organization, architecture, clean up, optimization, documentation all seem tedious and overwhelming. However, if you’re armed with the right tools and a solid foundation, you may find refactoring to be actually quite fun. Learn some Sass, markup, and documentation tips & tricks from a product designer’s perspective. Start making refactoring a regular part of your design process and development workflows.
Lucian Precup - Back to the Future: SQL 92 for Elasticsearch? - NoSQL matters...NoSQLmatters
What if we would try to make Elasticsearch SQL 92 compliant (http://www.contrib.andrew.cmu.edu/~shadow/sql/sql1992.txt)? This wouldn't serve that much nowadays, you would say. Well, we actually tried to do the exercise and we have some interesting conclusions. While we take Elasticsearch as an example for this "side by side", the issues we are addressing also apply to nosql in general. With this unusual exercise, we take the occasion to compare relational databases / sql with Elasticsearch / nosql on all the levels : functionality, semantics, performance and user experience.
Presented at JavaOne 2013, Tuesday September 24.
"Data Modeling Patterns" co-created with Ian Robinson.
"Pitfalls and Anti-Patterns" created by Ian Robinson.
BI Team @ LinkedIn hosted a user group meeting for MicroStrategy customers in bay area. Presentation includes information about LinkedIn, concepts of metadata driven model for business dashboards, customizations using SDK, JSP and JQUERY.
Back to Basics Webinar 5: Introduction to the Aggregation FrameworkMongoDB
This is the fifth webinar of a Back to Basics series that will introduce you to the MongoDB database. This webinar will introduce you to the aggregation framework.
Digital analytics with R - Sydney Users of R Forum - May 2015Johann de Boer
A presentation given to the Sydney Users of R forum about an open source R package I developed for querying Google Analytics data.
For instructions on getting started with ganalytics please refer to the readme file here: https://github.com/jdeboer/ganalytics/blob/master/README.md
It would be great to hear any feedback or questions you have about the ganalytics package or the presentation. Any difficulties you might encounter with installing or using the ganalytics package, please let me know so that it can be made easier for everyone. Submit issues here: https://github.com/jdeboer/ganalytics/issues/
Contributions to the package are welcome:
- Package documentation
- Adding examples and demos
- Testing and finding bugs to fix
- Ideas for improvements or new features
Thanks for your interest. Feel free to reach out to me via twitter: @johannux
Smarter Together - Bringing Relational Algebra, Powered by Apache Calcite, in...Julian Hyde
What if Looker saw the queries you just executed and could predict your next query? Could it make those queries faster, by smarter caching, or aggregate navigation? Could it read your past SQL queries and help you write your LookML model? Those are some of the reasons to add relational algebra into Looker’s query engine, and why Looker hired Julian Hyde, author of Apache Calcite, to lead the effort. In this talk about the internals of Looker’s query engine, Julian Hyde will describe how the engine works, how Looker queries are described in Calcite’s relational algebra, and some features that it makes possible.
A talk by Julian Hyde at JOIN 2019 in San Francisco.
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...Sergii Khomenko
Data cleaning is the first step of every Data Science project. Next one does Data Science. The talk covers a missing step of deployment and scaling Data Applications in production. We will go through all major steps of the process like Dockerizing application, Continuous Deployment with further AWS stack creation and rolling deploys although also covering new trends in Serverless architecture.
Have you heard about all the hot new features in SQL Server 2017? One of the game-changing features is Graph DB. Learn what it is, how you can use it, and what scenarios it excels in - specifically where data has strongly defined relationships and is more interconnected.
Back to the future : SQL 92 for Elasticsearch @nosqlmatters ParisLucian Precup
What if we would try to make Elasticsearch SQL 92 compliant (http://www.contrib.andrew.cmu.edu/~shadow/sql/sql1992.txt)? This wouldn't serve that much nowadays, you would say. Well, we actually tried to do the exercise and we have some interesting conclusions. While we take Elasticsearch as an example for this "side by side", the issues we are addressing also apply to nosql in general. With this unusual exercise, we take the occasion to compare relational databases / sql with Elasticsearch / nosql on all the levels : functionality, semantics, performance and user experience.
Did you know OutSystems enables you to use JavaScript to costumize how users interact with your applications? In this session, see how we've developed and extended patterns over the years and learn the best practices you can follow.
Link with resources: https://bit.ly/2F8P29j
UI Realigning & Refactoring by Jina BoltonCodemotion
Often designers and developers see Markup and CSS Refactoring as a dreaded, monolithic task. Organization, architecture, clean up, optimization, documentation all seem tedious and overwhelming. However, if you’re armed with the right tools and a solid foundation, you may find refactoring to be actually quite fun. Learn some Sass, markup, and documentation tips & tricks from a product designer’s perspective. Start making refactoring a regular part of your design process and development workflows.
Lucian Precup - Back to the Future: SQL 92 for Elasticsearch? - NoSQL matters...NoSQLmatters
What if we would try to make Elasticsearch SQL 92 compliant (http://www.contrib.andrew.cmu.edu/~shadow/sql/sql1992.txt)? This wouldn't serve that much nowadays, you would say. Well, we actually tried to do the exercise and we have some interesting conclusions. While we take Elasticsearch as an example for this "side by side", the issues we are addressing also apply to nosql in general. With this unusual exercise, we take the occasion to compare relational databases / sql with Elasticsearch / nosql on all the levels : functionality, semantics, performance and user experience.
Presented at JavaOne 2013, Tuesday September 24.
"Data Modeling Patterns" co-created with Ian Robinson.
"Pitfalls and Anti-Patterns" created by Ian Robinson.
BI Team @ LinkedIn hosted a user group meeting for MicroStrategy customers in bay area. Presentation includes information about LinkedIn, concepts of metadata driven model for business dashboards, customizations using SDK, JSP and JQUERY.
Back to Basics Webinar 5: Introduction to the Aggregation FrameworkMongoDB
This is the fifth webinar of a Back to Basics series that will introduce you to the MongoDB database. This webinar will introduce you to the aggregation framework.
Digital analytics with R - Sydney Users of R Forum - May 2015Johann de Boer
A presentation given to the Sydney Users of R forum about an open source R package I developed for querying Google Analytics data.
For instructions on getting started with ganalytics please refer to the readme file here: https://github.com/jdeboer/ganalytics/blob/master/README.md
It would be great to hear any feedback or questions you have about the ganalytics package or the presentation. Any difficulties you might encounter with installing or using the ganalytics package, please let me know so that it can be made easier for everyone. Submit issues here: https://github.com/jdeboer/ganalytics/issues/
Contributions to the package are welcome:
- Package documentation
- Adding examples and demos
- Testing and finding bugs to fix
- Ideas for improvements or new features
Thanks for your interest. Feel free to reach out to me via twitter: @johannux
Smarter Together - Bringing Relational Algebra, Powered by Apache Calcite, in...Julian Hyde
What if Looker saw the queries you just executed and could predict your next query? Could it make those queries faster, by smarter caching, or aggregate navigation? Could it read your past SQL queries and help you write your LookML model? Those are some of the reasons to add relational algebra into Looker’s query engine, and why Looker hired Julian Hyde, author of Apache Calcite, to lead the effort. In this talk about the internals of Looker’s query engine, Julian Hyde will describe how the engine works, how Looker queries are described in Calcite’s relational algebra, and some features that it makes possible.
A talk by Julian Hyde at JOIN 2019 in San Francisco.
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...Sergii Khomenko
Data cleaning is the first step of every Data Science project. Next one does Data Science. The talk covers a missing step of deployment and scaling Data Applications in production. We will go through all major steps of the process like Dockerizing application, Continuous Deployment with further AWS stack creation and rolling deploys although also covering new trends in Serverless architecture.
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Sergii Khomenko
Many people use Go for different projects: WebDev, DevOps or other general-purpose tasks. On another hand, with all the beauty and performance of the language it could be a good challenger for Data applications. In the talk, we will go through the common problems of Data Engineering. Starting with high-performance caching and probabilistic data structures like Bloom filters, CountMin or Hyperloglog. We will cover all stages of Data Pipelining like writing data producers for open source Apache Kafka or proprietary Amazon Kinesis or Google Pub/Sub with further data consuming and processing.
The talk covers real-life use-cases of Data Applications and will provide an overview of existing possibilities of Golang as a language for data engineering. In the talk, we will cover basic ideas of building high-performance data application, creating your own data pipelines based on open source solutions and also hosted proprietary like Amazon Kinesis or Google Pub/Sub. The idea is to provide an overview how good is Golang for data engineering and what are Pros and Cons.
Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...Sergii Khomenko
Tableau provides a rich feature set, in particular Tableau Server/Online data storage. It comes easy and cheap to use, allowing for a quick start and fast growth. However, it is no permanent storage solution. We painfully experienced this first hand at beginning of the year.
We provide a quick overview on how we initially set up our Tableau infrastructure and how this limited us. Focus will be on steps we came up with to improve our working environment.
Talk will be targeted at companies relying on Tableau environment. We will go through the process of automatically duplicating your Tableau data sources to Amazon Redshift. We will introduce our python toolchain for daily management. It enables you to be more flexible with scaling your data, being sure about backup strategies and many more points.
http://ontour15.tableau.com/berlin/schedule/tuesday#session-1456
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Sergii Khomenko
Puppet is widely known in DevOps community, but not so popular in data teams. Nevertheless, Puppet could easily empower your data teams. In the talk presented hands-on experience of using Puppet for different data topics starting from configuring Windows machine for Business Intelligence and finishing with advanced ranking infrastructures based on Puppet.
The talk will walk you through the process of setting up a standalone Puppet configuration, that used for provisioning Windows machine to be utilized for Business Intelligence purposes like Tableau and Talend Big Data configurations, ETL scheduling etc. Second part of the talk will cover a use-case of Puppet for enabling a lean ranking infrastructure.
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Sergii Khomenko
Initially storing data with Tableau Online storage could sound like a good idea, mainly because you get it for free. From another side Tableau doesn't say they provide a DWH platform, so you can not expect any kind of production use-cases from the storage of Tableau Online. In the talk, we will go through the process of automatically duplicating your Tableau datasources to Amazon Redshift. That will enable us to be more flexible with scaling your data, be sure about backup strategies and many-many more points. We will introduce our python toolchain that helps us in a daily management of our BI.
From simple to more advanced: Lessons learned in 13 months with TableauSergii Khomenko
In the talk described our experience of integration Tableau into our data reporting and Business Intelligence process.
Switching from in-house reporting solution to Tableau reporting, data refreshes with Command Line Utility
and other small tips and tricks. Using Tableau reports for building a flexible system for monitoring KPI of a company.
Crunching data with go: Tips, tricks, use-casesSergii Khomenko
Talk for the first meetup of Munich Golang User Group. Described use-cases from real Go development, covered fetching data from sql database, connecting to Google services like Google Analytics, Google BigQuery, other aspect of building a geolocation application.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Welocme to ViralQR, your best QR code generator.ViralQR
Welcome to ViralQR, your best QR code generator available on the market!
At ViralQR, we design static and dynamic QR codes. Our mission is to make business operations easier and customer engagement more powerful through the use of QR technology. Be it a small-scale business or a huge enterprise, our easy-to-use platform provides multiple choices that can be tailored according to your company's branding and marketing strategies.
Our Vision
We are here to make the process of creating QR codes easy and smooth, thus enhancing customer interaction and making business more fluid. We very strongly believe in the ability of QR codes to change the world for businesses in their interaction with customers and are set on making that technology accessible and usable far and wide.
Our Achievements
Ever since its inception, we have successfully served many clients by offering QR codes in their marketing, service delivery, and collection of feedback across various industries. Our platform has been recognized for its ease of use and amazing features, which helped a business to make QR codes.
Our Services
At ViralQR, here is a comprehensive suite of services that caters to your very needs:
Static QR Codes: Create free static QR codes. These QR codes are able to store significant information such as URLs, vCards, plain text, emails and SMS, Wi-Fi credentials, and Bitcoin addresses.
Dynamic QR codes: These also have all the advanced features but are subscription-based. They can directly link to PDF files, images, micro-landing pages, social accounts, review forms, business pages, and applications. In addition, they can be branded with CTAs, frames, patterns, colors, and logos to enhance your branding.
Pricing and Packages
Additionally, there is a 14-day free offer to ViralQR, which is an exceptional opportunity for new users to take a feel of this platform. One can easily subscribe from there and experience the full dynamic of using QR codes. The subscription plans are not only meant for business; they are priced very flexibly so that literally every business could afford to benefit from our service.
Why choose us?
ViralQR will provide services for marketing, advertising, catering, retail, and the like. The QR codes can be posted on fliers, packaging, merchandise, and banners, as well as to substitute for cash and cards in a restaurant or coffee shop. With QR codes integrated into your business, improve customer engagement and streamline operations.
Comprehensive Analytics
Subscribers of ViralQR receive detailed analytics and tracking tools in light of having a view of the core values of QR code performance. Our analytics dashboard shows aggregate views and unique views, as well as detailed information about each impression, including time, device, browser, and estimated location by city and country.
So, thank you for choosing ViralQR; we have an offer of nothing but the best in terms of QR code services to meet business diversity!
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
By Design, not by Accident - Agile Venture Bolzano 2024
Lean Ranking infrastructure with Solr
1. LEAN RANKING INFRASTRUCTURE WITH SOLR
Ranking infrastructure and using Solr for lean prototyping
sergii.khomenko@stylight.com, @lc0d3r
Slide 1 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
13. IMPROVING RELEVANCE
TF-IDF - default scoring model in Lucene/Solr
•
•
•
matching more query terms is better
more occurrences of a query term is better
more novel terms increase doc score more than common terms
Slide 13 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
14. IMPROVING RELEVANCE
• Stages to improve relevance in Solr
• Editorial voting (QueryEvaluationComponent)
• Indexing time
(analyzing content, text analysis)
• Query-time
(function queries, boosting)
Slide 14 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
17. IMPROVING RELEVANCE
Example of external file with boosting
coresde_DEproductsexternal_delta2.txt
15062471=0.5
15062479=0.2
15062507=0.41
Slide 17 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
18. LEAN APPROACH TO RANKING
Slide 18 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
19. LEAN APPROACH TO RANKING
Lean manufacturing, lean enterprise, or lean production, often simply,
"lean", is a production practice that considers the expenditure of resources for any goal
other than the creation of value for the end customer to be wasteful, and thus a target
for elimination.
Essentially, lean is centered on preserving value with less work.
Slide 19 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
20. LEAN APPROACH TO RANKING
Requirements:
• Decreasing time to implement new ranking model
• Possibility to use more dynamic ranking models
• Keeping working infrastructure alive
• A/B testing without changing entire infrastructure
• Performance level - “still fast” and “transparent”
Slide 20 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
21. LEAN APPROACH TO RANKING
Python benchmark
-h, --help
show this help message and exit
--gaid gaid, -g gaid Google analytics site id.
--gadate gadate a date to fetch the most popular pages from Google Analytics
–-solr solr, -s solr Solr server to benchmark performance.
--pages number, -p number a number of top pages from Google Analytics.
--repeats number, -r number a number of repeats for an every page.
--compare compare, -c compare Different rankings algorithms to compare.
--cmpmode CMPMODE run benchmark in comparison mode
python solr-benchmarkbenchmark.py -c RankingClassical,RankingDelta2
python solr-benchmarkbenchmark.py -c RankingClassical,RankingDelta2 --cmpmode 1
Slide 21 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
22. LEAN APPROACH TO RANKING
Common search infrastructure
nginx
Solr-loadbalancer
Slide 22 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
nginx
Solr
nginx
Jboss
Solr
Solr
24. LEAN APPROACH TO RANKING
nginx / templates / conf / solr-rewrites.conf.erb
include nginx
nginx::config { "solr_dev": }
nginx::solr-ranking { "delta2":
urls => [
"/search.action?gender=women&brand=2271&tag=1161&tag=877&tag=468",
"/search.action?gender=men&brand=11235&tag=10203&tag=10299&tag=10326"
],
Slide 24 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
25. LEAN APPROACH TO RANKING
nginx / templates / conf / solr-rewrites.conf.erb
<% urls.each do |url| -%>
if ($args ~* <% if url['gender'] > 0 -%>gender_id%3A<%= url['gender'] %>.*<% end %><% url['tags'].each do |tag| -%>tag_id%3A<%= tag %>.*<% end -%><% if
url['brand'] > 0 -%>brand_id%3A%28<%= url['brand'] %>%29<% end -%>) {
set $orig $args;
set $args "q={!boost+b=%24b+defType=dismax+v=%24qq}&qq=id:*";
rewrite ^(.*)$ "$1?$orig" break;
}
<% end -%>
Slide 25 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
31. REAL-WORLD EXAMPLES
Multiple points to evaluate
Stages to evaluate the model:
• R ranking model
• Independent Solr-node
• For internal use-cases
• Testing for some of pages
• A/B roll out for % of users
• Production roll out
Slide 31 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
32. THANKS FOR YOUR ATTENTION!
Questions?
Slide 32 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
33. Sergii Khomenko
Data Scientist
STYLIGHT GmbH
sergii.khomenko@stylight.com
@lc0d3r
http://www.stylight.com
Nymphenburger Straße 86
80636 Munich, Germany
Slide 33 | STYLIGHT | Proud to bleed purple | @lc0d3r and @stylight_eng
34. REFERENCE LIST
• Stack Overflow Tag Trends
http://hewgill.com/~greg/stackoverflow/stack_overflow/tags/#!l
ucene+solr+elasticsearch+sphinx
• Public websites using Solr
http://wiki.apache.org/solr/PublicServers
• CommonQueryParameters
http://wiki.apache.org/solr/CommonQueryParameters
• Thoughts in plain text http://lc0.github.io/
• STYLIGHT Engineering http://www.stylight.com/Engineering/