Lijun Chang, DECRA Fellow at the University of New South Wales talked about how to make subgraph matching more efficient thanks to postponing Cartesian products.
after a brief intro about indexes present in PostgreSQL, the new feature in PostgreSQL 10 are shown: parallel index scans, hash persistence, BRIN autosummarization, unbalanced support (SP-GiST) for inet data.
We introduce a sparse kernel learning framework for the Continuous Relevance Model (CRM). State-of-the-art image annotation models linearly combine evidence from several different feature types to improve image annotation accuracy. While previous authors have focused on learning the linear combination weights for these features, there has been no work examining the optimal combination of kernels. We address this gap by formulating a sparse kernel learning framework for the CRM, dubbed the SKL-CRM, that greedily selects an optimal combination of kernels. Our kernel learning framework rapidly converges to an annotation accuracy that substantially outperforms a host of state-of-the-art annotation models. We make two surprising conclusions: firstly, if the kernels are chosen correctly, only a very small number of features are required so to achieve superior performance over models that utilise a full suite of feature types; and secondly, the standard default selection of kernels commonly used in the literature is sub-optimal, and it is much better to adapt the kernel choice based on the feature type and image dataset.
after a brief intro about indexes present in PostgreSQL, the new feature in PostgreSQL 10 are shown: parallel index scans, hash persistence, BRIN autosummarization, unbalanced support (SP-GiST) for inet data.
We introduce a sparse kernel learning framework for the Continuous Relevance Model (CRM). State-of-the-art image annotation models linearly combine evidence from several different feature types to improve image annotation accuracy. While previous authors have focused on learning the linear combination weights for these features, there has been no work examining the optimal combination of kernels. We address this gap by formulating a sparse kernel learning framework for the CRM, dubbed the SKL-CRM, that greedily selects an optimal combination of kernels. Our kernel learning framework rapidly converges to an annotation accuracy that substantially outperforms a host of state-of-the-art annotation models. We make two surprising conclusions: firstly, if the kernels are chosen correctly, only a very small number of features are required so to achieve superior performance over models that utilise a full suite of feature types; and secondly, the standard default selection of kernels commonly used in the literature is sub-optimal, and it is much better to adapt the kernel choice based on the feature type and image dataset.
http://www.ncbi.nlm.nih.gov/pubmed/20236959
J R Soc Interface. 2010 Sep 6;7(50):1341-54. doi: 10.1098/rsif.2010.0063. Epub 2010 Mar 17.
Topological network alignment uncovers biological function and phylogeny.
Kuchaiev O, Milenkovic T, Memisevic V, Hayes W, Przulj N.
http://www.ncbi.nlm.nih.gov/pubmed/19259413
Cancer Inform. 2008;6:257-73. Epub 2008 Apr 14.
Uncovering biological network function via graphlet degree signatures.
Milenković T, Przulj N.
An Improved Optimization Techniques for Parallel Prefix Adder using FPGAIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
Design and implementation of Parallel Prefix Adders using FPGAsIOSR Journals
Abstract: Adders are known to have the frequently used in VLSI designs. In digital design we have half adder and full adder, main adders by using these adders we can implement ripple carry adders. RCA use to perform any number of addition. In this RCA is serial adder and it has commutation delay problem. If increase the ha&fa simultaneously delay also increase. That’s why we go for parallel adders(parallel prefix adders). IN the parallel prefix adder are ks adder(kogge-stone),sks adder(sparse kogge-stone),spaning tree and brentkung adder. These adders are designd and implemented on FPGA sparton3E kit. Simulated and synthesis by model sim6.4b, Xilinx ise10.1.
VLSI Projects for M. Tech, VLSI Projects in Vijayanagar, VLSI Projects in Bangalore, M. Tech Projects in Vijayanagar, M. Tech Projects in Bangalore, VLSI IEEE projects in Bangalore, IEEE 2015 VLSI Projects, FPGA and Xilinx Projects, FPGA and Xilinx Projects in Bangalore, FPGA and Xilinx Projects in Vijayangar
Advanced Analytics and Data Visualisation in Forest Management and PlanningNoli Sicad
Advanced Analytics and Data Visualisation in
Forest Management and Planning
E. Noli Sicad, PhD
ForestTECH 2017, 21-22 November 2017, Melbourne, Australia
In this talk we will describe a methodology to handle the causality to make inference on common-cause failure in a situation of missing data. The data are collected in the form of contingency table but the available information are only the numbers of CCF of different orders and the numbers of failure due to a given cause. Therefore only the margins of the contingency table are observed; thefrequencies in each cell are unknown. Assuming a Poisson model for the count, we suggest a Bayesian approach and we use the inverse Bayes formula (IBF) combined with a Metropolis-Hastings algorithm to make inference on the rate of occurrence for the different combination cause, order. The performance of the resulting algorithm is evaluated through simulations. A comparison is made with results obtained from the _-composition approach to deal with causality suggested by Zheng et al. (2013).
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...Asai Masataro
This is a presentation used in the aural session in AAAI-16. The original paper is available at http: guicho271828.github.io/publications/ .
# Abstract
Despite recent improvements in search techniques for cost-optimal classical planning, the exponential growth of
the size of the search frontier in A* is unavoidable. We investigate tiebreaking strategies for A*,
experimentally analyzing the performance of standard tiebreaking strategies that break ties according to the heuristic value of the nodes. We find that tiebreaking has a significant impact on search algorithm performance when there are zero-cost operators that induce large plateau regions in the search space. We develop a new framework for tiebreaking based on a depth metric which measures distance from the entrance to the plateau, and propose a new, randomized strategy which significantly outperforms standard strategies on domains with zero-cost actions.
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AESidescitation
Secure data transmission is very important in any communication systems.
Network Security provides many techniques for efficient data transmission through
unprotected network. Cryptography provides a method for securing the transmission of
information by the process of encryption. Encryption converts the message in to unreadable
form (Cipher Text) . Decryption converts this Cipher Text back to original message.
Advanced Encryption Standard (AES) has been used as the first choice of cryptographic
algorithm for many security based applications because of the high level of security and
flexibility of implementation in hardware and software. This paper presents an area
efficient, low power design for AES based on an 8-bit data path making it suitable for
wireless security applications. It has a significant power-area-latency performance
improvements over other existing AES designs. For high performance applications, AES S-
box and inverse S-box implemented using composite field Arithmetic (CFA). Also low
resource Mixcolumn structure is used in this structure. The 8 bit data path architecture is
implemented in XILINX 13.2 and simulated using MODELSIM 6.5 software. Also the
power and area calculation is done with the help of SYNOPSYS software.
1. Advantages of Karnaugh Maps
2. SOPS And POPS
3. Properties
4. Simplification Process
5. How to solve the Karenaugh map?
6. Different Types Of K-Maps
7. Presentation Introduction
8. Don't Care Condition
9. Conclution
http://www.ncbi.nlm.nih.gov/pubmed/20236959
J R Soc Interface. 2010 Sep 6;7(50):1341-54. doi: 10.1098/rsif.2010.0063. Epub 2010 Mar 17.
Topological network alignment uncovers biological function and phylogeny.
Kuchaiev O, Milenkovic T, Memisevic V, Hayes W, Przulj N.
http://www.ncbi.nlm.nih.gov/pubmed/19259413
Cancer Inform. 2008;6:257-73. Epub 2008 Apr 14.
Uncovering biological network function via graphlet degree signatures.
Milenković T, Przulj N.
An Improved Optimization Techniques for Parallel Prefix Adder using FPGAIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
Design and implementation of Parallel Prefix Adders using FPGAsIOSR Journals
Abstract: Adders are known to have the frequently used in VLSI designs. In digital design we have half adder and full adder, main adders by using these adders we can implement ripple carry adders. RCA use to perform any number of addition. In this RCA is serial adder and it has commutation delay problem. If increase the ha&fa simultaneously delay also increase. That’s why we go for parallel adders(parallel prefix adders). IN the parallel prefix adder are ks adder(kogge-stone),sks adder(sparse kogge-stone),spaning tree and brentkung adder. These adders are designd and implemented on FPGA sparton3E kit. Simulated and synthesis by model sim6.4b, Xilinx ise10.1.
VLSI Projects for M. Tech, VLSI Projects in Vijayanagar, VLSI Projects in Bangalore, M. Tech Projects in Vijayanagar, M. Tech Projects in Bangalore, VLSI IEEE projects in Bangalore, IEEE 2015 VLSI Projects, FPGA and Xilinx Projects, FPGA and Xilinx Projects in Bangalore, FPGA and Xilinx Projects in Vijayangar
Advanced Analytics and Data Visualisation in Forest Management and PlanningNoli Sicad
Advanced Analytics and Data Visualisation in
Forest Management and Planning
E. Noli Sicad, PhD
ForestTECH 2017, 21-22 November 2017, Melbourne, Australia
In this talk we will describe a methodology to handle the causality to make inference on common-cause failure in a situation of missing data. The data are collected in the form of contingency table but the available information are only the numbers of CCF of different orders and the numbers of failure due to a given cause. Therefore only the margins of the contingency table are observed; thefrequencies in each cell are unknown. Assuming a Poisson model for the count, we suggest a Bayesian approach and we use the inverse Bayes formula (IBF) combined with a Metropolis-Hastings algorithm to make inference on the rate of occurrence for the different combination cause, order. The performance of the resulting algorithm is evaluated through simulations. A comparison is made with results obtained from the _-composition approach to deal with causality suggested by Zheng et al. (2013).
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...Asai Masataro
This is a presentation used in the aural session in AAAI-16. The original paper is available at http: guicho271828.github.io/publications/ .
# Abstract
Despite recent improvements in search techniques for cost-optimal classical planning, the exponential growth of
the size of the search frontier in A* is unavoidable. We investigate tiebreaking strategies for A*,
experimentally analyzing the performance of standard tiebreaking strategies that break ties according to the heuristic value of the nodes. We find that tiebreaking has a significant impact on search algorithm performance when there are zero-cost operators that induce large plateau regions in the search space. We develop a new framework for tiebreaking based on a depth metric which measures distance from the entrance to the plateau, and propose a new, randomized strategy which significantly outperforms standard strategies on domains with zero-cost actions.
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AESidescitation
Secure data transmission is very important in any communication systems.
Network Security provides many techniques for efficient data transmission through
unprotected network. Cryptography provides a method for securing the transmission of
information by the process of encryption. Encryption converts the message in to unreadable
form (Cipher Text) . Decryption converts this Cipher Text back to original message.
Advanced Encryption Standard (AES) has been used as the first choice of cryptographic
algorithm for many security based applications because of the high level of security and
flexibility of implementation in hardware and software. This paper presents an area
efficient, low power design for AES based on an 8-bit data path making it suitable for
wireless security applications. It has a significant power-area-latency performance
improvements over other existing AES designs. For high performance applications, AES S-
box and inverse S-box implemented using composite field Arithmetic (CFA). Also low
resource Mixcolumn structure is used in this structure. The 8 bit data path architecture is
implemented in XILINX 13.2 and simulated using MODELSIM 6.5 software. Also the
power and area calculation is done with the help of SYNOPSYS software.
1. Advantages of Karnaugh Maps
2. SOPS And POPS
3. Properties
4. Simplification Process
5. How to solve the Karenaugh map?
6. Different Types Of K-Maps
7. Presentation Introduction
8. Don't Care Condition
9. Conclution
Scalable and Adaptive Graph Querying with MapReduceKyong-Ha Lee
We address the problem of processing multiple graph queries over a massive set of data graphs in this letter. As the number of data graphs is growing rapidly, it is often hard to process graph queries with serial algorithms in a timely manner. We propose a distributed graph querying algorithm, which employs feature-based comparison and a filterand-verify scheme working on the MapReduce framework. Moreover, we devise an ecient scheme that adaptively tunes a proper feature size at runtime by sampling data graphs. With various experiments, we show that the proposed method outperforms conventional algorithms in terms of both scalability and efficiency.
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks Ryan Rossi
Massively parallel architectures such as the GPU are becoming increasingly important due to the recent proliferation of data. In this paper, we propose a key class of hybrid parallel graphlet algorithms that leverages multiple CPUs and GPUs simultaneously for computing k-vertex induced subgraph statistics (called graphlets). In addition to the hybrid multi-core CPU-GPU framework, we also investigate single GPU methods (using multiple cores) and multi-GPU methods that leverage all available GPUs simultaneously for computing induced subgraph statistics. Both methods leverage GPU devices only, whereas the hybrid multi-core CPU-GPU framework leverages all available multi-core CPUs and multiple GPUs for computing graphlets in large networks. Compared to recent approaches, our methods are orders of magnitude faster, while also more cost effective enjoying superior performance per capita and per watt. In particular, the methods are up to 300+ times faster than a recent state-of-the-art method. To the best of our knowledge, this is the first work to leverage multiple CPUs and GPUs simultaneously for computing induced subgraph statistics.
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...Harshal Chaudhari
The presentation accompanying the paper at SDM 2018 - https://epubs.siam.org/doi/abs/10.1137/1.9781611975321.50
Github: https://github.com/chdhr-harshal/mc-monitor
In networking applications, one often wishes to obtain estimates about the number of objects at different parts of the network (e.g., the number of cars at an intersection of a road network or the number of packets expected to reach a node in a computer network) by monitoring the traffic in a small number of network nodes or edges. We formalize this task by defining the Markov Chain Monitoring problem. Given an initial distribution of items over the nodes of a Markov chain, we wish to estimate the distribution of items at subsequent times. We do this by asking a limited number of queries that retrieve, for example, how many items transitioned to a specific node or over a specific edge at a particular time. We consider different types of queries, each defining a different variant of the Markov Chain Monitoring. For each variant, we design efficient algorithms for choosing the queries that make our estimates as accurate as possible. In our experiments with synthetic and real datasets we demonstrate the efficiency and the efficacy of our algorithms in a variety of settings.
Uniformity in mechanical properties of the slab affects quality of subsequent rolling process. One of the most important factors deciding quality of the slab is fluctuation of the molten steel level in the mould. That is, smoothing pouring without fluctuating in the mould level means improvement in quality of the slab and protects break-out problem and allows high speed casting process. If molten steel surface fluctuates severely, the forming oscillation marks on the slab is unstable, solidification of molten steel is not uniform and there will be entrapment of mould powder in the solidified cast strand. It makes quality of the slab inferior and generates defects on the slab.
Textbook. The textbook Algorithms, 4th Edition by Robert Sedgewick and Kevin Wayne surveys the most important algorithms and data structures in use today.
http://algs4.cs.princeton.edu/
On the value of Sampling and Pruning for SBSEJianfeng Chen
Oral Prelim Exam slides (for publication).
Thesis statement: for the optimization of SE planning and replanning tasks, given appropriate separation operators, then oversampling and pruning is better than mutation based evolutionary approaches.
Fast and Scalable NUMA-based Thread Parallel Breadth-first SearchYuichiro Yasui
The 2015 International Conference on High Performance Computing & Simulation (HPCS2015)
Session 9A: July 22, 14:45 − 16:00
July 20 – 24, 2015, Amsterdam, the Netherlands
Similar to 8th TUC Meeting | Lijun Chang (University of New South Wales). Efficient Subgraph Matching by Postponing Cartesian Products. (20)
8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...LDBC council
Juan Sequeda, Co-founder of Capsenta, gave an interesting talk on how can we integrate data using graphs and semantics (semantic data virtualization). As Mr. Sequeda said, the idea is to integrate data without needing to move it around. Juan started off his presentation talking about the huge gap that exists between the IT departments, guardians of the data and the business development departments, trying to extract insights about the data.
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...LDBC council
During the 8th TUC Meeting held at Oracle’s facilities in Redwood City, California, Zhe Wu, Software Architect at Oracle Spatial and Graph, explained how is his team trying to bridge RDF Graph and Property Data Models.
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics EngineLDBC council
Yinlong started his talk with an introduction of his new position at Huawei, what is the company doing and more specifically how is it involved with Big Data Research and graphs. He also explained that his research center is currently working on Big Data Analytics and Management from 4 sides: Natural Language Processing, Graph analyrics, Machine Learning and Deep Learning.
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...LDBC council
George Fletcher, Associate Professor at the Eindhoven University of Technology, presented gMark, an open-source framework for generating synthetic graph instances and workloads. The main focus of gMark has been to tailor different graph data management scenarios, often driven by query workloads. Such as multi-query optimization, workload-driven graph database physical design or mapping discovery and query rewriting data integration systems.
8th TUC Meeting – Marcus Paradies (SAP) Social Network BenchmarkLDBC council
Marcus Paradies, Software developer at SAP extended the talk Arnau Prats gave about the SNB, in this case about the Intelligence workload. In contrast with the 17+4 queries the Interactive workload has, the Business Intelligence (BI) workload consists on 24 queries that can be seen as OLAP-style against the OLTP-style of the Interactive one. The BI focuses on analytic queries and they touch the whole graph.
Sergey Edunov, Software Engineer at Facebook gave a great talk on how and why his company generating large-scale social graphs. The underlying reasons to start such an ambitious project are capacity planning to make sure that their system will be able to handle a graph that keeps growing year after year and fair evaluation of their system against the ones being implemented by other companies.
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...LDBC council
Weining Qian, professor at East China Normal University presented his talk on Statistical Characteristics of Real-Life Knowledge graphs during the 8th TUC Meeting held at Oracle’s facilities in Redwood City, California.
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force statusLDBC council
Peter Boncz, Research Scientist at the Centrum Wiskunde & Informatica in the Netherlands, talked about the updates on the Graph Query Language Task Force after being alive for a year. This Task Force was created to answer an issue detected during the benchmark meetings, all the workload is created in English text because there is no common graph query language.
Jerven Bolleman, Lead Software Developer at Swiss-Prot Group, explained why are they offering a free SPARQL and RDF endpoint for the world to use and why is it hard to optimize it.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Welcome to the first live UiPath Community Day Dubai! Join us for this unique occasion to meet our local and global UiPath Community and leaders. You will get a full view of the MEA region's automation landscape and the AI Powered automation technology capabilities of UiPath. Also, hosted by our local partners Marc Ellis, you will enjoy a half-day packed with industry insights and automation peers networking.
📕 Curious on our agenda? Wait no more!
10:00 Welcome note - UiPath Community in Dubai
Lovely Sinha, UiPath Community Chapter Leader, UiPath MVPx3, Hyper-automation Consultant, First Abu Dhabi Bank
10:20 A UiPath cross-region MEA overview
Ashraf El Zarka, VP and Managing Director MEA, UiPath
10:35: Customer Success Journey
Deepthi Deepak, Head of Intelligent Automation CoE, First Abu Dhabi Bank
11:15 The UiPath approach to GenAI with our three principles: improve accuracy, supercharge productivity, and automate more
Boris Krumrey, Global VP, Automation Innovation, UiPath
12:15 To discover how Marc Ellis leverages tech-driven solutions in recruitment and managed services.
Brendan Lingam, Director of Sales and Business Development, Marc Ellis
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Enhancing Performance with Globus and the Science DMZGlobus
ESnet has led the way in helping national facilities—and many other institutions in the research community—configure Science DMZs and troubleshoot network issues to maximize data transfer performance. In this talk we will present a summary of approaches and tips for getting the most out of your network infrastructure using Globus Connect Server.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
The Metaverse and AI: how can decision-makers harness the Metaverse for their...Jen Stirrup
The Metaverse is popularized in science fiction, and now it is becoming closer to being a part of our daily lives through the use of social media and shopping companies. How can businesses survive in a world where Artificial Intelligence is becoming the present as well as the future of technology, and how does the Metaverse fit into business strategy when futurist ideas are developing into reality at accelerated rates? How do we do this when our data isn't up to scratch? How can we move towards success with our data so we are set up for the Metaverse when it arrives?
How can you help your company evolve, adapt, and succeed using Artificial Intelligence and the Metaverse to stay ahead of the competition? What are the potential issues, complications, and benefits that these technologies could bring to us and our organizations? In this session, Jen Stirrup will explain how to start thinking about these technologies as an organisation.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
8th TUC Meeting | Lijun Chang (University of New South Wales). Efficient Subgraph Matching by Postponing Cartesian Products.
1. Computer Science and Engineering
Lijun Chang
Efficient Subgraph Matching by
Postponing Cartesian Products
Lijun.Chang@unsw.edu.au
The University of New South Wales, Australia
2. 2
Outline
Ø Introduction & Existing Works
Ø Challenges of Subgraph Matching
Ø Our Approach: CFL-Match
v Core-First based Framework
v Compact Path Index (CPI) based Matching
Ø Experiment
Ø Conclusion
8. 8
Hardness Result
Ø Subgraph Isomorphism Testing
Ø Decide whether there is a subgraph of G that is isomophic to q
Ø NP-complete
Ø Enumerating all subgraph embeddings is harder
Ø This is the problem we study
9. 9
Existing Work
Ø Ullmann’s algorithm [J.ACM’76]
§ Iteratively maps query vertices one by one, following the input order of
query vertices.
§ Example: Input order could be (u1, u2, u3, u4, u5, u6)
§ Cartesian Products between vertices’ candidates.
Ø VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø TurboISO [SIGMOD’13]
Ø BoostISO [VLDB’15]
10. 10
Existing Work
Ø Ullmann’s algorithm [J.ACM’76]
Ø VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
§ Independently propose to enforce connectivity of the matching order to
reduce Cartesian products caused by disconnected query vertices.
§ QuickSI further removes false-positive candidates by first
processing infrequent query vertices and edges.
§ Connected order could be (u5, u1, u2, u3, u6, u4)
Ø TurboISO [SIGMOD’13]
Ø BoostISO [VLDB’15]
11. 11
Existing Work
Ø Ullmann’s algorithm [J.ACM’76]
Ø VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø TurboISO [SIGMOD’13]
§ Merge together query vertices with the same neighborhood.
§ Reduces Cartesian product caused by similar query vertices
§ Build a data structure online to facilitate the search process.
Ø BoostISO [VLDB’15]
12. 12
Existing Work
Ø Ullmann’s algorithm [J.ACM’76]
Ø VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø TurboISO [SIGMOD’13]
Ø BoostISO [VLDB’15]
§ Compress a data graph G by merging together similar vertices in G.
§ Develop query-dependent relationship between vertices in G.
It is still challenging for matching large query graphs.
13. 13
Challenges of Subgraph Matching
Matching order of QuickSI and TurboISO : (u1,u2,u3,u4,u5,u6).
Challenge I: Redundant Cartesian Products by Dissimilar Vertices.
Cartesian products:
Ø 100 mappings (v0,v2, v1000+i, v2100 +i) (3 ≤ i ≤ 102) of (u1,u2,u3,u4)
Ø 1000 mappings (v0, vj) (3 ≤ j ≤ 1002) of (u1,u5)
105 - 100 partial mappings
are redundant.
(u1,u2,u5,u3,u4,u6)
14. 14
Challenges of Subgraph Matching
Our Solution : Postpone Cartesian products.
Ø Decompose q into a dense subgraph and a forest, and
process the dense subgraph first.
Challenge I: Redundant Cartesian Products by Dissimilar Vertices.
15. 15
Challenges of Subgraph Matching
Challenge II: Exponential size of the path-based data structure in
TurboISO.
Ø TurboISO builds a data structure that materializes all embeddings of
query paths in a data graph
1. for generating matching order based on estimation of #candidates.
2. for enumerating subgraph isomorphic embeddings.
Ø Worst-case space complexity: O(|V(G)||v(q)-1|).
16. 16
Challenges of Subgraph Matching
Challenge II: Exponential size of the path-based data structure in
TurboISO.
Our Solution: Polynomial-size data structure, compact path-index (CPI) .
18. 18
CFL-Match
Ø A Core-First based Framework
§ Core-Forest Decomposition
Compute the minimal connected subgraph containing all non-tree edges of
q regarding any spanning tree.
§ Forest-Leaf Decomposition
Compute the set of leaf vertices by rooting each tree at its connection vertex.
19. 19
CFL-Match
Ø A Core-First based Framework
1) Core-Forest-Leaf Decomposition
2) CPI Construction
3) Mapping Extraction
i. Core-Match
ii. Forest-Match
iii. Leaf-Match
• Categorize leaf nodes according to label
• Perform combination instead of enumeration among different labels.
20. 20
Auxiliary Data Structure
Ø Compact Path-Index (CPI)
§ Compactly store candidate embeddings of query spanning trees.
§ Serve for computing an effective matching order.
Ø CPI Structure
§ Candidate sets
Each query node u has a candidate set u.C.
§ Edge sets
This is an edge between v ∈ u.C and v’ ∈ u’.C for adjacent query
nodes u and u’ in CPI if and only if (v, v’) exists in G.
21. 21
Auxiliary Data Structure
Ø Compact Path-Index (CPI)
§ Compactly store candidate embeddings of query spanning trees.
§ Serve for computing an effective matching order.
Ø CPI Structure
§ Example
22. 22
Auxiliary Data Structure
Ø Soundness of CPI
For every query node u in CPI, if there is an embedding of q in G that
maps u to v, then v must be in u.C.
Given a sound CPI, all embeddings of q in G can be computed by
traversing only the CPI while G is only probed for non-tree edge
checkings.
Ø It is NP-hard to build a minimum sound CPI.
Ø Aim to build a small and sound CPI.
Theorem
23. 23
CPI Construction
Ø General Idea
§ A heuristic approach:
1) u.C is initialized to contain all vertices in G with the same label as u
2) A data vertex v is pruned from u.C ,
if ∃u’ ∈ Nq(u), such that ∄v’ ∈ NG(v) & v’ ∈ u’.C.
Ø A two-phase CPI construction process:
§ Top-down construction, bottom-up refinement
§ Exploit the pruning power of both directions of every query edge.
§ Construct CPI of O(|E(G)| X |V(q)|) size in O(|E(G)| X |E(q)|) time
24. 24
CPI-based Match
Ø Compute path-based matching order using CPI
Ø Estimate #matches for each root-to-leaf path in CPI
Ø Add paths to the matching order in increasing order regarding #matches
Ø Traverse CPI to find mappings for query vertices
Ø Only probe G for non-tree edge validation
(u0,u1,u4,u3,u2, u5, u6, u7, u8, u9, u10)
25. 25
Experiment
Ø All algorithms are implemented in C++ and run on a machine with
3.2G CPU and 8G RAM.
Ø Datasets
§ Real Graphs
§ Synthetic Graphs
§ Randomly generate graphs with 100k vertices with average degree 8 and 50
distinct labels.
Ø Query Graphs
§ Randomly generate by random walk
§ Two Categories:
S: sparse (average degree ≤ 3). N: non-sparse (average degree > 3).
|V| |E| |∑| Degree
HPRD 9460 37081 307 7.8
Yeast 3112 12519 71 8.1
Human 4674 86282 44 36.9
26. 26
Comparing with Existing Techniques
Varying the size of query graph |V(q)|
CFL-Match: our proposed algorithm
27. 27
Effectiveness of Our New Framework
Evaluating our framework
Ø Match: subgraph matching algorithm with CPI but no query
decomposition.
Ø CF-Match: only core-forest decomposition with CPI.
Ø CFL-Match: our best algorithm.
29. 29
Conclusion
Ø We proposed a core-first framework for subgraph matching by
postponing Cartesian products
Ø We proposed a new polynomial-size path-based auxiliary data
structure CPI, and proposed efficient and effective technique for
constructing a small CPI
Ø We proposed efficient algorithms for subgraph matching based on
the core-first framework and the CPI
Ø Extensive empirical studies on real and synthetic graphs
demonstrate that our technique outperforms the state-of-the-art
algorithms.