Traditional web search forces the developers to leave their working environments and look for solutions in the web browsers. It often does not consider the context of their programming problems. The context-switching between the web browser and the working environment is time-consuming and distracting, and the keyword-based traditional search often does not help much in problem solving. In this paper, we propose an Eclipse IDE-based web search solution that collects the data from three web search APIs– Google, Yahoo, Bing and a programming Q & A site– StackOverflow. It then provides search results within IDE taking not only the content of the selected error into account but also the problem context, popularity and search engine recommendation of the result links. Experiments with 25 runtime errors and exceptions show that the proposed ap- proach outperforms the keyword-based search approaches with a recommendation accuracy of 96%. We also validate the results with a user study involving five prospective participants where we get a result agreement of 64.28%. While the preliminary results are promising, the approach needs to be further validated with more errors and exceptions followed by a user study with more participants to establish itself as a complete IDE-based web search solution.
An IDE-Based Context-Aware Meta Search EngineMasud Rahman
The document proposes an IDE-based meta search engine called SurfClipse that searches multiple search engines for solutions to programming exceptions. It considers the exception context, content relevance, link popularity, and search engine confidence to score and rank results. An experiment tested SurfClipse on 25 exceptions and found correct solutions for 24 cases. A user study with 5 exceptions had 64.28% agreement between recommended and participant solutions. Future work includes improving the metrics, conducting more experiments and user studies, and developing a distributed version of SurfClipse.
Some insights from a Systematic Mapping Study and a Systematic Review Study: ...Phu H. Nguyen
Doing literature reviews is a must for us (researchers) to avoid reinventing the wheel, and to expand the boundary of knowledge. Why not having fun with the snowballing technique and conducting the reviews systematically? This talk shares some insights from a Systematic Mapping Study (SMS) and a Systematic Literature Review (SLR). When to conduct a SMS? When to conduct a SLR? What are the differences?
This document discusses Foursquare's data analytics capabilities and team. It describes how Foursquare is able to perform massive intersection queries on user check-in and venue similarity data in under 100ms. It also outlines Foursquare's goals for their internal analytics dashboard, including a simple interface, scalability, and supporting most use cases fast while being cheap. The document notes that Foursquare's data team consists of a mix of analytics, science, and engineering skills and provides some references on building effective data science teams. It concludes by inviting the reader to join Foursquare's hiring opportunities.
The document summarizes Wittenberg University's Master of Science in Analytics program. The program provides networking opportunities and skills to solve real-world problems with minimal impact on work and family responsibilities. It combines analytics techniques with organizational leadership and ethics. The program is designed to provide critical skills for tackling challenging business problems and prepare students for high-paying data analyst roles that offer quick returns on tuition investment.
This document provides information about the selection of the most influential papers from the 30 year history of the International Symposium on Software Reliability Engineering (ISSRE). It describes the selection criteria, which included a call for proposals, bibliometric analysis of highly cited ISSRE papers, and opinions from former ISSRE program chairs. This resulted in 26 selected papers covering topics like software reliability, testing, aging, and failure analysis from 1992 to 2016. The document lists these selected papers and their authors.
On 2019, the 30th edition of the International Symposium on Software Reliability Engineering (ISSRE 2019) took place in Berlin, Germany, October 28-31. The first edition took place in Washington, DC, USA, in 1990.
To celebrate this very important anniversary, we promoted an initiative to identify the ISSRE most influential papers, called "Highlights from 30 years of ISSRE". We looked for ISSRE papers that had a great influence and impact in the community. The goal of the initiative is to remember those papers and their authors, which, in practice, tell a good part of the story of our conference.
Este documento presenta un resumen de la incidencia de la pobreza infantil en Argentina entre 2010 y 2015 desde una perspectiva multidimensional basada en el enfoque de derechos. La pobreza infantil se mide considerando privaciones en seis dimensiones: alimentación, saneamiento, vivienda, salud, información y estimulación/educación. Los resultados muestran que en 2015, el 40,4% de los niños en Argentina vivían en hogares por debajo de la línea de pobreza, una tasa superior a otros grupos etarios. Además, la
An IDE-Based Context-Aware Meta Search EngineMasud Rahman
The document proposes an IDE-based meta search engine called SurfClipse that searches multiple search engines for solutions to programming exceptions. It considers the exception context, content relevance, link popularity, and search engine confidence to score and rank results. An experiment tested SurfClipse on 25 exceptions and found correct solutions for 24 cases. A user study with 5 exceptions had 64.28% agreement between recommended and participant solutions. Future work includes improving the metrics, conducting more experiments and user studies, and developing a distributed version of SurfClipse.
Some insights from a Systematic Mapping Study and a Systematic Review Study: ...Phu H. Nguyen
Doing literature reviews is a must for us (researchers) to avoid reinventing the wheel, and to expand the boundary of knowledge. Why not having fun with the snowballing technique and conducting the reviews systematically? This talk shares some insights from a Systematic Mapping Study (SMS) and a Systematic Literature Review (SLR). When to conduct a SMS? When to conduct a SLR? What are the differences?
This document discusses Foursquare's data analytics capabilities and team. It describes how Foursquare is able to perform massive intersection queries on user check-in and venue similarity data in under 100ms. It also outlines Foursquare's goals for their internal analytics dashboard, including a simple interface, scalability, and supporting most use cases fast while being cheap. The document notes that Foursquare's data team consists of a mix of analytics, science, and engineering skills and provides some references on building effective data science teams. It concludes by inviting the reader to join Foursquare's hiring opportunities.
The document summarizes Wittenberg University's Master of Science in Analytics program. The program provides networking opportunities and skills to solve real-world problems with minimal impact on work and family responsibilities. It combines analytics techniques with organizational leadership and ethics. The program is designed to provide critical skills for tackling challenging business problems and prepare students for high-paying data analyst roles that offer quick returns on tuition investment.
This document provides information about the selection of the most influential papers from the 30 year history of the International Symposium on Software Reliability Engineering (ISSRE). It describes the selection criteria, which included a call for proposals, bibliometric analysis of highly cited ISSRE papers, and opinions from former ISSRE program chairs. This resulted in 26 selected papers covering topics like software reliability, testing, aging, and failure analysis from 1992 to 2016. The document lists these selected papers and their authors.
On 2019, the 30th edition of the International Symposium on Software Reliability Engineering (ISSRE 2019) took place in Berlin, Germany, October 28-31. The first edition took place in Washington, DC, USA, in 1990.
To celebrate this very important anniversary, we promoted an initiative to identify the ISSRE most influential papers, called "Highlights from 30 years of ISSRE". We looked for ISSRE papers that had a great influence and impact in the community. The goal of the initiative is to remember those papers and their authors, which, in practice, tell a good part of the story of our conference.
Este documento presenta un resumen de la incidencia de la pobreza infantil en Argentina entre 2010 y 2015 desde una perspectiva multidimensional basada en el enfoque de derechos. La pobreza infantil se mide considerando privaciones en seis dimensiones: alimentación, saneamiento, vivienda, salud, información y estimulación/educación. Los resultados muestran que en 2015, el 40,4% de los niños en Argentina vivían en hogares por debajo de la línea de pobreza, una tasa superior a otros grupos etarios. Además, la
Matematika tidak hanya dapat dikuasai oleh murid-murid berbakat saja, melainkan juga membutuhkan ketekunan. Murid dapat menguasai matematika dengan berlatih menyelesaikan soal secara mandiri, dan guru perlu menjelaskan pelajaran secara bertahap serta mudah dipahami. Keterlibatan orang tua dan pendekatan pembelajaran yang kreatif dapat membantu meningkatkan minat siswa terhadap matematika.
El resumen analiza la imagen pública de varios políticos argentinos como Sergio Massa, María Eugenia Vidal y Mauricio Macri según una encuesta nacional realizada en junio de 2016. Massa tiene la mejor imagen con más del 60% de opinión positiva, seguido por Vidal y Macri con opiniones positivas similares pero menor opinión negativa para Vidal. La encuesta también consultó sobre el caso de José López y si Cristina Fernández de Kirchner pudo haber estado al tanto de las irregularidades, con casi la mitad respondiendo que era imposible
O documento dá conselhos para aproveitar o ano novo de 2011, encorajando as pessoas a enfrentar desafios com cuidado e atenção, fazer novos amigos, curtir a vida e nunca esquecer de amar aqueles próximos a você.
Este documento resume un sermón sobre el Salmo 14. El sermón comienza explicando que el salmo trata sobre la necesidad de un cambio de corazón ya que describe la naturaleza humana sin discernimiento o deseo de buscar a Dios. Luego, el sermón analiza las primeras líneas del salmo y las conecta con pasajes de la Carta a los Romanos que también hablan de la necedad humana de negar a Dios a pesar de la evidencia de Su existencia en la creación. Finalmente, el sermón explica que el pue
This document contains size charts for various tall boot models from the brand Mountain Horse. It lists the available sizes, measurements for foot length and width, and specific notes on sizing for different models. Size options include regular, narrow, wide, short, long and plus/extra widths. Measurements are given in both European and US/UK sizes to help customers find the properly fitting boots.
La devaluación de diciembre de 2015 y enero de 2016 tuvo un fuerte impacto negativo en el consumo masivo en Argentina, con una caída del 9% en el bimestre. Sin embargo, el efecto fue menor que la devaluación de enero de 2014, a pesar de ser mayor en magnitud, debido a un menor aumento de precios. De todas formas, el consumo ya venía debilitándose desde 2013 y las ventas en supermercados volvieron a los niveles de 2009. A nivel provincial, todas mostraron caídas interanuales en dici
This document appears to be a trade catalogue for Horse Health Wessex Ltd, listing their products for sale including half pads, numnahs, squares, girth covers, and accessories made from merino lambskin. It provides information on sizing, colors, codes, and prices. Payment terms require settlement within 30 days, and policies are outlined for loss, damage, non-arrival of goods, and couriers. The mission of their dedicated wholesale team is also stated.
Un sistema operativo es un programa o conjunto de programas que gestiona los recursos del hardware y provee servicios a las aplicaciones. Se ejecuta con privilegios más altos que otras aplicaciones. El documento explica la diferencia entre el sistema operativo y las herramientas como el explorador de archivos. También describe cómo los entornos gráficos pueden instalarse de forma separada, como en las plataformas Amiga. Finalmente, señala que el núcleo Linux es un ejemplo prominente de esta distinción y cómo la informática se modernizó en los años 80 para
La factura de Tintorería la Impecable enumera los artículos y servicios proporcionados a Librerías Industriales S.A., incluyendo dos ordenadores Thosiba a 2.499 euros cada uno después de un descuento del 2%, 10 cintas de impresora a 650 euros, papel de 8x11 pulgadas a 203 euros con un 10% de descuento, un sistema de alimentación a 2.822 euros con un 5% de descuento, y más papel a 39 euros con un 2% de descuento, para un total bruto de 6.212 euros más IVA del
Este documento presenta un estudio bíblico del Salmo 15. El estudio analiza los requisitos para ser elegido mencionados en el salmo, los cuales incluyen andar en integridad, hacer justicia, hablar con verdad en el corazón, no calumniar con la lengua ni hacer mal al prójimo, no admitir reproches contra el vecino, despreciar al vil pero honrar a los que temen a Dios, y no dar dinero a usura ni admitir cohecho contra el inocente. El estudio explica que estos requisitos se refieren a las cual
El documento presenta los resultados de una encuesta de opinión pública en Argentina sobre la imagen de Cristina Fernández de Kirchner. El 64% de los encuestados la define con una palabra negativa, mientras que el 29,3% usa una palabra positiva y el 5,3% una palabra regular. La encuesta incluyó 3000 casos y se realizó del 27 al 28 de septiembre de 2016 a través de dispositivos móviles.
Este documento presenta un resumen y comentario del Salmo 18. El comentarista explica que David compuso este salmo cuando Dios lo liberó de sus enemigos, especialmente de Saúl. El salmo celebra la redención y salvación que Dios le brindó a David y a Israel. El comentarista analiza los versículos del salmo, destacando cómo David expresa su amor por Dios y cómo ve a Dios como su fortaleza, roca, escudo y refugio. Finalmente, el comentarista enfatiza que aunque David estuvo rodeado de en
Module 1 introduction and objectives presentationIRC
This document introduces a training module on contextualizing WASH services. The module aims to help participants understand the water cycle and processes that result in water and sanitation services. It will provide an overview of the institutional framework for water resource management and WASH services. Key players and the cost chain will be examined. Sustainability challenges in WASH services will also be discussed, with the overall goal of improving WASH governance. The module is divided into 4 sessions that will cover water processes, roles and responsibilities, costs, and sustainability issues.
Bringing New Experience with Openstack and Fuel (Ihor Dvoretskyi, Oleksandr M...IT Arena
The document discusses OpenStack and the Fuel installation tool. It provides an overview of OpenStack and cloud computing concepts like IaaS and PaaS. It then describes Fuel as an automated tool for deploying OpenStack that makes the process faster and less error-prone compared to manual deployment. The document also discusses Mirantis OpenStack and the benefits of their commercial support and services for managing OpenStack deployments.
This document outlines principles for hiring at ManpraX Software LLP. It recommends hiring the best "rockstar" candidates who will make fewer mistakes and require less management time despite a 25% higher salary. Ongoing hiring should be continuous rather than based on projects or requirements to attract more talented people. Employees should teach each other their areas of expertise to develop cumulative smartness and make smarter decisions.
This document summarizes a project management tool called Projectize. It discusses how conventional project management has limitations and there is a need for a tool to capture data, monitor performance, generate reports, and create a knowledge repository. Projectize is a network-based tool that brings management expertise, enhances change flexibility, and helps monitor project status through different metrics and explicit team accountability. It discusses the tool's expected modules, architecture, hierarchies, features, interfaces, security, operational scenario, constraints, and advantages.
This document provides an overview of customer relationship management (CRM). It defines CRM as the integration of sales, marketing, and service strategies. The goals of CRM are to provide better customer service, increase efficiency, help close deals faster, and discover new customers. CRM aims to use existing customer data to grow revenue, better serve customers through integrated systems, and create loyalty through excellent repeatable sales processes. It also discusses the importance of customer focus, CRM phases, benefits of CRM systems, necessary customer data, and keys to successful CRM implementation.
SurfClipse-- An IDE based context-aware Meta Search EngineMasud Rahman
Despite various debugging supports of the existing IDEs for programming errors and exceptions, software developers often look at web for working solutions or any up-to-date information. Traditional web search does not consider thecontext of the problems that they search solutions for, and thus it often does not help much in problem solving. In this paper, we propose a context-aware meta search tool, SurfClipse, that analyzes an encountered exception andits context in the IDE, and recommends not only suitable search queries but also relevant web pages for the exception (and its context). The tool collects results from three popular search engines and a programming Q & A site against the exception in the IDE, refines the results for relevance against the context of the exception, and then ranks them before recommendation. It provides two working modes--interactive and proactive to meet the versatile needs of the developers, and one can browse the result pages using a customized embedded browser provided by the tool.
R Tool for Visual Studio และการทำงานร่วมกันเป็นทีม โดย เฉลิมวงศ์ วิจิตรปิยะกุ...BAINIDA
R Tool for Visual Studio และการทำงานร่วมกันเป็นทีม โดย เฉลิมวงศ์ วิจิตรปิยะกุล MVP, Microsoft Thailand
THE FIRST NIDA BUSINESS ANALYTICS AND DATA SCIENCES CONTEST/CONFERENCE
Matematika tidak hanya dapat dikuasai oleh murid-murid berbakat saja, melainkan juga membutuhkan ketekunan. Murid dapat menguasai matematika dengan berlatih menyelesaikan soal secara mandiri, dan guru perlu menjelaskan pelajaran secara bertahap serta mudah dipahami. Keterlibatan orang tua dan pendekatan pembelajaran yang kreatif dapat membantu meningkatkan minat siswa terhadap matematika.
El resumen analiza la imagen pública de varios políticos argentinos como Sergio Massa, María Eugenia Vidal y Mauricio Macri según una encuesta nacional realizada en junio de 2016. Massa tiene la mejor imagen con más del 60% de opinión positiva, seguido por Vidal y Macri con opiniones positivas similares pero menor opinión negativa para Vidal. La encuesta también consultó sobre el caso de José López y si Cristina Fernández de Kirchner pudo haber estado al tanto de las irregularidades, con casi la mitad respondiendo que era imposible
O documento dá conselhos para aproveitar o ano novo de 2011, encorajando as pessoas a enfrentar desafios com cuidado e atenção, fazer novos amigos, curtir a vida e nunca esquecer de amar aqueles próximos a você.
Este documento resume un sermón sobre el Salmo 14. El sermón comienza explicando que el salmo trata sobre la necesidad de un cambio de corazón ya que describe la naturaleza humana sin discernimiento o deseo de buscar a Dios. Luego, el sermón analiza las primeras líneas del salmo y las conecta con pasajes de la Carta a los Romanos que también hablan de la necedad humana de negar a Dios a pesar de la evidencia de Su existencia en la creación. Finalmente, el sermón explica que el pue
This document contains size charts for various tall boot models from the brand Mountain Horse. It lists the available sizes, measurements for foot length and width, and specific notes on sizing for different models. Size options include regular, narrow, wide, short, long and plus/extra widths. Measurements are given in both European and US/UK sizes to help customers find the properly fitting boots.
La devaluación de diciembre de 2015 y enero de 2016 tuvo un fuerte impacto negativo en el consumo masivo en Argentina, con una caída del 9% en el bimestre. Sin embargo, el efecto fue menor que la devaluación de enero de 2014, a pesar de ser mayor en magnitud, debido a un menor aumento de precios. De todas formas, el consumo ya venía debilitándose desde 2013 y las ventas en supermercados volvieron a los niveles de 2009. A nivel provincial, todas mostraron caídas interanuales en dici
This document appears to be a trade catalogue for Horse Health Wessex Ltd, listing their products for sale including half pads, numnahs, squares, girth covers, and accessories made from merino lambskin. It provides information on sizing, colors, codes, and prices. Payment terms require settlement within 30 days, and policies are outlined for loss, damage, non-arrival of goods, and couriers. The mission of their dedicated wholesale team is also stated.
Un sistema operativo es un programa o conjunto de programas que gestiona los recursos del hardware y provee servicios a las aplicaciones. Se ejecuta con privilegios más altos que otras aplicaciones. El documento explica la diferencia entre el sistema operativo y las herramientas como el explorador de archivos. También describe cómo los entornos gráficos pueden instalarse de forma separada, como en las plataformas Amiga. Finalmente, señala que el núcleo Linux es un ejemplo prominente de esta distinción y cómo la informática se modernizó en los años 80 para
La factura de Tintorería la Impecable enumera los artículos y servicios proporcionados a Librerías Industriales S.A., incluyendo dos ordenadores Thosiba a 2.499 euros cada uno después de un descuento del 2%, 10 cintas de impresora a 650 euros, papel de 8x11 pulgadas a 203 euros con un 10% de descuento, un sistema de alimentación a 2.822 euros con un 5% de descuento, y más papel a 39 euros con un 2% de descuento, para un total bruto de 6.212 euros más IVA del
Este documento presenta un estudio bíblico del Salmo 15. El estudio analiza los requisitos para ser elegido mencionados en el salmo, los cuales incluyen andar en integridad, hacer justicia, hablar con verdad en el corazón, no calumniar con la lengua ni hacer mal al prójimo, no admitir reproches contra el vecino, despreciar al vil pero honrar a los que temen a Dios, y no dar dinero a usura ni admitir cohecho contra el inocente. El estudio explica que estos requisitos se refieren a las cual
El documento presenta los resultados de una encuesta de opinión pública en Argentina sobre la imagen de Cristina Fernández de Kirchner. El 64% de los encuestados la define con una palabra negativa, mientras que el 29,3% usa una palabra positiva y el 5,3% una palabra regular. La encuesta incluyó 3000 casos y se realizó del 27 al 28 de septiembre de 2016 a través de dispositivos móviles.
Este documento presenta un resumen y comentario del Salmo 18. El comentarista explica que David compuso este salmo cuando Dios lo liberó de sus enemigos, especialmente de Saúl. El salmo celebra la redención y salvación que Dios le brindó a David y a Israel. El comentarista analiza los versículos del salmo, destacando cómo David expresa su amor por Dios y cómo ve a Dios como su fortaleza, roca, escudo y refugio. Finalmente, el comentarista enfatiza que aunque David estuvo rodeado de en
Module 1 introduction and objectives presentationIRC
This document introduces a training module on contextualizing WASH services. The module aims to help participants understand the water cycle and processes that result in water and sanitation services. It will provide an overview of the institutional framework for water resource management and WASH services. Key players and the cost chain will be examined. Sustainability challenges in WASH services will also be discussed, with the overall goal of improving WASH governance. The module is divided into 4 sessions that will cover water processes, roles and responsibilities, costs, and sustainability issues.
Bringing New Experience with Openstack and Fuel (Ihor Dvoretskyi, Oleksandr M...IT Arena
The document discusses OpenStack and the Fuel installation tool. It provides an overview of OpenStack and cloud computing concepts like IaaS and PaaS. It then describes Fuel as an automated tool for deploying OpenStack that makes the process faster and less error-prone compared to manual deployment. The document also discusses Mirantis OpenStack and the benefits of their commercial support and services for managing OpenStack deployments.
This document outlines principles for hiring at ManpraX Software LLP. It recommends hiring the best "rockstar" candidates who will make fewer mistakes and require less management time despite a 25% higher salary. Ongoing hiring should be continuous rather than based on projects or requirements to attract more talented people. Employees should teach each other their areas of expertise to develop cumulative smartness and make smarter decisions.
This document summarizes a project management tool called Projectize. It discusses how conventional project management has limitations and there is a need for a tool to capture data, monitor performance, generate reports, and create a knowledge repository. Projectize is a network-based tool that brings management expertise, enhances change flexibility, and helps monitor project status through different metrics and explicit team accountability. It discusses the tool's expected modules, architecture, hierarchies, features, interfaces, security, operational scenario, constraints, and advantages.
This document provides an overview of customer relationship management (CRM). It defines CRM as the integration of sales, marketing, and service strategies. The goals of CRM are to provide better customer service, increase efficiency, help close deals faster, and discover new customers. CRM aims to use existing customer data to grow revenue, better serve customers through integrated systems, and create loyalty through excellent repeatable sales processes. It also discusses the importance of customer focus, CRM phases, benefits of CRM systems, necessary customer data, and keys to successful CRM implementation.
SurfClipse-- An IDE based context-aware Meta Search EngineMasud Rahman
Despite various debugging supports of the existing IDEs for programming errors and exceptions, software developers often look at web for working solutions or any up-to-date information. Traditional web search does not consider thecontext of the problems that they search solutions for, and thus it often does not help much in problem solving. In this paper, we propose a context-aware meta search tool, SurfClipse, that analyzes an encountered exception andits context in the IDE, and recommends not only suitable search queries but also relevant web pages for the exception (and its context). The tool collects results from three popular search engines and a programming Q & A site against the exception in the IDE, refines the results for relevance against the context of the exception, and then ranks them before recommendation. It provides two working modes--interactive and proactive to meet the versatile needs of the developers, and one can browse the result pages using a customized embedded browser provided by the tool.
R Tool for Visual Studio และการทำงานร่วมกันเป็นทีม โดย เฉลิมวงศ์ วิจิตรปิยะกุ...BAINIDA
R Tool for Visual Studio และการทำงานร่วมกันเป็นทีม โดย เฉลิมวงศ์ วิจิตรปิยะกุล MVP, Microsoft Thailand
THE FIRST NIDA BUSINESS ANALYTICS AND DATA SCIENCES CONTEST/CONFERENCE
- The document discusses the speaker's 25 years of experience applying AI techniques to software engineering projects. It covers early work in the 1990s on fault prediction and the challenges of applying machine learning at that time. It then discusses subsequent work in areas like search-based software engineering, natural language processing for requirements engineering, and using simulation and search techniques for testing autonomous vehicle systems. The speaker reflects on both the benefits and challenges of these different AI applications in software engineering.
CORRECT is a code reviewer recommendation tool that:
- Recommends appropriate code reviewers automatically by mining developers' contributions across projects
- Provides recommendation rationales that fit within developers' workflows
- Achieves over 90% accuracy in recommending reviewers based on library and technology experience
- Outperforms an existing technique (RevFinder) with 92.15% top-5 accuracy, 85.93% mean precision and 81.39% mean recall
- Performs similarly on open source projects with 85.20% top-5 accuracy, demonstrating effectiveness for public and private codebases
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...Spark Summit
Machine learning is being deployed in a growing number of applications which demand real-time, accurate, and robust predictions under heavy query load. However, most machine learning frameworks and systems only address model training and not deployment.
In this talk, we present Clipper, a general-purpose low-latency prediction serving system. Interposing between end-user applications and a wide range of machine learning frameworks, Clipper introduces a modular architecture to simplify model deployment across frameworks. Furthermore, by introducing caching, batching, and adaptive model selection techniques, Clipper reduces prediction latency and improves prediction throughput, accuracy, and robustness without modifying the underlying machine learning frameworks. We evaluated Clipper on four common machine learning benchmark datasets and demonstrate its ability to meet the latency, accuracy, and throughput demands of online serving applications. We also compared Clipper to the Tensorflow Serving system and demonstrate comparable prediction throughput and latency on a range of models while enabling new functionality, improved accuracy, and robustness.
From Model-based to Model and Simulation-based Systems ArchitecturesObeo
Achieving quality engineering through descriptive and analytical models
Systems architecture design is a key activity that affect the
overall systems engineering cost. It is hence fundamental
to ensure that the system architecture reaches a proper quality.
In this paper, we leverage on MBSE approaches and complement them
with simulation techniques, as a prom-ising way to improve the quality of the system architecture definition, and to come up with inno-vative solutions while securing the systems engineering process.
The document summarizes the results of benchmarking tests performed on the Blackboard Academic Suite to determine system sizing requirements. Key findings include:
- Tests showed a Unicode conversion taking minutes for small datasets, hours for moderate, and under 3 days for large datasets, meeting objectives.
- Regression performance from version 6.3 to 7.X met the objective of no more than a 5% degradation and potential for a 5% improvement.
- Benchmarking of different hardware platforms like Sun, Dell, and Windows showed performance varied based on configuration.
The document discusses how Blackboard sizes its Academic Suite software based on benchmarking. It provides details on the benchmarking methodology, including modeling user behavior, data growth, and performance objectives. The results showed how the software performed under different workload levels on various hardware configurations. The last part discusses using the benchmark results and sizing guide to determine an institution's adoption profile and appropriate hardware configuration based on factors like sessions per hour and page loads.
Question Answering System using machine learning approachGarima Nanda
In a compact form, this is a presentation reflecting how the machine learning approach can be used for the effective and efficient interaction using classification techniques.
Web-Based System for Software Requirements Quality Analysis Using Case-Based ...IOSR Journals
This document proposes a web-based system to analyze the quality of software requirements specifications (SRS) using case-based reasoning (CBR) and artificial neural networks (ANN). CBR solves new problems by comparing them to past, stored cases, but this can be inefficient when the case base is large. The proposed system improves the retrieval phase of CBR by using ANN to more efficiently measure the similarity between a new case and existing cases. This results in a web-based system that allows users to input SRS quality attributes and indicators, analyzes the SRS using CBR integrated with ANN, and presents a quality analysis report. The system is intended to help software developers better understand SRS quality and requirements.
Prov4J: A Semantic Web Framework for Generic Provenance Management Andre Freitas
Prov4J: A Semantic Web Framework for Generic Provenance Management
André Freitas, Arnaud Legendre, Sean O’Riain, Edward Curry
paper: http://andrefreitas.org/papers/Prov4J%20A%20Semantic%20Web%20Framework%20for%20Generic%20Provenance%20Management.pdf
This document summarizes a final report on a web recommender system project. It outlines the motivation, goals, requirements, design, algorithms, evaluation, results, techniques used, and lessons learned from the project. The project aimed to build a framework for web recommendation that provides basic algorithms and evaluation methods. It designed and implemented three recommendation algorithms and conducted an evaluation with five topics and three algorithms using modified average precision. The evaluation revealed topics strongly influenced results and further analysis of algorithms is needed.
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...Daniel Valcarce
Slides of the CERI 2014 paper:
Daniel Valcarce, Javier Parapar, Álvaro Barreiro. When Recommenders Met Big Data: an Architectural Proposal and Evaluation. Proceedings of the 3rd Spanish Conference on Information Retrieval, CERI 2014, pp. 73-84, A Coruña, Spain, 19 - 20 June, 2014. ISBN 978-84-9749-591-2.
http://www.dc.fi.udc.es/~dvalcarce/pubs/valcarce-etal-ceri2014.pdf
Intelligent Software Engineering: Synergy between AI and Software Engineering...Tao Xie
2018 Distinguished Speaker, the UC Irvine Institute for Software Research (ISR) Distinguished Speaker Series 2018-2019. "Intelligent Software Engineering: Synergy between AI and Software Engineering" http://isr.uci.edu/content/isr-distinguished-speaker-series-2018-2019
towards a model-based framework for development of engineering1 (1)Jinzhi Lu
This document proposes a model-based framework for developing engineering tool-chains that support cyber-physical systems modeling and simulation. It presents the SPIT framework, which takes a systems approach to support MBSE tool-chain development. The framework addresses functionalities of MBSE tool-chains from a systems engineering perspective. Demo tool-chains are developed to support co-simulation of CPS using MBSE. Future work includes extending tool integration languages to formalize co-simulation tool-chains and analyzing the functional dynamics of MBSE enterprise transitioning.
This document discusses effective web mining techniques for information retrieval on the World Wide Web. It proposes a hybrid algorithm to overcome drawbacks of traditional techniques like PageRank, clustering, Markov models and association rule mining. The proposed algorithm uses log data to create clickstream transactions, identify similar transactions through neighborhood-based clustering, and applies a Markov model to accurately predict the next page request. An experiment shows the proposed technique achieves 85% accuracy, higher than the 70% accuracy of traditional Markov models. The algorithm could help enhance search engine results delivery and online experiences.
Saving resources with simulation webinar 092011Scott Althouse
IBM Rational Rhapsody provides solutions to help reduce costs and risks when developing complex products and systems. It allows for early validation and verification of designs through model-based simulation and testing. This helps find defects earlier in the development process when they are cheaper to fix. Rational Rhapsody also improves collaboration, requirements management, and automation of testing.
Exploiting Context in Dealing with Programming Errors and ExceptionsMasud Rahman
This document proposes approaches to improve the process of debugging programming errors and exceptions. It summarizes existing ad-hoc approaches and their limitations. The proposed approaches leverage context from the integrated development environment to provide context-aware web search, query recommendation, content suggestion, code examples, and exception handling support. The approaches were evaluated in experiments and user studies and showed improvements over traditional search engines and existing approaches in areas like search accuracy, recall, and time to fix exceptions. The thesis contributes techniques like SurfClipse, QueryClipse, ContentSuggest, SurfExample, and ExcClipse to address different phases of the exception handling process.
Similar to SurfClipse-- An IDE based context-aware Meta Search Engine (ERA Track) (20)
RAISE Lab at Dalhousie University
aims to develop tools and technologies for intelligent automation in software engineering. An overview is presented by Dr. Masud Rahman, Assistant Professor, Faculty of Computer Science, Dalhousie University, Canada.
The Forgotten Role of Search Queries in IR-based Bug Localization: An Empiric...Masud Rahman
The document summarizes a study on improving search queries for bug localization using natural language text from bug reports. The study evaluated different keyword selection techniques, generated optimal search queries using a genetic algorithm, and compared optimal versus non-optimal queries. Key findings include: 1) Current approaches failed to identify keywords for 34% of bug reports, 2) A genetic algorithm produced optimal queries that achieved up to 80% higher performance than baselines, and 3) Optimal queries differed in using less frequent, less ambiguous, noun-heavy keywords located in bug report bodies.
The document outlines Masud Rahman's PhD thesis proposal on supporting source code search with context-aware, analytics-driven query reformulation. The proposal discusses three research questions: 1) evaluating term weighting techniques for keyword selection from source code and bug reports, 2) incorporating bug report quality for local code search, and 3) leveraging crowd knowledge and data analytics to deliver query keywords. The contribution summary highlights techniques for term dependence, quality-aware bug localization, and using crowd knowledge and large data analytics.
PhD Comprehensive exam of Masud RahmanMasud Rahman
This document presents a systematic literature review of automated query reformulations for source code search. It discusses seven research questions explored in the review, including the methods, algorithms, data sources, evaluation metrics, challenges, publication trends, and comparisons between local and internet-scale code search queries. The review analyzed over 50 primary studies identified through a multi-database search and filtering process. Key findings include the predominant use of term weighting, query expansion and reduction techniques, evaluations based on standard information retrieval metrics, and various challenges like vocabulary mismatch that remain unsolved. Opportunities for future work are also identified, such as leveraging bug reports for keyword selection and using semantic representations to address vocabulary issues.
This document summarizes a talk given by Masud Rahman, a PhD candidate at the University of Saskatchewan. The talk focused on Rahman's PhD thesis research, which aims to improve code search by generating context-aware, analytics-driven queries through effective reformulation. The talk outlined three research questions around improving keyword selection, incorporating bug report quality, and using crowd knowledge and data analytics. It provided an overview of Rahman's PhD thesis and publications addressing the research questions. Evaluation methods for the proposed approaches were also discussed.
This document summarizes a study on improving bug localization through considering the quality of bug reports and reformulating bug report queries. The study analyzes 5,500 bug reports from eight projects and finds that existing bug localization techniques perform poorly when bug reports lack useful information or contain excessive stack traces. Preliminary findings suggest context-aware query reformulation may help address these limitations by improving the quality and relevance of the queries used.
This document summarizes research into the impact of continuous integration (CI) on code reviews. The researchers studied over 500,000 pull requests and builds from open source projects to answer three questions: 1) Whether build status influences code review participation, 2) If frequent builds improve review quality, and 3) Predicting if a build will trigger new reviews. They found that passed builds were more associated with new reviews and comments. Projects with frequent builds received more review comments that remained steady over time, unlike less frequently built projects. Their machine learning model could predict if a build would trigger new reviews with up to 64% accuracy.
This document presents research on predicting the usefulness of code review comments using textual features and developer experience. The researchers analyzed 1,482 code review comments, manually classified as useful or non-useful. They found non-useful comments had more stop words and less code elements, while useful comments had higher conceptual similarity to changed code. More experienced reviewers provided more useful comments. The researchers also built a Random Forest model that predicts comment usefulness with 66% accuracy, outperforming baselines. Their work provides the first automated approach to assess code review comment usefulness.
The document describes a technique called STRICT that uses TextRank and POSRank algorithms to identify important terms from a software change task description to generate an effective initial search query. An experiment on 1,939 change tasks from 8 open source projects found that STRICT improved the query effectiveness in 57.84% of cases compared to baseline queries like title alone. STRICT also showed better retrieval performance based on metrics like mean average precision and mean recall compared to state-of-the-art techniques. The approach validates the use of graph-based ranking algorithms to address the challenge of generating relevant initial search queries from natural language change task descriptions.
The document analyzes why some questions on Stack Overflow remain unresolved and explores whether machine learning can predict which questions will be unresolved. It finds that unresolved questions have higher topic entropy, meaning they are less specific. Owners of unresolved questions reject answers more often, have lower reputation, and are less active on Stack Overflow. Models using features like topic entropy, answer rejection ratio, and owner reputation achieved up to 78% accuracy at predicting unresolved questions. The study aims to help improve question quality on Stack Overflow.
This document analyzes data from over 78,000 pull requests on GitHub to understand why pull request failure rates are high. It finds that 57.05% of pull requests failed, most often due to issues with recursion/refactoring, database queries, arrays/functions. Programming languages like Java, JavaScript and Ruby saw more failed pull requests on average than PHP. Projects in IDE and framework domains had the most pull request activity. Older projects, projects with more forks/developers, and projects where developers had 20-50 months of experience saw the highest numbers of pull requests and failures. The study aims to help understand and address common reasons for pull request failures on GitHub.
The document describes a technique called CodeInsight that mines insightful code comments from crowdsourced knowledge on Stack Overflow. An exploratory study of Stack Overflow discussions found that around 22% of comments discuss tips, bugs, or warnings related to code examples. CodeInsight uses heuristics like popularity, relevance, comment rank, sentiment, and word count to retrieve these insightful comments for a given code segment. An empirical evaluation showed the technique could recall over 80% of relevant comments on average. A user study with professional developers found that 80% of the comments recommended by CodeInsight were accurate and useful.
This document proposes using TextRank to identify initial search terms for software change tasks. It adapts TextRank, originally used for keyword extraction and text summarization, to build a graph of terms from development artifacts and rank them. An evaluation on 349 change tasks from two systems identifies search terms, which outperform an existing approach in solving more tasks with higher precision and recall. The approach recommends initial search queries to help developers find relevant code artifacts when performing change tasks.
This document discusses a method called BRACK for identifying bug-prone API methods using crowdsourced knowledge from Stack Overflow. BRACK ranks API method invocations based on two heuristics: API Context-Susceptibility (ACS) which estimates how context can impact an invocation, and API Error-Associativity (AEA) which calculates the co-occurrence of an invocation in defective and corrected code segments. An evaluation of BRACK on 8 open source systems found that it achieved a top-3 accuracy of 75.93% in identifying bug-prone invocations, and that ACS was more effective than AEA. The evaluation also showed BRACK had no significant bias towards system size or API package and performed comparably
The document presents research on RACK, a tool that uses crowdsourced knowledge from Stack Overflow to reformulate natural language code search queries into relevant API names. The researchers analyzed Stack Overflow data to find that answers frequently refer to APIs by name and cover a high percentage of core APIs. They also found question titles contain terms relevant to real code search queries. RACK maps query terms to API names using this data, then searches GitHub code examples. An evaluation showed RACK returns relevant examples with 79% top-10 accuracy, outperforming existing techniques.
RACK is an approach that automatically recommends relevant APIs for code search queries using crowdsourced knowledge from Stack Overflow questions, answers, and titles. An exploratory study found that accepted Stack Overflow answers frequently mention API names and cover a large percentage of standard APIs. Question titles often contain keywords relevant to code search. RACK constructs an API-token mapping database from Stack Overflow and ranks APIs for a given query based on heuristics measuring keyword-API co-occurrence and coherence. An evaluation found RACK achieved around 79% top-10 accuracy and outperformed existing techniques, demonstrating the potential of leveraging crowdsourced technical knowledge for API recommendation.
QUICKAR is a technique for automatically reformulating code search queries using crowdsourced knowledge from Stack Overflow. It constructs an adjacency list database of terms from Stack Overflow question titles. For an initial search query, it identifies reformulation candidates by comparing the query terms to terms in the adjacency list database and project source code. In experiments, QUICKAR significantly outperformed a baseline technique, improving over 50% of queries while worsening less than 50%, by leveraging vocabulary from Stack Overflow to address mismatches between developer queries and code.
The document describes CORRECT, a technique for recommending code reviewers for pull requests on GitHub based on developers' cross-project and technology experience. It evaluates CORRECT using codebases from both a commercial software company and open source projects. The results show that CORRECT achieves over 90% accuracy in recommending reviewers, outperforming a baseline technique. Library and technology experience are also found to be good proxies for code review skills. CORRECT performs equally well on both private and public codebases without bias toward any development framework.
How to Download & Install Module From the Odoo App Store in Odoo 17Celine George
Custom modules offer the flexibility to extend Odoo's capabilities, address unique requirements, and optimize workflows to align seamlessly with your organization's processes. By leveraging custom modules, businesses can unlock greater efficiency, productivity, and innovation, empowering them to stay competitive in today's dynamic market landscape. In this tutorial, we'll guide you step by step on how to easily download and install modules from the Odoo App Store.
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...EduSkills OECD
Andreas Schleicher, Director of Education and Skills at the OECD presents at the launch of PISA 2022 Volume III - Creative Minds, Creative Schools on 18 June 2024.
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumMJDuyan
(𝐓𝐋𝐄 𝟏𝟎𝟎) (𝐋𝐞𝐬𝐬𝐨𝐧 𝟏)-𝐏𝐫𝐞𝐥𝐢𝐦𝐬
𝐃𝐢𝐬𝐜𝐮𝐬𝐬 𝐭𝐡𝐞 𝐄𝐏𝐏 𝐂𝐮𝐫𝐫𝐢𝐜𝐮𝐥𝐮𝐦 𝐢𝐧 𝐭𝐡𝐞 𝐏𝐡𝐢𝐥𝐢𝐩𝐩𝐢𝐧𝐞𝐬:
- Understand the goals and objectives of the Edukasyong Pantahanan at Pangkabuhayan (EPP) curriculum, recognizing its importance in fostering practical life skills and values among students. Students will also be able to identify the key components and subjects covered, such as agriculture, home economics, industrial arts, and information and communication technology.
𝐄𝐱𝐩𝐥𝐚𝐢𝐧 𝐭𝐡𝐞 𝐍𝐚𝐭𝐮𝐫𝐞 𝐚𝐧𝐝 𝐒𝐜𝐨𝐩𝐞 𝐨𝐟 𝐚𝐧 𝐄𝐧𝐭𝐫𝐞𝐩𝐫𝐞𝐧𝐞𝐮𝐫:
-Define entrepreneurship, distinguishing it from general business activities by emphasizing its focus on innovation, risk-taking, and value creation. Students will describe the characteristics and traits of successful entrepreneurs, including their roles and responsibilities, and discuss the broader economic and social impacts of entrepreneurial activities on both local and global scales.
How to Manage Reception Report in Odoo 17Celine George
A business may deal with both sales and purchases occasionally. They buy things from vendors and then sell them to their customers. Such dealings can be confusing at times. Because multiple clients may inquire about the same product at the same time, after purchasing those products, customers must be assigned to them. Odoo has a tool called Reception Report that can be used to complete this assignment. By enabling this, a reception report comes automatically after confirming a receipt, from which we can assign products to orders.
🔥🔥🔥🔥🔥🔥🔥🔥🔥
إضغ بين إيديكم من أقوى الملازم التي صممتها
ملزمة تشريح الجهاز الهيكلي (نظري 3)
💀💀💀💀💀💀💀💀💀💀
تتميز هذهِ الملزمة بعِدة مُميزات :
1- مُترجمة ترجمة تُناسب جميع المستويات
2- تحتوي على 78 رسم توضيحي لكل كلمة موجودة بالملزمة (لكل كلمة !!!!)
#فهم_ماكو_درخ
3- دقة الكتابة والصور عالية جداً جداً جداً
4- هُنالك بعض المعلومات تم توضيحها بشكل تفصيلي جداً (تُعتبر لدى الطالب أو الطالبة بإنها معلومات مُبهمة ومع ذلك تم توضيح هذهِ المعلومات المُبهمة بشكل تفصيلي جداً
5- الملزمة تشرح نفسها ب نفسها بس تكلك تعال اقراني
6- تحتوي الملزمة في اول سلايد على خارطة تتضمن جميع تفرُعات معلومات الجهاز الهيكلي المذكورة في هذهِ الملزمة
واخيراً هذهِ الملزمة حلالٌ عليكم وإتمنى منكم إن تدعولي بالخير والصحة والعافية فقط
كل التوفيق زملائي وزميلاتي ، زميلكم محمد الذهبي 💊💊
🔥🔥🔥🔥🔥🔥🔥🔥🔥
How to Setup Default Value for a Field in Odoo 17Celine George
In Odoo, we can set a default value for a field during the creation of a record for a model. We have many methods in odoo for setting a default value to the field.
SurfClipse-- An IDE based context-aware Meta Search Engine (ERA Track)
1. AN IDE-BASED CONTEXT-AWARE
META SEARCH ENGINE
Mohammad Masudur Rahman, Shamima Yeasmin, and
Chanchal K. Roy
Department of Computer Science
University of Saskatchewan
20th Working Conference on Reverse Engineering
(WCRE 2013), Koblenz, Germany
6. IDE-BASED WEB SEARCH
About 80% effort on Software Maintenance
Bug fixation– error and exception handling
Developers spend about 19% of time in web search
Traditional web search
Does not consider context of search (No ties between
IDE and web browser)
Context-switching and distracting
Time consuming
Often not much productive
o IDE-Based context-aware search addresses
those issues.
7. EXISTING RELATED WORKS
Cordeiro et al. (RSSE’ 2012)– Context-based
recommendation system
Ponzanelli et al. (ICSE 2013)– Seahawk
Poshyvanyk et al. (IWICSS 2007)– COTS (Google
Desktop) into Eclipse IDE
Brandt et al. (SIGCHI 2010)– Integrating Google
web search into IDE
11. PROPOSED IDE-BASED META SEARCH
MODEL
Distinguished Features
Meta search engine– captures data from multiple
search engines
More precise context– both stack trace and associated
code as exception context
Popularity and confidence of result links
Complete web browsing experience within the IDE
12. PROPOSED METRICS & SCORES
Title to title Matching Score (Stitle)– Cosine similarity
measurement
Stack trace Matching Score (Sst)– SimHash based
similarity measurement
Code context Matching Score (Scc)– SimHash
based similarity measurement
StackOverflow Vote Score (Sso)– Summation of
differences between up and down votes for all
posts in the link
13. PROPOSED METRICS & SCORES
Top Ten Score (Stt)– Position of result link in the top
10 of each provider.
Page Rank Score (Spr)-- Relative popularity among
all links in the corpus using Page Rank algorithm.
Site Traffic Rank Score (Sstr)-- Alexa and Compete
Rank of each link
Search Engine weight (Ssew)---Relative reliability or
importance of each search engine. Experiments
with 75 programming queries against the search
engines.
14. METRICS NORMALIZATION
Normalization applied to -- Sst , Scc , Sso , Stt , Spr
and Sstr
Avoiding bias to any particular aspect
)min()max(
)min(
,
ii
ii
normalizedi
SS
SS
S
15. FINAL SCORE COMPONENTS
Content Relevance
Scnt=Stitle
Context Relevance
Scxt=(Sst + Scc)/2
Link Popularity
Spop=(Sso +Spr + Sstr)/3
Search Engine Confidence
Sser=(Ssew x Stt)
16. EXPERIMENT OVERVIEW
25 Exceptions collected from Eclipse IDE
workspaces.
Related to Eclipse plug-in framework and Java
Application Development
Solutions chosen from exhaustive web search with
cross validations by peers
Recommended results manually validated.
17. EXPERIMENTAL RESULTS
Score Top 10 Rank10 Top 20 Rank20
Scnt 10 3.60 16 8.63
Scnt, Scxt 11 3.00 16 7.43
Scnt, Spop 13 4.69 18 8.11
Scnt, Sser 23 4.39 23 4.39
Scnt, Scxt, Spop 13 4.07 18 7.61
Scnt, Scxt, Sser 24 4.45 24 4.45
Scnt, Scxt, Sser, Spop 23 4.26 24 4.54
Top10: No. of test cases solved when the top 10 results
considered
Rank10: Average rank of solutions when the top 10 results considered
18. USER STUDY
Five interesting exception test cases.
Five CS graduates research students as
participants.
Top 10 results from SurfClipse randomly presented
to the participants.
To avoid the bias of choosing top rated solutions.
64.28% agreement found.
19. USER STUDY RESULTS
Question ID ANSR ANSM Agreement
Q1 2.8 2.0 71.43%
Q2 4.6 2.8 60.87%
Q3 4.6 2.4 52.17%
Q4 4.2 3.0 71.43%
Q5 5.8 3.8 65.52%
Overall 4.4 2.8 64.28%
ANSR: Avg. no. of solutions recommended by the participants.
ANSM: Avg. no. of solution matched with that by our approach.
Agreement: % of agreement between solutions.
20. THREATS TO VALIDITY
Search is not real time yet.
Different aspects need different weights.
21. LATEST UPDATES
A Distributed model for IDE-Based web search–
client-server architecture, remotely hosted web
service
Parallel processing in computation
Two modes of operations– proactive and interactive
Granular refinement of metrics and assigning
relative weights (i.e., importance)
Complete IDE-based web search solution.
22. CONCLUSION & FUTURE WORKS
A novel IDE-Based search with meta search
capabilities
Exploits existing search service providers
Considers content, context, popularity and
search engine confidence of a result.
Recommends correct solution for 24(96%) out of 25
test cases.
64.28% agreement in user study.
Needs more extended experiments and user study.
Metrics need to be fine-tuned and more granulated.
Good Morning everyone
I am Masudur Rahman from University of Saskatchewan. Welcome to my presentation.
Here, I am going to present our paper titled as “An IDE-Based Context-Aware Meta Search Engine”
Basically, here, we proposed an IDE-Based recommendation system that works like a meta search engine, that means, it captures results from multiple search engines against a selected exception, and then analyze them to produce a better and context-relevant result set.
Study shows that about 50%-80% effort is spent on software maintenance,
And one of major concern during maintenance is bug fixation.
Software bugs are generally associated with runtime different errors and exceptions.
To deal with that errors and exceptions, developers spend a lot of time, its about 19% of their programming time.
Why? Its because of the traditional web search which has no ties with the IDE.
It does not consider the context of the problem developer is facing
and developer has to include the context information into the search query, which is challenging, because which term is more important than others is not clear; so, basically, this is a trial and error approach for the developer which is time-consuming.
Besides, the switching between IDE and the web browser is often not very interesting if you are trying to concentrate on a problem in the IDE.
So, what is the solution???
IDE-based search engine, and it has to consider the problem context of course.
There are some existing studies that try to address the issues of traditional web search.
However, they are basically based on StackOverflow, for example the first two works.
StackOverflow is a big source of information and recently it has 1.9 million users with 12 million posts;
However, we cannot ignore the whole web for information, and that is why our approach comes into play.
The rest two works basically tries to integrate Google desktop search and Google web search in the IDE;
However, we are interested to exploit multiple search engines to get more confident set of results for the developer.
The baseline idea is to leverage the existing resources for solving technical challenges in a smart way.
This is our proposed meta search model for IDE-based recommendation.
It has tow modules:
Client module
Computation module.
Once the developer selects an exception from Error log or console view, the client module captures the error message, stack trace and the context code likely responsible for exception and sends to the computation module.
Upon getting the search request, the computation module sends the error message to multiple search engines. We use Google, Bing, Yahoo and the StackOverflow API to collect results and use them to develop the corpus. Once the corpus is developed, we apply our proposed metrics and algorithms to produce a result set that is relevant to the encountered exception.
Now lets assume, what the traditional search engines do? Do they consider context? No. It’s the developer who has to represent the context besides the error message in the search query.
So, basically, we are providing four interesting and essential things in this model.
It exploits the idea of meta search. Why meta search? Lets discuss in the Q/A session.
We are considering more precise context: both stack trace and context code.
We are also considering popularity and confidence of a result link.
And the developer can readily browse the web link recommended with in IDE.
These are metrics we consider to determine the relevance of a result page against the query exception.
Please note that we collected the exception message and exception context in the form of stack trace and context code during search request.
So, title to tile matching basically tries to determine the content similarity between exception message and the result page title. We use cosine similarity measurement for that.
Then comes the context information.
We did HTML scraping and extract the content from different tags like pre, code, blockquote as they are likely to contain the context information about the discussed problems in the page.
Once extracted, we use SimHash based similarity to determine the relevance of the discussed problem with the query exception. SimHash basically produces a Hash value for a block of content, and if the hash value of two blocks are closer, they are considered similar. We use this metric for both stack trace and context code matching.
We also consider other metrics like
Top ten score – it marks if a result is found within the top 10 results of any search engine.
Page Rank score – we develop a artificial network among the result links in the corpus to determine their relative importance using PageRank algorithm.
Site Traffic Rank score – we collect Alexa rank for each result link.
Search engine weight– we calculate the support for each result from the search engines.
For most of the metrics
We use this formula to perform the normalization if it is not already normalized.
So, we have got different perspectives of each result link
And now, we use those perspectives to determine different types of scores.
We get:
Content-relevance
Context-relevance
Popularity and
Search engine confidence for each result.
Then, we give all of them equal share in the final score, and now we are working on their relative weights.
We design a limited experiment with 25 exceptions related to Eclipse plug-in framework
And we got interesting results.
Here, we decompose different component scores and show how much effective they are in recommendation.
We collect the top 10 and top 20 results and found that our algorithm can recommend up to 24 exceptions.
More interestingly, solutions are found within the top 5 positions mostly.
We also try to test the recommendations through a user study, because, the approach is all about the users benefit.
So, we collect the top 10 recommended results for five exceptions and present the results to the participants in a randomized order.
The idea is to check how developers apply their sense of relevance.
Here, we got 64.28% agreement between our results and their confirmation.
Basically, what we did is– we tried to map their selection to the top 5 results of each exception and found that agreement.
We also noticed that the result which they mark as relevant are found mostly in the top 5 results of our recommendation.
So, it shows that the tool is working quite accurately in relevance computation as an initial attempt.
However, it requires more extended experiment and extended user study to claim something solid.
Here are some latest updates about what we did by this time:
We applied parallel processing in the computation model to make it work faster. Present version is quite slower and not like real time.
We implemented a client-server architecture for this search so that it can be platform-independent and any IDE can leverage the search as a web service
We implemented two modes of operation– proactive and interactive, where the proactive version automatically triggers upon an exception.
We did more analysis with the metrics and scores.
To summarize,
We proposed a novel IDE-based search approach that exploits problem context and collects results from a meta search engine.
Our preliminary experiments show some interesting results.
However, of course the idea needs to be further experimented and tested to discover its potential which is our future work
And we are working on that.
So, that’s all about my talk. Thanks to all for your time.
Questions ??