This presents Falcons Explorer at Semantic Web Challenge (SWC) 2010. Falcons Explorer is a tabular and relational interface for exploring the Web of data.
This document discusses different types of measurement scales used in research including nominal, ordinal, interval, and ratio scales. It provides examples of how each scale can be used to measure variables like gender, rankings, test scores, and defines how data on each scale should be analyzed. Links are included to additional resources on using Excel for data analysis and scoring methods.
Career Path is a platform which provides data-driven personalized guidance to students for their career management and preparation as well as also provides student & alumni data management and career performance tracking.
Rui Pereira's thesis proposes a new approach to querying model-driven spreadsheets. Existing spreadsheet querying methods like Google Query are restrictive and counterintuitive. The proposed QuerySheet system infers the data model from spreadsheet structure and allows intuitive SQL-like queries. A preliminary study found QuerySheet was faster and more intuitive than Google Query. The thesis contributes a prototype and papers on model-driven spreadsheet querying. Future work includes improving denormalization handling and developing a graphical querying interface.
This document provides a bibliography of sources on research methods. It lists several URLs that discuss the strengths and weaknesses of qualitative and quantitative research, advantages and disadvantages of each, and differences between the two approaches. Other sources listed cover primary and secondary research, objectives of marketing research, audience research, and case studies on new product research.
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset SummarizationGong Cheng
This document summarizes the HIEDS approach to hierarchical dataset summarization. HIEDS aims to provide multigranular summaries that preserve dataset structure and are comprehensible. It models summarization as a multidimensional knapsack problem to maximize subgroup cohesion and moderateness while disallowing large overlap. HIEDS uses a greedy strategy for efficient solving but requires non-trivial implementation. Experiments show HIEDS outperforms the baseline by generating hierarchical rather than flat groups with better trade-offs and less redundancy.
The document discusses the semantic web and its potential uses for liberal arts campuses. It provides an overview of semantic web technologies like RDF, OWL, and SPARQL. Examples are given of how semantic web tools could be used for campus projects, pedagogy, and research by exposing metadata and linking data. Challenges mentioned include complexity, lack of visible applications, and the ecological growth needed for widespread adoption.
This document discusses different types of measurement scales used in research including nominal, ordinal, interval, and ratio scales. It provides examples of how each scale can be used to measure variables like gender, rankings, test scores, and defines how data on each scale should be analyzed. Links are included to additional resources on using Excel for data analysis and scoring methods.
Career Path is a platform which provides data-driven personalized guidance to students for their career management and preparation as well as also provides student & alumni data management and career performance tracking.
Rui Pereira's thesis proposes a new approach to querying model-driven spreadsheets. Existing spreadsheet querying methods like Google Query are restrictive and counterintuitive. The proposed QuerySheet system infers the data model from spreadsheet structure and allows intuitive SQL-like queries. A preliminary study found QuerySheet was faster and more intuitive than Google Query. The thesis contributes a prototype and papers on model-driven spreadsheet querying. Future work includes improving denormalization handling and developing a graphical querying interface.
This document provides a bibliography of sources on research methods. It lists several URLs that discuss the strengths and weaknesses of qualitative and quantitative research, advantages and disadvantages of each, and differences between the two approaches. Other sources listed cover primary and secondary research, objectives of marketing research, audience research, and case studies on new product research.
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset SummarizationGong Cheng
This document summarizes the HIEDS approach to hierarchical dataset summarization. HIEDS aims to provide multigranular summaries that preserve dataset structure and are comprehensible. It models summarization as a multidimensional knapsack problem to maximize subgroup cohesion and moderateness while disallowing large overlap. HIEDS uses a greedy strategy for efficient solving but requires non-trivial implementation. Experiments show HIEDS outperforms the baseline by generating hierarchical rather than flat groups with better trade-offs and less redundancy.
The document discusses the semantic web and its potential uses for liberal arts campuses. It provides an overview of semantic web technologies like RDF, OWL, and SPARQL. Examples are given of how semantic web tools could be used for campus projects, pedagogy, and research by exposing metadata and linking data. Challenges mentioned include complexity, lack of visible applications, and the ecological growth needed for widespread adoption.
Overview of Indiana University's Advanced Science Gateway support activities for drug discovery, computational chemistry, and other Web portals. For a broader overview of the OGCE project, see http://www.collab-ogce.org/ogce/index.php
Introduction to Software Defined Networking (SDN)rjain51
Class lecture by Prof. Raj Jain on Introduction to . The talk covers Origins of SDN, What is SDN?, Original Definition of SDN, What = Why We need SDN?, SDN Definition, XMPP, XMPP in Data Centers, Path Computation Element, PCE, Forwarding and Control Element, Sample ForCES Exchanges, Application Layer Traffic Optimization, ALTO, ALTO Extension, Current SDN Debate: What vs. How?, SDN Controller Functions, RESTful APIs, OSGi Framework, Open Daylight SDN Controller, OpenDaylight Tools, Affinity Metadata Service, SDN Related Organizations and Projects, SDN Web Sites, Hierarchy of Operations, Introduction to, Origins of SDN, What is SDN?, Original Definition of SDN, What = Why We need SDN?, SDN Definition, XMPP, XMPP in Data Centers, Path Computation Element, PCE, Forwarding and Control Element, Sample ForCES Exchanges, Application Layer Traffic Optimization, ALTO, ALTO Extension, Current SDN Debate: What vs. How?, SDN Controller Functions, RESTful APIs, OSGi Framework, Open Daylight SDN Controller, OpenDaylight Tools, Affinity Metadata Service, SDN Related Organizations and Projects, SDN Web Sites. Video recording available in YouTube.
The document discusses model risk management considerations for machine learning models. It begins with an overview of machine learning and artificial intelligence applications in finance. It then covers key elements of model risk management for machine learning such as model governance structure, model lifecycle management, tracking, metadata management, scaling, reproducibility, interpretability, and testing. The presentation concludes with a discussion on quantifying model risk.
This document introduces Marco Roos and discusses his transition from traditional molecular biology and bioinformatics work to e-science. It describes how e-science approaches can help address challenges in biology by enabling greater data and knowledge sharing, reuse of tools and workflows, and integrated analysis across multiple data types and sources. Examples discussed include semantic web technologies, workflow systems, and proposed e-laboratory platforms to empower scientists with virtual collaborative environments and intelligent assistance. The goal is to help biologists better exploit computational resources and expertise through enhanced and standardized e-science frameworks.
The document discusses Microsoft Research's ORECHEM project, which aims to integrate chemistry scholarship with web architectures, grid computing, and the semantic web. It involves developing infrastructure to enable new models for research and dissemination of scholarly materials in chemistry. Key aspects include using OAI-ORE standards to describe aggregations of web resources related to crystallography experiments. The objective is to build a pipeline that extracts 3D coordinate data from feeds, performs computations on resources like TeraGrid, and stores resulting RDF triples in a triplestore. RESTful web services are implemented to access different steps in the workflow.
This document summarizes the work done by Jie Bao and Li Ding at Rensselaer Polytechnic Institute on developing semantic wiki applications using Semantic MediaWiki. It outlines several semantic wiki applications created including a map mashup, policy testbed, CNL ontology editor, and distributed query extension. It also summarizes a meetup with Stanford researchers to discuss topics like integrity checking, templates, and using semantic wikis for business. Future work areas discussed include collaborative data sharing, access control, effective user interfaces, and handling large scale distributed data.
This document provides an overview of Nascent Applied Methods & Endeavors (NAME), a California-based company that develops technologies like electronic commerce applications, enterprise work architectures, and autonomous knowledge worker systems. It then lists numerous web links categorized under topics like autonomous agent research and development programs, specifications and engineering tools, intelligence theories and applications, and information retrieval systems. The document serves as an appendix providing structural references and authorities on NAME's organizational terminologies and autonomous agent development processes and programs.
This document provides an overview of ontologies and linked open data, including some real-world applications. It discusses how ontologies add semantics to data storage and querying by defining types of entities and their relationships. Popular ontologies like DCTerms and FOAF are mentioned. Examples of linked open data in the Facebook API and DBpedia are provided. The differences between relational and graph databases are outlined. SPARQL is introduced as the query language for graph databases, and Virtuoso is presented as an example of a non-relational, graph database.
Jean-Claude Bradley presents on "SMIRP: Effective use of a self-evolving database for information capture and retrieval in an R&D environment" on August 14, 2002 at the Barnett International Conference on Laboratory Notebooks. Specific implementations of integrating human and automated workflows in chemistry and nanotechnology applications are detailed.
Web2.0 2012 - lesson 7 - technologies and mashups Carlo Vaccari
This document discusses key concepts of Web 2.0 technologies including blogs, wikis, tags, social networks, AJAX, APIs, mashups, and frameworks. It provides examples of popular mashups that combine data from multiple sources to create new applications. Both the strengths and weaknesses of mashups are outlined, noting their potential for lightweight development but also dependence on external data sources and APIs.
The document summarizes Florida State University's Research Computing Center (RCC) and its Sky virtual cluster system. The RCC provides high performance and high throughput computing, data storage, visualization, and consulting services to FSU researchers. Sky is an OpenStack-based virtual machine system that allows researchers to customize computing resources and run applications not supported by the main RCC systems, such as software requiring root access or the Windows operating system. Several FSU research groups currently use Sky for tasks such as data dissemination, software development, and hosting bioinformatics tools.
Vaadin is a Java-based web application framework that allows building rich client-side web UIs using server-side Java. It features rich UI components, embraces the Java programming language, and uses a server-push architecture where the server-side code is instantly reflected to the client-side experience without page reloads. The document provides an overview of Vaadin, including what it is, why it was created to address challenges with traditional web development, how it works through its widget and component architecture, and how developers can get started using it with various IDEs and build tools.
Build Secure Cloud-Hosted Apps for SharePoint 2013Danny Jessee
Apps for SharePoint were introduced in SharePoint 2013 to maximize the level of capability and flexibility that developers can deliver without risking compromise to the farm. In this session, we will delve into apps that leverage resources running outside the SharePoint farm—whether in another on-premises web server or in the cloud. We will use server-side and client-side code to demonstrate how cloud-hosted apps can securely access data stored in SharePoint using the client object model (CSOM/JSOM) and REST APIs, along with the pros and cons associated with each approach. We will discuss the various permissions models associated with apps for SharePoint including types of app permissions, permission request scopes, and how app developers can manage permissions. We will conclude by building and provisioning a provider-hosted app for SharePoint to Office 365.
QCon São Paulo: Real-Time Analytics with Spark StreamingPaco Nathan
The document provides an overview of real-time analytics using Spark Streaming. It discusses Spark Streaming's micro-batch approach of treating streaming data as a series of small batch jobs. This allows for low-latency analysis while integrating streaming and batch processing. The document also covers Spark Streaming's fault tolerance mechanisms and provides several examples of companies like Pearson, Guavus, and Sharethrough using Spark Streaming for real-time analytics in production environments.
The document discusses future plans for iRODS at the Sanger Institute. It plans to expand the deployment of iRODS across multiple sites for increased redundancy and disaster recovery. This will involve setting up an additional zone at a new offsite location and replicating the data between zones. It also discusses various features that could be added to iRODS like improved instrumentation, caching plugins, and tighter integration with object storage vendors.
This document contains personal and educational details of Srinivasan Kannan. It lists his name, aliases, date of birth, address, email addresses, phone numbers, academic qualifications including degrees from PSG College of Technology and Chennai Mathematical Institute. It also outlines his work experience at companies like Sun Microsystems, SoftwareAG, and as an independent researcher. The document shares draft research publications and papers written by Srinivasan Kannan spanning topics in theoretical computer science and number theory, as well as designs for various open source products. Team patents filed during employment at Sun Microsystems are also referenced.
Towards Content-Based Dataset Search - Test Collections and BeyondGong Cheng
The document discusses content-based dataset search (CBDS) as an improvement over metadata-based dataset search (MBDS). It presents ACORDAR, a test collection for ad hoc CBDS using synthetic and TREC queries on RDF datasets. Evaluation results showed that both metadata and dataset content are useful, and that TREC queries are more difficult. CBDS faces challenges including scalability, tractability, and heterogeneity, but is likely to trend as it provides higher relevance and explainability than MBDS.
Overview of Indiana University's Advanced Science Gateway support activities for drug discovery, computational chemistry, and other Web portals. For a broader overview of the OGCE project, see http://www.collab-ogce.org/ogce/index.php
Introduction to Software Defined Networking (SDN)rjain51
Class lecture by Prof. Raj Jain on Introduction to . The talk covers Origins of SDN, What is SDN?, Original Definition of SDN, What = Why We need SDN?, SDN Definition, XMPP, XMPP in Data Centers, Path Computation Element, PCE, Forwarding and Control Element, Sample ForCES Exchanges, Application Layer Traffic Optimization, ALTO, ALTO Extension, Current SDN Debate: What vs. How?, SDN Controller Functions, RESTful APIs, OSGi Framework, Open Daylight SDN Controller, OpenDaylight Tools, Affinity Metadata Service, SDN Related Organizations and Projects, SDN Web Sites, Hierarchy of Operations, Introduction to, Origins of SDN, What is SDN?, Original Definition of SDN, What = Why We need SDN?, SDN Definition, XMPP, XMPP in Data Centers, Path Computation Element, PCE, Forwarding and Control Element, Sample ForCES Exchanges, Application Layer Traffic Optimization, ALTO, ALTO Extension, Current SDN Debate: What vs. How?, SDN Controller Functions, RESTful APIs, OSGi Framework, Open Daylight SDN Controller, OpenDaylight Tools, Affinity Metadata Service, SDN Related Organizations and Projects, SDN Web Sites. Video recording available in YouTube.
The document discusses model risk management considerations for machine learning models. It begins with an overview of machine learning and artificial intelligence applications in finance. It then covers key elements of model risk management for machine learning such as model governance structure, model lifecycle management, tracking, metadata management, scaling, reproducibility, interpretability, and testing. The presentation concludes with a discussion on quantifying model risk.
This document introduces Marco Roos and discusses his transition from traditional molecular biology and bioinformatics work to e-science. It describes how e-science approaches can help address challenges in biology by enabling greater data and knowledge sharing, reuse of tools and workflows, and integrated analysis across multiple data types and sources. Examples discussed include semantic web technologies, workflow systems, and proposed e-laboratory platforms to empower scientists with virtual collaborative environments and intelligent assistance. The goal is to help biologists better exploit computational resources and expertise through enhanced and standardized e-science frameworks.
The document discusses Microsoft Research's ORECHEM project, which aims to integrate chemistry scholarship with web architectures, grid computing, and the semantic web. It involves developing infrastructure to enable new models for research and dissemination of scholarly materials in chemistry. Key aspects include using OAI-ORE standards to describe aggregations of web resources related to crystallography experiments. The objective is to build a pipeline that extracts 3D coordinate data from feeds, performs computations on resources like TeraGrid, and stores resulting RDF triples in a triplestore. RESTful web services are implemented to access different steps in the workflow.
This document summarizes the work done by Jie Bao and Li Ding at Rensselaer Polytechnic Institute on developing semantic wiki applications using Semantic MediaWiki. It outlines several semantic wiki applications created including a map mashup, policy testbed, CNL ontology editor, and distributed query extension. It also summarizes a meetup with Stanford researchers to discuss topics like integrity checking, templates, and using semantic wikis for business. Future work areas discussed include collaborative data sharing, access control, effective user interfaces, and handling large scale distributed data.
This document provides an overview of Nascent Applied Methods & Endeavors (NAME), a California-based company that develops technologies like electronic commerce applications, enterprise work architectures, and autonomous knowledge worker systems. It then lists numerous web links categorized under topics like autonomous agent research and development programs, specifications and engineering tools, intelligence theories and applications, and information retrieval systems. The document serves as an appendix providing structural references and authorities on NAME's organizational terminologies and autonomous agent development processes and programs.
This document provides an overview of ontologies and linked open data, including some real-world applications. It discusses how ontologies add semantics to data storage and querying by defining types of entities and their relationships. Popular ontologies like DCTerms and FOAF are mentioned. Examples of linked open data in the Facebook API and DBpedia are provided. The differences between relational and graph databases are outlined. SPARQL is introduced as the query language for graph databases, and Virtuoso is presented as an example of a non-relational, graph database.
Jean-Claude Bradley presents on "SMIRP: Effective use of a self-evolving database for information capture and retrieval in an R&D environment" on August 14, 2002 at the Barnett International Conference on Laboratory Notebooks. Specific implementations of integrating human and automated workflows in chemistry and nanotechnology applications are detailed.
Web2.0 2012 - lesson 7 - technologies and mashups Carlo Vaccari
This document discusses key concepts of Web 2.0 technologies including blogs, wikis, tags, social networks, AJAX, APIs, mashups, and frameworks. It provides examples of popular mashups that combine data from multiple sources to create new applications. Both the strengths and weaknesses of mashups are outlined, noting their potential for lightweight development but also dependence on external data sources and APIs.
The document summarizes Florida State University's Research Computing Center (RCC) and its Sky virtual cluster system. The RCC provides high performance and high throughput computing, data storage, visualization, and consulting services to FSU researchers. Sky is an OpenStack-based virtual machine system that allows researchers to customize computing resources and run applications not supported by the main RCC systems, such as software requiring root access or the Windows operating system. Several FSU research groups currently use Sky for tasks such as data dissemination, software development, and hosting bioinformatics tools.
Vaadin is a Java-based web application framework that allows building rich client-side web UIs using server-side Java. It features rich UI components, embraces the Java programming language, and uses a server-push architecture where the server-side code is instantly reflected to the client-side experience without page reloads. The document provides an overview of Vaadin, including what it is, why it was created to address challenges with traditional web development, how it works through its widget and component architecture, and how developers can get started using it with various IDEs and build tools.
Build Secure Cloud-Hosted Apps for SharePoint 2013Danny Jessee
Apps for SharePoint were introduced in SharePoint 2013 to maximize the level of capability and flexibility that developers can deliver without risking compromise to the farm. In this session, we will delve into apps that leverage resources running outside the SharePoint farm—whether in another on-premises web server or in the cloud. We will use server-side and client-side code to demonstrate how cloud-hosted apps can securely access data stored in SharePoint using the client object model (CSOM/JSOM) and REST APIs, along with the pros and cons associated with each approach. We will discuss the various permissions models associated with apps for SharePoint including types of app permissions, permission request scopes, and how app developers can manage permissions. We will conclude by building and provisioning a provider-hosted app for SharePoint to Office 365.
QCon São Paulo: Real-Time Analytics with Spark StreamingPaco Nathan
The document provides an overview of real-time analytics using Spark Streaming. It discusses Spark Streaming's micro-batch approach of treating streaming data as a series of small batch jobs. This allows for low-latency analysis while integrating streaming and batch processing. The document also covers Spark Streaming's fault tolerance mechanisms and provides several examples of companies like Pearson, Guavus, and Sharethrough using Spark Streaming for real-time analytics in production environments.
The document discusses future plans for iRODS at the Sanger Institute. It plans to expand the deployment of iRODS across multiple sites for increased redundancy and disaster recovery. This will involve setting up an additional zone at a new offsite location and replicating the data between zones. It also discusses various features that could be added to iRODS like improved instrumentation, caching plugins, and tighter integration with object storage vendors.
This document contains personal and educational details of Srinivasan Kannan. It lists his name, aliases, date of birth, address, email addresses, phone numbers, academic qualifications including degrees from PSG College of Technology and Chennai Mathematical Institute. It also outlines his work experience at companies like Sun Microsystems, SoftwareAG, and as an independent researcher. The document shares draft research publications and papers written by Srinivasan Kannan spanning topics in theoretical computer science and number theory, as well as designs for various open source products. Team patents filed during employment at Sun Microsystems are also referenced.
Similar to Falcons Explorer: Tabular and Relational End-user Programming for the Web of Data (20)
Towards Content-Based Dataset Search - Test Collections and BeyondGong Cheng
The document discusses content-based dataset search (CBDS) as an improvement over metadata-based dataset search (MBDS). It presents ACORDAR, a test collection for ad hoc CBDS using synthetic and TREC queries on RDF datasets. Evaluation results showed that both metadata and dataset content are useful, and that TREC queries are more difficult. CBDS faces challenges including scalability, tractability, and heterogeneity, but is likely to trend as it provides higher relevance and explainability than MBDS.
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...Gong Cheng
This document presents an algorithm called CORE for generating compact yet relaxable answers to keyword queries over knowledge graphs. CORE aims to balance answer compactness, defined as having a bounded diameter, with answer completeness, defined as covering the most query keywords. It provides theoretical foundations for the existence of such answers and uses a best-first search approach. An evaluation shows CORE efficiently computes answers that are more complete than alternatives while remaining compact.
Semantic Data Retrieval: Search, Ranking, and SummarizationGong Cheng
Gong Cheng presented on semantic data retrieval, including entity retrieval and association retrieval from semantic graphs. He discussed two main challenges: efficiently searching large graphs for associations within a diameter bound, and ranking the retrieved associations. For the first challenge, he proposed algorithms using path finding, pruning, and result deduplication. For the second challenge, he conducted a user study and found that association size was the most important ranking factor. Other proposed measures like entity homogeneity and relation heterogeneity had mixed user preferences.
Semantic Web related top conference reviewGong Cheng
The document summarizes key topics in semantic web and knowledge graph research from 2014-2017, including conferences, hot research areas, applications, and papers. It discusses trends such as increasing focus on knowledge graph applications, integration, and construction using techniques like neural networks. Notable news includes Google calling for dataset metadata and Wikidata creating its 31 millionth entity. The road ahead may involve greater knowledge graph commercialization, enrichment, and making knowledge graphs more accessible on the Web.
The document proposes a new approach called relatedness-based multi-entity summarization (MES) to generate concise summaries of related entities. It formulates MES as a quadratic multidimensional knapsack problem (QMKP) to select important and diverse intra-entity features while also selecting inter-entity features that indicate relatedness. It presents an algorithm called REMES based on the grasp heuristic to solve the QMKP formulation. A user study shows REMES outperforms other entity summarization methods at multi-entity summarization tasks.
Generating Illustrative Snippets for Open Data on the WebGong Cheng
We propose generating illustrative snippets from datasets to serve with metadata on dataset search engines. Currently, only metadata is shown. Snippets would help users understand the contents faster by covering important types and entities, using familiar entities, and keeping entities related. We formulate the snippet generation as a maximum-weight-and-coverage connected graph problem to optimize for these qualities. Experimental results show our snippets outperform baselines.
Efficient Algorithms for Association Finding and Frequent Association Pattern...Gong Cheng
The document presents efficient algorithms for association finding and frequent association pattern mining in large graph data. It describes the problems of finding all associations connecting a set of query entities within a diameter constraint and mining frequent association patterns. The basic solutions and optimizations for association finding using distance-based pruning and distance oracles are discussed. For frequent pattern mining, it addresses generating a canonical code to uniquely represent patterns and counting code occurrences to determine frequency. Experiments on real datasets demonstrate the efficiency and scalability of the approaches.
This document discusses summarizing semantic data, including entity descriptions, entity associations, and semantic datasets. It describes extractive and abstractive summarization methods. For entity descriptions, intrinsic metrics like frequency, centrality, informativeness, and diversity are used to rank property-value pairs for the summary. Extrinsic metrics also utilize external knowledge and context. Similar methods are applied to summarizing entity associations by ranking paths between entities. Summarizing semantic datasets involves selecting a representative subset of the data.
Taking up the Gaokao Challenge: An Information Retrieval ApproachGong Cheng
This document describes an information retrieval approach for answering questions from the Gaokao, China's national college entrance exam. It retrieves relevant concept pages, quotes, and disambiguates terms from Wikipedia. It ranks pages based on centrality within vector spaces of words, links, and categories, filtering within relevant historical categories. It assesses answer options based on the extent the question and pages can entail each option. In experiments, it correctly answered 43.09% of questions answerable from Wikipedia and 31.28% of questions outside Wikipedia's scope.
Summarizing Entity Descriptions for Effective and Efficient Human-centered En...Gong Cheng
Presented at WWW'15, Florence.
Gong Cheng, Danyun Xu, Yuzhong Qu. Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking. In Proceedings of the 24th International World Wide Web Conference (WWW), pages 184--194, 2015.
Explass: Exploring Associations between Entities via Top-K Ontological Patter...Gong Cheng
This document describes Explass, a system for exploring associations between entities via top-k ontological patterns and facets. It discusses challenges in exploring the over 1,000 associations within 4 hops in DBpedia and proposes two exploration methods: clustering associations into patterns and using entity/property classes as facets. The key steps involve mining significant patterns as frequent itemsets and selecting k patterns based on frequency, informativeness, and overlap. A demo of Explass on DBpedia is presented along with results of a user study comparing it to other approaches.
Facilitating Human Intervention in Coreference Resolution with Comparative En...Gong Cheng
The document presents a method for facilitating human intervention in coreference resolution by providing comparative entity summaries. It describes using properties and values of candidate coreferent entities to generate summaries that reflect their commonality and differences. The optimal summary maximizes commonality, difference, identity information and diversity, subject to a length limit. An evaluation involved human subjects verifying coreferent relationships for different summarization approaches. The comparative summary approach was found to improve verification efficiency without affecting accuracy as much as only showing common properties or entire descriptions.
Towards Exploratory Relationship Search: A Clustering-based ApproachGong Cheng
This document presents an approach for exploratory relationship search through hierarchical clustering. It aims to address the challenge of too many relationship search results by organizing them into a cluster hierarchy based on common relationship patterns. An evaluation with participants performing lookup and exploratory search tasks on DBpedia data found that the clustering approach outperformed simple listing and faceted categorization alternatives. User feedback suggested areas for improvement like more concise visualizations and cognitive support. The authors conclude it is a promising approach and future work could combine facets and clustering or explore alternatives.
Essentials of Automations: Exploring Attributes & Automation ParametersSafe Software
Building automations in FME Flow can save time, money, and help businesses scale by eliminating data silos and providing data to stakeholders in real-time. One essential component to orchestrating complex automations is the use of attributes & automation parameters (both formerly known as “keys”). In fact, it’s unlikely you’ll ever build an Automation without using these components, but what exactly are they?
Attributes & automation parameters enable the automation author to pass data values from one automation component to the next. During this webinar, our FME Flow Specialists will cover leveraging the three types of these output attributes & parameters in FME Flow: Event, Custom, and Automation. As a bonus, they’ll also be making use of the Split-Merge Block functionality.
You’ll leave this webinar with a better understanding of how to maximize the potential of automations by making use of attributes & automation parameters, with the ultimate goal of setting your enterprise integration workflows up on autopilot.
Conversational agents, or chatbots, are increasingly used to access all sorts of services using natural language. While open-domain chatbots - like ChatGPT - can converse on any topic, task-oriented chatbots - the focus of this paper - are designed for specific tasks, like booking a flight, obtaining customer support, or setting an appointment. Like any other software, task-oriented chatbots need to be properly tested, usually by defining and executing test scenarios (i.e., sequences of user-chatbot interactions). However, there is currently a lack of methods to quantify the completeness and strength of such test scenarios, which can lead to low-quality tests, and hence to buggy chatbots.
To fill this gap, we propose adapting mutation testing (MuT) for task-oriented chatbots. To this end, we introduce a set of mutation operators that emulate faults in chatbot designs, an architecture that enables MuT on chatbots built using heterogeneous technologies, and a practical realisation as an Eclipse plugin. Moreover, we evaluate the applicability, effectiveness and efficiency of our approach on open-source chatbots, with promising results.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/how-axelera-ai-uses-digital-compute-in-memory-to-deliver-fast-and-energy-efficient-computer-vision-a-presentation-from-axelera-ai/
Bram Verhoef, Head of Machine Learning at Axelera AI, presents the “How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-efficient Computer Vision” tutorial at the May 2024 Embedded Vision Summit.
As artificial intelligence inference transitions from cloud environments to edge locations, computer vision applications achieve heightened responsiveness, reliability and privacy. This migration, however, introduces the challenge of operating within the stringent confines of resource constraints typical at the edge, including small form factors, low energy budgets and diminished memory and computational capacities. Axelera AI addresses these challenges through an innovative approach of performing digital computations within memory itself. This technique facilitates the realization of high-performance, energy-efficient and cost-effective computer vision capabilities at the thin and thick edge, extending the frontier of what is achievable with current technologies.
In this presentation, Verhoef unveils his company’s pioneering chip technology and demonstrates its capacity to deliver exceptional frames-per-second performance across a range of standard computer vision networks typical of applications in security, surveillance and the industrial sector. This shows that advanced computer vision can be accessible and efficient, even at the very edge of our technological ecosystem.
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...Jason Yip
The typical problem in product engineering is not bad strategy, so much as “no strategy”. This leads to confusion, lack of motivation, and incoherent action. The next time you look for a strategy and find an empty space, instead of waiting for it to be filled, I will show you how to fill it in yourself. If you’re wrong, it forces a correction. If you’re right, it helps create focus. I’ll share how I’ve approached this in the past, both what works and lessons for what didn’t work so well.
Main news related to the CCS TSI 2023 (2023/1695)Jakub Marek
An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers.
The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 .
The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor IvaniukFwdays
At this talk we will discuss DDoS protection tools and best practices, discuss network architectures and what AWS has to offer. Also, we will look into one of the largest DDoS attacks on Ukrainian infrastructure that happened in February 2022. We'll see, what techniques helped to keep the web resources available for Ukrainians and how AWS improved DDoS protection for all customers based on Ukraine experience
The Microsoft 365 Migration Tutorial For Beginner.pptxoperationspcvita
This presentation will help you understand the power of Microsoft 365. However, we have mentioned every productivity app included in Office 365. Additionally, we have suggested the migration situation related to Office 365 and how we can help you.
You can also read: https://www.systoolsgroup.com/updates/office-365-tenant-to-tenant-migration-step-by-step-complete-guide/
Dandelion Hashtable: beyond billion requests per second on a commodity serverAntonios Katsarakis
This slide deck presents DLHT, a concurrent in-memory hashtable. Despite efforts to optimize hashtables, that go as far as sacrificing core functionality, state-of-the-art designs still incur multiple memory accesses per request and block request processing in three cases. First, most hashtables block while waiting for data to be retrieved from memory. Second, open-addressing designs, which represent the current state-of-the-art, either cannot free index slots on deletes or must block all requests to do so. Third, index resizes block every request until all objects are copied to the new index. Defying folklore wisdom, DLHT forgoes open-addressing and adopts a fully-featured and memory-aware closed-addressing design based on bounded cache-line-chaining. This design offers lock-free index operations and deletes that free slots instantly, (2) completes most requests with a single memory access, (3) utilizes software prefetching to hide memory latencies, and (4) employs a novel non-blocking and parallel resizing. In a commodity server and a memory-resident workload, DLHT surpasses 1.6B requests per second and provides 3.5x (12x) the throughput of the state-of-the-art closed-addressing (open-addressing) resizable hashtable on Gets (Deletes).
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security.
This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing.
The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.
AppSec PNW: Android and iOS Application Security with MobSFAjin Abraham
Mobile Security Framework - MobSF is a free and open source automated mobile application security testing environment designed to help security engineers, researchers, developers, and penetration testers to identify security vulnerabilities, malicious behaviours and privacy concerns in mobile applications using static and dynamic analysis. It supports all the popular mobile application binaries and source code formats built for Android and iOS devices. In addition to automated security assessment, it also offers an interactive testing environment to build and execute scenario based test/fuzz cases against the application.
This talk covers:
Using MobSF for static analysis of mobile applications.
Interactive dynamic security assessment of Android and iOS applications.
Solving Mobile app CTF challenges.
Reverse engineering and runtime analysis of Mobile malware.
How to shift left and integrate MobSF/mobsfscan SAST and DAST in your build pipeline.
Discover top-tier mobile app development services, offering innovative solutions for iOS and Android. Enhance your business with custom, user-friendly mobile applications.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Skybuffer SAM4U tool for SAP license adoptionTatiana Kojar
Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool.
SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.
Falcons Explorer: Tabular and Relational End-user Programming for the Web of Data
1. Falcons Explorer: Tabular and Relational
End-user Programming for the Web of Data
Gong Cheng, Huiyao Wu, Saisai Gong, Weiyi Ge, Yuzhong Qu
Websoft Research Group, Nanjing University
Institute of Web Science, Southeast University
2. Gong Cheng (程龚) gcheng@seu.edu.cn
http://upload.wikimedia.org/wikipedia/commons/5/58/Rdf_graph_for_Eric_Miller.png
How many Web users inside this auditorium
know how to play with RDF data?
80%
90%
100%
3. Gong Cheng (程龚) gcheng@seu.edu.cn
How many Web users outside this auditorium
know how to play with RDF data?
20%
10%
5%
4. Gong Cheng (程龚) gcheng@seu.edu.cn
But…
60% of end-user workers use
spreadsheets or databases
30% use conditional statements
(if-else) or formulas
5. Gong Cheng (程龚) gcheng@seu.edu.cn
Spreadsheet & Relational DB
http://www.mysql.com/common/logos/logo-mysql-110x57.png
http://www.oracleimg.com/admin/images/ocom/hp/oralogo_small.gif
http://why.openoffice.org/images/calc-big.png
6. Gong Cheng (程龚) gcheng@seu.edu.cn
Playing with RDF data
Just As
Playing with spreadsheet or relational DB
7. Gong Cheng (程龚) gcheng@seu.edu.cn
A tabular and relational interface for exploring RDF data
Browsing
Searching
Querying
http://ws.nju.edu.cn/explorer
Falcons Explorer is …
End-user programming