The document discusses listening to library data by representing it in various forms such as visualizations and sound. It provides examples of mapping different types of library data, like questions asked, to musical instruments to create melodies. The document also includes a sample of raw library usage data and references for visualizing and sonifying data.
Implementing Durham Etheses - Sebastian Palucha (Pecha Kucha)Repository Fringe
The document summarizes the implementation and development of Durham University's institutional repository, Durham E-Theses, from its launch in 2009 to present. Key milestones included registering with search services, implementing an OAI-PMH harvest for inclusion in the EThOS database, addressing UTF-8 encoding issues, and adding features like Creative Commons licensing and a local digitization project. Ongoing priorities include sustainability, integrating additional workflows, and migrating to a new repository framework.
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata APITerry Reese
This document summarizes OCLC's WorldCat Metadata API, which allows for the creation, reading, updating, and deletion of bibliographic records and institutional holdings in WorldCat. It describes available operations through the API as well as challenges around authentication, documentation, and lack of a sandbox environment. Potential uses are also outlined, such as automating technical services processes, batch record ingestion, and building pipelines between library systems and WorldCat. Integration with the tool MarcEdit is discussed for batch institutional holdings management and bibliographic record creation/updating through the API.
Where did my layer come from? The semantics of data releaseAdam Leadbetter
This document discusses the semantics of spatial data release and provenance metadata. It introduces Adam Leadbetter from the Marine Institute and provides several relevant links on topics like linked data, the PROV ontology, and information on data publication and citation. Several citations and the author's contact details are also included.
The document discusses issues around privacy and the internet. It provides examples of how personal data is collected and shared online through social media, smartphones, smart devices, and other means. It also discusses concerns about data security, data breaches, government surveillance, and the lack of strong privacy laws and regulations. Overall it examines the tensions between privacy rights and the widespread collection and sharing of personal data in the digital age.
The document discusses setting up makerspaces in libraries. It provides examples of makerspaces at several libraries like Westport Public Library and Chattanooga Public Library. The rest of the document outlines workshops that could be offered, including Arduino/robotics workshops using Arduino kits, crafting workshops using donated materials, and Raspberry Pi workshops teaching programming. It emphasizes finding instructors and giving support to help makerspaces be successful in libraries.
The document provides technology resources and contact information for the CCSD technology team, including the district website and intranet, eSupport helpdesk, and online teaching resources like Discovery Streaming and Storytown. It also provides login instructions for Discovery Streaming and notes that 4th, 5th, and 6th grade teacher computers will be upgraded to laptops with Windows 7 and Office 2010 by the holidays.
Digitization Process by Audra Eagle YunCraig FANSLER
The document provides an overview of getting started with digitization for an organization. It discusses organizing a digitization project, including prioritizing projects and creating a team. It also covers choosing equipment like hardware and software, and setting up a digital production station. The document outlines the scanning process, file organization, and storage. Finally, it discusses options for publishing and sharing digitized content both locally and through free online platforms.
The document provides an overview of getting started with digitization for an organization. It discusses organizing a digitization project, including prioritizing projects and creating a team. It also covers choosing equipment like hardware and software, setting up a scanning station, and scanning processes. Finally, it discusses options for publishing and storing digital files, including free services like Flickr and preserving content through the North Carolina Digital Heritage Center.
Implementing Durham Etheses - Sebastian Palucha (Pecha Kucha)Repository Fringe
The document summarizes the implementation and development of Durham University's institutional repository, Durham E-Theses, from its launch in 2009 to present. Key milestones included registering with search services, implementing an OAI-PMH harvest for inclusion in the EThOS database, addressing UTF-8 encoding issues, and adding features like Creative Commons licensing and a local digitization project. Ongoing priorities include sustainability, integrating additional workflows, and migrating to a new repository framework.
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata APITerry Reese
This document summarizes OCLC's WorldCat Metadata API, which allows for the creation, reading, updating, and deletion of bibliographic records and institutional holdings in WorldCat. It describes available operations through the API as well as challenges around authentication, documentation, and lack of a sandbox environment. Potential uses are also outlined, such as automating technical services processes, batch record ingestion, and building pipelines between library systems and WorldCat. Integration with the tool MarcEdit is discussed for batch institutional holdings management and bibliographic record creation/updating through the API.
Where did my layer come from? The semantics of data releaseAdam Leadbetter
This document discusses the semantics of spatial data release and provenance metadata. It introduces Adam Leadbetter from the Marine Institute and provides several relevant links on topics like linked data, the PROV ontology, and information on data publication and citation. Several citations and the author's contact details are also included.
The document discusses issues around privacy and the internet. It provides examples of how personal data is collected and shared online through social media, smartphones, smart devices, and other means. It also discusses concerns about data security, data breaches, government surveillance, and the lack of strong privacy laws and regulations. Overall it examines the tensions between privacy rights and the widespread collection and sharing of personal data in the digital age.
The document discusses setting up makerspaces in libraries. It provides examples of makerspaces at several libraries like Westport Public Library and Chattanooga Public Library. The rest of the document outlines workshops that could be offered, including Arduino/robotics workshops using Arduino kits, crafting workshops using donated materials, and Raspberry Pi workshops teaching programming. It emphasizes finding instructors and giving support to help makerspaces be successful in libraries.
The document provides technology resources and contact information for the CCSD technology team, including the district website and intranet, eSupport helpdesk, and online teaching resources like Discovery Streaming and Storytown. It also provides login instructions for Discovery Streaming and notes that 4th, 5th, and 6th grade teacher computers will be upgraded to laptops with Windows 7 and Office 2010 by the holidays.
Digitization Process by Audra Eagle YunCraig FANSLER
The document provides an overview of getting started with digitization for an organization. It discusses organizing a digitization project, including prioritizing projects and creating a team. It also covers choosing equipment like hardware and software, and setting up a digital production station. The document outlines the scanning process, file organization, and storage. Finally, it discusses options for publishing and sharing digitized content both locally and through free online platforms.
The document provides an overview of getting started with digitization for an organization. It discusses organizing a digitization project, including prioritizing projects and creating a team. It also covers choosing equipment like hardware and software, setting up a scanning station, and scanning processes. Finally, it discusses options for publishing and storing digital files, including free services like Flickr and preserving content through the North Carolina Digital Heritage Center.
Social Media Confusion? How to Choose the Right NetworkKyle Buyers
A webinar broadcast in the summer of 2012 (before I went back to school for Marketing), this presentation was the most highly-registered in our company during my time there and received very positive feedback from its attendees.
Today there are things I would edit and improve, including my overall strategic approach to social media - but that is the fun of working in an emerging field. Also, now that I have a solid educational background on the subject, I would use my new knowledge to bolster this presentation.
This document discusses data citation and using identifiers to cite datasets. It explains that identifiers provide exposure, transparency, citation tracking and verification for datasets. Identifiers associate an alphanumeric string with the location of an object, like a dataset, and can include optional metadata. Common identifier systems like DOIs provide a precise way to identify and cite datasets. Services like EZID make it easy to create and manage identifiers for datasets. The document encourages attendees to get started with data citation by creating test identifiers and discussing options with librarians.
This PPT went through several revisions. I worked and worked on finding the best, most representative images wanting to create memorable pictures for my students. I shared the PPT with my mentor who commented on how much she enjoyed the presentation and how soothing it was. At that point I realized that adding music would improve the presentation. I searched through my entire library of music in an effort to find music that best fit the presentation. I struggled and finally learned how to add more than one song to a presentation. I am very proud of my final creation and believe my students will enjoy learning about symbols in literature as they view this Power Point presentation.
This document contains links to images of various robots and artificial intelligence characters from movies, television shows, and comics. It includes images of Cyborg from DC comics, C-3PO and R2-D2 from Star Wars, Bender from Futurama, WALL-E from the movie of the same name, a robot from IRobot, a robot from an unknown source, Eve from WALL-E, Optimus Prime from Transformers, a tortured robot from an unknown source, and robots from the movie Robots.
This document provides an overview of resources for librarians to self-educate on data science basics, software, and the library's role in data management. It recommends introductory readings on cyberinfrastructure, data challenges, and evolving library services. More advanced readings include syllabi on digital curation. The document also lists blogs, conferences, and organizations for continuing education, as well as tools for tasks like data curation, metadata, and visualization.
This document contains links to various images including photos of space, the Earth, code icons, game wallpapers, and photos related to artificial intelligence and controllers. It provides a collection of external links to visual media without any connecting text or explanation.
Within 60 seconds:
- 710 computers are sold worldwide with 555 containing Intel processors
- 1,820 terabytes of new data are created, filling over 2.6 million CDs
- 232 computers worldwide become infected with malware
- Over $75,000 is added to Google's revenue
- 2 million internet users watch online pornography
- 1.1 million instant messages are sent
- 103 Blackberries are sold
EZID makes it simple for researchers and others to obtain and manage long-term identifiers for their digital content. The service can create and resolve identifiers, and it also allows entry and maintenance of information about the identifier (metadata). This presentation was given as part of a webinar series.
This document discusses machine learning and deep learning concepts in a very simple way. It provides links to resources about languages like R and Python, the basics of neural networks and perceptrons, and a live demo of a simple perceptron algorithm implemented in Swift. It also briefly touches on pros and cons of frameworks like TensorFlow and CoreML for building machine learning models on Apple platforms.
Memory Forensics for IR - Leveraging Volatility to Hunt Advanced ActorsJared Greenhill
This presentation outlined how performing memory forensics on a single memory image broke open an extremely large intrusion in the non-profit space. Tools, techniques and procedures (TTP’s) of an advanced actor intrusion will be highlighted during a technical deep-dive of memory analysis and related workflow.
Gathering content that is free of copyright, can be difficult if you don't know where to start. This session will teach participants about resources that are available online that are copyright free and can be used by students.
This document discusses how webpages and internet usage looked in the year 2000, with 3 main points. First, webpages were self-hosted on personal computers rather than online servers. Second, internet browsers like Internet Explorer and Firefox were used to access these basic webpages. Third, concepts like cloud computing, collaboration, and online applications did not yet exist, and the internet was less integrated into daily life and business compared to today.
This is the presentation material used for the VOGIN-IP lezing 28 februari 2013 by Marina Noordegraaf. If you want to hear more about the context and meaning of the images, you know whom you might ask ;-) For the version WITH animations go to http://dl.dropbox.com/u/18649633/VOGINIP280213Slideshare.pptx
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
RO-Crate: A framework for packaging research products into FAIR Research Objects presented to Research Data Alliance RDA Data Fabric/GEDE FAIR Digital Object meeting. 2021-02-25
The document discusses open data from Fingal County Council's perspective. It provides details on Fingal's open data portal including the 170 datasets across 12 categories and apps created through an open data app competition. It also discusses Dublin region's open data network, examples of data reuse, and steps for government agencies to publish open data including assigning responsibility, releasing data without restrictions, and engaging communities.
PPT created by Kathleen Curran on Traditional Symbols in Literature. The slide show will be accompanied by a lecture. Students will have a handout with the symbol listed. They will take notes on what the symbol represents as they view the slide show and listen to my lecture. The next step will be having students find these symbols in literature we read in class.
InterPlanetary Wayback: The Next Step Towards Decentralized Web ArchivingSawood Alam
InterPlanetary Wayback (IPWB) facilitates permanence and collaboration in web archives by disseminating the contents of WARC files into the InterPlanetary File System (IPFS) network. IPFS is a peer-to-peer content-addressable file system that inherently allows deduplication and facilitates opt-in replication. IPWB splits the header and payload of WARC response records before disseminating into IPFS to leverage the deduplication, builds a CDXJ index with references to the IPFS hashes returns, and combines the headers and payload from IPFS at the time of replay. We also explore the possibility of an index-free, fully decentralized collaborative web archiving system as the next step.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Social Media Confusion? How to Choose the Right NetworkKyle Buyers
A webinar broadcast in the summer of 2012 (before I went back to school for Marketing), this presentation was the most highly-registered in our company during my time there and received very positive feedback from its attendees.
Today there are things I would edit and improve, including my overall strategic approach to social media - but that is the fun of working in an emerging field. Also, now that I have a solid educational background on the subject, I would use my new knowledge to bolster this presentation.
This document discusses data citation and using identifiers to cite datasets. It explains that identifiers provide exposure, transparency, citation tracking and verification for datasets. Identifiers associate an alphanumeric string with the location of an object, like a dataset, and can include optional metadata. Common identifier systems like DOIs provide a precise way to identify and cite datasets. Services like EZID make it easy to create and manage identifiers for datasets. The document encourages attendees to get started with data citation by creating test identifiers and discussing options with librarians.
This PPT went through several revisions. I worked and worked on finding the best, most representative images wanting to create memorable pictures for my students. I shared the PPT with my mentor who commented on how much she enjoyed the presentation and how soothing it was. At that point I realized that adding music would improve the presentation. I searched through my entire library of music in an effort to find music that best fit the presentation. I struggled and finally learned how to add more than one song to a presentation. I am very proud of my final creation and believe my students will enjoy learning about symbols in literature as they view this Power Point presentation.
This document contains links to images of various robots and artificial intelligence characters from movies, television shows, and comics. It includes images of Cyborg from DC comics, C-3PO and R2-D2 from Star Wars, Bender from Futurama, WALL-E from the movie of the same name, a robot from IRobot, a robot from an unknown source, Eve from WALL-E, Optimus Prime from Transformers, a tortured robot from an unknown source, and robots from the movie Robots.
This document provides an overview of resources for librarians to self-educate on data science basics, software, and the library's role in data management. It recommends introductory readings on cyberinfrastructure, data challenges, and evolving library services. More advanced readings include syllabi on digital curation. The document also lists blogs, conferences, and organizations for continuing education, as well as tools for tasks like data curation, metadata, and visualization.
This document contains links to various images including photos of space, the Earth, code icons, game wallpapers, and photos related to artificial intelligence and controllers. It provides a collection of external links to visual media without any connecting text or explanation.
Within 60 seconds:
- 710 computers are sold worldwide with 555 containing Intel processors
- 1,820 terabytes of new data are created, filling over 2.6 million CDs
- 232 computers worldwide become infected with malware
- Over $75,000 is added to Google's revenue
- 2 million internet users watch online pornography
- 1.1 million instant messages are sent
- 103 Blackberries are sold
EZID makes it simple for researchers and others to obtain and manage long-term identifiers for their digital content. The service can create and resolve identifiers, and it also allows entry and maintenance of information about the identifier (metadata). This presentation was given as part of a webinar series.
This document discusses machine learning and deep learning concepts in a very simple way. It provides links to resources about languages like R and Python, the basics of neural networks and perceptrons, and a live demo of a simple perceptron algorithm implemented in Swift. It also briefly touches on pros and cons of frameworks like TensorFlow and CoreML for building machine learning models on Apple platforms.
Memory Forensics for IR - Leveraging Volatility to Hunt Advanced ActorsJared Greenhill
This presentation outlined how performing memory forensics on a single memory image broke open an extremely large intrusion in the non-profit space. Tools, techniques and procedures (TTP’s) of an advanced actor intrusion will be highlighted during a technical deep-dive of memory analysis and related workflow.
Gathering content that is free of copyright, can be difficult if you don't know where to start. This session will teach participants about resources that are available online that are copyright free and can be used by students.
This document discusses how webpages and internet usage looked in the year 2000, with 3 main points. First, webpages were self-hosted on personal computers rather than online servers. Second, internet browsers like Internet Explorer and Firefox were used to access these basic webpages. Third, concepts like cloud computing, collaboration, and online applications did not yet exist, and the internet was less integrated into daily life and business compared to today.
This is the presentation material used for the VOGIN-IP lezing 28 februari 2013 by Marina Noordegraaf. If you want to hear more about the context and meaning of the images, you know whom you might ask ;-) For the version WITH animations go to http://dl.dropbox.com/u/18649633/VOGINIP280213Slideshare.pptx
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
RO-Crate: A framework for packaging research products into FAIR Research Objects presented to Research Data Alliance RDA Data Fabric/GEDE FAIR Digital Object meeting. 2021-02-25
The document discusses open data from Fingal County Council's perspective. It provides details on Fingal's open data portal including the 170 datasets across 12 categories and apps created through an open data app competition. It also discusses Dublin region's open data network, examples of data reuse, and steps for government agencies to publish open data including assigning responsibility, releasing data without restrictions, and engaging communities.
PPT created by Kathleen Curran on Traditional Symbols in Literature. The slide show will be accompanied by a lecture. Students will have a handout with the symbol listed. They will take notes on what the symbol represents as they view the slide show and listen to my lecture. The next step will be having students find these symbols in literature we read in class.
InterPlanetary Wayback: The Next Step Towards Decentralized Web ArchivingSawood Alam
InterPlanetary Wayback (IPWB) facilitates permanence and collaboration in web archives by disseminating the contents of WARC files into the InterPlanetary File System (IPFS) network. IPFS is a peer-to-peer content-addressable file system that inherently allows deduplication and facilitates opt-in replication. IPWB splits the header and payload of WARC response records before disseminating into IPFS to leverage the deduplication, builds a CDXJ index with references to the IPFS hashes returns, and combines the headers and payload from IPFS at the time of replay. We also explore the possibility of an index-free, fully decentralized collaborative web archiving system as the next step.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on integration of Salesforce with Bonterra Impact Management.
Interested in deploying an integration with Salesforce for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Skybuffer SAM4U tool for SAP license adoptionTatiana Kojar
Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool.
SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...alexjohnson7307
Predictive maintenance is a proactive approach that anticipates equipment failures before they happen. At the forefront of this innovative strategy is Artificial Intelligence (AI), which brings unprecedented precision and efficiency. AI in predictive maintenance is transforming industries by reducing downtime, minimizing costs, and enhancing productivity.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
Welcome to my lightning talk ‘Listening to library data’Many of you know me but for those that don’t, my name is Katie Legere and I’m systems coordinator at the Queen’s University Library in Kingston. I have somewhere around 7 years of library experience – 5 years as developer at a public library and a year and a bit in an academic library. I also have around 10 years of computing experience as well as an undergrad and masters in computer science .But what I really have a lot of is experience in music – over 35 years studying and performing and thinking about various kinds of music. So, it really isn’t surprising that when I get a chance to think about a way to combine these things, I get pretty excited. Today I want to tell you about an emerging field in data representation and a little project I did at my library.
Librarians love statistics and no wonder! But, while access to information is important for planning and evaluating what we do, the amazing advances in information technology have enabled an explosion in the sheer amount of data available (Badrakhan 2010). Creating new ways of understanding this data has become an important field of research
The field of visualization is a rapidly growing one these days. Even data that is pretty easy to understand by looking at the numbers can be made clearer and more engaging by using familiar tools like graphs and pie charts.
As the sheer amount of data grows so do our ways of visualizing it using things like tag clouds, maps and infographics.Visualization is increasingly a hot topic at conferences.
The concept of using sound or ‘sonification’ to interpret large amounts of data in real time is a relatively recent one though.Sonification is most commonly defined as the “use of non-speech audio to convey information” (Hermann and Ritter 1999), The concept of gathering information through sound is hardly new. Humans and our animal friends have been using our ears as well as our eyes and noses as long as we’ve been around.And it’s unlikely that we would have survived too long if we could not hear the snap of a twig behind us in the forest and decide that running away or climbing a tree might be in order. Sonification takes advantage of people’s innate ability to detect subtle differences in sounds and perceive cycles, rhythms, patterns and short events by listening, even allowing data to be monitored while the listener is doing something else.
And indeed using sound to gather information is not new. Medical practitioners have been stethoscopes as a normal part of their equipment for hundreds of years to help diagnose dangerous conditions (Barrass and Kramer 1999). Geiger counters measure radiation levels and transmit information both through a visual interface and audibly through clicks (Hunt 2011). And the ping of sonar is a familiar sound to many from watching moviesAs the amount of information available to organizations increases, new ways of analyzing and understanding the data must also appear in order that meaningful information can be drawn from it. Sonification also allows for adapting the way users interact with information (Diaz-Merced et al. 2012) increasing accessibility by employing our highly developed hearing as an option as well as a complement to visualization techniques in understanding data.
The usual approach to the representation of data as sound is through parameter mapping. Data elements are mapped to particular elements of sound such as pitch, duration, volume, and timbre and what entered as a stream of numbers emerges as sound.
For examples – Etna VolcanoThe MusicaInaudita sound laboratory of the University of Salerno (Italy), in collaboration with the Catania INFN Section and the Technologies and Research for Contemporary Arts has a project which sonifies Geophysical data collected by a digital seismograph placed on the Etna volcano in Catania (Italy). They maintain that sonic representations are particularly useful when dealing with complex, high-dimensional data, or in data monitoring tasks where it is practically impossible to use the visual inspection and are interested in exploring the possibility of describing patterns or trends, through sound, which were hardly perceivable otherwise.
Using the scientific graph data that the ATLAS experiment presented on 4th July, scientists at Cambridge created a sonification algorithm which offers the same qualitative and quantitative information contained in the graph , only translated into notes. They mapped the numbers to the notes using two principles:1. the same number is associated to the same note2. the melody is “covariant” with the data, i.e. the melody changes following exactly the same profile of the scientific data, exactly as shown in the attached picture.
Using the same sort of data mapping process, I set out to see what our library reference statistics might tell us.
We use a slightly modified version of libstats to gather reference information from the various libraries and store all of this in a mySql database.
So it was fairly straightforward to extract the data and do some mapping.Each library was mapped to an instrument… and when one hears the final product it is easy to identify the libraries where there is a lot of activity because you hear that particular instrument.
Each of the question types was mapped to a melodic fragment. And the date of the question was mapped to a bar number – each ‘new day’ became a new bar number and any duplications on the same day increased the volume as well.So, in the same way as the library to instrument mapping, when one hears the final product it is easy to identify the kinds of questions that recur from the repetition of the melodic fragment.
The result is a musical score (using finale as the notation software) which, conceivably could be performed but is easily exported to an mp3 file to listen to… and here’s a fragment.
Clearly I could go a great deal further in the project but it was definitely an interesting exercise and, I think that sonification offers us an interesting new way of looking at data. Are there any questions?