Copy of getting into ai event slides (PDF)Matthew Miller
1) The document discusses key principles of AI discussed at a conference: AI should enhance rather than replace humans, be transparent to build trust, and consider social impact.
2) Speakers addressed topics like the future of creativity, business, jobs, and education with AI, and that humans and computers together can better solve problems.
3) To be a data scientist requires skills like asking questions, experimenting, understanding failure, passion for a sector, and enjoying learning.
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebJames Hendler
These slides, based on a presentation at distinguished lecture at IBM Almaden in March, 2017 explore some of the challenges to machine learning and some recent work. It is a newer version of the slides originally presented at IJCAI 2016.
This document provides an overview of artificial intelligence (AI) including:
1. It discusses common myths about AI and clarifies that while AI systems can learn, they require significant human guidance. Neural networks are inspired by the human brain but function differently.
2. It outlines the history of AI from its origins in the 1940s to modern approaches using large datasets and increased computing power.
3. It defines key AI concepts like machine learning, deep learning, artificial narrow intelligence, and artificial general intelligence.
4. It explains different machine learning methods like supervised, unsupervised, and reinforcement learning and what is meant by "learning" in machine learning systems.
5. It provides examples of AI applications
The document discusses the "black box problem" in artificial intelligence and neural networks. Specifically, it notes that while these systems can perform complex tasks, the inner workings and decision-making processes are not fully understood. It argues that developing theoretical frameworks grounded in other domains, like physics, could help increase transparency and interpretability of these technologies. More work is needed to better understand and explain how artificial intelligence systems learn and operate.
Hector Guerrero- Road to Business AnalyticsErika Marr
This document provides an overview of key concepts in business analytics including:
- Definitions of data science, data scientist, and analytics which involve extracting insights from data.
- A process map of data science including data collection, cleaning, modeling, and communication.
- A brief history and timeline of developments in computer technology, statistics, and analytics from the 1960s to present.
- Emerging areas like artificial intelligence, autonomous systems, and the impact of technology on jobs and society.
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
IJCAI 16 keynote on the need to bring modern AI accomplishments of recent years into connection with the more traditional goals of symbolic AI (and vice versa).
Presentation given at the Linguistic Data Consortium (LDC), University of Pennsylvania, April 2019. Based on presentations at the 6th ACM Collective Intelligence Conference, 2018 and the 6th AAAI Conference on Human Computation & Crowdsourcing (HCOMP), 2018. Blog post: https://blog.humancomputation.com/?p=9932.
Social Machines - 2017 Update (University of Iowa)James Hendler
This is an update to the talk entitled "Social Machines: the coming collision of artificial intelligence, social networks and humanity." It was presented as an ACM Distinguished Speaker lecture at the "University of Iowa Computing Conference" 2017-02-24
Copy of getting into ai event slides (PDF)Matthew Miller
1) The document discusses key principles of AI discussed at a conference: AI should enhance rather than replace humans, be transparent to build trust, and consider social impact.
2) Speakers addressed topics like the future of creativity, business, jobs, and education with AI, and that humans and computers together can better solve problems.
3) To be a data scientist requires skills like asking questions, experimenting, understanding failure, passion for a sector, and enjoying learning.
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebJames Hendler
These slides, based on a presentation at distinguished lecture at IBM Almaden in March, 2017 explore some of the challenges to machine learning and some recent work. It is a newer version of the slides originally presented at IJCAI 2016.
This document provides an overview of artificial intelligence (AI) including:
1. It discusses common myths about AI and clarifies that while AI systems can learn, they require significant human guidance. Neural networks are inspired by the human brain but function differently.
2. It outlines the history of AI from its origins in the 1940s to modern approaches using large datasets and increased computing power.
3. It defines key AI concepts like machine learning, deep learning, artificial narrow intelligence, and artificial general intelligence.
4. It explains different machine learning methods like supervised, unsupervised, and reinforcement learning and what is meant by "learning" in machine learning systems.
5. It provides examples of AI applications
The document discusses the "black box problem" in artificial intelligence and neural networks. Specifically, it notes that while these systems can perform complex tasks, the inner workings and decision-making processes are not fully understood. It argues that developing theoretical frameworks grounded in other domains, like physics, could help increase transparency and interpretability of these technologies. More work is needed to better understand and explain how artificial intelligence systems learn and operate.
Hector Guerrero- Road to Business AnalyticsErika Marr
This document provides an overview of key concepts in business analytics including:
- Definitions of data science, data scientist, and analytics which involve extracting insights from data.
- A process map of data science including data collection, cleaning, modeling, and communication.
- A brief history and timeline of developments in computer technology, statistics, and analytics from the 1960s to present.
- Emerging areas like artificial intelligence, autonomous systems, and the impact of technology on jobs and society.
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
IJCAI 16 keynote on the need to bring modern AI accomplishments of recent years into connection with the more traditional goals of symbolic AI (and vice versa).
Presentation given at the Linguistic Data Consortium (LDC), University of Pennsylvania, April 2019. Based on presentations at the 6th ACM Collective Intelligence Conference, 2018 and the 6th AAAI Conference on Human Computation & Crowdsourcing (HCOMP), 2018. Blog post: https://blog.humancomputation.com/?p=9932.
Social Machines - 2017 Update (University of Iowa)James Hendler
This is an update to the talk entitled "Social Machines: the coming collision of artificial intelligence, social networks and humanity." It was presented as an ACM Distinguished Speaker lecture at the "University of Iowa Computing Conference" 2017-02-24
The document discusses the need for ontologies that can better support linking and mapping between large, distributed databases on the semantic web. While OWL has been successful in some domains, it lacks expressivity for tasks like representing part-whole relations, temporal reasoning, and procedural knowledge. A new generation of ontology languages may need to relax requirements like decidability in order to more powerfully represent relationships that are important for data integration and discovery across multiple knowledge sources.
Machine Learning for Non-technical Peopleindico data
Machine learning is one of the most promising and most difficult to understand fields of the modern age. Here are the slides from Slater Victoroff's (CEO of indico) talk at General Assembly Boston for non-technical folks on how to separate the signal from the noise -- stay tuned for the next time he speaks:
https://generalassemb.ly/education/machine-learning-for-non-technical-people
This document discusses trends and issues in interdisciplinary research between ICT and social sciences. It touches on several topics including:
- The emergence of data-driven science and use of digital tools for research
- Debate around claims that large datasets can replace theories and models
- Development of computational social science and e-science tools
- New roles for data and need for contextualization of big data findings
- Challenges of big data such as data gaps, biases, and ethical issues
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Matthew Lease
This document summarizes a presentation about designing human-AI partnerships for fact-checking misinformation. It discusses using crowdsourced rationales to improve the accuracy and cost-efficiency of annotation tasks. It also addresses challenges in designing interfaces for automatic fact-checking models, such as integrating human knowledge and reasoning to correct errors and account for bias. The goal is to develop mixed-initiative systems where humans and AI can jointly reason and personalize fact-checking.
Talk given at Delft University speaker series on "Crowd Computing & Human-Centered AI" (https://www.academicfringe.org/). November 23, 2020. Covers two 2020 works:
(1) Anubrata Das, Brandon Dang, and Matthew Lease. Fast, Accurate, and Healthier: Interactive Blurring Helps Moderators Reduce Exposure to Harmful Content. In Proceedings of the 8th AAAI Conference on Human Computation and Crowdsourcing (HCOMP), 2020.
Alexander Braylan and Matthew Lease. Modeling and Aggregation of Complex Annotations via Annotation Distances. In Proceedings of the Web Conference, pages 1807--1818, 2020.
This document summarizes a talk on data science for software engineering. It discusses how data science involves various fields like statistics, machine learning, and data mining. It notes that while "big data" is often discussed, software engineering data is typically small and sparse. Domain knowledge is important for data mining to avoid misinterpreting data. Data science with software engineering data requires understanding organizations and their willingness to share data given privacy concerns. The document outlines sharing data, models, and methods for learning across different organizations and discusses techniques for balancing privacy and utility when sharing data.
Social Machines: The coming collision of Artificial Intelligence, Social Netw...James Hendler
Jim Hendler discusses social machines, which he defines as networks of machines supporting networks of people working together in ways that impact the real world. He argues that social networking consumes huge amounts of human time and that this time could be harnessed through social machines to solve problems like curing disease and feeding the hungry. Examples of early social machines include games with a purpose that harness human computation and citizen science projects like Galaxy Zoo. Moving forward, social machines may blend more with artificial intelligence, and their study requires multidisciplinary perspectives from computing, social science, and other fields. Realizing their potential faces both social challenges around online communities and technical challenges in platform design.
Introduction to Data Science and Large-scale Machine LearningNik Spirin
This document is a presentation about data science and artificial intelligence given by James G. Shanahan. It provides an outline that covers topics such as machine learning, data science applications, architecture, and future directions. Shanahan has over 25 years of experience in data science and currently works as an independent consultant and teaches at UC Berkeley. The presentation provides background on artificial intelligence and machine learning techniques as well as examples of their successful applications.
This is the slideshow for a presentation I gave as part of my graduate coursework at the Institute for Innovation and Public Purpose at University College London (UCL IIPP). Drawing on the work of IIPP professors including Carlota Perez (techno-economic paradigms), Mariana Mazzucato (“The Entrepreneurial State”), and Tim O’Reilly, I evaluate the innovation trajectory of Deep Neural Networks as a method of machine learning. I trace the history of machine learning to its present-day and conclude that while Deep Neural Networks have not yet reached technological maturity, they are already starting to encounter barriers to exponential growth and innovation. These slides were designed to be read independently from the spoken portion. If you found this useful or interesting, please message me on LinkedIn! - Justin Beirold
Designing Human-AI Partnerships to Combat Misinfomation Matthew Lease
The document discusses designing human-AI partnerships to combat misinformation. It describes a prototype partnership where a human and AI work together to fact-check claims. The partnership aims to make the AI more transparent and address user bias by allowing the user to adjust the perceived reliability of news sources, which then changes the AI's political leaning analysis and fact checking results. The discussion wraps up by noting challenges like avoiding echo chambers and assessing potential harms, as well as opportunities for AI to reduce bias and increase trust through explainable, interactive systems.
Why Watson Won: A cognitive perspectiveJames Hendler
In this talk, we present how the Watson program, IBM's famous Jeopardy playing computer, works (based on papers published by IBM), we look at some aspects of potential scoring approaches, and we examine how Watson compares to several well known systems and some preliminary thoughts on using it in future artificial intelligence and cognitive science approaches.
Lessons learned from building practical deep learning systemsXavier Amatriain
1. There are many lessons to be learned from building practical deep learning systems, including choosing the right evaluation metrics, being thoughtful about your data and potential biases, and understanding dependencies between data, models, and systems.
2. It is important to optimize only what matters and beware of biases in your data. Simple models are often better than complex ones, and feature engineering is crucial.
3. Both supervised and unsupervised learning are important, and ensembles often perform best. Your AI infrastructure needs to support both experimentation and production.
Talk given at Los Alamos National Labs in Fall 2015.
As research becomes more data-intensive and platforms become more heterogeneous, we need to shift focus from performance to productivity.
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Matthew Lease
Presented at the 31st ACM User Interface Software and Technology Symposium (UIST), 2018. Paper: https://www.ischool.utexas.edu/~ml/papers/nguyen-uist18.pdf
In this talk I review some of the early visions of the Semantic Web, some of the different views, and I follow through on a thread of how Semantic Web technology has been adopted in search engines (and other companies). I end with a challenge to the research community to keep pursuing this research, rather than letting industry take over the "low end" and keep new work from flourishing.
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Dhiana Deva
The document discusses introducing machine learning and the challenges that come with it. It likens introducing machine learning to opening Pandora's box, as it brings problems like constraints, assumptions, risks, and issues. It recommends starting with simple approaches, addressing these challenges through iteration, and aiming high with vision while avoiding algorithmic bias. The overall message is to have fun on the journey of machine learning and focus on creating customer value.
The Unreasonable Effectiveness of MetadataJames Hendler
Invited talk at VIVO 2017 conference - explores the view of the semantic web as enriched metadata, and how that kind of information can be used in new and interesting ways.
Search as Communication: Lessons from a Personal JourneyDaniel Tunkelang
The document discusses lessons learned from the author's personal journey in search engineering. It covers insights from library science about treating search as an information-seeking context and communicating with users. It also discusses the importance of entity detection and how to leverage corpus features to improve extraction. The author realized that queries vary in difficulty and systems need to recognize this and adapt accordingly. The key takeaway is that search should be treated as a communication problem rather than just a ranking task.
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...advantika
Вы считаете, что создание сайта - это несколько месяцев работы, большие вложения и недоступно обычному малому бизнесу? Мы расскажем, как запустить собственный сайт всего за несколько дней без лишних расходов и покажем все многообразие готовых решений для сайтов на 1С-Битрикс. Вам останется только выбрать подходящее вам!
This two-day lesson plan teaches 9th grade students about assessing personal health and fitness. On day one, students are introduced to key terms like flexibility, muscle endurance, BMI, personal fitness, health, and wellness. They play a matching game and take a mini quiz. Day two focuses on flexibility, with students learning how to perform and score the sit-and-reach test. The importance of flexibility and maintaining it is emphasized. Students will be assessed through quizzes, exams, and tracking their own fitness scores over time.
The document discusses the need for ontologies that can better support linking and mapping between large, distributed databases on the semantic web. While OWL has been successful in some domains, it lacks expressivity for tasks like representing part-whole relations, temporal reasoning, and procedural knowledge. A new generation of ontology languages may need to relax requirements like decidability in order to more powerfully represent relationships that are important for data integration and discovery across multiple knowledge sources.
Machine Learning for Non-technical Peopleindico data
Machine learning is one of the most promising and most difficult to understand fields of the modern age. Here are the slides from Slater Victoroff's (CEO of indico) talk at General Assembly Boston for non-technical folks on how to separate the signal from the noise -- stay tuned for the next time he speaks:
https://generalassemb.ly/education/machine-learning-for-non-technical-people
This document discusses trends and issues in interdisciplinary research between ICT and social sciences. It touches on several topics including:
- The emergence of data-driven science and use of digital tools for research
- Debate around claims that large datasets can replace theories and models
- Development of computational social science and e-science tools
- New roles for data and need for contextualization of big data findings
- Challenges of big data such as data gaps, biases, and ethical issues
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Matthew Lease
This document summarizes a presentation about designing human-AI partnerships for fact-checking misinformation. It discusses using crowdsourced rationales to improve the accuracy and cost-efficiency of annotation tasks. It also addresses challenges in designing interfaces for automatic fact-checking models, such as integrating human knowledge and reasoning to correct errors and account for bias. The goal is to develop mixed-initiative systems where humans and AI can jointly reason and personalize fact-checking.
Talk given at Delft University speaker series on "Crowd Computing & Human-Centered AI" (https://www.academicfringe.org/). November 23, 2020. Covers two 2020 works:
(1) Anubrata Das, Brandon Dang, and Matthew Lease. Fast, Accurate, and Healthier: Interactive Blurring Helps Moderators Reduce Exposure to Harmful Content. In Proceedings of the 8th AAAI Conference on Human Computation and Crowdsourcing (HCOMP), 2020.
Alexander Braylan and Matthew Lease. Modeling and Aggregation of Complex Annotations via Annotation Distances. In Proceedings of the Web Conference, pages 1807--1818, 2020.
This document summarizes a talk on data science for software engineering. It discusses how data science involves various fields like statistics, machine learning, and data mining. It notes that while "big data" is often discussed, software engineering data is typically small and sparse. Domain knowledge is important for data mining to avoid misinterpreting data. Data science with software engineering data requires understanding organizations and their willingness to share data given privacy concerns. The document outlines sharing data, models, and methods for learning across different organizations and discusses techniques for balancing privacy and utility when sharing data.
Social Machines: The coming collision of Artificial Intelligence, Social Netw...James Hendler
Jim Hendler discusses social machines, which he defines as networks of machines supporting networks of people working together in ways that impact the real world. He argues that social networking consumes huge amounts of human time and that this time could be harnessed through social machines to solve problems like curing disease and feeding the hungry. Examples of early social machines include games with a purpose that harness human computation and citizen science projects like Galaxy Zoo. Moving forward, social machines may blend more with artificial intelligence, and their study requires multidisciplinary perspectives from computing, social science, and other fields. Realizing their potential faces both social challenges around online communities and technical challenges in platform design.
Introduction to Data Science and Large-scale Machine LearningNik Spirin
This document is a presentation about data science and artificial intelligence given by James G. Shanahan. It provides an outline that covers topics such as machine learning, data science applications, architecture, and future directions. Shanahan has over 25 years of experience in data science and currently works as an independent consultant and teaches at UC Berkeley. The presentation provides background on artificial intelligence and machine learning techniques as well as examples of their successful applications.
This is the slideshow for a presentation I gave as part of my graduate coursework at the Institute for Innovation and Public Purpose at University College London (UCL IIPP). Drawing on the work of IIPP professors including Carlota Perez (techno-economic paradigms), Mariana Mazzucato (“The Entrepreneurial State”), and Tim O’Reilly, I evaluate the innovation trajectory of Deep Neural Networks as a method of machine learning. I trace the history of machine learning to its present-day and conclude that while Deep Neural Networks have not yet reached technological maturity, they are already starting to encounter barriers to exponential growth and innovation. These slides were designed to be read independently from the spoken portion. If you found this useful or interesting, please message me on LinkedIn! - Justin Beirold
Designing Human-AI Partnerships to Combat Misinfomation Matthew Lease
The document discusses designing human-AI partnerships to combat misinformation. It describes a prototype partnership where a human and AI work together to fact-check claims. The partnership aims to make the AI more transparent and address user bias by allowing the user to adjust the perceived reliability of news sources, which then changes the AI's political leaning analysis and fact checking results. The discussion wraps up by noting challenges like avoiding echo chambers and assessing potential harms, as well as opportunities for AI to reduce bias and increase trust through explainable, interactive systems.
Why Watson Won: A cognitive perspectiveJames Hendler
In this talk, we present how the Watson program, IBM's famous Jeopardy playing computer, works (based on papers published by IBM), we look at some aspects of potential scoring approaches, and we examine how Watson compares to several well known systems and some preliminary thoughts on using it in future artificial intelligence and cognitive science approaches.
Lessons learned from building practical deep learning systemsXavier Amatriain
1. There are many lessons to be learned from building practical deep learning systems, including choosing the right evaluation metrics, being thoughtful about your data and potential biases, and understanding dependencies between data, models, and systems.
2. It is important to optimize only what matters and beware of biases in your data. Simple models are often better than complex ones, and feature engineering is crucial.
3. Both supervised and unsupervised learning are important, and ensembles often perform best. Your AI infrastructure needs to support both experimentation and production.
Talk given at Los Alamos National Labs in Fall 2015.
As research becomes more data-intensive and platforms become more heterogeneous, we need to shift focus from performance to productivity.
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Matthew Lease
Presented at the 31st ACM User Interface Software and Technology Symposium (UIST), 2018. Paper: https://www.ischool.utexas.edu/~ml/papers/nguyen-uist18.pdf
In this talk I review some of the early visions of the Semantic Web, some of the different views, and I follow through on a thread of how Semantic Web technology has been adopted in search engines (and other companies). I end with a challenge to the research community to keep pursuing this research, rather than letting industry take over the "low end" and keep new work from flourishing.
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019Dhiana Deva
The document discusses introducing machine learning and the challenges that come with it. It likens introducing machine learning to opening Pandora's box, as it brings problems like constraints, assumptions, risks, and issues. It recommends starting with simple approaches, addressing these challenges through iteration, and aiming high with vision while avoiding algorithmic bias. The overall message is to have fun on the journey of machine learning and focus on creating customer value.
The Unreasonable Effectiveness of MetadataJames Hendler
Invited talk at VIVO 2017 conference - explores the view of the semantic web as enriched metadata, and how that kind of information can be used in new and interesting ways.
Search as Communication: Lessons from a Personal JourneyDaniel Tunkelang
The document discusses lessons learned from the author's personal journey in search engineering. It covers insights from library science about treating search as an information-seeking context and communicating with users. It also discusses the importance of entity detection and how to leverage corpus features to improve extraction. The author realized that queries vary in difficulty and systems need to recognize this and adapt accordingly. The key takeaway is that search should be treated as a communication problem rather than just a ranking task.
Экономим время и деньги. Тиражные решения и готовые интернет-магазины на «1С-...advantika
Вы считаете, что создание сайта - это несколько месяцев работы, большие вложения и недоступно обычному малому бизнесу? Мы расскажем, как запустить собственный сайт всего за несколько дней без лишних расходов и покажем все многообразие готовых решений для сайтов на 1С-Битрикс. Вам останется только выбрать подходящее вам!
This two-day lesson plan teaches 9th grade students about assessing personal health and fitness. On day one, students are introduced to key terms like flexibility, muscle endurance, BMI, personal fitness, health, and wellness. They play a matching game and take a mini quiz. Day two focuses on flexibility, with students learning how to perform and score the sit-and-reach test. The importance of flexibility and maintaining it is emphasized. Students will be assessed through quizzes, exams, and tracking their own fitness scores over time.
The document defines and provides examples of different types of information systems:
Transaction Processing Systems (TPS) process day-to-day transactions for organizations like banks and retailers. Office Automation Systems (OAS) use computer networks to enable office functions. Knowledge Work Systems (KWS) aid knowledge workers like engineers and scientists. Decision Support Systems (DSS) analyze business data to help managers make decisions. Executive Support Systems (ESS) generate summarized reports for executives. Group Decision Support Systems (GDSS) use digital tools to facilitate collaboration during group meetings and projects.
Управление проектами, или как успеть в срок, не упуская мелочейadvantika
Часто ли в вашей компании возникает необходимость ставить несколько задач одновременно? Сколько времени вы тратите на контроль и планирование работы внутри компании? Хотели бы вы видеть наглядную картину по всем контактам с клиентами и контролировать отработку каждой сделки в единой системе?
Из данного доклада вы узнаете про возможности системы Битрикс24, которая позволяет автоматизировать эти и многие другие потребности современного бизнеса.
The document discusses Florida's no helmet law for motorcyclists and the dangers of not wearing a helmet. It notes that statistical evidence shows Florida is the most dangerous state to drive in, with a fatality rate of 1.25 per 100 million vehicle miles traveled. Hundreds of families lose loved ones to motorcycle accidents in Florida each year. The advantages of wearing a helmet to protect the skull from impact are presented, yet some argue a helmet law would be unconstitutional. Reform is called for to enact a helmet law and reduce accidents.
The document discusses humanity's future in space exploration and discovery. It references the Kepler space telescope and its planet hunting mission, highlighting that it has discovered plenty of planets and continues its work despite being damaged. It encourages visiting the Planet Hunters website to participate in the analysis of Kepler data and the search for new planets. Images included show scenes from space and depictions of sci-fi and reality to contrast imagination with our growing knowledge of the universe.
PENGARUH BAHAN TAMBAH (POLYMER P102) TERHADAP KUAT TEKAN DAN MODULUS ELASTISI...Agil Handayani
Penelitian ini bertujuan mengetahui pengaruh penambahan bahan kimia polymer P102 terhadap karakteristik beton self-compacting baik segar maupun kuat tekan. Dilakukan variasi dosis polymer 0%, 0,5%, 1%, 1,5%, 2% dari berat semen dengan jumlah benda uji 15 buah. Hasil pengujian slump flow menunjukkan campuran 1% polymer memenuhi syarat SCC dengan diameter aliran 661 mm. Pengujian kuat tekan menunjukkan maksimum 27
Cоздать интернет-магазин? – Легко! Возможности платформы «1С-Битрикс» для упр...advantika
В данном докладе, мы покажем как легко и быстро вы можете создать, настроить и запустить интернет-магазин на платформе 1С-Битрикс. Также расскажем о новых возможностях интернет-магазина от 1С-Битрикс и технологии, которая позволит ускорить ваш сайт в 100 раз.
How Many Dimensions of Compatibility?: Discovering What's Right for Your Users Marliese Thomas
How Many Dimensions of Compatibility: Discovering What's Right for Your Users
This was the keynote address at University of Houston Library's Discovery Day Camp on June 10, 2011. Some extra screenshots of admin interfaces have been added after the actual presentation.
Reinventing Laboratory Data To Be Bigger, Smarter & FasterOSTHUS
• Big Data technologies, especially Data Lakes are spreading across many industries at the moment with the hopes that they will provide unprecedented capabilities for data integration and data analytics
• In spite of the popularity and promise of these technology approaches, many early adopters are not seeking immediate solutions to their complex problems. Answers are not simply appearing – this talk will explore this issue more thoroughly
• Of the 4 V’s of Big Data, Data Variety and Data Veracity (uncertainty) are of increasing importance. These can cause barriers to successful integration strategies , which, in turn, can lead to poorly performing analytics.
• The problems of Variety and Veracity can be tackled using a new form of Data Science which combines formal ontologies with statistical heuristics. This talk will explore some key features of these approaches and how they can be developed together in symbiosis – leading to complex models that allow for improved analytics – or as we call it Big Analysis.
• The end result is improved capture of data types/sources, from laboratory instrument data, to clinical data, to regulatory rules & submissions, all the way to business drivers for the enterprise. In the end providing advanced analytics capabilities that can be built as modules and expand across an enterprise.
Current challenges facing the implementation of NoSQL-type databases involve how to use advanced rule-based analytics on large tables and key value stores, where metadata is often sparse. Graph databases or triple stores are great for utilizing one’s metadata, but are often computationally inefficient compared to NoSQL stores. To combat this problem, Modus Operandi will showcase a Predicate Store inside of its MOVIA product that can run advanced, first-order level, logical rule sets and queries against large tables or column stores directly to provide a scalable, rapid and advanced data analytics for cloud applications. This provides graph complexity in terms of content with the performance and scalability of NoSQL data approaches. The system also allows for both statistical algorithms as well as logic-based rule sets to be run concurrently, meaning that a host of parallel analytics can be run at once, providing deep analysis over a multitude of important pattern types.
The document discusses collaborating across disciplines like data science, behavioral science, design, and product management. It provides examples of how machine learning and behavioral insights are used in products to personalize experiences. It advocates bringing these fields together to design using a transdisciplinary approach, with an emphasis on testing hypotheses, designing experiments collaboratively, and defining problems jointly. The document outlines principles for collaborating across teams and designing with human behaviors and limitations in mind.
Data science and analytics have evolved significantly in recent years. While tools and techniques have advanced, failure and frustration remain common in many data science projects. Only 8% of projects are described as successful, despite 73% of executives believing data science will revolutionize their business. Common reasons for failure include high costs, dependence on legacy systems, siloed data, and a lack of clear business objectives or executive support. To improve outcomes, the document argues that data science must apply other disciplines beyond just tools and techniques. It discusses concepts like data philosophy, expertise, networks, identity, and space that could help solve shortcomings if integrated into how problems are approached and teams are structured.
This document provides an overview of a community and directory for all things related to artificial intelligence. It summarizes discussions from various speakers on topics like the future of AI, how AI will impact jobs and business, creativity and art, ethics and privacy, and career opportunities in AI. The document also provides advice on how to prepare for a career in AI through attending events, learning from newsletters and podcasts, getting experience, and networking with people working in the field.
This document discusses several ethical issues related to technology enhanced learning (TEL). It addresses issues that may arise as educational technology becomes more sophisticated, as well as existing issues due to increasing cultural diversity. Some key ethical frameworks and approaches discussed include Judeo-Christian ethics, Buddhist ethics, Kant's categorical imperative, human rights approaches, and care ethics. Technical effects of TEL related to issues of transparency, accountability, data protection, and more. Cultural issues discussed include cultural dominance, assumptions around interaction and performance, and potential ethnocentric biases. The document advocates examining cultural assumptions and recognizing that ethics are not one-size-fits-all.
The document provides an overview of data science, artificial intelligence, and machine learning. It discusses the differences between AI and machine learning, as well as what constitutes data science. Examples are given of applying data science in healthcare to study the impact of remote patient monitoring devices and identify high-risk patients. State-of-the-art machine learning techniques like neural networks, deep learning, and deep reinforcement learning are also overviewed. Finally, the document discusses how companies are using data science and AI and provides next steps for learning and applying these fields.
Opening talk at the "Interdisciplinary Data Resources to Address the Challenges of Urban Living” Workshop at the Urban Big Data Centre, University of Glasgow, 4 April 2016
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
This is my presentation on the Topic "Data Science - An emerging Stream of Science with its Spreading Reach & Impact". I have compiled and collected different statistics and data from different sources. This may be useful for students and those who might be interested in this field of Study.
Data Science Institutes : kelly technologies is the best Data Science Training Institutes in Hyderabad. Providing Data Science training by real time faculty in Hyderabad.
Presentation given at the HEA Social Sciences learning and teaching summit 'Exploring the implications of ‘the era of big data’ for learning and teaching'.
A blog post outlining the issues discussed at the summit is available via: http://bit.ly/1lCBUIB
This document provides a SWOT (Strengths, Weaknesses, Opportunities, Threats) analysis of open-source software (OSS). It notes that while OSS has strengths like low costs and a large community, its biggest challenge is supporting software without strict governance rules. OSS also faces threats from established institutions resistant to change and past perceptions that liken it to outdated systems. The document advocates for "next generation" library catalogs that go beyond just finding information to helping users understand content through services like analyzing word frequencies, phrases and numeric metadata.
This document provides an overview and introduction to text analysis using the GATE (General Architecture for Text Engineering) toolkit. It discusses key concepts in natural language processing (NLP) like entity recognition, relation extraction, and event recognition. It also describes GATE's rule-based information extraction system called ANNIE and how it can be used to perform named entity recognition on text data using gazetteers and handcrafted grammars. The document demonstrates running ANNIE on a sample text document in GATE to annotate and extract named entities.
Framework for understanding data science.pdfMichael Brodie
The objective of my research is to provide a framework with which the data science community can understand, define, and develop data science as a field of inquiry. The framework is based on the classical reference framework (axiology, ontology, epistemology, methodology) used for 200 years to define knowledge discovery paradigms and disciplines in the humanities, sciences, algorithms, and now data science. I augmented it for automated problem-solving with (methods, technology, community) [1][2]. The resulting data science reference framework is used to define the data science knowledge discovery paradigm in terms of the philosophy of data science addressed in [1] and the data science problem-solving paradigm, i.e., the data science method, and the data science problem-solving workflow, addressed in [2][3]. The framework is a much called for unifying framework for data science as it contains the components required to define data science. For insights to better understand data science, this paper uses the framework to define the emerging, often enigmatic, data science problem-solving paradigm and workflow, and to compare them with their well-understood scientific counterparts – scientific problem-solving paradigm and workflow.
The objective of my current research [4] is to develop a 21st C re-conception of data. Unlike 20th C data that are assets, 21st C data science data is phenomenological – a resource in which to discover phenomena and their properties, previously and otherwise impossible.
[1] Brodie, M.L., Defining data science: a new field of inquiry, arXiv preprint https://doi.org/10.48550/arXiv.2306.16177 Harvard University, July 2023.
[2] Brodie, M.L., A data science axiology: the nature, value, and risks of data science, arXiv preprint http://arxiv.org/abs/2307.10460 Harvard University, July 2023.
[3] Brodie, M.L., A framework for understanding data science, arXiv preprint https://arxiv.org/abs/2403.00776 Harvard University, March 2024.
[4] Brodie, M.L., Re-conceiving data in the 21st Century. Work in progress, Harvard University.
The document provides instructions for using slides that introduce generative AI and its use in assessments. The slides are designed as a 1-hour lecture that can be delivered live or circulated asynchronously. Staff should delete instruction slides before using the presentation with students. The AI and You teaching toolkit provides further guidance.
What are Cognitive Applications? What is exciting about them? They represent a whole new way of human computer interaction and acting on data insights. Introducing IBM Watson and how to develop Cognitive applications. AI, Machine Learning compared and contrasted.
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
This document discusses the changing landscape of data science and AI in biomedicine. Some key points:
- We are at a tipping point where data science is becoming a driver of biomedical research rather than just a tool. Biomedical researchers need to become data scientists.
- Data science is interdisciplinary and touches every field due to the rise of digital data. It requires openness, translation of findings, and consideration of responsibilities like algorithmic bias.
- Advances like AlphaFold2 show the power of large collaborative efforts combining data, computing resources, engineering, and domain expertise. This points to the need for public-private partnerships and new models of open data sharing.
- The definition of
2820181Phil 2 Puzzles and ParadoxesProf. Sven B.docxlorainedeserre
This document discusses Grelling's Paradox, which is a semantic paradox similar to the liar paradox. It defines the terms "heterological" and "autological" and examines whether the term "heterological" is itself heterological. It leads to a contradiction, as both assuming that "heterological" is and is not heterological results in a contradiction. The document then shifts topics to discuss future trends in training and development, including increased use of new technologies, sustainability initiatives, and advances in areas like neuroscience and data analysis that will influence the field.
Similar to Data Models And Details About Open Data (20)
Dandelion Hashtable: beyond billion requests per second on a commodity serverAntonios Katsarakis
This slide deck presents DLHT, a concurrent in-memory hashtable. Despite efforts to optimize hashtables, that go as far as sacrificing core functionality, state-of-the-art designs still incur multiple memory accesses per request and block request processing in three cases. First, most hashtables block while waiting for data to be retrieved from memory. Second, open-addressing designs, which represent the current state-of-the-art, either cannot free index slots on deletes or must block all requests to do so. Third, index resizes block every request until all objects are copied to the new index. Defying folklore wisdom, DLHT forgoes open-addressing and adopts a fully-featured and memory-aware closed-addressing design based on bounded cache-line-chaining. This design offers lock-free index operations and deletes that free slots instantly, (2) completes most requests with a single memory access, (3) utilizes software prefetching to hide memory latencies, and (4) employs a novel non-blocking and parallel resizing. In a commodity server and a memory-resident workload, DLHT surpasses 1.6B requests per second and provides 3.5x (12x) the throughput of the state-of-the-art closed-addressing (open-addressing) resizable hashtable on Gets (Deletes).
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor IvaniukFwdays
At this talk we will discuss DDoS protection tools and best practices, discuss network architectures and what AWS has to offer. Also, we will look into one of the largest DDoS attacks on Ukrainian infrastructure that happened in February 2022. We'll see, what techniques helped to keep the web resources available for Ukrainians and how AWS improved DDoS protection for all customers based on Ukraine experience
Must Know Postgres Extension for DBA and Developer during MigrationMydbops
Mydbops Opensource Database Meetup 16
Topic: Must-Know PostgreSQL Extensions for Developers and DBAs During Migration
Speaker: Deepak Mahto, Founder of DataCloudGaze Consulting
Date & Time: 8th June | 10 AM - 1 PM IST
Venue: Bangalore International Centre, Bangalore
Abstract: Discover how PostgreSQL extensions can be your secret weapon! This talk explores how key extensions enhance database capabilities and streamline the migration process for users moving from other relational databases like Oracle.
Key Takeaways:
* Learn about crucial extensions like oracle_fdw, pgtt, and pg_audit that ease migration complexities.
* Gain valuable strategies for implementing these extensions in PostgreSQL to achieve license freedom.
* Discover how these key extensions can empower both developers and DBAs during the migration process.
* Don't miss this chance to gain practical knowledge from an industry expert and stay updated on the latest open-source database trends.
Mydbops Managed Services specializes in taking the pain out of database management while optimizing performance. Since 2015, we have been providing top-notch support and assistance for the top three open-source databases: MySQL, MongoDB, and PostgreSQL.
Our team offers a wide range of services, including assistance, support, consulting, 24/7 operations, and expertise in all relevant technologies. We help organizations improve their database's performance, scalability, efficiency, and availability.
Contact us: info@mydbops.com
Visit: https://www.mydbops.com/
Follow us on LinkedIn: https://in.linkedin.com/company/mydbops
For more details and updates, please follow up the below links.
Meetup Page : https://www.meetup.com/mydbops-databa...
Twitter: https://twitter.com/mydbopsofficial
Blogs: https://www.mydbops.com/blog/
Facebook(Meta): https://www.facebook.com/mydbops/
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...Jason Yip
The typical problem in product engineering is not bad strategy, so much as “no strategy”. This leads to confusion, lack of motivation, and incoherent action. The next time you look for a strategy and find an empty space, instead of waiting for it to be filled, I will show you how to fill it in yourself. If you’re wrong, it forces a correction. If you’re right, it helps create focus. I’ll share how I’ve approached this in the past, both what works and lessons for what didn’t work so well.
The Department of Veteran Affairs (VA) invited Taylor Paschal, Knowledge & Information Management Consultant at Enterprise Knowledge, to speak at a Knowledge Management Lunch and Learn hosted on June 12, 2024. All Office of Administration staff were invited to attend and received professional development credit for participating in the voluntary event.
The objectives of the Lunch and Learn presentation were to:
- Review what KM ‘is’ and ‘isn’t’
- Understand the value of KM and the benefits of engaging
- Define and reflect on your “what’s in it for me?”
- Share actionable ways you can participate in Knowledge - - Capture & Transfer
How information systems are built or acquired puts information, which is what they should be about, in a secondary place. Our language adapted accordingly, and we no longer talk about information systems but applications. Applications evolved in a way to break data into diverse fragments, tightly coupled with applications and expensive to integrate. The result is technical debt, which is re-paid by taking even bigger "loans", resulting in an ever-increasing technical debt. Software engineering and procurement practices work in sync with market forces to maintain this trend. This talk demonstrates how natural this situation is. The question is: can something be done to reverse the trend?
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillLizaNolte
HERE IS YOUR WEBINAR CONTENT! 'Mastering Customer Journey Management with Dr. Graham Hill'. We hope you find the webinar recording both insightful and enjoyable.
In this webinar, we explored essential aspects of Customer Journey Management and personalization. Here’s a summary of the key insights and topics discussed:
Key Takeaways:
Understanding the Customer Journey: Dr. Hill emphasized the importance of mapping and understanding the complete customer journey to identify touchpoints and opportunities for improvement.
Personalization Strategies: We discussed how to leverage data and insights to create personalized experiences that resonate with customers.
Technology Integration: Insights were shared on how inQuba’s advanced technology can streamline customer interactions and drive operational efficiency.
ScyllaDB is making a major architecture shift. We’re moving from vNode replication to tablets – fragments of tables that are distributed independently, enabling dynamic data distribution and extreme elasticity. In this keynote, ScyllaDB co-founder and CTO Avi Kivity explains the reason for this shift, provides a look at the implementation and roadmap, and shares how this shift benefits ScyllaDB users.
In our second session, we shall learn all about the main features and fundamentals of UiPath Studio that enable us to use the building blocks for any automation project.
📕 Detailed agenda:
Variables and Datatypes
Workflow Layouts
Arguments
Control Flows and Loops
Conditional Statements
💻 Extra training through UiPath Academy:
Variables, Constants, and Arguments in Studio
Control Flow in Studio
"Scaling RAG Applications to serve millions of users", Kevin GoedeckeFwdays
How we managed to grow and scale a RAG application from zero to thousands of users in 7 months. Lessons from technical challenges around managing high load for LLMs, RAGs and Vector databases.
"Choosing proper type of scaling", Olena SyrotaFwdays
Imagine an IoT processing system that is already quite mature and production-ready and for which client coverage is growing and scaling and performance aspects are life and death questions. The system has Redis, MongoDB, and stream processing based on ksqldb. In this talk, firstly, we will analyze scaling approaches and then select the proper ones for our system.
"NATO Hackathon Winner: AI-Powered Drug Search", Taras KlobaFwdays
This is a session that details how PostgreSQL's features and Azure AI Services can be effectively used to significantly enhance the search functionality in any application.
In this session, we'll share insights on how we used PostgreSQL to facilitate precise searches across multiple fields in our mobile application. The techniques include using LIKE and ILIKE operators and integrating a trigram-based search to handle potential misspellings, thereby increasing the search accuracy.
We'll also discuss how the azure_ai extension on PostgreSQL databases in Azure and Azure AI Services were utilized to create vectors from user input, a feature beneficial when users wish to find specific items based on text prompts. While our application's case study involves a drug search, the techniques and principles shared in this session can be adapted to improve search functionality in a wide range of applications. Join us to learn how PostgreSQL and Azure AI can be harnessed to enhance your application's search capability.
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsDianaGray10
Join us to learn how UiPath Apps can directly and easily interact with prebuilt connectors via Integration Service--including Salesforce, ServiceNow, Open GenAI, and more.
The best part is you can achieve this without building a custom workflow! Say goodbye to the hassle of using separate automations to call APIs. By seamlessly integrating within App Studio, you can now easily streamline your workflow, while gaining direct access to our Connector Catalog of popular applications.
We’ll discuss and demo the benefits of UiPath Apps and connectors including:
Creating a compelling user experience for any software, without the limitations of APIs.
Accelerating the app creation process, saving time and effort
Enjoying high-performance CRUD (create, read, update, delete) operations, for
seamless data management.
Speakers:
Russell Alfeche, Technology Leader, RPA at qBotic and UiPath MVP
Charlie Greenberg, host
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
Session 1 - Intro to Robotic Process Automation.pdfUiPathCommunity
👉 Check out our full 'Africa Series - Automation Student Developers (EN)' page to register for the full program:
https://bit.ly/Automation_Student_Kickstart
In this session, we shall introduce you to the world of automation, the UiPath Platform, and guide you on how to install and setup UiPath Studio on your Windows PC.
📕 Detailed agenda:
What is RPA? Benefits of RPA?
RPA Applications
The UiPath End-to-End Automation Platform
UiPath Studio CE Installation and Setup
💻 Extra training through UiPath Academy:
Introduction to Automation
UiPath Business Automation Platform
Explore automation development with UiPath Studio
👉 Register here for our upcoming Session 2 on June 20: Introduction to UiPath Studio Fundamentals: https://community.uipath.com/events/details/uipath-lagos-presents-session-2-introduction-to-uipath-studio-fundamentals/
Session 1 - Intro to Robotic Process Automation.pdf
Data Models And Details About Open Data
1. Datasets, Data Models, and Useful
Facts In Drawing conclusions
If there's a delay and you have your phone keep in mind that
Facebook has a language setting called English(Pirate)
Banging your head against a wall burns 150 calories an hour.
By Michael Bostwick
2. You've had some experience with technology.
We cant cover it all, so be sure to read the manual.
Google is a great search engine!
“Never memorize something that you can look up.”
― Albert Einstein
Feel free to ask questions or share insight.
Assumptions
4. The study of where
information comes
from, what it
represents and how it
can be turned into a
valuable resource in
the creation of
business and IT
strategies.
Data Science
5.
6. The Great Chain Of Being By Plato
Over time we moved to Porphyrian tree
An example is someone's family tree
History Of Human Knowledge
Described in more detail
very well in a TED Talk by
Manuel Lima
7. More and More things are based
on Networks
http://cpan-explorer.org/
Networks Metaphor
8. From a Ted Talk By Eric Berlow and Sean Gourley
(this model expands to see connections, it processed language)
TED Talk Map
9. More Maps At socialexplorer.com
Mapping Of Social Ties
From A Ted Talk By Dave Troy About Social Mapping
10. kdnuggets.com listing(and industry site)
caesar0301 awesome public datasets
data.gov
code for kansas city github wiki
reddit.com datasets
blog.visual.ly data sources
Open StreetMaps Data
opensecrets.org a watchdog group
wiki's linked data
wiki data a nosql listing of different relationships
trulia local map
Data Sets
12. Freedom of Information Act (FOIA)
Missouri Sunshine Law
Many states have similar laws.
We the people petitions
Media and Other Public Posting Options
instigative journalism postings
craigslist
Getting More And New Data
13. A theory that seeks
to explain how,
why, and at what
rate new ideas and
technology spread
through cultures.
Diffusion Of Innovation
14. Innovation
Any idea, practice, or object that is perceived as new by an
individual or other unit of adoption could be considered an
innovation available for study.
Adopters
In most case adopters are individuals, but can also be
organizations
Communication channels
Time
Social system
Components Of Diffusion Of
Innovations
15. Knowledge
The individual first view into to the innovation
Persuasion
The individual is interested in the innovation and actively seeks related information/details.
Decision
The individual takes the concept of the change and weighs the advantages/disadvantages of
using the innovation and decides whether to adopt or reject the innovation.
This stage is very hard to get metrics on.
Implementation
They use it, how much and where is based on the situation. During this stage the individual also
determines the usefulness of the innovation and may search for further information about it.
Confirmation
The individual his/her decision to continue using the innovation. This stage is both intrapersonal
(may cause cognitive dissonance) and interpersonal, confirmation the group has made the right
decision.
Adoption Process For Diffusion
Of Innovation
16. We heard about it on the news
Officials began to push the use of it
Some people couldn't get it to work, so they contact
there friends and family
Issues made some people upset
Now its working, and lots of people use it
Some people still just use there work insurance
Example Change.gov
17. We all should remember this from science class
Empirical Research Model
18. The model describes the meaning of its instances.
A conceptual schema is a high-level description of a
business's informational needs.
Normally described with entity relationship(E.R)
diagrams
Common in software development
Semantic Data Model
20. A very good explanation is given
on ted
The Math is lengthy
Cities are very hard to kill
All companies die
Companies and cities are like animals
Growth is linked to population
The growth of population is linked to
other factors
Geoffrey West Urban Scaling Model
21. pace of life decree's
as you get bigger
double the size of a
city, and get a 15%
increase of things
Growth Is Hard To Sustain
22. To avoid collapse, major
innovation take place
Cycles of innovation to avoid
collapse, you have to keep
innovating, or collapse
Cycles Of Innovation
23. Please Feel Free To Share Models Or
Patterns
Anti patterns are also approaches that
shouldn’t be taken!
Open Invitation To Share Models
24. UML
Universal Modeling Language
PlantUML
draw.io
Different subjects have there own models!
Algorithms were excluded
Open MIT Courses
Khanacademy Algorithms
Making New Models
25. If we have data, let’s look at data. If all we have
are opinions, let’s go with mine.
– Jim Barksdale, former Netscape CEO
Torture the data, and it will confess to anything
– Ronald Coase, Economics, Nobel Prize Laureate
Data Can Be BAD!
26. Business Failures
Systemic Failure
Loss Of Trust
An example is the bad data used in the
autism drug study
Possible Harms
27. False Correlation
Emotional Appeal
Question demographics
Look at relationships instead
Confirmation Bias
Posta Hac Ergo Proctor Hoc
(after which therefore because of which)
after something not because of it
Common Mistakes
28. Show Your work so others can see the
mistakes
Showing your work also shows what you
might have missed
Data has limits
Data doesn't create meaning, people do
Use Critical Thinking
29. Should get its own talk
Good Starting Tour of M.I.
Microsoft has a very cool cloud framework
out
Python and R, are very common tools for
using machine learning
Machine learning is great for making
predictive calls
Machine Learning
30. Some databases have this feature built in!
QGIS
An open source GIS viewer
ArchGIS
A very common GIS application, that’s licensed
Google Earth
KML Files
Open Street Maps
Can take user submit information
It also allows downloading of all of there data
Spatial Mapping
31. Relation Database Management System
ACID (Atomicity, Consistency, Isolation, Durability)
Commonly SQL
Standard Query Language
Most RDMS systems have a bulk loading tool
Lots Of Examples
Oracle
MS Sql Server
Postgress
RDMS
32. These databases duplicate data
There great for large data sets
Big Data is normally done by this
There great for distributed information
They can scale to handle endless data
Lots of Open Source Options
MongoDB
CouchDB
Non Relational Databases
33. R
Python
Perl
Bash
Windows Powershell
PHP And Javascript
Scripting Languages
34. C#
C++ (and C)
Java
There are a lot of solutions on github and sourceforge,
that can be changed and expanded
Some are frameworks like eclipse frameworks
Compiled Programming
Languages