The document summarizes a keynote presentation about scholarly document processing and how it can support both fast, intuitive thinking (System 1) as well as slow, analytical thinking (System 2). It discusses three main challenges for supporting System 2 thinking: 1) discovering new ideas and connections, 2) uncovering discrepancies in existing research, and 3) understanding the provenance of ideas. It argues that tools are needed to better support exploring alternative perspectives, systematic reviews of negative results, and tracing the history of terms and concepts. The goal is for document processing to facilitate both intuitive leaps and deeper analytical thinking in research.
All Things Open 2014 - Day 1
Wednesday, October 22nd, 2014
Arfon Smith
Chief Scientist for GitHub
Open Government/Open Data
What Academia Can Learn from Open Source
Find more by Arfon here: https://speakerdeck.com/arfon
Provocation talk given by David De Roure at the e-Science Institute, Edinburgh, 19 November 2007 as part of the ESI Strategic Advisory Board Workshop or "e-Science Think Tank"
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Big Data Spain
This document discusses trends in data science in 2016, including how data science is moving into new use cases such as medicine, politics, government, and neuroscience. It also covers trends in hardware, generalized libraries, leveraging workflows, and frameworks that could enable a big leap ahead. The document discusses learning trends like MOOCs, inverted classrooms, collaborative learning, and how O'Reilly Media is embracing Jupyter notebooks. It also covers measuring distance between learners and subject communities, and the importance of both people and automation working together.
This document discusses anonymous peer review and its benefits. It recommends using peer review for student assignments by having students apply rubrics and criteria to evaluate past samples without knowing whose work they are reviewing. The document outlines the steps for conducting anonymous peer review, including having students submit assignments anonymously using numbered tickets and then drawing tickets to review and provide feedback on anonymized assignments. The goal of anonymous peer review is to help students learn to apply objective criteria to evaluate work and give constructive feedback without bias.
The document outlines the weekly content and activities for a course on emerging practices. It includes 12 weeks of content covering topics like technology and society, social robots, human-robot interaction, adoption and diffusion, and speculative futures. Students will analyze and evaluate case studies of emerging technologies, present findings, and participate in activities like co-design workshops. Assignments include analysis reports, evaluation reports, and a final synthesis report. Resources provided include links to example technologies and videos for additional research.
All Things Open 2014 - Day 1
Wednesday, October 22nd, 2014
Arfon Smith
Chief Scientist for GitHub
Open Government/Open Data
What Academia Can Learn from Open Source
Find more by Arfon here: https://speakerdeck.com/arfon
Provocation talk given by David De Roure at the e-Science Institute, Edinburgh, 19 November 2007 as part of the ESI Strategic Advisory Board Workshop or "e-Science Think Tank"
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Big Data Spain
This document discusses trends in data science in 2016, including how data science is moving into new use cases such as medicine, politics, government, and neuroscience. It also covers trends in hardware, generalized libraries, leveraging workflows, and frameworks that could enable a big leap ahead. The document discusses learning trends like MOOCs, inverted classrooms, collaborative learning, and how O'Reilly Media is embracing Jupyter notebooks. It also covers measuring distance between learners and subject communities, and the importance of both people and automation working together.
This document discusses anonymous peer review and its benefits. It recommends using peer review for student assignments by having students apply rubrics and criteria to evaluate past samples without knowing whose work they are reviewing. The document outlines the steps for conducting anonymous peer review, including having students submit assignments anonymously using numbered tickets and then drawing tickets to review and provide feedback on anonymized assignments. The goal of anonymous peer review is to help students learn to apply objective criteria to evaluate work and give constructive feedback without bias.
The document outlines the weekly content and activities for a course on emerging practices. It includes 12 weeks of content covering topics like technology and society, social robots, human-robot interaction, adoption and diffusion, and speculative futures. Students will analyze and evaluate case studies of emerging technologies, present findings, and participate in activities like co-design workshops. Assignments include analysis reports, evaluation reports, and a final synthesis report. Resources provided include links to example technologies and videos for additional research.
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...jybufgofasfbkpoovh
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. Descriptive statistics. Inferential statistics. Python Libraries for Data Science.
The document discusses the development of K-HAL, an AI assistant, through three versions. K-HAL V1.0 used a simple ontology and knowledge base to represent spacecraft piloting knowledge. K-HAL V2.0 leveraged existing online ontologies and linked data to expand its knowledge. It also used crowdsourcing to add new facts. K-HAL V3.0 would represent processes by modeling interactions between a virtual choir, conductor, and listeners. The conclusion advocates reusing and sharing ontologies and data to benefit the semantic web community.
This document discusses open science and how NASA can accelerate and expand its scientific impact through more open practices. It defines open science as collaborative sharing of data, information and knowledge to accelerate research. It advocates for making the scientific process, data, software, and findings more accessible and transparent through practices like open data, open source software, open cyberinfrastructure, open access publications, and public communication. It acknowledges open science can be difficult but provides examples of how different groups are implementing more open approaches.
Klaus Tochtermann presented on science 2.0-enabled paradigms for information delivery. He discussed how the social and semantic web will impact research and publication processes. Specifically, he examined how tools like social media, blogs and wikis can support open science practices. Tochtermann also explored challenges of reputation management, citizen science, open data licensing and ensuring reliability of linked open data. The presentation proposed moving from taking users to content to bringing personalized content to users through viral information delivery enabled by semantic technologies.
The document discusses steps in the social research process, including moving from a research question to research design. It covers choosing indicators to measure concepts, and planning the project by breaking it into phases and using a Gantt chart to manage timelines and resources. Philosophy influences research design and outcomes, so researchers should reflect on their assumptions. Overall planning and defining indicators are important for effective research design.
"Strategies for Scientific Communication" part of #SSNCE Research Scholars’ C...IIIT Hyderabad
Topics covered:
Picking a real-world problem
Blogademia
Create public good
Be active about research on social media
Develop / be part of a group / community
Use tools / platforms effectively
Presenting
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...GigaScience, BGI Hong Kong
This document discusses the growing reproducibility crisis in scientific research and proposes open data and transparent methods as solutions. It notes several studies finding a lack of reproducibility in published research due to inaccessible data and methods. Consequences of this include a large and growing number of retractions as well as perceptions that some regions have higher rates of fraudulent research due to lack of transparency. The document argues that open data, software and peer review can help address these issues by enabling credit for sharing and reusing research objects. Examples of initiatives that aim to reward open practices and improve reproducibility through open data publishing and peer review are also provided.
The document discusses the evolution of e-Research, from early forms like supercomputing and grid computing to current approaches like big data. It argues that e-Research will become so integrated into normal research practices that it will effectively disappear as a separate field. The document also provides examples of how computational approaches are transforming different domains like the sciences, social sciences, humanities, and arts. It analyzes the digitization of cultural artifacts and large-scale text analysis as novel advances enabled by e-Research.
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...Erin Robinson
Robinson, Erin. (2020, December). Putting Data to Work: Moving science forward together beyond where we thought possible!. Presented at the 2020 Fall American Geophysical Union Meeting (AGU), Remote: Zenodo. http://doi.org/10.5281/zenodo.4315009
The Evolution of e-Research: Machines, Methods and MusicDavid De Roure
The document summarizes the evolution of e-research over three generations from 1981 to the present. The first generation saw early adopters using tools within their disciplines with some reuse. The second generation was characterized by increased reuse of tools, data and methods across areas. The third generation is defined by radical sharing of resources globally across any discipline through social networks and reusable research objects. The document also discusses several specific projects and tools that exemplify each generation of e-research including myExperiment, Galaxy, and SALAMI.
This document discusses the emergence of "Data Science" programs and initiatives at major universities over the past few years. It notes that while the activities described in these new Data Science programs often overlap significantly with traditional statistics work, the programs tend to marginalize or distance themselves from academic statistics departments. The document explores different perspectives on the relationship between Data Science and Statistics, with some statisticians arguing they are essentially doing the same work, while others see Data Science as broader or separate from Statistics. The author aims to present a vision for Data Science that better incorporates the historical contributions and ongoing work of Statistics.
The document discusses reproducible research and outlines steps to achieve it. It describes Jon Claerbout's early work on reproducible research using LaTeX and makefiles at Stanford University. It also discusses the Madagascar project for multidimensional data analysis using open-source tools. Finally, it summarizes several articles from a special issue of CiSE magazine on reproducible computational experiments using tools like R, MATLAB, Python and Fortran.
The document discusses tools used for forecasting the future, including trend analysis, the Delphi method, and scenario planning. It then provides an overview of scenario planning and describes how to develop scenarios around a focal issue and critical uncertainties. Examples of professional forecasting products are also presented, such as the Horizon Report and Map of Future Forces, which analyze trends to predict emerging technologies.
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA DATASCIENCE
The document provides a brief history of data science as an academic field, outlines current data science educational programs, and predicts future trends. It notes that data science emerged from a series of academic concepts in the 1960s and gained popularity in the 2000s. Currently, there are three main educational paradigms for data science: business analytics, data science, and data engineering. The document also discusses best practices in data science education and predicts that future trends may include increased specialization, collaboration between education and industry, and evolving skills demands.
Being an Open Scholar in a Connected WorldStian Håklev
This document discusses the benefits of open scholarship in a connected world. It argues that open access to research articles makes information more accessible to broader audiences, including the general public and students. When data and research notes are openly shared online, it can enable unexpected reuse and collaboration. However, the current academic publishing and reward systems may not fully incentivize open scholarship. The document calls for exploring new models of peer review, metrics of impact, and ways of publishing research to make the scholarly process more transparent and collaborative.
Leveraging Generative AI to Drive Nonprofit InnovationTechSoup
In this webinar, participants learned how to utilize Generative AI to streamline operations and elevate member engagement. Amazon Web Service experts provided a customer specific use cases and dived into low/no-code tools that are quick and easy to deploy through Amazon Web Service (AWS.)
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...TechSoup
Whether you're new to SEO or looking to refine your existing strategies, this webinar will provide you with actionable insights and practical tips to elevate your nonprofit's online presence.
More Related Content
Similar to Scholarly Document Processing Research in the Age of AI
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. D...jybufgofasfbkpoovh
Data; Data manipulation, sorting, grouping, rearranging. Plotting the data. Descriptive statistics. Inferential statistics. Python Libraries for Data Science.
The document discusses the development of K-HAL, an AI assistant, through three versions. K-HAL V1.0 used a simple ontology and knowledge base to represent spacecraft piloting knowledge. K-HAL V2.0 leveraged existing online ontologies and linked data to expand its knowledge. It also used crowdsourcing to add new facts. K-HAL V3.0 would represent processes by modeling interactions between a virtual choir, conductor, and listeners. The conclusion advocates reusing and sharing ontologies and data to benefit the semantic web community.
This document discusses open science and how NASA can accelerate and expand its scientific impact through more open practices. It defines open science as collaborative sharing of data, information and knowledge to accelerate research. It advocates for making the scientific process, data, software, and findings more accessible and transparent through practices like open data, open source software, open cyberinfrastructure, open access publications, and public communication. It acknowledges open science can be difficult but provides examples of how different groups are implementing more open approaches.
Klaus Tochtermann presented on science 2.0-enabled paradigms for information delivery. He discussed how the social and semantic web will impact research and publication processes. Specifically, he examined how tools like social media, blogs and wikis can support open science practices. Tochtermann also explored challenges of reputation management, citizen science, open data licensing and ensuring reliability of linked open data. The presentation proposed moving from taking users to content to bringing personalized content to users through viral information delivery enabled by semantic technologies.
The document discusses steps in the social research process, including moving from a research question to research design. It covers choosing indicators to measure concepts, and planning the project by breaking it into phases and using a Gantt chart to manage timelines and resources. Philosophy influences research design and outcomes, so researchers should reflect on their assumptions. Overall planning and defining indicators are important for effective research design.
"Strategies for Scientific Communication" part of #SSNCE Research Scholars’ C...IIIT Hyderabad
Topics covered:
Picking a real-world problem
Blogademia
Create public good
Be active about research on social media
Develop / be part of a group / community
Use tools / platforms effectively
Presenting
Scott Edmunds talk at AIST: Overcoming the Reproducibility Crisis: and why I ...GigaScience, BGI Hong Kong
This document discusses the growing reproducibility crisis in scientific research and proposes open data and transparent methods as solutions. It notes several studies finding a lack of reproducibility in published research due to inaccessible data and methods. Consequences of this include a large and growing number of retractions as well as perceptions that some regions have higher rates of fraudulent research due to lack of transparency. The document argues that open data, software and peer review can help address these issues by enabling credit for sharing and reusing research objects. Examples of initiatives that aim to reward open practices and improve reproducibility through open data publishing and peer review are also provided.
The document discusses the evolution of e-Research, from early forms like supercomputing and grid computing to current approaches like big data. It argues that e-Research will become so integrated into normal research practices that it will effectively disappear as a separate field. The document also provides examples of how computational approaches are transforming different domains like the sciences, social sciences, humanities, and arts. It analyzes the digitization of cultural artifacts and large-scale text analysis as novel advances enabled by e-Research.
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...Erin Robinson
Robinson, Erin. (2020, December). Putting Data to Work: Moving science forward together beyond where we thought possible!. Presented at the 2020 Fall American Geophysical Union Meeting (AGU), Remote: Zenodo. http://doi.org/10.5281/zenodo.4315009
The Evolution of e-Research: Machines, Methods and MusicDavid De Roure
The document summarizes the evolution of e-research over three generations from 1981 to the present. The first generation saw early adopters using tools within their disciplines with some reuse. The second generation was characterized by increased reuse of tools, data and methods across areas. The third generation is defined by radical sharing of resources globally across any discipline through social networks and reusable research objects. The document also discusses several specific projects and tools that exemplify each generation of e-research including myExperiment, Galaxy, and SALAMI.
This document discusses the emergence of "Data Science" programs and initiatives at major universities over the past few years. It notes that while the activities described in these new Data Science programs often overlap significantly with traditional statistics work, the programs tend to marginalize or distance themselves from academic statistics departments. The document explores different perspectives on the relationship between Data Science and Statistics, with some statisticians arguing they are essentially doing the same work, while others see Data Science as broader or separate from Statistics. The author aims to present a vision for Data Science that better incorporates the historical contributions and ongoing work of Statistics.
The document discusses reproducible research and outlines steps to achieve it. It describes Jon Claerbout's early work on reproducible research using LaTeX and makefiles at Stanford University. It also discusses the Madagascar project for multidimensional data analysis using open-source tools. Finally, it summarizes several articles from a special issue of CiSE magazine on reproducible computational experiments using tools like R, MATLAB, Python and Fortran.
The document discusses tools used for forecasting the future, including trend analysis, the Delphi method, and scenario planning. It then provides an overview of scenario planning and describes how to develop scenarios around a focal issue and critical uncertainties. Examples of professional forecasting products are also presented, such as the Horizon Report and Map of Future Forces, which analyze trends to predict emerging technologies.
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA DATASCIENCE
The document provides a brief history of data science as an academic field, outlines current data science educational programs, and predicts future trends. It notes that data science emerged from a series of academic concepts in the 1960s and gained popularity in the 2000s. Currently, there are three main educational paradigms for data science: business analytics, data science, and data engineering. The document also discusses best practices in data science education and predicts that future trends may include increased specialization, collaboration between education and industry, and evolving skills demands.
Being an Open Scholar in a Connected WorldStian Håklev
This document discusses the benefits of open scholarship in a connected world. It argues that open access to research articles makes information more accessible to broader audiences, including the general public and students. When data and research notes are openly shared online, it can enable unexpected reuse and collaboration. However, the current academic publishing and reward systems may not fully incentivize open scholarship. The document calls for exploring new models of peer review, metrics of impact, and ways of publishing research to make the scholarly process more transparent and collaborative.
Similar to Scholarly Document Processing Research in the Age of AI (20)
Leveraging Generative AI to Drive Nonprofit InnovationTechSoup
In this webinar, participants learned how to utilize Generative AI to streamline operations and elevate member engagement. Amazon Web Service experts provided a customer specific use cases and dived into low/no-code tools that are quick and easy to deploy through Amazon Web Service (AWS.)
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...TechSoup
Whether you're new to SEO or looking to refine your existing strategies, this webinar will provide you with actionable insights and practical tips to elevate your nonprofit's online presence.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
This document provides an overview of wound healing, its functions, stages, mechanisms, factors affecting it, and complications.
A wound is a break in the integrity of the skin or tissues, which may be associated with disruption of the structure and function.
Healing is the body’s response to injury in an attempt to restore normal structure and functions.
Healing can occur in two ways: Regeneration and Repair
There are 4 phases of wound healing: hemostasis, inflammation, proliferation, and remodeling. This document also describes the mechanism of wound healing. Factors that affect healing include infection, uncontrolled diabetes, poor nutrition, age, anemia, the presence of foreign bodies, etc.
Complications of wound healing like infection, hyperpigmentation of scar, contractures, and keloid formation.
How Barcodes Can Be Leveraged Within Odoo 17Celine George
In this presentation, we will explore how barcodes can be leveraged within Odoo 17 to streamline our manufacturing processes. We will cover the configuration steps, how to utilize barcodes in different manufacturing scenarios, and the overall benefits of implementing this technology.
A Visual Guide to 1 Samuel | A Tale of Two HeartsSteve Thomason
These slides walk through the story of 1 Samuel. Samuel is the last judge of Israel. The people reject God and want a king. Saul is anointed as the first king, but he is not a good king. David, the shepherd boy is anointed and Saul is envious of him. David shows honor while Saul continues to self destruct.
🔥🔥🔥🔥🔥🔥🔥🔥🔥
إضغ بين إيديكم من أقوى الملازم التي صممتها
ملزمة تشريح الجهاز الهيكلي (نظري 3)
💀💀💀💀💀💀💀💀💀💀
تتميز هذهِ الملزمة بعِدة مُميزات :
1- مُترجمة ترجمة تُناسب جميع المستويات
2- تحتوي على 78 رسم توضيحي لكل كلمة موجودة بالملزمة (لكل كلمة !!!!)
#فهم_ماكو_درخ
3- دقة الكتابة والصور عالية جداً جداً جداً
4- هُنالك بعض المعلومات تم توضيحها بشكل تفصيلي جداً (تُعتبر لدى الطالب أو الطالبة بإنها معلومات مُبهمة ومع ذلك تم توضيح هذهِ المعلومات المُبهمة بشكل تفصيلي جداً
5- الملزمة تشرح نفسها ب نفسها بس تكلك تعال اقراني
6- تحتوي الملزمة في اول سلايد على خارطة تتضمن جميع تفرُعات معلومات الجهاز الهيكلي المذكورة في هذهِ الملزمة
واخيراً هذهِ الملزمة حلالٌ عليكم وإتمنى منكم إن تدعولي بالخير والصحة والعافية فقط
كل التوفيق زملائي وزميلاتي ، زميلكم محمد الذهبي 💊💊
🔥🔥🔥🔥🔥🔥🔥🔥🔥
Gender and Mental Health - Counselling and Family Therapy Applications and In...PsychoTech Services
A proprietary approach developed by bringing together the best of learning theories from Psychology, design principles from the world of visualization, and pedagogical methods from over a decade of training experience, that enables you to: Learn better, faster!
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Scholarly Document Processing Research in the Age of AI
1. Scholarly Document Processing
Research in the Age of AI
Min-Yen Kan
National University of Singapore
Slides @ http://bit.ly/kan-sdp22
17 Oct 2022 3rd SDProc @ COLING 2022 1
2. Warning: This is a participatory keynote!
…that is, there is a pop quiz. You have been warned! 🤣🤣
Please access the poll at
http://pollev.com/knmnyn
Do skip the name registration
17 Oct 2022 3rd SDProc @ COLING 2022 2
3. Fast and Slow
Kahneman and Tversky, Thinking Fast and Slow
Daniel Kahneman
System 1 System 2
Fast Slow
Automatic Controlled
Intuitive Analytical
Parallel Serial
Associative Logical
Slides @ http://bit.ly/kan-sdp22
17 Oct 2022 3rd SDProc @ COLING 2022 3
4. Neural Nets – System 1
Andrew Ng
System 1 System 2
Fast Slow
Automatic Controlled
Intuitive Analytical
Parallel Serial
Associative Logical
Slides @ http://bit.ly/kan-sdp22
17 Oct 2022 3rd SDProc @ COLING 2022 4
✏Your Turn: What do you think the
loss function of research should be?
5. The Age of Accelerations
Friedman, Thank You for Being Late
His three accelerations
• Moore’s law
• Globalization
• Mother Nature
Kurzweil’s “Second half of the
chessboard”
Thomas Friedman
17 Oct 2022 3rd SDProc @ COLING 2022 5
6. The Age of Accelerations
Friedman, Thank You for Being Late
Our three accelerants (take your pick)
• arXiv
• PapersWithCode
• (Semantic) Scholar
Kurzweil’s “Second half of the
chessboard”
Thomas Friedman
17 Oct 2022 3rd SDProc @ COLING 2022 6
7. What about System 2?
Are there scholarly problems that
require more analytical, logical,
and sustained thinking?
Absolutely!
17 Oct 2022 3rd SDProc @ COLING 2022 7
System 1 System 2
Fast Slow
Automatic Controlled
Intuitive Analytical
Parallel Serial
Associative Logical
8. A Brief
History of
Science
A fast primer
Photo Credit: 오힘찬 @ WikimediaCommons (CC SA-BY 4.0)
17 Oct 2022 3rd SDProc @ COLING 2022 8
9. From Astrology to Astronomy
Ptolemy Copernicus
17 Oct 2022 3rd SDProc @ COLING 2022 9
10. Astronomy 2.0 and 3.0
Galileo and Kepler Newton
17 Oct 2022 3rd SDProc @ COLING 2022 10
11. Kuhn – challenging accumulative growth
Kuhn, The Structure of Scientific Revolutions
Paradigm Shift
Normal Science
To think about: What age is SDP in now?
17 Oct 2022 3rd SDProc @ COLING 2022 11
Thomas Kuhn
12. Science is a verb
… in the sense that it is a method (activity) involving the making of hypotheses,
the design of experiments and the analysis of data. But a critical part of the
scientific process is the conversation phase after the experimentation is
done. Scientists share their findings with the broader community through
publications or presentations at meetings. What happens next is a back-and-
forth discussion including a critique of methods or interpretation, and a
comparison with previous findings.
If there are flaws in the experimental design or interpretation, … scientists
need to be willing to hear and respond to feedback. If there are conflicting
results, it may require additional hypothesis making and
experimentation. Only when the conversation runs its course do the conclusions
become a part of accepted scientific understanding.
17 Oct 2022 3rd SDProc @ COLING 2022 12
Steve Savage’s post on Science 2.0
13. Science in the Age of AI
17 Oct 2022 3rd SDProc @ COLING 2022 13
Video Source: Video by RedEye450 from Pexels
14. Loss function of research
Beam search analogy
Accelerations make the
gradient steeper
Overload favors System 1
Publish or Perish
Suboptimal local minima
17 Oct 2022 3rd SDProc @ COLING 2022 14
15. What affordances does AI yield?
Better System 1!
e.g., Neural Architecture
Search (NAS)
17 Oct 2022 3rd SDProc @ COLING 2022 15
Figures from Ren et al. 2021 ACM Comput. Surv. 37(4)
16. System 1 and 2 work together
One way: System 1 brings data for System 2 to deliberate with
System 2 gives feedback (end-to-end) to System 1
Neither system is perfect but the whole is better than the parts
(multi-view learning)
Let’s connect it back to our societal research loss function
17 Oct 2022 3rd SDProc @ COLING 2022 16
19. 1. Discovering Adjacent Possibles
Liquid Networks
The Slow Hunch
Serendipity
Exaptation
Steven Johnson
17 Oct 2022 3rd SDProc @ COLING 2022 19
Johnson, Where Good Ideas Come From
20. Confirmation Bias in Recommender Systems
We train search and recommender systems, but on historical data
This results in confirmation bias (more like this)
But if we want to afford System 2 thinking, we want
serendipitous recommendation (to learn what we don’t know)
Need to capture multimodal evidence and laborious human assessment
17 Oct 2022 3rd SDProc @ COLING 2022 20
21. Next Gen Platforms
For discoverability:
• Setting exploration criteria
• Reproducible search
• Suggesting alternative paths and terminologies
For discussion, collaboration and crediting:
• “Calm” for Scientists (arXiv off)
• MIT Deliberatorium
• Big Science initiatives
17 Oct 2022 3rd SDProc @ COLING 2022 21
& Toolkits (not everyone wants to do it
globally and publicly)
24. 2. Uncovering Discrepancies –
Countering the Streetlight Effect
What happens next is a back-and-forth discussion including a
critique of methods or interpretation, and a comparison with
previous findings.
If there are flaws in the experimental design or interpretation,
… scientists need to be willing to hear and respond to feedback.
Communities do not sufficiently report negative
results
Difficult to organize discrepancies for systematic
exploration, thus we cannot question the
establishment
17 Oct 2022 3rd SDProc @ COLING 2022 24
Related: Davies et al. Promoting inclusive metrics of success and
impact to dismantle a discriminatory reward system in science.
Photo by Guilherme Rossi @ Pexels
25. Aids for Paradigm Shifts
Systematic reviews for what doesn’t work
“Our techniques improve on Dataset X but less well on Y.
Uncover choices left (un)stated by authors
“We compare against current relevant baselines [1, 2, 3]”
Machine reading of Limitations and Ethical Consideration sections
17 Oct 2022 3rd SDProc @ COLING 2022 25
, 3
26. 17 Oct 2022 3rd SDProc @ COLING 2022 26
https://symplectic.co.uk/guest-blog/research-data-mechanics/
✏Your Turn: What
about citation
half-life? How is it
changing?
27. 3. Finding Provenance
Perhaps surprisingly, citations half-life has lengthened in most fields.
Does this mean that we are finding the right works?
17 Oct 2022 3rd SDProc @ COLING 2022 27
Martín-Martín et al. Back to the past: on the shoulders of an academic search engine giant
Davis and Cochran Cited Half-Life of the Journal Literature
28. Aids for Finding Provenance
Paraphrase, terminology and simplification services in situ
(stay tuned for Head’s keynote)
Lower the barrier for communication. Platforms for easier means for
discussing problems and knowing of furthering research
Who cares about my research?
(Multi-hop) Trace terms and ideas back to their source
17 Oct 2022 3rd SDProc @ COLING 2022 28
29. We need to participate in Science!
This is the last activity, I promise!
http://pollev.com/knmnyn
17 Oct 2022 3rd SDProc @ COLING 2022 29
✏Your Turn: Please use
your own judgement to
rank the three
challenges presented
30. Conclusion: SDP needs to
get involved in Science
Let’s be deliberate about our tools for
science. Care to discuss?
Diversity and inclusion are also important
for holistic progress in science.
Thanks to:
WING members:
George Huang Po-Wei
Yajing Yang
Abhinav Ramesh Kashyap
Muthu Kumar Chandrasekaran
Collaborators:
Min Song
Namhee Kim
and many more previous WING members,
and my family, and
all of you who’ve attended physically and
virtually to listen!
Thank you!
Yanxia Qin
Aminesh Prasad
Kazunari Sugiyama
Juyoung An
Slides @ http://bit.ly/kan-sdp22
17 Oct 2022 3rd SDProc @ COLING 2022 30