SlideShare a Scribd company logo
1 of 38
Download to read offline
Research Fast and Slow
Min-Yen Kan
National University of Singapore
Slides available at http://bit.ly/kan-coling18
Fast and Slow
Daniel Kahneman
System 1 System 2
Fast Slow
Automatic Controlled
Intuitive Analytical
Parallel Serial
Associative Logical
24 Aug 2018 COLING (Santa Fe) 2/38
Slides: http://bit.ly/kan-coling18
All tracings CC BY 4 by Min
The Neural Net – judgements in 1 second
Andrew Ng
System 1 System 2
Fast Slow
Automatic Controlled
Intuitive Analytical
Parallel Serial
Associative Logical
To think about: what’s the loss function
of research?
24 Aug 2018 COLING (Santa Fe) 3/38
Slides: http://bit.ly/kan-coling18
The Age of Accelerations
Thank You for Being Late
Kurzweil’s “Second half of the
chessboard”
His three accelerations
• Moore’s law
• Globalization
• Mother Nature
Thomas Friedman24 Aug 2018 COLING (Santa Fe) 4/38
Slides: http://bit.ly/kan-coling18
Research Fast
Nearly instantaneous galaxy-wide communications
accelerates exchange of information and ideas, greatly
enhancing the research capabilities of scientists and spying
power of agents (+100% Research)
“arXiv”
Research Fast
Image originally from Microprose software
24 Aug 2018 COLING (Santa Fe) 5/38
https://symplectic.co.uk/guest-blog/research-data-mechanics/
Research Faster
24 Aug 2018 COLING (Santa Fe) 6/38
Research Too Fast?
We have our own age of
accelerations for research
1. arXiv – “kiasu”
2. GitHub / Jupyter Notebook
3. Shared Tasks
Overheard Yejin Choi: “Working on
sQUaD makes people feel better”
“We shape our buildings, thereafter our
buildings shapes us.” – Winston Churchill
Image courtesy Recursion Pharmaceuticals
24 Aug 2018 COLING (Santa Fe) 7/38
Reviewer 2 must be stopped
Reviewers also favor defensible positions.
Safer to use standard metrics for acceptance.
Understandable for journals but for
conferences?
Provenance unknown. Source: Reviewer 2 Must Be Stopped Facebook Group
Here is a neat summary of the current state of
interpretations of skull comparisons in biological
anthropology:
24 Aug 2018 COLING (Santa Fe) 8/38
Loss function of research
Beam search analogy
Accelerations make the gradient
steeper
Overload favors the convenient
But scholars can’t do random
restart
Suboptimal local minima
24 Aug 2018 COLING (Santa Fe) 9/38
Q: When your ML doesn’t outperform,
what do you do?
Your turn: select an answer.
1. Sigh.
2. Gather more data. Maybe your model needs more data to find
proper weights.
3. Simplify your model. Maybe its easier just to do more
hyperparameter searching.
4. Read your arXiv feed.
5. Study your problem more. Let your text speak.
24 Aug 2018 COLING (Santa Fe) 10/38
CL = Why?
NLP = What?
*
Jason Eisner has
a better answer
on Quora
24 Aug 2018 COLING (Santa Fe) 11/38
Let your text speak
How would a NLP system help scholars with this?
Digital library systems today are still monolithic scalable CMS
• They serve authors: prestige, directed discovery (search)
• And provision appropriate access and provenance: OpenURL, DOI
What about the users? We can do better.
Let’s turn our science loose on our science.
Poster credit: JH Miller of Westinghouse Electric via Wikipedia
…but ask the right questions
24 Aug 2018 COLING (Santa Fe) 12/38
Digital Library
Towards the introspective
Photo Credits: sizumaru @ Flickr24 Aug 2018 COLING (Santa Fe) 13
The (Slow) Process of Research
Enthusiasm
Discovering
Reading
Collecting Data
Sensemaking
Comparing
Publishing
Communicating
Maintaining
Title
Authors
Abstract
Introduction
Related Work
Method
Evaluation
References
Ancillary Artifacts
24 Aug 2018 COLING (Santa Fe) 14/38
Discovering:
Argumentative Zoning
Teufel and Kan (2011) Robust Argumentative Zoning
for Sensemaking in Scholarly Documents.
24 Aug 2018 COLING (Santa Fe) 15/38
COLING (Santa Fe)
Reading:
Summarizing
Scholarly Documents
Photo Credits: ElvisWalksWithUs @ Tumbler
Reading and
understanding is hard.
Scientific discoveries
are written for peer
experts to understand,
but not for novices to
learn.
Let’s use NLP to digest
scholarly work to make
them easier to understand
for beginners
24 Aug 2018 16/38
Summarizing Scholarly Documents
Why not just use the abstract?
24 Aug 2018 COLING (Santa Fe) 17/38
Let’s look at our example
It follows a specific presentation convention
There is a logical document structure
INTRODUCTION
…
THE BASICS OF THE BǍ CONSTRUCTION
…
ANALYSIS
Rhetorical Structure Theory*
*
Apologies to Mann and Thompson24 Aug 2018 COLING (Santa Fe) 18/38
Background
Aim
Abstracting abstracts
These are the “ground
truth” for a summary of a
single paper
Want to select sentences
that overlap significantly
with the abstract
But even here there is
structure within an
abstract
24 Aug 2018 COLING (Santa Fe) 19/38
Revisiting Text Rank
Capitalize on the
conventional structure in
documents
Identify the logical
structure of a scientific
document
Find the best sentences
within the sections of the
document
Aim
Analysis
Background
Jun Ping Ng, Praveen Bysani, Ziheng Lin, Min-Yen Kan and Chew Lim
Tan (2011) SWING: Exploiting Category-Specific Information for
Guided Summarization. In Proceedings of the Text Analysis
Conference 2011 (TAC 2011). Gaithersburg, Maryland, USA.
24 Aug 2018 COLING (Santa Fe) 20/38
What about scholarly documents?
They have references and citations
“citation sentences”
Often describe a paper from the
community’s point of view
A representation of a key point of a work
Citation sentences and in- article
sentences have complementary purposes:
Results and evaluations usually not
mentioned in citation sentences.
Sentences in a paper that describe its
method are usually too detailed for a
summary.
24 Aug 2018 COLING (Santa Fe) 21/38
What do they refer to when
they cite?
- Citation Provenance.
The whole paper? A specific
section of the paper?
Why do people cite?
- Citation Function.
What functions are there?
Sensemaking:
Citation Analysis
24 Aug 2018 COLING (Santa Fe) 22/38
The context of the citation
influences its importance: via
its function and referent
• Its location (which section)
• Its syntactic structure
For discussion of the modern Mandarin disposal construction the reader
is referred to Li and Thompson (1981), Cheng (1988), Sybesma (1999),
Bender (2000), among many others.
In addition, we proposed a set of new features that used verb class
information induced from the frame files of the Chinese PropBank, as well
as features that were designed to exploit the grammatical constructions that
Semantic Role Labeling of Chinese Predicates are unique to Chinese,
specifically the BA (Bender 2000) and BEI (Huang 1999) constructions.
Low, H. W. (2011). Citation provenance. Bachelor’s thesis, National University of Singapore.
Yulianto, E. (2012). Citation typing. Bachelor’s thesis, National University of Singapore.
24 Aug 2018 COLING (Santa Fe) 23/38
Discovery:
Serendipitous
Recommendation
Photo Credits: Virtueel Platform @ Flickr, CC BY-SA 2.0
Interactions with others play an
important role in populating the
slow hunch
24 Aug 2018 COLING (Santa Fe) 24/38
Tell me something I don’t know
Collaborative Filtering: “People like you also read…”
Q: Whom do we ask for help?
Your Co-Author NetworkDissimilar Users in the Community
Kazunari Sugiyama and Min-Yen Kan (2011) Serendipitous Recommendation for Scholarly Papers Considering Relations Among
Researchers. In Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011) Short Papers.
24 Aug 2018 COLING (Santa Fe) 25/38
We attend conferences (like this one) in part
to help learn from each other.
A key artifact is the slide presentation, which
often summarizes the work in an accessible
manner.
But they:
• Miss important technical
details
Idea: Use both together
Sensemaking:
Conference
Presentations
Photo by myself, cleared to use by subject
24 Aug 2018 COLING (Santa Fe) 26/38
Presentations and their Relationship to Documents
Document in focus Presentation in Focus24 Aug 2018 COLING (Santa Fe) 27/38
Better to juxtapose both media together in a fine-grained manner.
Output: an alignment map
Aligning Documents to their Presentations
24 Aug 2018
COLING (Santa Fe)
Bamdad Bahrani and Min-Yen Kan (2013) Multimodal Alignment of Scholarly Documents and Their Presentations. In Proceedings
of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA. pp. 281-284. Short Paper.
Min-Yen Kan (2007) SlideSeer: A Digital Library of Aligned Document and Presentation Pairs, In Proceedings of the Joint
Conference on Digital Libraries (JCDL '07). Vancouver, Canada, June, pp. 81-90.
28/38
Slide Demographics
24 Aug 2018 COLING (Santa Fe) 29/38
Error Analysis: It’s a multimodal problem
Slide Type Common reason % Incorrectly Aligned by
Baseline
Nil Doesn’t know where to align
à align to best fit
64%
Outline Name of some sections in it
à align to longest one
36%
Image Very little text available 81%
Drawing Noisy data: lots of shapes and text
boxes
53%
Table Little text, noisy data 50%
Text 24%
24 Aug 2018 COLING (Santa Fe) 30/38
Research Slow
Ignore the gradient:
Open up the Adjacent Possible
Develops the Slow Hunch through
persistent exposure to ancillary evidence
The Swedish Fika: collision allows scholars
to benefit from each other
24 Aug 2018 COLING (Santa Fe) 31/38
Eureka Moment
Not the
Photo Credit: Daniel Dionne @ Flickr, CC BY-SA 2.0
24 Aug 2018 COLING (Santa Fe) 32
Where Good Ideas Come From
The Adjacent Possible
Liquid Networks
The Slow Hunch
Serendipity
Error
Exaptation
Platforms
Steven Johnson
24 Aug 2018 COLING (Santa Fe) 33/38
Fuben-Eki
FUrther BENEfit of
a Kind of Inconvenience
Make key operations deliberate
Fosters ownership, self-affirmation
Key idea: synergize with human computational abilities
Kawakami H. Toward System Design based on Benefit of Inconvenience,
Journal of Human Interface, 11(1), 125-134 (2009) (in Japanese).
24 Aug 2018 COLING (Santa Fe) 34/38
Acceleration has it out for us
Adapted from Astro Teller’s graph
Others are public
advocates, but what
about ourselves?
We need to ramp up
our research slow
Use the time gifted
by research fast
Time
RateofChange
Human adaptability
Technology
Research Slow
Research Fast
Government
Individuals
Companies
Humanity
Opportunity
Gap
24 Aug 2018 COLING (Santa Fe) 35/38
Weapons of Math Destruction
A WMD is
• Massive
• Opaque
• No feedback loop
The Class Break
Cathy O’Neil
24 Aug 2018 COLING (Santa Fe) 36/38
Reprise: Research Slow – The Challenge of Story
Be an advocate
Write the story, wonder “what if?”
Blog your opinion or write columns
Stories give context
We are human
Aids association, catalyzes the slow
hunch
Ensembles are better than any
single classifier
Hiyao Miyazaki: Castle in the Sky
24 Aug 2018 COLING (Santa Fe) 37/38
Conclusion: We combine
theory and practice
… nothing works and we don’t know why
Let’s fix that:
Fast begets fast, but we need to consciously
support slow
NLP to do intelligence augmentation (IA),
put scholars on the slow hunch
Close the loop: Let data speak and inform our
models
… v2: It works but we don’t know why… v3: It works and we think we know why
AND SlowResearch Fast
… v4: It works and we think we know why
and we’ll advocate for it
Thanks to:
WING members:
Bamdad Bahrani
Muthu Chandrasekaran
Tao Chen
Emily Bender
Leon Derczynski
Pierre Isabelle
Marti Hearst
Gertjan van Noord
Chris Manning
all those students and faculty at NUS whom
I subjected to a dry run
And research slow technology of
audiobooks and digital commonplace books
Xiangnan He
Aminesh Prasad
Kazunari Sugiyama
Aobo Wang
Yohei Seki
24 Aug 2018 COLING (Santa Fe) 38/38
Slides: http://bit.ly/kan-coling18

More Related Content

What's hot

2011 06-14 cristhian-parra_u_count
2011 06-14 cristhian-parra_u_count2011 06-14 cristhian-parra_u_count
2011 06-14 cristhian-parra_u_countCristhian Parra
 
Reputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersReputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersMicah Altman
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
 
Moving beyond sameAs with PLATO: Partonomy detection for Linked Data
Moving beyond sameAs with PLATO: Partonomy detection for Linked DataMoving beyond sameAs with PLATO: Partonomy detection for Linked Data
Moving beyond sameAs with PLATO: Partonomy detection for Linked DataPrateek Jain
 
A Social Network System based on an Ontology in the Korea Institute of Orient...
A Social Network System based on an Ontology in the Korea Institute of Orient...A Social Network System based on an Ontology in the Korea Institute of Orient...
A Social Network System based on an Ontology in the Korea Institute of Orient...Sang-Kyun Kim
 
Interactive visualization and exploration of network data with gephi
Interactive visualization and exploration of network data with gephiInteractive visualization and exploration of network data with gephi
Interactive visualization and exploration of network data with gephiBernhard Rieder
 
SOC2002 Lecture 12
SOC2002 Lecture 12SOC2002 Lecture 12
SOC2002 Lecture 12Bonnie Green
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveMicah Altman
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in WikidataElena Simperl
 

What's hot (10)

2011 06-14 cristhian-parra_u_count
2011 06-14 cristhian-parra_u_count2011 06-14 cristhian-parra_u_count
2011 06-14 cristhian-parra_u_count
 
Reputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersReputation Management for Early Career Researchers
Reputation Management for Early Career Researchers
 
Sandusky, "Deep Indexing and Discover of Tables and Figures"
Sandusky, "Deep Indexing and Discover of Tables and Figures"Sandusky, "Deep Indexing and Discover of Tables and Figures"
Sandusky, "Deep Indexing and Discover of Tables and Figures"
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
Moving beyond sameAs with PLATO: Partonomy detection for Linked Data
Moving beyond sameAs with PLATO: Partonomy detection for Linked DataMoving beyond sameAs with PLATO: Partonomy detection for Linked Data
Moving beyond sameAs with PLATO: Partonomy detection for Linked Data
 
A Social Network System based on an Ontology in the Korea Institute of Orient...
A Social Network System based on an Ontology in the Korea Institute of Orient...A Social Network System based on an Ontology in the Korea Institute of Orient...
A Social Network System based on an Ontology in the Korea Institute of Orient...
 
Interactive visualization and exploration of network data with gephi
Interactive visualization and exploration of network data with gephiInteractive visualization and exploration of network data with gephi
Interactive visualization and exploration of network data with gephi
 
SOC2002 Lecture 12
SOC2002 Lecture 12SOC2002 Lecture 12
SOC2002 Lecture 12
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspective
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in Wikidata
 

Similar to Research Fast and Slow

Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkOCLC Research
 
ARLIS 2010 RLG Partnership Round Table
ARLIS 2010 RLG Partnership Round TableARLIS 2010 RLG Partnership Round Table
ARLIS 2010 RLG Partnership Round TableOCLC Research
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
2008 10 Millennials Presentation
2008 10 Millennials Presentation2008 10 Millennials Presentation
2008 10 Millennials PresentationLisa Metzer
 
Annotated Bibliography On Research Methodologies
Annotated Bibliography On Research MethodologiesAnnotated Bibliography On Research Methodologies
Annotated Bibliography On Research MethodologiesJeff Nelson
 
Ethnography, Grounded Theory and Systems Analysis
Ethnography, Grounded Theory and Systems AnalysisEthnography, Grounded Theory and Systems Analysis
Ethnography, Grounded Theory and Systems Analysislinlinlin
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainAngelo Salatino
 
Visual Analysis of Concept Change and Information Diffusion
Visual Analysis of Concept Change and Information DiffusionVisual Analysis of Concept Change and Information Diffusion
Visual Analysis of Concept Change and Information Diffusioninscit2006
 
Big Data and Artificial Intelligence
Big Data and Artificial IntelligenceBig Data and Artificial Intelligence
Big Data and Artificial IntelligenceArthur Charpentier
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesLudo Waltman
 
Open access impact
Open access impactOpen access impact
Open access impactIryna Kuchma
 
Case study jackeline oviedo
Case study   jackeline oviedoCase study   jackeline oviedo
Case study jackeline oviedokaterinegomezj
 
Case study jackeline oviedo
Case study   jackeline oviedoCase study   jackeline oviedo
Case study jackeline oviedokaterinegomezj
 
Epistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsEpistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsSimon Knight
 

Similar to Research Fast and Slow (20)

Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery Network
 
ARLIS 2010 RLG Partnership Round Table
ARLIS 2010 RLG Partnership Round TableARLIS 2010 RLG Partnership Round Table
ARLIS 2010 RLG Partnership Round Table
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
2008 10 Millennials Presentation
2008 10 Millennials Presentation2008 10 Millennials Presentation
2008 10 Millennials Presentation
 
Annotated Bibliography On Research Methodologies
Annotated Bibliography On Research MethodologiesAnnotated Bibliography On Research Methodologies
Annotated Bibliography On Research Methodologies
 
Ethnography, Grounded Theory and Systems Analysis
Ethnography, Grounded Theory and Systems AnalysisEthnography, Grounded Theory and Systems Analysis
Ethnography, Grounded Theory and Systems Analysis
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domain
 
Visual Analysis of Concept Change and Information Diffusion
Visual Analysis of Concept Change and Information DiffusionVisual Analysis of Concept Change and Information Diffusion
Visual Analysis of Concept Change and Information Diffusion
 
Theory in PhD
Theory in PhDTheory in PhD
Theory in PhD
 
Big Data and Artificial Intelligence
Big Data and Artificial IntelligenceBig Data and Artificial Intelligence
Big Data and Artificial Intelligence
 
Peer Review and Science2.0
Peer Review and Science2.0Peer Review and Science2.0
Peer Review and Science2.0
 
Big Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
Big Data Research Trend and Forecast (2005-2015): An Informetrics PerspectiveBig Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
Big Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
 
Valutazione della ricerca
Valutazione della ricerca Valutazione della ricerca
Valutazione della ricerca
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Open access impact
Open access impactOpen access impact
Open access impact
 
Case study jackeline oviedo
Case study   jackeline oviedoCase study   jackeline oviedo
Case study jackeline oviedo
 
Presentación pavel
Presentación  pavelPresentación  pavel
Presentación pavel
 
Case study jackeline oviedo
Case study   jackeline oviedoCase study   jackeline oviedo
Case study jackeline oviedo
 
Epistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsEpistemic networks for Epistemic Commitments
Epistemic networks for Epistemic Commitments
 

Recently uploaded

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 

Research Fast and Slow

  • 1. Research Fast and Slow Min-Yen Kan National University of Singapore Slides available at http://bit.ly/kan-coling18
  • 2. Fast and Slow Daniel Kahneman System 1 System 2 Fast Slow Automatic Controlled Intuitive Analytical Parallel Serial Associative Logical 24 Aug 2018 COLING (Santa Fe) 2/38 Slides: http://bit.ly/kan-coling18 All tracings CC BY 4 by Min
  • 3. The Neural Net – judgements in 1 second Andrew Ng System 1 System 2 Fast Slow Automatic Controlled Intuitive Analytical Parallel Serial Associative Logical To think about: what’s the loss function of research? 24 Aug 2018 COLING (Santa Fe) 3/38 Slides: http://bit.ly/kan-coling18
  • 4. The Age of Accelerations Thank You for Being Late Kurzweil’s “Second half of the chessboard” His three accelerations • Moore’s law • Globalization • Mother Nature Thomas Friedman24 Aug 2018 COLING (Santa Fe) 4/38 Slides: http://bit.ly/kan-coling18
  • 5. Research Fast Nearly instantaneous galaxy-wide communications accelerates exchange of information and ideas, greatly enhancing the research capabilities of scientists and spying power of agents (+100% Research) “arXiv” Research Fast Image originally from Microprose software 24 Aug 2018 COLING (Santa Fe) 5/38
  • 7. Research Too Fast? We have our own age of accelerations for research 1. arXiv – “kiasu” 2. GitHub / Jupyter Notebook 3. Shared Tasks Overheard Yejin Choi: “Working on sQUaD makes people feel better” “We shape our buildings, thereafter our buildings shapes us.” – Winston Churchill Image courtesy Recursion Pharmaceuticals 24 Aug 2018 COLING (Santa Fe) 7/38
  • 8. Reviewer 2 must be stopped Reviewers also favor defensible positions. Safer to use standard metrics for acceptance. Understandable for journals but for conferences? Provenance unknown. Source: Reviewer 2 Must Be Stopped Facebook Group Here is a neat summary of the current state of interpretations of skull comparisons in biological anthropology: 24 Aug 2018 COLING (Santa Fe) 8/38
  • 9. Loss function of research Beam search analogy Accelerations make the gradient steeper Overload favors the convenient But scholars can’t do random restart Suboptimal local minima 24 Aug 2018 COLING (Santa Fe) 9/38
  • 10. Q: When your ML doesn’t outperform, what do you do? Your turn: select an answer. 1. Sigh. 2. Gather more data. Maybe your model needs more data to find proper weights. 3. Simplify your model. Maybe its easier just to do more hyperparameter searching. 4. Read your arXiv feed. 5. Study your problem more. Let your text speak. 24 Aug 2018 COLING (Santa Fe) 10/38
  • 11. CL = Why? NLP = What? * Jason Eisner has a better answer on Quora 24 Aug 2018 COLING (Santa Fe) 11/38
  • 12. Let your text speak How would a NLP system help scholars with this? Digital library systems today are still monolithic scalable CMS • They serve authors: prestige, directed discovery (search) • And provision appropriate access and provenance: OpenURL, DOI What about the users? We can do better. Let’s turn our science loose on our science. Poster credit: JH Miller of Westinghouse Electric via Wikipedia …but ask the right questions 24 Aug 2018 COLING (Santa Fe) 12/38
  • 13. Digital Library Towards the introspective Photo Credits: sizumaru @ Flickr24 Aug 2018 COLING (Santa Fe) 13
  • 14. The (Slow) Process of Research Enthusiasm Discovering Reading Collecting Data Sensemaking Comparing Publishing Communicating Maintaining Title Authors Abstract Introduction Related Work Method Evaluation References Ancillary Artifacts 24 Aug 2018 COLING (Santa Fe) 14/38
  • 15. Discovering: Argumentative Zoning Teufel and Kan (2011) Robust Argumentative Zoning for Sensemaking in Scholarly Documents. 24 Aug 2018 COLING (Santa Fe) 15/38
  • 16. COLING (Santa Fe) Reading: Summarizing Scholarly Documents Photo Credits: ElvisWalksWithUs @ Tumbler Reading and understanding is hard. Scientific discoveries are written for peer experts to understand, but not for novices to learn. Let’s use NLP to digest scholarly work to make them easier to understand for beginners 24 Aug 2018 16/38
  • 17. Summarizing Scholarly Documents Why not just use the abstract? 24 Aug 2018 COLING (Santa Fe) 17/38
  • 18. Let’s look at our example It follows a specific presentation convention There is a logical document structure INTRODUCTION … THE BASICS OF THE BǍ CONSTRUCTION … ANALYSIS Rhetorical Structure Theory* * Apologies to Mann and Thompson24 Aug 2018 COLING (Santa Fe) 18/38
  • 19. Background Aim Abstracting abstracts These are the “ground truth” for a summary of a single paper Want to select sentences that overlap significantly with the abstract But even here there is structure within an abstract 24 Aug 2018 COLING (Santa Fe) 19/38
  • 20. Revisiting Text Rank Capitalize on the conventional structure in documents Identify the logical structure of a scientific document Find the best sentences within the sections of the document Aim Analysis Background Jun Ping Ng, Praveen Bysani, Ziheng Lin, Min-Yen Kan and Chew Lim Tan (2011) SWING: Exploiting Category-Specific Information for Guided Summarization. In Proceedings of the Text Analysis Conference 2011 (TAC 2011). Gaithersburg, Maryland, USA. 24 Aug 2018 COLING (Santa Fe) 20/38
  • 21. What about scholarly documents? They have references and citations “citation sentences” Often describe a paper from the community’s point of view A representation of a key point of a work Citation sentences and in- article sentences have complementary purposes: Results and evaluations usually not mentioned in citation sentences. Sentences in a paper that describe its method are usually too detailed for a summary. 24 Aug 2018 COLING (Santa Fe) 21/38
  • 22. What do they refer to when they cite? - Citation Provenance. The whole paper? A specific section of the paper? Why do people cite? - Citation Function. What functions are there? Sensemaking: Citation Analysis 24 Aug 2018 COLING (Santa Fe) 22/38
  • 23. The context of the citation influences its importance: via its function and referent • Its location (which section) • Its syntactic structure For discussion of the modern Mandarin disposal construction the reader is referred to Li and Thompson (1981), Cheng (1988), Sybesma (1999), Bender (2000), among many others. In addition, we proposed a set of new features that used verb class information induced from the frame files of the Chinese PropBank, as well as features that were designed to exploit the grammatical constructions that Semantic Role Labeling of Chinese Predicates are unique to Chinese, specifically the BA (Bender 2000) and BEI (Huang 1999) constructions. Low, H. W. (2011). Citation provenance. Bachelor’s thesis, National University of Singapore. Yulianto, E. (2012). Citation typing. Bachelor’s thesis, National University of Singapore. 24 Aug 2018 COLING (Santa Fe) 23/38
  • 24. Discovery: Serendipitous Recommendation Photo Credits: Virtueel Platform @ Flickr, CC BY-SA 2.0 Interactions with others play an important role in populating the slow hunch 24 Aug 2018 COLING (Santa Fe) 24/38
  • 25. Tell me something I don’t know Collaborative Filtering: “People like you also read…” Q: Whom do we ask for help? Your Co-Author NetworkDissimilar Users in the Community Kazunari Sugiyama and Min-Yen Kan (2011) Serendipitous Recommendation for Scholarly Papers Considering Relations Among Researchers. In Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011) Short Papers. 24 Aug 2018 COLING (Santa Fe) 25/38
  • 26. We attend conferences (like this one) in part to help learn from each other. A key artifact is the slide presentation, which often summarizes the work in an accessible manner. But they: • Miss important technical details Idea: Use both together Sensemaking: Conference Presentations Photo by myself, cleared to use by subject 24 Aug 2018 COLING (Santa Fe) 26/38
  • 27. Presentations and their Relationship to Documents Document in focus Presentation in Focus24 Aug 2018 COLING (Santa Fe) 27/38
  • 28. Better to juxtapose both media together in a fine-grained manner. Output: an alignment map Aligning Documents to their Presentations 24 Aug 2018 COLING (Santa Fe) Bamdad Bahrani and Min-Yen Kan (2013) Multimodal Alignment of Scholarly Documents and Their Presentations. In Proceedings of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA. pp. 281-284. Short Paper. Min-Yen Kan (2007) SlideSeer: A Digital Library of Aligned Document and Presentation Pairs, In Proceedings of the Joint Conference on Digital Libraries (JCDL '07). Vancouver, Canada, June, pp. 81-90. 28/38
  • 29. Slide Demographics 24 Aug 2018 COLING (Santa Fe) 29/38
  • 30. Error Analysis: It’s a multimodal problem Slide Type Common reason % Incorrectly Aligned by Baseline Nil Doesn’t know where to align à align to best fit 64% Outline Name of some sections in it à align to longest one 36% Image Very little text available 81% Drawing Noisy data: lots of shapes and text boxes 53% Table Little text, noisy data 50% Text 24% 24 Aug 2018 COLING (Santa Fe) 30/38
  • 31. Research Slow Ignore the gradient: Open up the Adjacent Possible Develops the Slow Hunch through persistent exposure to ancillary evidence The Swedish Fika: collision allows scholars to benefit from each other 24 Aug 2018 COLING (Santa Fe) 31/38
  • 32. Eureka Moment Not the Photo Credit: Daniel Dionne @ Flickr, CC BY-SA 2.0 24 Aug 2018 COLING (Santa Fe) 32
  • 33. Where Good Ideas Come From The Adjacent Possible Liquid Networks The Slow Hunch Serendipity Error Exaptation Platforms Steven Johnson 24 Aug 2018 COLING (Santa Fe) 33/38
  • 34. Fuben-Eki FUrther BENEfit of a Kind of Inconvenience Make key operations deliberate Fosters ownership, self-affirmation Key idea: synergize with human computational abilities Kawakami H. Toward System Design based on Benefit of Inconvenience, Journal of Human Interface, 11(1), 125-134 (2009) (in Japanese). 24 Aug 2018 COLING (Santa Fe) 34/38
  • 35. Acceleration has it out for us Adapted from Astro Teller’s graph Others are public advocates, but what about ourselves? We need to ramp up our research slow Use the time gifted by research fast Time RateofChange Human adaptability Technology Research Slow Research Fast Government Individuals Companies Humanity Opportunity Gap 24 Aug 2018 COLING (Santa Fe) 35/38
  • 36. Weapons of Math Destruction A WMD is • Massive • Opaque • No feedback loop The Class Break Cathy O’Neil 24 Aug 2018 COLING (Santa Fe) 36/38
  • 37. Reprise: Research Slow – The Challenge of Story Be an advocate Write the story, wonder “what if?” Blog your opinion or write columns Stories give context We are human Aids association, catalyzes the slow hunch Ensembles are better than any single classifier Hiyao Miyazaki: Castle in the Sky 24 Aug 2018 COLING (Santa Fe) 37/38
  • 38. Conclusion: We combine theory and practice … nothing works and we don’t know why Let’s fix that: Fast begets fast, but we need to consciously support slow NLP to do intelligence augmentation (IA), put scholars on the slow hunch Close the loop: Let data speak and inform our models … v2: It works but we don’t know why… v3: It works and we think we know why AND SlowResearch Fast … v4: It works and we think we know why and we’ll advocate for it Thanks to: WING members: Bamdad Bahrani Muthu Chandrasekaran Tao Chen Emily Bender Leon Derczynski Pierre Isabelle Marti Hearst Gertjan van Noord Chris Manning all those students and faculty at NUS whom I subjected to a dry run And research slow technology of audiobooks and digital commonplace books Xiangnan He Aminesh Prasad Kazunari Sugiyama Aobo Wang Yohei Seki 24 Aug 2018 COLING (Santa Fe) 38/38 Slides: http://bit.ly/kan-coling18