Aslin.discussion

•

0 likes•302 views

Jesse Lingeman

Technology Education

Summary

Data
coding
,
analysis,
archiving,
and

sharing
for
open
collabora9on

Richard
Aslin

University
of
Rochester

1.

What
is
your
hypothesis?

•  9/11
occurred
because
the
intelligence

community
suﬀered
from
a
“failure
of

imagina9on”

–  BoGom-‐up
data
mining
(“connec9ng
the
dots”)

–  Top-‐down
predic9ons
(“what
are
vulnerabili9es??”)

•  Clearly,
you
need
both

•  Must
apply
approaches
itera9vely
and
repeatedly

2.

Observa9ons
are
DVs

•  Are
the
paGerns
you
“see”
the
ones
that
are

“relevant”
or
causal?

•  Problem
of
data
sparsity
and
false
correla9ons

•  Hypothesis
tes9ng
requires
an
experiment

(manipula9ng
an
IV)

•  Tension
between
“ecology”
and
“control
of

variables”
(sociology
of
preferred
methods)

3.

How
expand
hypothesis
space?

•  If
large/standard
datasets,
then
evalua9on

becomes
stagnant
(only
evaluated
with
that

dataset)

•  If
evalua9on
only
uses
standard
(sta9s9cal)

tools,
same
problem
of
stagna9on

•  Is
clever
visualiza9on
the
key
to
hypothesis

forma9on,
even
if
“simple”
variables?

TED
talk
by
Deb
Roy
from
MIT

4.

When
do
you
give
up?

•  Reliance
on
visual
paGern
recogni9on
by

human
coder
may
not
reveal
relevant

(informa9ve)
features
(sound
spectrogram

cannot
be
“read”)

•  Failure
at
macro
level
prompts
search
for
info

at
micro
level
(fMRI
univariate
vs.
mul9variate

analysis):
need
to
“drill
down”

•  Failure
at
micro
level
may
indicate

indeterminacy
of
causal
hierarchy
(Fodor)

5.

Rules
of
sharing

•  When
does
“your”
data
become
accessible
by:

–  Your
collaborators

–  Friends
who
ask

–  Strangers

–  Anyone

•  Who
gets
credit?

•  How
should
junior
researchers
“share”?

Especially
with
senior
labs
that
have
$$$.

Similar to Aslin.discussion

NGP Retreat Open Science 2015Jackie Wirz, PhD

Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki

Data Science Folk KnowledgeKrishna Sankar

Biswa researchAaryvrat Gupta

NeuroVault and the vision for data sharing in neuroimagingKrzysztof Gorgolewski

Computational Reproducibility vs. Transparency: Is It FAIR Enough?Bertram Ludäscher

The Architecture of UnderstandingPeter Morville

Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds

Why science needs open data – Jisc and CNI conference 10 July 2014Jisc

Share and Reuse: how data sharing can take your research to the next levelKrzysztof Gorgolewski

The Architecture of UnderstandingPeter Morville

Altman pitt 2013_v3Micah Altman

(One Possible) Future of Scholarly CommunicationMicah Altman

From byte to mindBenjamin Laken

Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier

Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...Shalin Hai-Jew

Jsm big-dataSean Taylor

Waves keynote2cDavid Topps

Elizabeth Churchill, "Data by Design"summersocialwebshop

Similar to Aslin.discussion (20)

NGP Retreat Open Science 2015

Open science, open data - FOSTER training, Potsdam

Data Science Folk Knowledge

Biswa research

NeuroVault and the vision for data sharing in neuroimaging

Computational Reproducibility vs. Transparency: Is It FAIR Enough?

The Architecture of Understanding

Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...

Why science needs open data – Jisc and CNI conference 10 July 2014

Share and Reuse: how data sharing can take your research to the next level

The Architecture of Understanding

Altman pitt 2013_v3

(One Possible) Future of Scholarly Communication

From byte to mind

Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona

Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...

Jsm big-data

Waves keynote2c

Elizabeth Churchill, "Data by Design"

Recently uploaded

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

A Domino Admins Adventures (Engage 2024)Gabriella Davis

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Recently uploaded (20)

Boost PC performance: How more available memory can improve productivity

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Unblocking The Main Thread Solving ANRs and Frozen Frames

IAC 2024 - IA Fast Track to Search Focused AI Solutions

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

CNv6 Instructor Chapter 6 Quality of Service

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Data Cloud, More than a CDP by Matt Robison

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Maximizing Board Effectiveness 2024 Webinar.pptx

🐬 The future of MySQL is Postgres 🐘

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

My Hashitalk Indonesia April 2024 Presentation

Salesforce Community Group Quito, Salesforce 101

Breaking the Kubernetes Kill Chain: Host Path Mount

Google AI Hackathon: LLM based Evaluator for RAG

A Domino Admins Adventures (Engage 2024)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Aslin.discussion

1. Summary Data coding , analysis, archiving, and sharing for open collabora9on Richard Aslin University of Rochester

2. 1. What is your hypothesis? •  9/11 occurred because the intelligence community suﬀered from a “failure of imagina9on” –  BoGom-‐up data mining (“connec9ng the dots”) –  Top-‐down predic9ons (“what are vulnerabili9es??”) •  Clearly, you need both •  Must apply approaches itera9vely and repeatedly

3. 2. Observa9ons are DVs •  Are the paGerns you “see” the ones that are “relevant” or causal? •  Problem of data sparsity and false correla9ons •  Hypothesis tes9ng requires an experiment (manipula9ng an IV) •  Tension between “ecology” and “control of variables” (sociology of preferred methods)

4. 3. How expand hypothesis space? •  If large/standard datasets, then evalua9on becomes stagnant (only evaluated with that dataset) •  If evalua9on only uses standard (sta9s9cal) tools, same problem of stagna9on •  Is clever visualiza9on the key to hypothesis forma9on, even if “simple” variables? TED talk by Deb Roy from MIT

5. 4. When do you give up? •  Reliance on visual paGern recogni9on by human coder may not reveal relevant (informa9ve) features (sound spectrogram cannot be “read”) •  Failure at macro level prompts search for info at micro level (fMRI univariate vs. mul9variate analysis): need to “drill down” •  Failure at micro level may indicate indeterminacy of causal hierarchy (Fodor)

6. 5. Rules of sharing •  When does “your” data become accessible by: –  Your collaborators –  Friends who ask –  Strangers –  Anyone •  Who gets credit? •  How should junior researchers “share”? Especially with senior labs that have $$$.

Aslin.discussion

Recommended

Recommended

More Related Content

Similar to Aslin.discussion

Similar to Aslin.discussion (20)

More from Jesse Lingeman

More from Jesse Lingeman (12)

Recently uploaded

Recently uploaded (20)

Aslin.discussion