SlideShare a Scribd company logo
1 of 28
Using data to improve
student research
EasyBib is an automatic
bibliography composer.
Students use it to cite
sources for their research.
We teach information
literacy.
18%
of all student papers include plagiarism1
Source: (1) TurnItIn; (2) Both Sides Now: Librarians Looking at Information Literacy from High School and College.
50%
likelihood of using a credible vs. non-
credible source1
4%
increase in the use of paper mills and
cheating sites1
~16%
of students are adequately prepared for
college.2
That’s how we felt too..
The problem is becoming
bigger.
Unprepared students
make for unprepared
adults.
It’s not just students who
plagiarize:
•Pal Schmitt, former president
of Hungary
•German education minister
•Jayson Blair (former New
York Times writer)
•Jonah Lehrer, journalist and
author
•Fareed Zakaria (reporter,
author, host)
We are in the right place
to figure it out.
Over half of all
students in the
US (40M)
Over half a billion
citations
We asked ourselves the
following questions:
•What are students using in their
research?
•How good are their sources?
•How can we help them?
We started with the
basics._gaq.push([
'citations._trackEvent',
citationTitle,
citationPublisher,
citationId
]);
Here’s what we found.
Top sources 2010
•Wikipedia
•Google
1.The New York Times
2.CIA World Factbook
3.Oracle Thinkquest
4.Buzzle
5.US BLS
6.Dictionary.com
7.CDC
8.PBS
9.eHow
Source: EasyBib Google Analytics Oct 2010-Nov 2010 data.
What could we do?
•Warn them when their source’s
credibility is in question
•Analyze the quality of their full
bibliography
•Make it easier to not plagiarize
•Suggest better sources
Define credibility.
Improve citation quality
Gave students access to
their own analytics
To combat plagiarism, we
built an audit trail for notes
So after all this...
Does it blend (tm) ?
1. Wikipedia
2. Bio.com
3. History.com
4. PBS
5. Mayo Clinic
6. CDC
7. The New York Times
8. BBC
9. CNN
10.WebMD
11.US BLS
• Wikipedia still on top,
but ...
• No content farms, no
Google..
• WebMD is questionable,
but its credibility can be
argued for.
Source: Apr-May 2013 Google Analytics data
We have to admit, it’s getting
better...
We have to admit, it’s getting
better...
Help students find better
sources
How does the Research
engine currently work?
Cloudant (CouchDB)
MySQL
Lucene/Solr
Slow, asynchronous, lots of moving
parts.
Starting to do a bit more
StatsD::increment($metrics);
$response = $rediska->publish(
array('realtime'),
$citation
);
There’s a lot more we can
do, and data will help us.
Cloudant Search
•Full-text search integrated into Cloudant
•Lucene syntax
•Indexing is easy
function(doc){
index("title", doc.title, {"store": "yes"});
}
•Grouping of sources via chained map-reduce
map: function(doc){
if (doc.title){ emit({"title": doc.title}, 1); }
}
reduce: _sum
dbcopy: citationGroup
------
map: function(doc){
if (doc.title && doc.key.title){ emit(doc.value, doc.key.title); }
}
Live data analysis.
Crowdsourcing.
•Use Cloudant Search to power
feedback on sources (# of times
cited in real time, quality of
bibliographies derived from)
•Allow users to submit their own
credibility evaluations and aggregate
results
SourceRank!
Credibility weighting + crowdsourcing
Synchronous & realtime via Cloudant Search
Value nodes based on nearest neighbors
And other things...
Driving growth
We have the largest UGC citation
set. Making this searchable
creates a “moat.”
The more people that use EasyBib,
the better the tool becomes.
What about other data
analytics tools?
Too stretched to learn more complex tools
(looking for easy answers)
Costs (GA is free!)
EMR, Hadoop, Redshift, Cloudant Search:
This is what’s next.
Questions?
Darshan Somashekar
@darshan
darshan@imagineeasy.com

More Related Content

Viewers also liked

Crossing the Chasm (Ikanow - Chicago Summit)
Crossing the Chasm (Ikanow - Chicago Summit)Crossing the Chasm (Ikanow - Chicago Summit)
Crossing the Chasm (Ikanow - Chicago Summit)Open Analytics
 
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...Open Analytics
 
CDM….Where do you start? (OA Cyber Summit)
CDM….Where do you start? (OA Cyber Summit)CDM….Where do you start? (OA Cyber Summit)
CDM….Where do you start? (OA Cyber Summit)Open Analytics
 
An Immigrant’s view of Cyberspace (OA Cyber Summit)
An Immigrant’s view of Cyberspace (OA Cyber Summit)An Immigrant’s view of Cyberspace (OA Cyber Summit)
An Immigrant’s view of Cyberspace (OA Cyber Summit)Open Analytics
 
Using Real-Time Data to Drive Optimization & Personalization
Using Real-Time Data to Drive Optimization & PersonalizationUsing Real-Time Data to Drive Optimization & Personalization
Using Real-Time Data to Drive Optimization & PersonalizationOpen Analytics
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Open Analytics
 
Piwik: An Analytics Alternative (Chicago Summit)
Piwik: An Analytics Alternative (Chicago Summit)Piwik: An Analytics Alternative (Chicago Summit)
Piwik: An Analytics Alternative (Chicago Summit)Open Analytics
 
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...Open Analytics
 
From Insight to Impact (Chicago Summit - Keynote)
From Insight to Impact (Chicago Summit - Keynote)From Insight to Impact (Chicago Summit - Keynote)
From Insight to Impact (Chicago Summit - Keynote)Open Analytics
 
Competing in the Digital Economy
Competing in the Digital EconomyCompeting in the Digital Economy
Competing in the Digital EconomyOpen Analytics
 
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)Open Analytics
 
M&A Trends in Telco Analytics
M&A Trends in Telco AnalyticsM&A Trends in Telco Analytics
M&A Trends in Telco AnalyticsOpen Analytics
 
Cyber after Snowden (OA Cyber Summit)
Cyber after Snowden (OA Cyber Summit)Cyber after Snowden (OA Cyber Summit)
Cyber after Snowden (OA Cyber Summit)Open Analytics
 
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)Open Analytics
 
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...Open Analytics
 

Viewers also liked (15)

Crossing the Chasm (Ikanow - Chicago Summit)
Crossing the Chasm (Ikanow - Chicago Summit)Crossing the Chasm (Ikanow - Chicago Summit)
Crossing the Chasm (Ikanow - Chicago Summit)
 
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
 
CDM….Where do you start? (OA Cyber Summit)
CDM….Where do you start? (OA Cyber Summit)CDM….Where do you start? (OA Cyber Summit)
CDM….Where do you start? (OA Cyber Summit)
 
An Immigrant’s view of Cyberspace (OA Cyber Summit)
An Immigrant’s view of Cyberspace (OA Cyber Summit)An Immigrant’s view of Cyberspace (OA Cyber Summit)
An Immigrant’s view of Cyberspace (OA Cyber Summit)
 
Using Real-Time Data to Drive Optimization & Personalization
Using Real-Time Data to Drive Optimization & PersonalizationUsing Real-Time Data to Drive Optimization & Personalization
Using Real-Time Data to Drive Optimization & Personalization
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Piwik: An Analytics Alternative (Chicago Summit)
Piwik: An Analytics Alternative (Chicago Summit)Piwik: An Analytics Alternative (Chicago Summit)
Piwik: An Analytics Alternative (Chicago Summit)
 
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
 
From Insight to Impact (Chicago Summit - Keynote)
From Insight to Impact (Chicago Summit - Keynote)From Insight to Impact (Chicago Summit - Keynote)
From Insight to Impact (Chicago Summit - Keynote)
 
Competing in the Digital Economy
Competing in the Digital EconomyCompeting in the Digital Economy
Competing in the Digital Economy
 
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
 
M&A Trends in Telco Analytics
M&A Trends in Telco AnalyticsM&A Trends in Telco Analytics
M&A Trends in Telco Analytics
 
Cyber after Snowden (OA Cyber Summit)
Cyber after Snowden (OA Cyber Summit)Cyber after Snowden (OA Cyber Summit)
Cyber after Snowden (OA Cyber Summit)
 
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
 
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
 

Similar to Easybib Open Analytics NYC

The Transition Years: Evaluating Info Lit Skills from High School to College-...
The Transition Years: Evaluating Info Lit Skills from High School to College-...The Transition Years: Evaluating Info Lit Skills from High School to College-...
The Transition Years: Evaluating Info Lit Skills from High School to College-...Imagine Easy Solutions
 
T carse ESOL_October_2013_3D_Research_presentation
T carse ESOL_October_2013_3D_Research_presentationT carse ESOL_October_2013_3D_Research_presentation
T carse ESOL_October_2013_3D_Research_presentationTimCarse
 
Trying to stop the kids using google greg sheaf hslg conference 2013
Trying to stop the kids using google greg sheaf hslg conference 2013Trying to stop the kids using google greg sheaf hslg conference 2013
Trying to stop the kids using google greg sheaf hslg conference 2013hslgcommittee
 
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...Colin Harrison
 
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...ABES
 
The Power of Open Data!
The Power of Open Data!The Power of Open Data!
The Power of Open Data!Renaine Julian
 
How Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesHow Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesJulie Coiro
 
Google & garbage lsta 2012
Google & garbage lsta 2012Google & garbage lsta 2012
Google & garbage lsta 2012Paige Jaeger
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataHamilton Public Library
 
Fight for your right!
Fight for your right!Fight for your right!
Fight for your right!Lynda Kellam
 
Teaching ten steps to better research
Teaching ten steps to better researchTeaching ten steps to better research
Teaching ten steps to better researchlibrarykate
 
@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015Michael Nelson
 
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & EvaluationFSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & EvaluationLorri Mon
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Paul Royster
 
Day 3: Introduction to Information Literacy
Day 3:  Introduction to Information LiteracyDay 3:  Introduction to Information Literacy
Day 3: Introduction to Information LiteracyBuffy Hamilton
 

Similar to Easybib Open Analytics NYC (20)

Perceptions of Libraries
Perceptions of LibrariesPerceptions of Libraries
Perceptions of Libraries
 
The Transition Years: Evaluating Info Lit Skills from High School to College-...
The Transition Years: Evaluating Info Lit Skills from High School to College-...The Transition Years: Evaluating Info Lit Skills from High School to College-...
The Transition Years: Evaluating Info Lit Skills from High School to College-...
 
T carse ESOL_October_2013_3D_Research_presentation
T carse ESOL_October_2013_3D_Research_presentationT carse ESOL_October_2013_3D_Research_presentation
T carse ESOL_October_2013_3D_Research_presentation
 
Trying to stop the kids using google greg sheaf hslg conference 2013
Trying to stop the kids using google greg sheaf hslg conference 2013Trying to stop the kids using google greg sheaf hslg conference 2013
Trying to stop the kids using google greg sheaf hslg conference 2013
 
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...
Nine Strategies for Enhancing Critical Internet Literacy. Colin Harrison ukla...
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...
Evaluer les nouvelles plates-formes de services web et leur impact sur les bi...
 
The Power of Open Data!
The Power of Open Data!The Power of Open Data!
The Power of Open Data!
 
How Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesHow Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New Literacies
 
Google & garbage lsta 2012
Google & garbage lsta 2012Google & garbage lsta 2012
Google & garbage lsta 2012
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 
Fight for your right!
Fight for your right!Fight for your right!
Fight for your right!
 
The Transition Years
The Transition YearsThe Transition Years
The Transition Years
 
Teaching ten steps to better research
Teaching ten steps to better researchTeaching ten steps to better research
Teaching ten steps to better research
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015
 
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & EvaluationFSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
 
Day 3: Introduction to Information Literacy
Day 3:  Introduction to Information LiteracyDay 3:  Introduction to Information Literacy
Day 3: Introduction to Information Literacy
 

More from Open Analytics

Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)Open Analytics
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupOpen Analytics
 
The caprate presentation_july2013_open analytics dc meetup
The caprate presentation_july2013_open analytics dc meetupThe caprate presentation_july2013_open analytics dc meetup
The caprate presentation_july2013_open analytics dc meetupOpen Analytics
 
Verifeed open analytics_3min deck_071713_final
Verifeed open analytics_3min deck_071713_finalVerifeed open analytics_3min deck_071713_final
Verifeed open analytics_3min deck_071713_finalOpen Analytics
 
Oas schwartz OA Summit
Oas schwartz OA SummitOas schwartz OA Summit
Oas schwartz OA SummitOpen Analytics
 
Luigi presentation OA Summit
Luigi presentation OA SummitLuigi presentation OA Summit
Luigi presentation OA SummitOpen Analytics
 
Intridea ajn-rttos OA NYC Summit
Intridea ajn-rttos OA NYC SummitIntridea ajn-rttos OA NYC Summit
Intridea ajn-rttos OA NYC SummitOpen Analytics
 
Open analytics summit nyc
Open analytics summit nycOpen analytics summit nyc
Open analytics summit nycOpen Analytics
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanycOpen Analytics
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics eventOpen Analytics
 
Candor - open analytics nyc
Candor  - open analytics nycCandor  - open analytics nyc
Candor - open analytics nycOpen Analytics
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summitOpen Analytics
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summitOpen Analytics
 

More from Open Analytics (15)

Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics Meetup
 
The caprate presentation_july2013_open analytics dc meetup
The caprate presentation_july2013_open analytics dc meetupThe caprate presentation_july2013_open analytics dc meetup
The caprate presentation_july2013_open analytics dc meetup
 
Verifeed open analytics_3min deck_071713_final
Verifeed open analytics_3min deck_071713_finalVerifeed open analytics_3min deck_071713_final
Verifeed open analytics_3min deck_071713_final
 
HDScores OA DC Pitch
HDScores OA DC PitchHDScores OA DC Pitch
HDScores OA DC Pitch
 
Oas schwartz 16
Oas schwartz 16Oas schwartz 16
Oas schwartz 16
 
Oas schwartz OA Summit
Oas schwartz OA SummitOas schwartz OA Summit
Oas schwartz OA Summit
 
Luigi presentation OA Summit
Luigi presentation OA SummitLuigi presentation OA Summit
Luigi presentation OA Summit
 
Intridea ajn-rttos OA NYC Summit
Intridea ajn-rttos OA NYC SummitIntridea ajn-rttos OA NYC Summit
Intridea ajn-rttos OA NYC Summit
 
Open analytics summit nyc
Open analytics summit nycOpen analytics summit nyc
Open analytics summit nyc
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics event
 
Candor - open analytics nyc
Candor  - open analytics nycCandor  - open analytics nyc
Candor - open analytics nyc
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summit
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summit
 

Recently uploaded

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 

Easybib Open Analytics NYC

  • 1. Using data to improve student research
  • 2. EasyBib is an automatic bibliography composer. Students use it to cite sources for their research.
  • 3. We teach information literacy. 18% of all student papers include plagiarism1 Source: (1) TurnItIn; (2) Both Sides Now: Librarians Looking at Information Literacy from High School and College. 50% likelihood of using a credible vs. non- credible source1 4% increase in the use of paper mills and cheating sites1 ~16% of students are adequately prepared for college.2
  • 4. That’s how we felt too..
  • 5. The problem is becoming bigger.
  • 6. Unprepared students make for unprepared adults. It’s not just students who plagiarize: •Pal Schmitt, former president of Hungary •German education minister •Jayson Blair (former New York Times writer) •Jonah Lehrer, journalist and author •Fareed Zakaria (reporter, author, host)
  • 7. We are in the right place to figure it out. Over half of all students in the US (40M) Over half a billion citations
  • 8. We asked ourselves the following questions: •What are students using in their research? •How good are their sources? •How can we help them?
  • 9. We started with the basics._gaq.push([ 'citations._trackEvent', citationTitle, citationPublisher, citationId ]);
  • 10. Here’s what we found. Top sources 2010 •Wikipedia •Google 1.The New York Times 2.CIA World Factbook 3.Oracle Thinkquest 4.Buzzle 5.US BLS 6.Dictionary.com 7.CDC 8.PBS 9.eHow Source: EasyBib Google Analytics Oct 2010-Nov 2010 data.
  • 11. What could we do? •Warn them when their source’s credibility is in question •Analyze the quality of their full bibliography •Make it easier to not plagiarize •Suggest better sources
  • 14. Gave students access to their own analytics
  • 15. To combat plagiarism, we built an audit trail for notes
  • 16. So after all this... Does it blend (tm) ? 1. Wikipedia 2. Bio.com 3. History.com 4. PBS 5. Mayo Clinic 6. CDC 7. The New York Times 8. BBC 9. CNN 10.WebMD 11.US BLS • Wikipedia still on top, but ... • No content farms, no Google.. • WebMD is questionable, but its credibility can be argued for. Source: Apr-May 2013 Google Analytics data
  • 17. We have to admit, it’s getting better... We have to admit, it’s getting better...
  • 18. Help students find better sources
  • 19. How does the Research engine currently work? Cloudant (CouchDB) MySQL Lucene/Solr Slow, asynchronous, lots of moving parts.
  • 20. Starting to do a bit more StatsD::increment($metrics); $response = $rediska->publish( array('realtime'), $citation );
  • 21. There’s a lot more we can do, and data will help us.
  • 22. Cloudant Search •Full-text search integrated into Cloudant •Lucene syntax •Indexing is easy function(doc){ index("title", doc.title, {"store": "yes"}); } •Grouping of sources via chained map-reduce map: function(doc){ if (doc.title){ emit({"title": doc.title}, 1); } } reduce: _sum dbcopy: citationGroup ------ map: function(doc){ if (doc.title && doc.key.title){ emit(doc.value, doc.key.title); } }
  • 23. Live data analysis. Crowdsourcing. •Use Cloudant Search to power feedback on sources (# of times cited in real time, quality of bibliographies derived from) •Allow users to submit their own credibility evaluations and aggregate results
  • 24. SourceRank! Credibility weighting + crowdsourcing Synchronous & realtime via Cloudant Search Value nodes based on nearest neighbors And other things...
  • 25. Driving growth We have the largest UGC citation set. Making this searchable creates a “moat.” The more people that use EasyBib, the better the tool becomes.
  • 26. What about other data analytics tools? Too stretched to learn more complex tools (looking for easy answers) Costs (GA is free!) EMR, Hadoop, Redshift, Cloudant Search: This is what’s next.