Successfully reported this slideshow.
You’ve unlocked unlimited downloads on SlideShare!
The Future of Text AnalysisDr. Stuart ShulmanTexifter, LLC<br />Wednesday, June 15, 2011<br />
Briefing Agenda<br />R&D in Annotation and Public Comments<br />“The Future of Text Analysis” – The vision<br />“What is DiscoverText?” – The software<br /> The Features – The basics<br />Capturing social media importing other text<br />Creating archives, buckets and datasets<br />Coding a dataset or training a classifier<br />
Dr. Stuart W. Shulman<br />Founder & CEO, Texifter, LLCAssistant Professor, Department of Political ScienceUniversity of Massachusetts Amherst<br />Director, Qualitative Data Analysis Program (QDAP)Associate Director, National Center for Digital Government<br />Editor, Journal of Information Technology & Politics<br />413-545-5375 email@example.com://people.umass.edu/stu/<br />
Crowdsourcing<br />Crowdsourcing will bring widely distributed<br />wisdom to process of text analysis<br />“This is really the biggest paradigm shift in innovation since the Industrial Revolution”<br />- MIT professor Eric von Hippel, specialist in innovation management<br />
Active Machine Learning<br />By utilizing information and decisions previously captured, we can enhance future machine-based decisions<br />Active <br />Learning<br />Loop<br />
What is DiscoverText?<br />DiscoverText is a:<br />personal or organizational archive in the cloud<br />search engine for eDiscovery <br />social media comment aggregator<br />de-duplication and near duplicate clustering engine<br />FOIA redaction toolkit<br />coding, reporting and validation team workbench<br />repository of human annotation (text about text), and<br />customizable machine-learning classifier <br />(beta launched April 2011)<br />
A fall 2011 briefing for personnel at Forrester in Cambridge MA.