SlideShare a Scribd company logo
1 of 38
meow ::06 David Newman Bill Landis, ex officio Kat Hagedorn Clustering, Classification,  and Metadata  Enhancement Techniques July 24, 2006
Clustering, Classification, and Metadata Enhancement Techniques on OAI Records ,[object Object],[object Object],[object Object]
Goals ,[object Object],[object Object],[object Object],[object Object]
What We Did Cluster Preprocessing & Topic Modeling >   vocab- ulary preprocess topic model (cluster/learn) topics OAI records
What We Did vocab- ulary preprocess topic model (cluster/learn) topics Cluster OAI records vocab -ulary preprocess topic model (classify) 1.  topics in records 2.  records in topics oai rec Classify Preprocessing & Topic Modeling >   OAI records
What We Did Cluster Classify Preprocessing & Topic Modeling >   clustering is  learning  the topics classification is  using  the learned topics vocab- ulary preprocess topic model (cluster/learn) topics OAI records vocab -ulary preprocess topic model (classify) 1.  topics in records 2.  records in topics oai rec OAI records
Repository Selection ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Preprocessing & Topic Modeling >
Selected Repositories * *Repositories harvested by UMich/OAIster, June 7, 2006. Preprocessing & Topic Modeling >   1 in 3 141,000 Research Papers in Economics repec 1 in 3 625,000 PubMed Central pubmed - 370,000 Publishing Network for Geoscientific and Environmental Data pangaea 1 in 3 131,000 Office of Science and Technology Information osti 1 in 2  33,000 The National Science Digital Library nsdl - 239,000 Library of Congress Digitized Historical Collections loc 1 in 3 212,000 Institute of Physics iop 1 in 2 29,000 Directory of Open Access Journals Articles doaj 1 in 3 717,000 CiteSeer Scientific Literature Digital Library citeseer 1 in 2 45,000 CERN Document Server cern - 3,000 Caltech Electronic Theses and Dissertations caltech 1 in 3 368,000 arXiv.org Eprint Archive arxiv Records used for clustering (learning) Records Description Short Name
Usage of Dublin Core Fields ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Preprocessing & Topic Modeling >
Preprocessing Example <ID=oai:CiteSeerPSU:44072> <title> Reinforcement Learning: A Survey <description> This paper surveys the field of reinforcement learning from a computer-science perspective.  It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized.  Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word &quot;reinforcement.&quot; … <subject> Leslie Pack Kaelbling, Michael Littman, Andrew Moore.  Reinforcement Learning: A Survey vocab -ulary preprocess <ID=oai:CiteSeerPSU:44072> reinforcement learning survey survey field reinforcement learning computer science perspective written accessible researcher familiar machine learning historical basis field broad selection current summarized reinforcement learning faced agent learn behavior trial error interaction dynamic environment resemblance psychology differ considerably detail word reinforcement … leslie pack kaelbling littman andrew moore reinforcement learning survey Preprocessing & Topic Modeling >
Stopwords and Stemming ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Preprocessing & Topic Modeling >
Building Vocabulary ,[object Object],[object Object],[object Object],[object Object],[object Object],Preprocessing & Topic Modeling >
Preprocessing & Topic Modeling >   ,[object Object],[object Object],[object Object]
Computation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Decision point: How many topics? Decision point: How many iterations? Preprocessing & Topic Modeling >
Broad Topical Categories ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Preprocessing & Topic Modeling >
Broad Topical Categories broad topical categories Preprocessing & Topic Modeling >   vocab- ulary preprocess topic model (cluster/learn) topics OAI records topic model (cluster/learn) Cluster Cluster the clusters
Broad Topical Categories Cluster broad topical categories Cluster the clusters Classify group of keywords vocab -ulary preprocess topic model (classify) topics organized under  broad topical categories group of keywords Preprocessing & Topic Modeling >   vocab- ulary preprocess topic model (cluster/learn) topics OAI records topic model (cluster/learn)
Clustering, Classification, and Metadata Enhancement Techniques on OAI Records ,[object Object],[object Object],[object Object]
The “Browser” ,[object Object],[object Object],[object Object],[object Object],[object Object],*Based on 750,000 sampled records from 9 repositories, 500 topics The Browser >
The “Browser”:  http://yarra.calit2.uci.edu/meow/ The Browser >
Selected Topics: Useful ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The Browser >
Selected Topics: Less Useful ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],and very useful to filter out French records The Browser >
Broad Topical Categories (BTCs) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The Browser >
BTCs: Clustering the clusters The Browser >
BTCs: Classifying group of keywords ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],domain expert specifies list of relevant keywords and (importance) The Browser >
BTCs: Classifying group of keywords ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],in review, would delete this topic from this BTC just found 1 topic relevant to transportation The Browser >
Browse Records in a Topic nice mix of repositories The Browser >   can navigate back to multiple BTCs
Browse Records in a Topic: From one repository The Browser >   display records just from Library of Congress
Sample Record ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The Browser >   link to actual OAI record topics for this record
Repository-specific Browsers ,[object Object],[object Object],[object Object],[object Object],[object Object],The Browser >
Clustering, Classification, and Metadata Enhancement Techniques on OAI Records ,[object Object],[object Object],[object Object]
Evaluation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Lessons Learned & Next Steps >
Further Evaluation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Lessons Learned & Next Steps >
Discussion Point: When to Re-cluster? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],cluster classify cluster cluster classify classify classify classify classify Lessons Learned & Next Steps >
Products and Services ,[object Object],[object Object],[object Object],[object Object],[object Object],Lessons Learned & Next Steps >
Archive of Topics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Lessons Learned & Next Steps >
Subject Search/Browse for OAIster ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Lessons Learned & Next Steps >
How To Reach Us ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

Viewers also liked

Whats New In Silverlight 3
Whats New In Silverlight 3Whats New In Silverlight 3
Whats New In Silverlight 3Bruce Johnson
 
Silverlight 4 and Expression Blend
Silverlight 4 and Expression BlendSilverlight 4 and Expression Blend
Silverlight 4 and Expression BlendBruce Johnson
 
VIReC Cyber Seminar Series 2006 VIReC Cyber Seminar Series 2006
VIReC Cyber Seminar Series 2006 	 VIReC Cyber Seminar Series 2006VIReC Cyber Seminar Series 2006 	 VIReC Cyber Seminar Series 2006
VIReC Cyber Seminar Series 2006 VIReC Cyber Seminar Series 2006MedicineAndDermatology
 
Minnesota HIPAA Collaborative Minnesota HIPAA Collaborative
Minnesota HIPAA Collaborative 	 Minnesota HIPAA CollaborativeMinnesota HIPAA Collaborative 	 Minnesota HIPAA Collaborative
Minnesota HIPAA Collaborative Minnesota HIPAA CollaborativeMedicineAndDermatology
 
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...MedicineAndDermatology
 

Viewers also liked (6)

Whats New In Silverlight 3
Whats New In Silverlight 3Whats New In Silverlight 3
Whats New In Silverlight 3
 
Silverlight 4 and Expression Blend
Silverlight 4 and Expression BlendSilverlight 4 and Expression Blend
Silverlight 4 and Expression Blend
 
VIReC Cyber Seminar Series 2006 VIReC Cyber Seminar Series 2006
VIReC Cyber Seminar Series 2006 	 VIReC Cyber Seminar Series 2006VIReC Cyber Seminar Series 2006 	 VIReC Cyber Seminar Series 2006
VIReC Cyber Seminar Series 2006 VIReC Cyber Seminar Series 2006
 
Minnesota HIPAA Collaborative Minnesota HIPAA Collaborative
Minnesota HIPAA Collaborative 	 Minnesota HIPAA CollaborativeMinnesota HIPAA Collaborative 	 Minnesota HIPAA Collaborative
Minnesota HIPAA Collaborative Minnesota HIPAA Collaborative
 
Teaching Using Portable Ultrasound
Teaching Using Portable UltrasoundTeaching Using Portable Ultrasound
Teaching Using Portable Ultrasound
 
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...
INTEGRATION OF LEPROSY REHABILITATION SERVICES INTO THE MAINSTREAM OF PHYSICA...
 

Similar to Meow Hagedorn

Семантический поиск - что это, как работает и чем отличается от просто поиска
Семантический поиск - что это, как работает и чем отличается от просто поискаСемантический поиск - что это, как работает и чем отличается от просто поиска
Семантический поиск - что это, как работает и чем отличается от просто поискаVitebsk Miniq
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research ObjectsDavid De Roure
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
Wikipedia Document Classification
Wikipedia Document Classification Wikipedia Document Classification
Wikipedia Document Classification Mohit Sharma
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD MicrothesauriMarcia Zeng
 
Using Computer as a Research Assistant in Qualitative Research
Using Computer as a Research Assistant in Qualitative ResearchUsing Computer as a Research Assistant in Qualitative Research
Using Computer as a Research Assistant in Qualitative ResearchJoshuaApolonio1
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesIchigaku Takigawa
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodDuncan Hull
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesElsevier
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...Angelo Salatino
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsAnubhav Jain
 
Reverse engineering and theory building v3
Reverse engineering and theory building v3Reverse engineering and theory building v3
Reverse engineering and theory building v3ClarkTony
 
Research Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataResearch Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataRicard de la Vega
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...faflrt
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesMax Irwin
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersCarlos Toxtli
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoAshok Venkatesan
 

Similar to Meow Hagedorn (20)

Семантический поиск - что это, как работает и чем отличается от просто поиска
Семантический поиск - что это, как работает и чем отличается от просто поискаСемантический поиск - что это, как работает и чем отличается от просто поиска
Семантический поиск - что это, как работает и чем отличается от просто поиска
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research Objects
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
Wikipedia Document Classification
Wikipedia Document Classification Wikipedia Document Classification
Wikipedia Document Classification
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
 
qualitative.ppt
qualitative.pptqualitative.ppt
qualitative.ppt
 
Using Computer as a Research Assistant in Qualitative Research
Using Computer as a Research Assistant in Qualitative ResearchUsing Computer as a Research Assistant in Qualitative Research
Using Computer as a Research Assistant in Qualitative Research
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree Ensembles
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific Method
 
A-Study_TopicModeling
A-Study_TopicModelingA-Study_TopicModeling
A-Study_TopicModeling
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
 
Reverse engineering and theory building v3
Reverse engineering and theory building v3Reverse engineering and theory building v3
Reverse engineering and theory building v3
 
Research Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataResearch Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories Metadata
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and Dato
 
SISAP17
SISAP17SISAP17
SISAP17
 

More from MedicineAndDermatology

The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the Cervix
The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the CervixThe Role of Human Papillomavirus (HPV) Infection in Abnormalities of the Cervix
The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the CervixMedicineAndDermatology
 
Alzheimers disease genetics teaching resources
Alzheimers disease genetics teaching resourcesAlzheimers disease genetics teaching resources
Alzheimers disease genetics teaching resourcesMedicineAndDermatology
 
Multiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMultiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMedicineAndDermatology
 
Multiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMultiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMedicineAndDermatology
 

More from MedicineAndDermatology (20)

Fibromyalgia.Rc Burdine.Fmdrl
Fibromyalgia.Rc Burdine.FmdrlFibromyalgia.Rc Burdine.Fmdrl
Fibromyalgia.Rc Burdine.Fmdrl
 
Preterm Labor Prevention Watrin
Preterm Labor Prevention WatrinPreterm Labor Prevention Watrin
Preterm Labor Prevention Watrin
 
Dementia.2
Dementia.2Dementia.2
Dementia.2
 
Desirable Criteria For Cases
Desirable Criteria For CasesDesirable Criteria For Cases
Desirable Criteria For Cases
 
The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the Cervix
The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the CervixThe Role of Human Papillomavirus (HPV) Infection in Abnormalities of the Cervix
The Role of Human Papillomavirus (HPV) Infection in Abnormalities of the Cervix
 
Medical Management During Ramadan
Medical Management During RamadanMedical Management During Ramadan
Medical Management During Ramadan
 
HPV and Cervical Cancer Screening
HPV and Cervical Cancer ScreeningHPV and Cervical Cancer Screening
HPV and Cervical Cancer Screening
 
Suggested Teaching Approaches For Dd
Suggested Teaching Approaches For DdSuggested Teaching Approaches For Dd
Suggested Teaching Approaches For Dd
 
Cancer Screening
Cancer ScreeningCancer Screening
Cancer Screening
 
Acae Nicu Paper Final Subm Correction
Acae Nicu Paper Final Subm CorrectionAcae Nicu Paper Final Subm Correction
Acae Nicu Paper Final Subm Correction
 
Alzheimers disease genetics teaching resources
Alzheimers disease genetics teaching resourcesAlzheimers disease genetics teaching resources
Alzheimers disease genetics teaching resources
 
Hypertension
HypertensionHypertension
Hypertension
 
Anemia
AnemiaAnemia
Anemia
 
Urinary Tract Infection in Children
Urinary Tract Infection in ChildrenUrinary Tract Infection in Children
Urinary Tract Infection in Children
 
Community Acquired Pneumonia Fmdrl
Community Acquired Pneumonia FmdrlCommunity Acquired Pneumonia Fmdrl
Community Acquired Pneumonia Fmdrl
 
Community Acquired Pneumonia Fmdrl
Community Acquired Pneumonia FmdrlCommunity Acquired Pneumonia Fmdrl
Community Acquired Pneumonia Fmdrl
 
Multiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMultiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C Bourque
 
Multiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C BourqueMultiple Sclerosis Diagnostics Dr C Bourque
Multiple Sclerosis Diagnostics Dr C Bourque
 
Smoking Cessation Tutorial
Smoking Cessation TutorialSmoking Cessation Tutorial
Smoking Cessation Tutorial
 
Insomnia Presentation
Insomnia PresentationInsomnia Presentation
Insomnia Presentation
 

Recently uploaded

Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service AvailableCall Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service AvailableJanvi Singh
 
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...parulsinha
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...GENUINE ESCORT AGENCY
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...parulsinha
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...hotbabesbook
 
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Anamika Rawat
 
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...chennailover
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426jennyeacort
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Ishani Gupta
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls ServiceGENUINE ESCORT AGENCY
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappInaaya Sharma
 
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Sheetaleventcompany
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...adilkhan87451
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Anamika Rawat
 
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...parulsinha
 

Recently uploaded (20)

Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service AvailableCall Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
 
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
 
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
 
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
 

Meow Hagedorn

  • 1. meow ::06 David Newman Bill Landis, ex officio Kat Hagedorn Clustering, Classification, and Metadata Enhancement Techniques July 24, 2006
  • 2.
  • 3.
  • 4. What We Did Cluster Preprocessing & Topic Modeling > vocab- ulary preprocess topic model (cluster/learn) topics OAI records
  • 5. What We Did vocab- ulary preprocess topic model (cluster/learn) topics Cluster OAI records vocab -ulary preprocess topic model (classify) 1. topics in records 2. records in topics oai rec Classify Preprocessing & Topic Modeling > OAI records
  • 6. What We Did Cluster Classify Preprocessing & Topic Modeling > clustering is learning the topics classification is using the learned topics vocab- ulary preprocess topic model (cluster/learn) topics OAI records vocab -ulary preprocess topic model (classify) 1. topics in records 2. records in topics oai rec OAI records
  • 7.
  • 8. Selected Repositories * *Repositories harvested by UMich/OAIster, June 7, 2006. Preprocessing & Topic Modeling > 1 in 3 141,000 Research Papers in Economics repec 1 in 3 625,000 PubMed Central pubmed - 370,000 Publishing Network for Geoscientific and Environmental Data pangaea 1 in 3 131,000 Office of Science and Technology Information osti 1 in 2 33,000 The National Science Digital Library nsdl - 239,000 Library of Congress Digitized Historical Collections loc 1 in 3 212,000 Institute of Physics iop 1 in 2 29,000 Directory of Open Access Journals Articles doaj 1 in 3 717,000 CiteSeer Scientific Literature Digital Library citeseer 1 in 2 45,000 CERN Document Server cern - 3,000 Caltech Electronic Theses and Dissertations caltech 1 in 3 368,000 arXiv.org Eprint Archive arxiv Records used for clustering (learning) Records Description Short Name
  • 9.
  • 10. Preprocessing Example <ID=oai:CiteSeerPSU:44072> <title> Reinforcement Learning: A Survey <description> This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word &quot;reinforcement.&quot; … <subject> Leslie Pack Kaelbling, Michael Littman, Andrew Moore. Reinforcement Learning: A Survey vocab -ulary preprocess <ID=oai:CiteSeerPSU:44072> reinforcement learning survey survey field reinforcement learning computer science perspective written accessible researcher familiar machine learning historical basis field broad selection current summarized reinforcement learning faced agent learn behavior trial error interaction dynamic environment resemblance psychology differ considerably detail word reinforcement … leslie pack kaelbling littman andrew moore reinforcement learning survey Preprocessing & Topic Modeling >
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Broad Topical Categories broad topical categories Preprocessing & Topic Modeling > vocab- ulary preprocess topic model (cluster/learn) topics OAI records topic model (cluster/learn) Cluster Cluster the clusters
  • 17. Broad Topical Categories Cluster broad topical categories Cluster the clusters Classify group of keywords vocab -ulary preprocess topic model (classify) topics organized under broad topical categories group of keywords Preprocessing & Topic Modeling > vocab- ulary preprocess topic model (cluster/learn) topics OAI records topic model (cluster/learn)
  • 18.
  • 19.
  • 20. The “Browser”: http://yarra.calit2.uci.edu/meow/ The Browser >
  • 21.
  • 22.
  • 23.
  • 24. BTCs: Clustering the clusters The Browser >
  • 25.
  • 26.
  • 27. Browse Records in a Topic nice mix of repositories The Browser > can navigate back to multiple BTCs
  • 28. Browse Records in a Topic: From one repository The Browser > display records just from Library of Congress
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.