SlideShare a Scribd company logo
1 of 11
How we did it...


Parscit
How we did it...


Parscit
How we did it...


Parscit
How we did it...


Parscit
How we did it...


Parscit
How we did it...


Parscit



                       REST API
Lessons learned

• data gathering from PDF is only OK for
  some data
• alot of cleanup work + complexity with
  distributed clean up data
• future: more structured data as a starting
  point.
What we want...

• clean citation data
• geographical data: author - affiliation links
• structured data
• ...
What might be helpful...




         }
PDF
Author
 Title
Our work on the EC-TEL paper data extraction.
Our work on the EC-TEL paper data extraction.

More Related Content

More from Sten Govaerts

The Student Activity Meter
The Student Activity MeterThe Student Activity Meter
The Student Activity MeterSten Govaerts
 
Learning Dashboards & Learnscapes
Learning Dashboards & LearnscapesLearning Dashboards & Learnscapes
Learning Dashboards & LearnscapesSten Govaerts
 
An introduction to Git.
An introduction to Git.An introduction to Git.
An introduction to Git.Sten Govaerts
 
Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Sten Govaerts
 
Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Sten Govaerts
 
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...Sten Govaerts
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionSten Govaerts
 
Learning Dashboards and Learnscapes
Learning Dashboards and LearnscapesLearning Dashboards and Learnscapes
Learning Dashboards and LearnscapesSten Govaerts
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionSten Govaerts
 
Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Sten Govaerts
 
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Sten Govaerts
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findabilitySten Govaerts
 
Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Sten Govaerts
 
A Federated Search and Social Recommendation Widget
A Federated Search and Social Recommendation WidgetA Federated Search and Social Recommendation Widget
A Federated Search and Social Recommendation WidgetSten Govaerts
 
Visualizing Activities for Self-reflection and Awareness
Visualizing Activities for Self-reflection and AwarenessVisualizing Activities for Self-reflection and Awareness
Visualizing Activities for Self-reflection and AwarenessSten Govaerts
 
Role WP3 roadmapping
Role WP3 roadmappingRole WP3 roadmapping
Role WP3 roadmappingSten Govaerts
 
Improving Findability through Mashups and Visualizations.
Improving Findability through Mashups and Visualizations.Improving Findability through Mashups and Visualizations.
Improving Findability through Mashups and Visualizations.Sten Govaerts
 

More from Sten Govaerts (20)

The Go-Lab portal
The Go-Lab portalThe Go-Lab portal
The Go-Lab portal
 
The Student Activity Meter
The Student Activity MeterThe Student Activity Meter
The Student Activity Meter
 
Learning Dashboards & Learnscapes
Learning Dashboards & LearnscapesLearning Dashboards & Learnscapes
Learning Dashboards & Learnscapes
 
An introduction to Git.
An introduction to Git.An introduction to Git.
An introduction to Git.
 
Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.Quantified Self in the Multimedia course.
Quantified Self in the Multimedia course.
 
Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.Learning Analytics & Learnscapes.
Learning Analytics & Learnscapes.
 
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
From Findability to Awareness: Metadata in Music and Technology Enhanced Lear...
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflection
 
Learning Dashboards and Learnscapes
Learning Dashboards and LearnscapesLearning Dashboards and Learnscapes
Learning Dashboards and Learnscapes
 
The Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflectionThe Student Activity Meter for Awareness and Self-reflection
The Student Activity Meter for Awareness and Self-reflection
 
Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.Evaluating the Student Activity Meter: Two Case Studies.
Evaluating the Student Activity Meter: Two Case Studies.
 
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
Towards Responsive Open Learning Environments: the ROLE Interoperability Fram...
 
Using mashup technology to improve findability
Using mashup technology to improve findabilityUsing mashup technology to improve findability
Using mashup technology to improve findability
 
Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...Winter School defense simulation: Visualizing Activities for Self-reflection ...
Winter School defense simulation: Visualizing Activities for Self-reflection ...
 
A Federated Search and Social Recommendation Widget
A Federated Search and Social Recommendation WidgetA Federated Search and Social Recommendation Widget
A Federated Search and Social Recommendation Widget
 
Multimedia: Music
Multimedia: MusicMultimedia: Music
Multimedia: Music
 
Visualizing Activities for Self-reflection and Awareness
Visualizing Activities for Self-reflection and AwarenessVisualizing Activities for Self-reflection and Awareness
Visualizing Activities for Self-reflection and Awareness
 
Role UI session
Role UI sessionRole UI session
Role UI session
 
Role WP3 roadmapping
Role WP3 roadmappingRole WP3 roadmapping
Role WP3 roadmapping
 
Improving Findability through Mashups and Visualizations.
Improving Findability through Mashups and Visualizations.Improving Findability through Mashups and Visualizations.
Improving Findability through Mashups and Visualizations.
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyUXDXConf
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreelreely ones
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?Mark Billinghurst
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty SecureFemke de Vroome
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...FIDO Alliance
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Patrick Viafore
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessUXDXConf
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1DianaGray10
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 

Our work on the EC-TEL paper data extraction.