SlideShare a Scribd company logo
1 of 45
Download to read offline
Data-Driven Problem Solving in
Higher Education Institutions
Lighton Phiri <lighton.phiri@unza.zm>
Department of Library & Information Science
University of Zambia
http://lis.unza.zm/~lightonphiri
Discovering Insight from
Scholarly Research Output in
Higher Educational Institutions
2
October 27, 2021
About The DataLab Research Group at The
University of Zambia
● The DataLab research group
at The University of Zambia is
composed of faculty staff and
students—undergraduate
and postgraduate—working
in three main areas
○ Data Mining
○ Digital Libraries
○ Technology-Enhanced
Learning
http://datalab.unza.zm
3
October 27, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Insights from Scholarly Research Data
4
October 27, 2021
Outline
● Part I. Data-Driven Problem Solving
○ Introduction
○ Data Mining Pipelines
○ Data Mining Models
○ Past and Current Projects
● Part II. Insights from Scholarly Research Data
5
October 27, 2021
Machine Learning Instrumental in Deriving
Insights
https://commons.wikimedia.org/
● Artificial Intelligence encompases
a broad spectrum of sub-fields
○ Traditional machine learning
techniques and approaches
○ Deep Learning approaches
6
October 27, 2021
Machine Learning Instrumental in Deriving
Insights
https://commons.wikimedia.org/
● Artificial Intelligence encompases
a broad spectrum of sub-fields
○ Traditional machine learning
techniques and approaches
○ Deep Learning approaches
7
October 27, 2021
Past and Current Projects Leverage Machine
Learning With Data as a Key Ingredient
8
October 27, 2021
Project #1: Predicting Student Learning
Outcomes—Problem
● ICT 1110 performance is as issue. The poor performance
transcends all assessments: quizzes, tests and practical
programming questions.
9
October 27, 2021
Project #1: Predicting Student Learning
Outcomes—Current Work
● Potential solution: implement a prediction model aimed at
identifying at-risk students .
○ Initiate interventions on at-risk students.
Chaibela, M., Chisha, I., Pungwa, D., Siabbaba D. and Simukoko B. (2021)
“Performance Predictor: Machine Learning Tool for Student Performance Outcomes”.
Work-in-Progress
10
October 27, 2021
Project #2: Medical Imaging Workflows in
Zambia—Problem
https://mjz.co.zm/index.php/mjz/article/view/560
11
October 27, 2021
Project #2: Medical Imaging Workflows in
Zambia—Current Work
12
October 27, 2021
Project #3: Automatic Forecasting of
Seasonal Rainfall—Current Work
13
October 27, 2021
Data-Drive Problem Solving Pipelines are
Generic
● Fundamentally,
machine learning
aims to extract
knowledge from
data
○ Historical data is
used to
infer/predict
outcomes
associated with
new observations
14
October 27, 2021
Data-Drive Problem Solving Pipelines are
Generic
● Input features
identified during
feature engineering
are used to train
models
○ Features correlated
with outcome to be
identified
15
October 27, 2021
Data-Drive Problem Solving Pipelines are
Generic
● The ML inference
model is used to
predict future
patterns
○ Models can then be
deployed as Web
services and/or
standalone
applications
16
October 27, 2021
Established Data Mining Models Crucial to
Data-Driven Problem Solving (1/4)
https://doi.org/10.1017/S0269888910000032
● Numerous data
mining models and
frameworks have
been proposed
○ Most trace their
roots from the
KDD Process
proposed by
Fayyad et al.
17
October 27, 2021
Established Data Mining Models Crucial to
Data-Driven Problem Solving (2/4)
https://doi.org/10.1017/S0269888906000737
18
October 27, 2021
Established Data Mining Models Crucial to
Data-Driven Problem Solving (2/4)
https://doi.org/10.1017/S0269888906000737
19
October 27, 2021
Established Data Mining Models Crucial to
Data-Driven Problem Solving (3/4)
https://www.kdnuggets.com
● CRISP-DM model is one
of the most widely
used data mining
models
● Data understanding
and preparation are
the most time
consuming
20
October 27, 2021
Established Data Mining Models Crucial to
Data-Driven Problem Solving (4/4)
https://arxiv.org/abs/2003.05155
21
October 27, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Insights from Scholarly Research Data
○ Problem and Motivation
○ Data Sources, Preprocessing and Preparation
○ Scholarly Research Output Insights
22
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (1/3)
https://worldmapper.org
23
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (1/3)
https://worldmapper.org
24
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (2/3)
25
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (2/3)
26
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (3/3)
http://www.webometrics.info
27
October 27, 2021
Online Visibility of Scholarly Research in
Zambia—Problem (3/3)
http://www.webometrics.info
28
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (1/5)
https://ir.nust.na
29
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (2/5)
http://dspace.unza.zm
30
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (2/5)
http://dspace.unza.zm
http://journals.unza.zm
31
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (3/5)
● Textual content mined from PDF
manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined from
metadata for training
● PDF document metadata
● Curated datasets from external
repositories
32
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (3/5)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
33
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (3/5)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
34
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (3/5)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
35
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (4/5)
● OAI-PMH used to
harvest all ETD
descriptive metadata
elements
● OAI-ORE used to
harvest all ETD PDF
documents
36
October 27, 2021
Data Sources, Collection, Preprocessing and
Preparation (5/5)
● Text features extracted from a set of core
bitstream portions—ETD Title, ETD
Abstract, ETD Title Page and ETD pages—to
classify ETD manuscripts
ETD Type
ETD Subjects
IR Collection
37
October 27, 2021
Quantifying The Online Visibility of Research
in Zambia
Phiri, L. (2018). ICICT 2018
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
38
October 27, 2021
Quantifying The Online Visibility of Research
in Zambia
Phiri, L. (2018). ICICT 2018
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
39
October 27, 2021
Quantifying The Online Visibility of Research
in Zambia
Phiri, L. (2018). ICICT 2018
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
40
October 27, 2021
Improved Visibility through Automatic
Classification of ETDs
● Implementation of classification models to
automatically classify IR digital objects
using the minimum possible input from
graduate students: “The ETD Manuscript”
○ The ETD manuscript bitstream is considered
the “single source of truth”
○ Metadata prepared by staff that work with IR
potentially have inconsistencies
Phiri, L. (2021). IJMSO Vol. 14, No. 3
“Automatic Classification of Digital Objects for Improved Metadata Quality of ETDs”
URL: https://doi.org/10.1504/IJMSO.2020.112804
41
October 27, 2021
Improved Discoverability of Digital Objects
in Institutional Repositories
Chipangila, B. et al. (2021). JCDL 2021
“Improved Discoverability of Digital Objects in IRs Using Controlled Vocabularies”
URL: https://doi.org/10.1109/JCDL52503.2021.00022
42
October 27, 2021
Beyond Insights into Scholarly Research
Landscape in Zambia
http://lis.unza.zm/portal
43
October 27, 2021
Q & A Session
● Comments, concerns and complaints?
[1] Phiri, L. (2018). Research Visibility in the Global South: Towards
Increased Online Visibility of Scholarly Research Output in
Zambia. IEEE International Conference in Information and
Communication Technologies.
[2] Chipangila, B. et al. (2021). Improved Discoverability of Digital
Objects in Institutional Repositories Using Controlled
Vocabularies
[3] Phiri, L. (2020). Automatic classification of digital objects for
improved metadata quality of electronic theses and dissertations
in institutional repositories. International Journal of Metadata,
Semantics and Ontologies, 14(3), 234-248.
Bibliography
lighton.phiri@unza.zm
http://datalab.unza.zm
http://lis.unza.zm/~lightonphiri

More Related Content

Similar to Data-Driven Insights in Higher Ed

Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthLighton Phiri
 
Improved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningImproved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningLighton Phiri
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)WARCnet
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisIJAEMSJORNAL
 
LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...Cristian Rodríguez Enríquez
 
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)IJDKP
 
KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)webeng-kaist
 
MOVING presentation at JSI
MOVING presentation at JSIMOVING presentation at JSI
MOVING presentation at JSIMOVING Project
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...LIBER Europe
 
Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Beat Signer
 
Adaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainAdaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainMikhail Navrotskii
 
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Lighton Phiri
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Christophe Debruyne
 

Similar to Data-Driven Insights in Higher Ed (20)

Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of Truth
 
Improved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningImproved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine Learning
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive Analysis
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
Exploratory Analysis of User Data
Exploratory Analysis of User DataExploratory Analysis of User Data
Exploratory Analysis of User Data
 
LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...
 
Data Mining and Its Application in Library and Information Science
Data Mining and Its Application in Library and Information ScienceData Mining and Its Application in Library and Information Science
Data Mining and Its Application in Library and Information Science
 
WebEng_202107
WebEng_202107WebEng_202107
WebEng_202107
 
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
 
KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)
 
MOVING presentation at JSI
MOVING presentation at JSIMOVING presentation at JSI
MOVING presentation at JSI
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
 
Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...
 
bonino
boninobonino
bonino
 
Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"
 
Adaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainAdaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education Domain
 
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
 

More from Lighton Phiri

Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...Lighton Phiri
 
User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...Lighton Phiri
 
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Lighton Phiri
 
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Lighton Phiri
 
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Lighton Phiri
 
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesEnterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesLighton Phiri
 
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Lighton Phiri
 
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsDRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsLighton Phiri
 
OJS Training: Users and User Roles
OJS Training: Users and User RolesOJS Training: Users and User Roles
OJS Training: Users and User RolesLighton Phiri
 
OJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationOJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationLighton Phiri
 
OJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessOJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessLighton Phiri
 
OJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesOJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesLighton Phiri
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...Lighton Phiri
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...Lighton Phiri
 
Post PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesPost PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesLighton Phiri
 
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningTechnology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningLighton Phiri
 
Research Visibility in the Global South: Towards Increased Online Visibility...
Research Visibility  in the Global South: Towards Increased Online Visibility...Research Visibility  in the Global South: Towards Increased Online Visibility...
Research Visibility in the Global South: Towards Increased Online Visibility...Lighton Phiri
 
Ph.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationPh.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationLighton Phiri
 
Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Lighton Phiri
 
Towards Open Access Institutional Repositories in Zambian HEIs
Towards Open Access Institutional Repositories in Zambian HEIsTowards Open Access Institutional Repositories in Zambian HEIs
Towards Open Access Institutional Repositories in Zambian HEIsLighton Phiri
 

More from Lighton Phiri (20)

Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
 
User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...
 
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
 
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
 
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
 
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesEnterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
 
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
 
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsDRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
 
OJS Training: Users and User Roles
OJS Training: Users and User RolesOJS Training: Users and User Roles
OJS Training: Users and User Roles
 
OJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationOJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and Configuration
 
OJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessOJS Training: Managing The Submission Process
OJS Training: Managing The Submission Process
 
OJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesOJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal Issues
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
 
Post PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesPost PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and Challenges
 
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningTechnology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
 
Research Visibility in the Global South: Towards Increased Online Visibility...
Research Visibility  in the Global South: Towards Increased Online Visibility...Research Visibility  in the Global South: Towards Increased Online Visibility...
Research Visibility in the Global South: Towards Increased Online Visibility...
 
Ph.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationPh.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for Orchestration
 
Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...
 
Towards Open Access Institutional Repositories in Zambian HEIs
Towards Open Access Institutional Repositories in Zambian HEIsTowards Open Access Institutional Repositories in Zambian HEIs
Towards Open Access Institutional Repositories in Zambian HEIs
 

Recently uploaded

Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 

Recently uploaded (20)

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 

Data-Driven Insights in Higher Ed

  • 1. Data-Driven Problem Solving in Higher Education Institutions Lighton Phiri <lighton.phiri@unza.zm> Department of Library & Information Science University of Zambia http://lis.unza.zm/~lightonphiri Discovering Insight from Scholarly Research Output in Higher Educational Institutions
  • 2. 2 October 27, 2021 About The DataLab Research Group at The University of Zambia ● The DataLab research group at The University of Zambia is composed of faculty staff and students—undergraduate and postgraduate—working in three main areas ○ Data Mining ○ Digital Libraries ○ Technology-Enhanced Learning http://datalab.unza.zm
  • 3. 3 October 27, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Insights from Scholarly Research Data
  • 4. 4 October 27, 2021 Outline ● Part I. Data-Driven Problem Solving ○ Introduction ○ Data Mining Pipelines ○ Data Mining Models ○ Past and Current Projects ● Part II. Insights from Scholarly Research Data
  • 5. 5 October 27, 2021 Machine Learning Instrumental in Deriving Insights https://commons.wikimedia.org/ ● Artificial Intelligence encompases a broad spectrum of sub-fields ○ Traditional machine learning techniques and approaches ○ Deep Learning approaches
  • 6. 6 October 27, 2021 Machine Learning Instrumental in Deriving Insights https://commons.wikimedia.org/ ● Artificial Intelligence encompases a broad spectrum of sub-fields ○ Traditional machine learning techniques and approaches ○ Deep Learning approaches
  • 7. 7 October 27, 2021 Past and Current Projects Leverage Machine Learning With Data as a Key Ingredient
  • 8. 8 October 27, 2021 Project #1: Predicting Student Learning Outcomes—Problem ● ICT 1110 performance is as issue. The poor performance transcends all assessments: quizzes, tests and practical programming questions.
  • 9. 9 October 27, 2021 Project #1: Predicting Student Learning Outcomes—Current Work ● Potential solution: implement a prediction model aimed at identifying at-risk students . ○ Initiate interventions on at-risk students. Chaibela, M., Chisha, I., Pungwa, D., Siabbaba D. and Simukoko B. (2021) “Performance Predictor: Machine Learning Tool for Student Performance Outcomes”. Work-in-Progress
  • 10. 10 October 27, 2021 Project #2: Medical Imaging Workflows in Zambia—Problem https://mjz.co.zm/index.php/mjz/article/view/560
  • 11. 11 October 27, 2021 Project #2: Medical Imaging Workflows in Zambia—Current Work
  • 12. 12 October 27, 2021 Project #3: Automatic Forecasting of Seasonal Rainfall—Current Work
  • 13. 13 October 27, 2021 Data-Drive Problem Solving Pipelines are Generic ● Fundamentally, machine learning aims to extract knowledge from data ○ Historical data is used to infer/predict outcomes associated with new observations
  • 14. 14 October 27, 2021 Data-Drive Problem Solving Pipelines are Generic ● Input features identified during feature engineering are used to train models ○ Features correlated with outcome to be identified
  • 15. 15 October 27, 2021 Data-Drive Problem Solving Pipelines are Generic ● The ML inference model is used to predict future patterns ○ Models can then be deployed as Web services and/or standalone applications
  • 16. 16 October 27, 2021 Established Data Mining Models Crucial to Data-Driven Problem Solving (1/4) https://doi.org/10.1017/S0269888910000032 ● Numerous data mining models and frameworks have been proposed ○ Most trace their roots from the KDD Process proposed by Fayyad et al.
  • 17. 17 October 27, 2021 Established Data Mining Models Crucial to Data-Driven Problem Solving (2/4) https://doi.org/10.1017/S0269888906000737
  • 18. 18 October 27, 2021 Established Data Mining Models Crucial to Data-Driven Problem Solving (2/4) https://doi.org/10.1017/S0269888906000737
  • 19. 19 October 27, 2021 Established Data Mining Models Crucial to Data-Driven Problem Solving (3/4) https://www.kdnuggets.com ● CRISP-DM model is one of the most widely used data mining models ● Data understanding and preparation are the most time consuming
  • 20. 20 October 27, 2021 Established Data Mining Models Crucial to Data-Driven Problem Solving (4/4) https://arxiv.org/abs/2003.05155
  • 21. 21 October 27, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Insights from Scholarly Research Data ○ Problem and Motivation ○ Data Sources, Preprocessing and Preparation ○ Scholarly Research Output Insights
  • 22. 22 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (1/3) https://worldmapper.org
  • 23. 23 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (1/3) https://worldmapper.org
  • 24. 24 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (2/3)
  • 25. 25 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (2/3)
  • 26. 26 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (3/3) http://www.webometrics.info
  • 27. 27 October 27, 2021 Online Visibility of Scholarly Research in Zambia—Problem (3/3) http://www.webometrics.info
  • 28. 28 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (1/5) https://ir.nust.na
  • 29. 29 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (2/5) http://dspace.unza.zm
  • 30. 30 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (2/5) http://dspace.unza.zm http://journals.unza.zm
  • 31. 31 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (3/5) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 32. 32 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (3/5) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 33. 33 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (3/5) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 34. 34 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (3/5) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 35. 35 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (4/5) ● OAI-PMH used to harvest all ETD descriptive metadata elements ● OAI-ORE used to harvest all ETD PDF documents
  • 36. 36 October 27, 2021 Data Sources, Collection, Preprocessing and Preparation (5/5) ● Text features extracted from a set of core bitstream portions—ETD Title, ETD Abstract, ETD Title Page and ETD pages—to classify ETD manuscripts ETD Type ETD Subjects IR Collection
  • 37. 37 October 27, 2021 Quantifying The Online Visibility of Research in Zambia Phiri, L. (2018). ICICT 2018 “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 38. 38 October 27, 2021 Quantifying The Online Visibility of Research in Zambia Phiri, L. (2018). ICICT 2018 “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 39. 39 October 27, 2021 Quantifying The Online Visibility of Research in Zambia Phiri, L. (2018). ICICT 2018 “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 40. 40 October 27, 2021 Improved Visibility through Automatic Classification of ETDs ● Implementation of classification models to automatically classify IR digital objects using the minimum possible input from graduate students: “The ETD Manuscript” ○ The ETD manuscript bitstream is considered the “single source of truth” ○ Metadata prepared by staff that work with IR potentially have inconsistencies Phiri, L. (2021). IJMSO Vol. 14, No. 3 “Automatic Classification of Digital Objects for Improved Metadata Quality of ETDs” URL: https://doi.org/10.1504/IJMSO.2020.112804
  • 41. 41 October 27, 2021 Improved Discoverability of Digital Objects in Institutional Repositories Chipangila, B. et al. (2021). JCDL 2021 “Improved Discoverability of Digital Objects in IRs Using Controlled Vocabularies” URL: https://doi.org/10.1109/JCDL52503.2021.00022
  • 42. 42 October 27, 2021 Beyond Insights into Scholarly Research Landscape in Zambia http://lis.unza.zm/portal
  • 43. 43 October 27, 2021 Q & A Session ● Comments, concerns and complaints?
  • 44. [1] Phiri, L. (2018). Research Visibility in the Global South: Towards Increased Online Visibility of Scholarly Research Output in Zambia. IEEE International Conference in Information and Communication Technologies. [2] Chipangila, B. et al. (2021). Improved Discoverability of Digital Objects in Institutional Repositories Using Controlled Vocabularies [3] Phiri, L. (2020). Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories. International Journal of Metadata, Semantics and Ontologies, 14(3), 234-248. Bibliography