SlideShare a Scribd company logo
1 of 86
Download to read offline
Deep Learning
Indaba X - Zambia 2021
Lighton Phiri <lighton.phiri@unza.zm>
Department of Library & Information Science
University of Zambia
http://lis.unza.zm/~lightonphiri
Using Machine Learning Techniques
for Solving
Locally Relevant Problems
2
May 25, 2021
About The DataLab Research Group at The
University of Zambia
● The DataLab research group
at The University of Zambia is
composed of faculty staff and
students—undergraduate
and postgraduate—working
in three main areas
○ Data Mining
○ Digital Libraries
○ Technology-Enhanced
Learning
http://datalab.unza.zm
3
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
● Part III. Potential Problems
4
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
○ Introduction
○ Data Mining Pipelines
○ Data Mining Models
● Part II. Past and Current Projects
● Part III. Potential Problems
5
May 25, 2021
Machine Learning 101 [...]
https://commons.wikimedia.org/
● Artificial Intelligence encompases
a broad spectrum of sub-fields
○ Traditional machine learning
techniques and approaches
○ Deep Learning approaches
6
May 25, 2021
Machine Learning 101 [...]
https://commons.wikimedia.org/
● Artificial Intelligence encompases
a broad spectrum of sub-fields
○ Traditional machine learning
techniques and approaches
○ Deep Learning approaches
7
May 25, 2021
Data is Key to ML-Centric Problem Solving
8
May 25, 2021
Data Mining Pipelines
● Fundamentally,
machine learning
aims to extract
knowledge from
data
○ Historical data is
used to
infer/predict
outcomes
associated with
new observations
9
May 25, 2021
Data Mining Pipelines
● Input features
identified during
feature engineering
are used to train
models
○ Features correlated
with outcome to be
identified
10
May 25, 2021
Data Mining Pipelines
● The ML inference
model is used to
predict future
patterns
○ Models can then be
deployed as Web
services and/or
standalone
applications
11
May 25, 2021
Data Mining Models (1/5)
https://doi.org/10.1017/S0269888910000032
● Numerous data
mining models and
frameworks have
been proposed
○ Most trace their
roots from the
KDD Process
proposed by
Fayyad et al.
12
May 25, 2021
Data Mining Models (2/5)
https://doi.org/10.1017/S0269888910000032
13
May 25, 2021
Data Mining Models (2/5)
https://doi.org/10.1017/S0269888910000032
14
May 25, 2021
Data Mining Models (2/5)
https://doi.org/10.1017/S0269888910000032
15
May 25, 2021
Data Mining Models (3/5)
https://doi.org/10.1017/S0269888906000737
16
May 25, 2021
Data Mining Models (3/5)
https://doi.org/10.1017/S0269888906000737
17
May 25, 2021
Data Mining Models (4/5)
https://www.kdnuggets.com
● CRISP-DM model is one
of the most widely
used data mining
models
● Data understanding
and preparation are
the most time
consuming
18
May 25, 2021
Data Mining Models (5/5)
https://arxiv.org/abs/2003.05155
19
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
○ Scholarly Research Output in Zambia
○ Predicting Learning Outcome at UNZA
○ Medical Imaging Workflows in Zambia
○ Automatic Weather Prediction in Zambia
● Part III. Potential Problems
20
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
○ Scholarly Research Output in Zambia
○ Predicting Learning Outcome at UNZA
○ Medical Imaging Workflows in Zambia
○ Automatic Weather Prediction in Zambia
● Part III. Potential Problems
21
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (1/4)
https://worldmapper.org
22
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (1/4)
https://worldmapper.org
23
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (2/4)
24
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (2/4)
25
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (3/4)
http://www.webometrics.info
26
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (3/4)
http://www.webometrics.info
27
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (4/4)
Phiri, L. (2018)
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
28
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (4/4)
Phiri, L. (2018)
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
29
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Problem (4/4)
Phiri, L. (2018)
“Towards Increased Online Visibility of Scholarly Research Output in Zambia”.
URL: http://lis.unza.zm/archive/handle/123456789/227
30
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Multipronged Approach
31
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Multipronged Approach
32
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Multipronged Approach
33
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Multipronged Approach
34
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (1/7)
● Implementation of classification models to
automatically classify IR digital objects
using the minimum possible input from
graduate students: “The ETD Manuscript”
○ The ETD manuscript bitstream is considered
the “single source of truth”
○ Metadata prepared by staff that work with IR
potentially have inconsistencies
Phiri, L. (2021)
“Automatic Classification of Digital Objects for Improved Metadata Quality of ETDs”
URL: https://doi.org/10.1504/IJMSO.2020.112804
35
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (2/7)
● Text features extracted from a set of core
bitstream portions—ETD Title, ETD
Abstract, ETD Title Page and ETD pages—to
classify ETD manuscripts
ETD Type
ETD Subjects
IR Collection
36
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (3/7)
● Textual content mined from PDF
manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined from
metadata for training
● PDF document metadata
● Curated datasets from external
repositories
37
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (3/7)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
38
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (3/7)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
39
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (3/7)
● Textual content mined
from PDF manuscripts
○ Cover/title pages
○ Preliminary pages
● Textual content mined
from metadata for
training
● PDF document metadata
● Curated datasets from
external repositories
40
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (4/7)
● OAI-PMH used to
harvest all ETD
descriptive metadata
elements
● OAI-ORE used to
harvest all ETD PDF
documents
41
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (5/7)
● ETD Type—98.1%
● ETD Collection— 81.1%
● ETD Subjects—81.7%
● The models would still
need to be
incorporated into an
application that
requires “some”
human intervention
42
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (6/7)
https://github.com/lightonphiri/etd_autoclassifier
43
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—ETDs Automatic Classification (7/7)
https://datalab-apis.herokuapp.com/api/collection
44
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Current Work (1/3)
M’sendo R. (2019—Present)
MSc Computer Science, University of Zambia
“Multi-Faceted Automatic Classification of Institutional Repository Objects”
45
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Current Work (2/3)
Chisale A. (2021—Present)
MLIS, University of Zambia
“Automatic Generation of Electronic Theses and Dissertations Metadata”
46
May 25, 2021
Project #1: Online Visibility of Research in
Zambia—Current Work (3/3)
http://lis.unza.zm/portal
47
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
○ Scholarly Research Output in Zambia
○ Predicting Learning Outcome at UNZA
○ Medical Imaging Workflows in Zambia
○ Automatic Weather Prediction in Zambia
● Part III. Potential Problems
48
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Problem (1/2)
● ICT 1110 performance is as issue. The poor performance
transcends all assessments: quizzes, tests and practical
programming questions.
49
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Problem (1/2)
● ICT 1110 performance is as issue. The poor performance
transcends all assessments: quizzes, tests and practical
programming questions.
50
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Problem (2/2)
● Potential solution: implement a prediction model aimed at
identifying at-risk students .
○ Initiate interventions on at-risk students.
51
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Data Sources (1/5)
● Demographics information
● LMS interaction logs
● Course workload
● Subject responses
52
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Data Sources (2/5)
● Assessment results
broken down by question
○ Concepts associated with
question
○ Topics associated with
question
53
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Data Sources (3/5)
● Assessment results broken
down by question
○ Concepts associated with
question
○ Topics associated with
question
54
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Data Sources (4/5)
● LMS interaction logs
○ How often do
students access
Moodle (login
attempts)
○ Which Moodle
features are being
access (GradeBook,
Messaging)
○ Time spent on Moodle
55
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Data Sources (5/5)
● ICT 1110 information survey to
capture information not available
in SIS
○ Experience with computers
○ Motivation for taking the course
○ Specific location where student lives
(although this can be inferred from
next of kin address perhaps?)
56
May 25, 2021
Project #2: Predicting Student Learning
Outcomes—Current Work
Chaibela, M., Chisha, I., Pungwa, D., Siabbaba D. and Simukoko B. (2021)
“Performance Predictor: Machine Learning Tool for Student Performance Outcomes”.
Work-in-Progress
57
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
○ Scholarly Research Output in Zambia
○ Predicting Learning Outcome at UNZA
○ Medical Imaging Workflows in Zambia
○ Automatic Weather Prediction in Zambia
● Part III. Potential Problems
58
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Problem
https://mjz.co.zm/index.php/mjz/article/view/560
59
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Current Work (1/2)
60
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Current Work (1/2)
61
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Current Work (2/2)
62
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Current Work (2/2)
63
May 25, 2021
Project #3: Medical Imaging Workflows in
Zambia—Current Work (2/2)
64
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
○ Scholarly Research Output in Zambia
○ Predicting Learning Outcome at UNZA
○ Medical Imaging Workflows in Zambia
○ Automatic Weather Prediction in Zambia
● Part III. Potential Problems
65
May 25, 2021
Project #3: Automatic Forecasting of
Seasonal Rainfall—Current Work
66
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
● Part III. Potential Problems
○ Exemplar Projects in Zambia
○ Potential Locally Relevant Problems
67
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
● Part III. Potential Problems
○ Exemplar Projects in Zambia
○ Potential Locally Relevant Problems
68
May 25, 2021
Agriculture: Automatic identification and
Early Warning of Fall Armyworms
http://dspace.unza.zm/handle/123456789/7141
69
May 25, 2021
Telecommunications: Automatic Customer
Segmentation
http://dspace.unza.zm/handle/123456789/7069
70
May 25, 2021
Banking: Automatic Data Mining for Fraud
Detection
https://bit.ly/3wxJICk
71
May 25, 2021
Outline
● Part I. Data-Driven Problem Solving
● Part II. Past and Current Projects
● Part III. Potential Problems
○ Exemplar Projects in Zambia
○ Potential Locally Relevant Problems
72
May 25, 2021
Potential Locally Relevant Problems in
Zambia (1/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
73
May 25, 2021
Potential Locally Relevant Problems in
Zambia (2/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
Zambia Daily Mail | August 18, 2019 | Volume 22 No. 033
74
May 25, 2021
Potential Locally Relevant Problems in
Zambia (3/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
75
May 25, 2021
Potential Locally Relevant Problems in
Zambia (3/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
76
May 25, 2021
Potential Locally Relevant Problems in
Zambia (4/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
77
May 25, 2021
Potential Locally Relevant Problems in
Zambia (4/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
78
May 25, 2021
Potential Locally Relevant Problems in
Zambia (4/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
79
May 25, 2021
Potential Locally Relevant Problems in
Zambia (4/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development perhaps?
80
May 25, 2021
Potential Locally Relevant Problems in
Zambia (5/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development
perhaps?
81
May 25, 2021
Potential Locally Relevant Problems in
Zambia (5/6)
● Impact-driven
research/studies
○ Education
○ Health
○ So-called ICT for
development
perhaps?
82
May 25, 2021
Potential Locally Relevant Problems in
Zambia (6/6)
● Education
● Health
● So-called ICT for
development
perhaps?
83
May 25, 2021
Potential Locally Relevant Problems in
Zambia (6/6)
● Education
● Health
● So-called ICT for
development
perhaps?
84
May 25, 2021
Q & A Session
● Comments, concerns and complaints?
[1] Phiri, L. (2018). Research Visibility in the Global South: Towards
Increased Online Visibility of Scholarly Research Output in
Zambia. IEEE International Conference in Information and
Communication Technologies.
[2] Phiri, L. (2020). A Multi-Faceted Multi-Stakeholder Approach for
Increased Visibility of ETDs in Zambia. Cadernos BAD, (1).
https://doi.org/10.1017/S0269888910000032
[3] Phiri, L. (2020). Automatic classification of digital objects for
improved metadata quality of electronic theses and dissertations
in institutional repositories. International Journal of Metadata,
Semantics and Ontologies, 14(3), 234-248.
Bibliography
lighton.phiri@unza.zm
http://datalab.unza.zm
http://lis.unza.zm/~lightonphiri

More Related Content

Similar to Using Machine Learning Techniques for Solving Locally Relevant Problems

Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthLighton Phiri
 
Personal Knowledge Graphs: Use Cases in e-learning Platforms
Personal Knowledge Graphs: Use Cases in e-learning PlatformsPersonal Knowledge Graphs: Use Cases in e-learning Platforms
Personal Knowledge Graphs: Use Cases in e-learning PlatformsEleniIlkou
 
Improved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningImproved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningLighton Phiri
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisIJAEMSJORNAL
 
LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...Cristian Rodríguez Enríquez
 
Classroom for the_future
Classroom for the_futureClassroom for the_future
Classroom for the_futureJBMKTAGENCY
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)WARCnet
 
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...Teachers' Digital Competence and Participation in teacher networks (ED-Medi...
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...Riina Vuorikari
 
An introduction to the Semantic Web and Semantic Technologies for Learning an...
An introduction to the Semantic Web and Semantic Technologies for Learning an...An introduction to the Semantic Web and Semantic Technologies for Learning an...
An introduction to the Semantic Web and Semantic Technologies for Learning an...Katy Jordan
 
KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)webeng-kaist
 
WebbaseAppForProjectTopicSelection -ForLinkedIn
WebbaseAppForProjectTopicSelection -ForLinkedInWebbaseAppForProjectTopicSelection -ForLinkedIn
WebbaseAppForProjectTopicSelection -ForLinkedInKwadzo Asense
 
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»eMadrid network
 
Adaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainAdaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainMikhail Navrotskii
 
Exploring Machine Learning for Libraries and Archives: Present and Future
Exploring Machine Learning for Libraries and Archives: Present and FutureExploring Machine Learning for Libraries and Archives: Present and Future
Exploring Machine Learning for Libraries and Archives: Present and FutureBohyun Kim
 
Information Literacy skill and seeking behavior.pdf
Information Literacy skill and seeking behavior.pdfInformation Literacy skill and seeking behavior.pdf
Information Literacy skill and seeking behavior.pdfAyyanar k
 
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)IJDKP
 
Best of BbWorld 09: What Can Blackboard Do For Your Middle School
Best of BbWorld 09: What Can Blackboard Do For Your Middle SchoolBest of BbWorld 09: What Can Blackboard Do For Your Middle School
Best of BbWorld 09: What Can Blackboard Do For Your Middle SchoolBlackboard
 

Similar to Using Machine Learning Techniques for Solving Locally Relevant Problems (20)

Institutional Repository Single Sources of Truth
Institutional Repository Single Sources of TruthInstitutional Repository Single Sources of Truth
Institutional Repository Single Sources of Truth
 
Personal Knowledge Graphs: Use Cases in e-learning Platforms
Personal Knowledge Graphs: Use Cases in e-learning PlatformsPersonal Knowledge Graphs: Use Cases in e-learning Platforms
Personal Knowledge Graphs: Use Cases in e-learning Platforms
 
Improved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine LearningImproved Scholarly Communication Using Machine Learning
Improved Scholarly Communication Using Machine Learning
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive Analysis
 
LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...LiDIA: An integration architecture to query Linked Open Data from multiple da...
LiDIA: An integration architecture to query Linked Open Data from multiple da...
 
Classroom for the_future
Classroom for the_futureClassroom for the_future
Classroom for the_future
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)
 
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...Teachers' Digital Competence and Participation in teacher networks (ED-Medi...
Teachers' Digital Competence and Participation in teacher networks (ED-Medi...
 
An introduction to the Semantic Web and Semantic Technologies for Learning an...
An introduction to the Semantic Web and Semantic Technologies for Learning an...An introduction to the Semantic Web and Semantic Technologies for Learning an...
An introduction to the Semantic Web and Semantic Technologies for Learning an...
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)KAIST Web Engineering Lab Introduction (2017 ver.)
KAIST Web Engineering Lab Introduction (2017 ver.)
 
WebbaseAppForProjectTopicSelection -ForLinkedIn
WebbaseAppForProjectTopicSelection -ForLinkedInWebbaseAppForProjectTopicSelection -ForLinkedIn
WebbaseAppForProjectTopicSelection -ForLinkedIn
 
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
 
Adaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education DomainAdaptive Knowledge Portal for Education Domain
Adaptive Knowledge Portal for Education Domain
 
Exploring Machine Learning for Libraries and Archives: Present and Future
Exploring Machine Learning for Libraries and Archives: Present and FutureExploring Machine Learning for Libraries and Archives: Present and Future
Exploring Machine Learning for Libraries and Archives: Present and Future
 
Information Literacy skill and seeking behavior.pdf
Information Literacy skill and seeking behavior.pdfInformation Literacy skill and seeking behavior.pdf
Information Literacy skill and seeking behavior.pdf
 
Communicatons Fulbright
Communicatons FulbrightCommunicatons Fulbright
Communicatons Fulbright
 
BCT_AERA2013
BCT_AERA2013BCT_AERA2013
BCT_AERA2013
 
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
2nd International Conference on Big Data, Blockchain and Security (BDBS 2021)
 
Best of BbWorld 09: What Can Blackboard Do For Your Middle School
Best of BbWorld 09: What Can Blackboard Do For Your Middle SchoolBest of BbWorld 09: What Can Blackboard Do For Your Middle School
Best of BbWorld 09: What Can Blackboard Do For Your Middle School
 

More from Lighton Phiri

Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...Lighton Phiri
 
User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...Lighton Phiri
 
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Lighton Phiri
 
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Lighton Phiri
 
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Lighton Phiri
 
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesEnterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesLighton Phiri
 
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Lighton Phiri
 
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsDRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsLighton Phiri
 
OJS Training: Users and User Roles
OJS Training: Users and User RolesOJS Training: Users and User Roles
OJS Training: Users and User RolesLighton Phiri
 
OJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationOJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationLighton Phiri
 
OJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessOJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessLighton Phiri
 
OJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesOJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesLighton Phiri
 
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Lighton Phiri
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...Lighton Phiri
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...Lighton Phiri
 
Post PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesPost PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesLighton Phiri
 
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningTechnology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningLighton Phiri
 
Research Visibility in the Global South: Towards Increased Online Visibility...
Research Visibility  in the Global South: Towards Increased Online Visibility...Research Visibility  in the Global South: Towards Increased Online Visibility...
Research Visibility in the Global South: Towards Increased Online Visibility...Lighton Phiri
 
Ph.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationPh.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationLighton Phiri
 
Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Lighton Phiri
 

More from Lighton Phiri (20)

Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...Enterprise Medical Imaging for Streamlined Radiological Diagnosis  in Zambian...
Enterprise Medical Imaging for Streamlined Radiological Diagnosis in Zambian...
 
User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...User Centred Design and Implementation of Useful Picture Archiving and Commun...
User Centred Design and Implementation of Useful Picture Archiving and Commun...
 
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
Enterprise Medical Imaging for Improved Radiological Workflows in Zambian Pub...
 
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
Empirical Evaluation of ETD-ms Compliance for ETDs Harvested by the NDLTD Uni...
 
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
Enterprise Medical Imaging in Public Health Facilities in Zambia: Towards a U...
 
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and OpportunitiesEnterprise Medical Imaging in the Global South: Challenges and Opportunities
Enterprise Medical Imaging in the Global South: Challenges and Opportunities
 
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
Factors Influencing Co-Creation of Open Education Resources Using Learning Ob...
 
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal SystemsDRGS OJS Training: Electronic Publishing Using Open Journal Systems
DRGS OJS Training: Electronic Publishing Using Open Journal Systems
 
OJS Training: Users and User Roles
OJS Training: Users and User RolesOJS Training: Users and User Roles
OJS Training: Users and User Roles
 
OJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and ConfigurationOJS Training: Journal Settings and Configuration
OJS Training: Journal Settings and Configuration
 
OJS Training: Managing The Submission Process
OJS Training: Managing The Submission ProcessOJS Training: Managing The Submission Process
OJS Training: Managing The Submission Process
 
OJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal IssuesOJS Training: Creating and Managing Journal Issues
OJS Training: Creating and Managing Journal Issues
 
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
Open Access Electronic Publishing for Increased Online Visibility: Tooling Ch...
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
 
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs i...
 
Post PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and ChallengesPost PhD Transition Experience: Successes and Challenges
Post PhD Transition Experience: Successes and Challenges
 
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and LearningTechnology-Enhanced Learning for Improved Quality of Teaching and Learning
Technology-Enhanced Learning for Improved Quality of Teaching and Learning
 
Research Visibility in the Global South: Towards Increased Online Visibility...
Research Visibility  in the Global South: Towards Increased Online Visibility...Research Visibility  in the Global South: Towards Increased Online Visibility...
Research Visibility in the Global South: Towards Increased Online Visibility...
 
Ph.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for OrchestrationPh.D Research Proposal: Software Tools for Orchestration
Ph.D Research Proposal: Software Tools for Orchestration
 
Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...Research Visibility in the Global South: Towards Increased Online Visibility ...
Research Visibility in the Global South: Towards Increased Online Visibility ...
 

Recently uploaded

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 

Recently uploaded (20)

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 

Using Machine Learning Techniques for Solving Locally Relevant Problems

  • 1. Deep Learning Indaba X - Zambia 2021 Lighton Phiri <lighton.phiri@unza.zm> Department of Library & Information Science University of Zambia http://lis.unza.zm/~lightonphiri Using Machine Learning Techniques for Solving Locally Relevant Problems
  • 2. 2 May 25, 2021 About The DataLab Research Group at The University of Zambia ● The DataLab research group at The University of Zambia is composed of faculty staff and students—undergraduate and postgraduate—working in three main areas ○ Data Mining ○ Digital Libraries ○ Technology-Enhanced Learning http://datalab.unza.zm
  • 3. 3 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ● Part III. Potential Problems
  • 4. 4 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ○ Introduction ○ Data Mining Pipelines ○ Data Mining Models ● Part II. Past and Current Projects ● Part III. Potential Problems
  • 5. 5 May 25, 2021 Machine Learning 101 [...] https://commons.wikimedia.org/ ● Artificial Intelligence encompases a broad spectrum of sub-fields ○ Traditional machine learning techniques and approaches ○ Deep Learning approaches
  • 6. 6 May 25, 2021 Machine Learning 101 [...] https://commons.wikimedia.org/ ● Artificial Intelligence encompases a broad spectrum of sub-fields ○ Traditional machine learning techniques and approaches ○ Deep Learning approaches
  • 7. 7 May 25, 2021 Data is Key to ML-Centric Problem Solving
  • 8. 8 May 25, 2021 Data Mining Pipelines ● Fundamentally, machine learning aims to extract knowledge from data ○ Historical data is used to infer/predict outcomes associated with new observations
  • 9. 9 May 25, 2021 Data Mining Pipelines ● Input features identified during feature engineering are used to train models ○ Features correlated with outcome to be identified
  • 10. 10 May 25, 2021 Data Mining Pipelines ● The ML inference model is used to predict future patterns ○ Models can then be deployed as Web services and/or standalone applications
  • 11. 11 May 25, 2021 Data Mining Models (1/5) https://doi.org/10.1017/S0269888910000032 ● Numerous data mining models and frameworks have been proposed ○ Most trace their roots from the KDD Process proposed by Fayyad et al.
  • 12. 12 May 25, 2021 Data Mining Models (2/5) https://doi.org/10.1017/S0269888910000032
  • 13. 13 May 25, 2021 Data Mining Models (2/5) https://doi.org/10.1017/S0269888910000032
  • 14. 14 May 25, 2021 Data Mining Models (2/5) https://doi.org/10.1017/S0269888910000032
  • 15. 15 May 25, 2021 Data Mining Models (3/5) https://doi.org/10.1017/S0269888906000737
  • 16. 16 May 25, 2021 Data Mining Models (3/5) https://doi.org/10.1017/S0269888906000737
  • 17. 17 May 25, 2021 Data Mining Models (4/5) https://www.kdnuggets.com ● CRISP-DM model is one of the most widely used data mining models ● Data understanding and preparation are the most time consuming
  • 18. 18 May 25, 2021 Data Mining Models (5/5) https://arxiv.org/abs/2003.05155
  • 19. 19 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ○ Scholarly Research Output in Zambia ○ Predicting Learning Outcome at UNZA ○ Medical Imaging Workflows in Zambia ○ Automatic Weather Prediction in Zambia ● Part III. Potential Problems
  • 20. 20 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ○ Scholarly Research Output in Zambia ○ Predicting Learning Outcome at UNZA ○ Medical Imaging Workflows in Zambia ○ Automatic Weather Prediction in Zambia ● Part III. Potential Problems
  • 21. 21 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (1/4) https://worldmapper.org
  • 22. 22 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (1/4) https://worldmapper.org
  • 23. 23 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (2/4)
  • 24. 24 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (2/4)
  • 25. 25 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (3/4) http://www.webometrics.info
  • 26. 26 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (3/4) http://www.webometrics.info
  • 27. 27 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (4/4) Phiri, L. (2018) “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 28. 28 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (4/4) Phiri, L. (2018) “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 29. 29 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Problem (4/4) Phiri, L. (2018) “Towards Increased Online Visibility of Scholarly Research Output in Zambia”. URL: http://lis.unza.zm/archive/handle/123456789/227
  • 30. 30 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Multipronged Approach
  • 31. 31 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Multipronged Approach
  • 32. 32 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Multipronged Approach
  • 33. 33 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Multipronged Approach
  • 34. 34 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (1/7) ● Implementation of classification models to automatically classify IR digital objects using the minimum possible input from graduate students: “The ETD Manuscript” ○ The ETD manuscript bitstream is considered the “single source of truth” ○ Metadata prepared by staff that work with IR potentially have inconsistencies Phiri, L. (2021) “Automatic Classification of Digital Objects for Improved Metadata Quality of ETDs” URL: https://doi.org/10.1504/IJMSO.2020.112804
  • 35. 35 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (2/7) ● Text features extracted from a set of core bitstream portions—ETD Title, ETD Abstract, ETD Title Page and ETD pages—to classify ETD manuscripts ETD Type ETD Subjects IR Collection
  • 36. 36 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (3/7) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 37. 37 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (3/7) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 38. 38 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (3/7) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 39. 39 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (3/7) ● Textual content mined from PDF manuscripts ○ Cover/title pages ○ Preliminary pages ● Textual content mined from metadata for training ● PDF document metadata ● Curated datasets from external repositories
  • 40. 40 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (4/7) ● OAI-PMH used to harvest all ETD descriptive metadata elements ● OAI-ORE used to harvest all ETD PDF documents
  • 41. 41 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (5/7) ● ETD Type—98.1% ● ETD Collection— 81.1% ● ETD Subjects—81.7% ● The models would still need to be incorporated into an application that requires “some” human intervention
  • 42. 42 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (6/7) https://github.com/lightonphiri/etd_autoclassifier
  • 43. 43 May 25, 2021 Project #1: Online Visibility of Research in Zambia—ETDs Automatic Classification (7/7) https://datalab-apis.herokuapp.com/api/collection
  • 44. 44 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Current Work (1/3) M’sendo R. (2019—Present) MSc Computer Science, University of Zambia “Multi-Faceted Automatic Classification of Institutional Repository Objects”
  • 45. 45 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Current Work (2/3) Chisale A. (2021—Present) MLIS, University of Zambia “Automatic Generation of Electronic Theses and Dissertations Metadata”
  • 46. 46 May 25, 2021 Project #1: Online Visibility of Research in Zambia—Current Work (3/3) http://lis.unza.zm/portal
  • 47. 47 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ○ Scholarly Research Output in Zambia ○ Predicting Learning Outcome at UNZA ○ Medical Imaging Workflows in Zambia ○ Automatic Weather Prediction in Zambia ● Part III. Potential Problems
  • 48. 48 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Problem (1/2) ● ICT 1110 performance is as issue. The poor performance transcends all assessments: quizzes, tests and practical programming questions.
  • 49. 49 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Problem (1/2) ● ICT 1110 performance is as issue. The poor performance transcends all assessments: quizzes, tests and practical programming questions.
  • 50. 50 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Problem (2/2) ● Potential solution: implement a prediction model aimed at identifying at-risk students . ○ Initiate interventions on at-risk students.
  • 51. 51 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Data Sources (1/5) ● Demographics information ● LMS interaction logs ● Course workload ● Subject responses
  • 52. 52 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Data Sources (2/5) ● Assessment results broken down by question ○ Concepts associated with question ○ Topics associated with question
  • 53. 53 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Data Sources (3/5) ● Assessment results broken down by question ○ Concepts associated with question ○ Topics associated with question
  • 54. 54 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Data Sources (4/5) ● LMS interaction logs ○ How often do students access Moodle (login attempts) ○ Which Moodle features are being access (GradeBook, Messaging) ○ Time spent on Moodle
  • 55. 55 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Data Sources (5/5) ● ICT 1110 information survey to capture information not available in SIS ○ Experience with computers ○ Motivation for taking the course ○ Specific location where student lives (although this can be inferred from next of kin address perhaps?)
  • 56. 56 May 25, 2021 Project #2: Predicting Student Learning Outcomes—Current Work Chaibela, M., Chisha, I., Pungwa, D., Siabbaba D. and Simukoko B. (2021) “Performance Predictor: Machine Learning Tool for Student Performance Outcomes”. Work-in-Progress
  • 57. 57 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ○ Scholarly Research Output in Zambia ○ Predicting Learning Outcome at UNZA ○ Medical Imaging Workflows in Zambia ○ Automatic Weather Prediction in Zambia ● Part III. Potential Problems
  • 58. 58 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Problem https://mjz.co.zm/index.php/mjz/article/view/560
  • 59. 59 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Current Work (1/2)
  • 60. 60 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Current Work (1/2)
  • 61. 61 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Current Work (2/2)
  • 62. 62 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Current Work (2/2)
  • 63. 63 May 25, 2021 Project #3: Medical Imaging Workflows in Zambia—Current Work (2/2)
  • 64. 64 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ○ Scholarly Research Output in Zambia ○ Predicting Learning Outcome at UNZA ○ Medical Imaging Workflows in Zambia ○ Automatic Weather Prediction in Zambia ● Part III. Potential Problems
  • 65. 65 May 25, 2021 Project #3: Automatic Forecasting of Seasonal Rainfall—Current Work
  • 66. 66 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ● Part III. Potential Problems ○ Exemplar Projects in Zambia ○ Potential Locally Relevant Problems
  • 67. 67 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ● Part III. Potential Problems ○ Exemplar Projects in Zambia ○ Potential Locally Relevant Problems
  • 68. 68 May 25, 2021 Agriculture: Automatic identification and Early Warning of Fall Armyworms http://dspace.unza.zm/handle/123456789/7141
  • 69. 69 May 25, 2021 Telecommunications: Automatic Customer Segmentation http://dspace.unza.zm/handle/123456789/7069
  • 70. 70 May 25, 2021 Banking: Automatic Data Mining for Fraud Detection https://bit.ly/3wxJICk
  • 71. 71 May 25, 2021 Outline ● Part I. Data-Driven Problem Solving ● Part II. Past and Current Projects ● Part III. Potential Problems ○ Exemplar Projects in Zambia ○ Potential Locally Relevant Problems
  • 72. 72 May 25, 2021 Potential Locally Relevant Problems in Zambia (1/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 73. 73 May 25, 2021 Potential Locally Relevant Problems in Zambia (2/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps? Zambia Daily Mail | August 18, 2019 | Volume 22 No. 033
  • 74. 74 May 25, 2021 Potential Locally Relevant Problems in Zambia (3/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 75. 75 May 25, 2021 Potential Locally Relevant Problems in Zambia (3/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 76. 76 May 25, 2021 Potential Locally Relevant Problems in Zambia (4/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 77. 77 May 25, 2021 Potential Locally Relevant Problems in Zambia (4/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 78. 78 May 25, 2021 Potential Locally Relevant Problems in Zambia (4/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 79. 79 May 25, 2021 Potential Locally Relevant Problems in Zambia (4/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 80. 80 May 25, 2021 Potential Locally Relevant Problems in Zambia (5/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 81. 81 May 25, 2021 Potential Locally Relevant Problems in Zambia (5/6) ● Impact-driven research/studies ○ Education ○ Health ○ So-called ICT for development perhaps?
  • 82. 82 May 25, 2021 Potential Locally Relevant Problems in Zambia (6/6) ● Education ● Health ● So-called ICT for development perhaps?
  • 83. 83 May 25, 2021 Potential Locally Relevant Problems in Zambia (6/6) ● Education ● Health ● So-called ICT for development perhaps?
  • 84. 84 May 25, 2021 Q & A Session ● Comments, concerns and complaints?
  • 85. [1] Phiri, L. (2018). Research Visibility in the Global South: Towards Increased Online Visibility of Scholarly Research Output in Zambia. IEEE International Conference in Information and Communication Technologies. [2] Phiri, L. (2020). A Multi-Faceted Multi-Stakeholder Approach for Increased Visibility of ETDs in Zambia. Cadernos BAD, (1). https://doi.org/10.1017/S0269888910000032 [3] Phiri, L. (2020). Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories. International Journal of Metadata, Semantics and Ontologies, 14(3), 234-248. Bibliography