SlideShare a Scribd company logo
User Modeling of Skills and Expertise from Resumes
Hua Li, Daniel J. T. Powell, Mark Clark, Tifani O'Brien and Rafael Alonso
Leidos, Inc., Virginia, U.S.A.
Keywords: User Modeling, Expertise Modeling, Resume, Profile, Skill.
Abstract: Job applicants describe their skills and expertise in resumes and curriculum vitaes (CVs). These biographic
data are often evaluated by human resource personnel or a search committee. This manual approach works
well when the number of resumes is small. However, in this information age, the volume of available
resumes can be overwhelming and there is a need for automatic evaluation of applicant skills and expertise.
In this paper, we describe a user modeling algorithm to quantitatively identify skills and expertise from
biographic data. This algorithm is called REMA (Resume Expertise Modeling Algorithm). REMA takes
data from a resume document as input and produces an expertise model. The expertise model details the
expertise topics for which the resume owner has claimed competency. Each topic carries a weight indicating
the level of competency. There are two key insights for this algorithm. First, one's expertise is the
cumulative result of the various “learning events” in one's career. These learning events are mentioned in
various sections of the resume, such as earning a degree, writing a paper, or getting a patent. Second, one's
knowledge and skills can become outdated or forgotten over time if not reinforced by learning. We have
developed a prototype resume evaluation system based on REMA and are in the process of evaluating
REMA’s performance.
1 INTRODUCTION
A resume is a written summary of one’s education,
work experience, skills, credentials, and
accomplishments that is often used to apply for jobs.
One key function of a resume is to provide
information regarding one’s skills and expertise. The
resume evaluator is required to manually judge the
competence of a skill, i.e., the level of mastery of
that skill, and/or the expertise in a skill domain, i.e.,
the level of mastery of all skills in that domain.
Expertise modeling is used to automatically produce
a quantitative assessment of the competence of
various skills and the expertise of relevant skill
domains from a resume.
Expertise modeling is valuable for evaluators
when faced with a large number of resumes to
review. Given a position description, expertise
modeling can be used to find suitable candidates by
matching a set of skills between a position and a
candidate’s resume. There are many other
applications of expertise modeling, some of which
are listed below.
 Expertise finder: given a skill, find experts with
that skill, i.e., find resumes with a high expertise
level for that skill.
 Virtual expertise group finder: given a set of
resumes, cluster them based on skill set
similarities.
 Job finder: given a candidate resume, find
matching positions within a career market.
 Next skill to learn: suggest skills for career
development. Given a user resume, find career-
conducive skills (CCS) that the user should learn.
The CCS are skills that frequently co-occur with
a user’s skill set in expert resumes.
Even though there’s extensive literature on expertise
recommendation, to our knowledge, expertise
modeling from resumes is still an open research
area. Our model is called Resume Expertise
Modeling Algorithm (REMA). It takes a resume, CV
or biographical document as input and produces an
expertise model. The algorithm focuses on the
concept of “expertise mention” in the resume, e.g.,
earning a Ph.D. in psychology in 2000 or publishing
a paper on psychology in 2001. Such mentions
imply a certain level of education on a given topic
(i.e., psychology) at a particular point in time (i.e.,
2000 and 2001, respectively). The greater the
number of learning events on a specific topic, the
more competent that person is with the topic. This
process is referred to as “reinforcement”.
Li, H., Powell, D., Clark, M., O’Brien, T. and Alonso, R..
User Modeling of Skills and Expertise from Resumes.
In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - Volume 3: KMIS, pages 229-233
ISBN: 978-989-758-158-8
Copyright c 2015 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
229
Conversely, skills can get rusty over time if a person
stops learning. This process is referred to as
“forgetting”. The expertise modeling algorithm
processes the expertise mentions incrementally, in a
chronological order. New expertise topics are
inserted into the expertise model and their weights
are increased by reinforcement and decreased by
forgetting.
2 RELATED WORK
The related work is mainly in the area of expertise
recommendation or expert finding which attempts to
find the right person with the appropriate skills and
knowledge. This is useful for many purposes
including problem solving, question answering, and
collaboration. A significant amount of research has
been generated in the Information Retrieval
community (Smirnova1 and Balog, 2011, Balog et
al. 2007; Balog et al. 2009; Liebregts and Bogers,
2009). This line of research focuses on content-
based algorithms, similar to document search. These
algorithms identify experts based on the content of
documents that they are associated with (Liebregts
and Bogers, 2009; Serdyukov et al. 2007). While
these approaches have been very effective in finding
the most knowledgeable people on a given topic
based on a large collection of documents from an
enterprise or the internet, it’s not clear how they can
be used to assess the expertise on multiple topics
based on a single resume.
3 THE REMAALGORITHM
The REMA algorithm is shown in Figure 1. An
input resume is first parsed into expertise mentions.
The mentions are evaluated by Natural Language
Processing (NLP) tools to extract expertise topics.
Figure 1: REMA algorithm diagram.
These topics are then processed by REMA’s
expertise model adaptation component to generate
the expertise model. This algorithm is an extension
of our user modelling algorithm RAMA
(Reinforcement and Aging Modeling Algorithm)
described in detail elsewhere (Li and Alonso, 2014;
Li and Alonso, 2012; Alonso et al., 2010).
3.1 Expertise Mentions
Expertise mentions are phrases or statements in the
resume that indicate significant learning events. For
example, a resume may mention a paper on
databases in a certain year in the publication section.
When parsing the expertise mentions, the associated
resume section and the date of the event are captured
because they are important indicators of level of
expertise. REMA uses a source relevance parameter
to register the fact that expertise mentions in
different parts of a resume carry different
significance. For example, a mention in a patent and
publication section should indicate more expertise
than one in an experience and education section.
Even within the same section, mentions originated
from different sources may carry different
significances. For example, within the publication
section, mentions of a book or journal paper are
more indicative of expertise than those of a
conference paper. The date of event mentioned
reflects the recency of the learning. In other words,
skills or expertise acquired more recently are more
up-to-date and less likely to be forgotten. We use
regular expression and GATE to extract the date and
time information from the expertise mention.
3.2 Expertise Topics
Expertise topics are terms indicating skills or
expertise such as database or machine learning.
They are extracted from expertise mentions using
NLP tools. In particular, Apache Lucene®1
is used
to extract simple terms from text. WordNet®2
is
used to identify noun words. GATE is used to
extract noun chunks and named entities. OpenCalais
web service is used to extract expertise related tags
including "Industry Term", "Technology", and
"Programming Language". Relationships between
1
Apache Lucene is a registered trademark of the Apache
Software Foundation within the United States and/or
other countries.
2
WordNet is a registered trademark of the Trustees of
Princeton University within the United States and/or
other countries.
KMIS 2015 - 7th International Conference on Knowledge Management and Information Sharing
230
Figure 2: Left: Effects of Reinforcement Factor (Learning Rate) on Weight; Right: Effects of decay half-life on time
(Forgetting Factor) weight.
expertise topics are also extracted from expertise
mentions. For example, WordNet’s semantic relation
“hyponym” is useful to map a broader expertise to a
narrower one. We can also use Wikipedia’s®3
categories to establish a similar kind of relationship
between expertise topics.
3.3 Expertise Model Adaptation
This component has two subcomponents: content
adaptation and weight adaptation. The former refers
to the addition of new expertise topics in the
expertise model. As expertise mentions are being
processed in chronological order, unseen expertise
topics will be automatically inserted into the
expertise model with a default weight. The size of
the default weight is equal to the weight change
from the reinforcement of one expertise mention
(see below).
Weight adaptation refers to the dynamic
adjustment of the weights of expertise topics in the
model by reinforcement and forgetting mechanisms.
Reinforcement increases the topic weight. The more
expertise mentions on a given topic, the more life
learning events the person has on that topic. As a
result, the more competent that person is with the
topic. This process is termed “reinforcement”. The
parameter that controls the rate of learning is called
the reinforcement factor. The effects of this factor on
the learning rate in simulation experiments are
shown in the left panel of Figure 2.
Conversely, naturally forgetting will decrease the
topic weight. In general, a person’s skills will
stagnate over time if the person stops learning. There
3
Wikipedia is a registered trademark of the Wikimedia
Foundation, Inc., in the United States and/or other
countries.
are at least two contributing factors to this
stagnation. The first is our memory decay as
described in decay theory4. This theory proposes that
memory fades and information becomes less
available for later retrieval as time passes. The
second factor is the technological knowledge
depreciation, or decay (Nemet, 2012, Park, Shin, and
Park, 2006). The average decay rate is estimated at
13.3%, which corresponds to a half-life of 4.86 years
(Park, Shin, and Park, 2006). The effects of five
different decay half-life values over time are shown
in the right panel of Figure 2.
3.4 Expertise Model
The expertise model consists of weighted expertise
topics and their relationships. Each topic carries an
authority label that denotes the origin of the
expertise term, such as a WordNet, Wikipedia, or a
web service like OpenCalais. The collection of
expertise topics represents the breadth of the
person’s skills and expertise. The topic weights
indicate the level of expertise and have a range from
zero to one. The larger the weight, the more
competent the person is with that skill or expertise.
4 REMA RESULTS
We implemented a prototype REMA system in Java.
It has a simple Graphical User Interface (GUI) that
allows the user to process a directory of resumes and
then display the resulting expertise models. This
prototype will be used to conduct evaluation
experiments in the near future to assess the
4
http://en.wikipedia.org/wiki/Decay_theory
User Modeling of Skills and Expertise from Resumes
231
Figure 3: Table view of an example expertise model.
performance of the REMA algorithm. In this section
we show some preliminary results.
4.1 Topic Cloud View for Expertise
Model
An expertise model is shown as a topic cloud where
the larger the font size and the redder the color
indicate higher expertise levels (Figure 3).
Figure 4: Topic Cloud view of an example expertise
model.
4.2 Table View for Expertise Model
An example expertise model is shown as a table with
the following six columns (Figure 4):
 ID – the sequence number of each model
element (expertise topic)
 ELEMENT – the expertise topic
 WEIGHT – the level of expertise
 TYPE – the type of topic, XENTITY denotes
an entity topic, XRELATION denotes a synset
(set of synonyms) -> hypernym relationship
defined in WordNet.
 NLP – Natural language processing tool used
for extracting this element
 Features – features associated with this
element.
5 CONCLUSIONS
We developed REMA, a user modeling algorithm
that quantitatively identifies skills and expertise
from biographic data. REMA takes data from a
resume document as input and produces an expertise
model. There are two key concepts for this
algorithm. First, one's expertise is the cumulative
result of the various “learning events” in one's
career. Second, one's knowledge and skills can
become outdated or forgotten over time if not
reinforced by learning. We have developed a
prototype resume evaluation system based on
REMA and are in the process of evaluating REMA’s
performance.
REFERENCES
Alonso, R., P. Bramsen, and H. Li, 2010. Incremental user
KMIS 2015 - 7th International Conference on Knowledge Management and Information Sharing
232
modeling with heterogeneous user behaviors,
International Conference on Knowledge Management
and Information Sharing, International Conference on
Knowledge Management and Information Sharing
(KMIS 2010).
Li, H. and R. Alonso. User Modeling for Contextual
Suggestion. In Proceedings of 21st Text REtrieval
Conference (TREC 2014). NIST, 2014.
http://trec.nist.gov/pubs/trec23/papers/pro-
RAMA_cs.pdf
Li, H. and R. Alonso, Managing Analysis Context,
ESAIR'12: Fifth International Workshop on Exploiting
Semantic Annotations in Information Retrieval, 2012.
Nemet, G., 2012. Historical Case Studies of Energy
Technology Innovation, 1–11.
Park, G., Shin, J., & Park, Y., 2006. Measurement of
depreciation rate of technological knowledge :
Technology cycle time approach. Journal of scientific
and industrial research 65 (2), 121–127.
Balog, K., Bogers, T., Azzopardi, L., de Rijke, M., van
den Bosch, A.: Broad expertise retrieval in sparse data
environments. In: SIGIR 2007, pp. 551–558 (2007)
Balog, K., Soboroff, I., Thomas, P., Craswell, N., de
Vries, A.P., Bailey, P. Overview of the TREC 2008
enterprise track. In: TREC 2008 (2009).
Smirnova1 E. and K. Balog. A User-Oriented Model for
Expert Finding. P. Clough et al. (Eds.): ECIR 2011,
LNCS 6611, pp. 580–592, Springer-Verlag Berlin
Heidelberg, 2011.
Liebregts, R. and Bogers, T.: Design and evaluation of a
university-wide expert search engine. In: Boughanem,
M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.)
ECIR 2009. LNCS, vol. 5478, pp. 587–594. Springer,
Heidelberg (2009).
Serdyukov, P., Hiemstra, D., Fokkinga, M.M., Apers,
P.M.G.: Generative modelling of persons and
documents for expert search. In: SIGIR 2007, pp. 827–
828 (2007).
User Modeling of Skills and Expertise from Resumes
233

More Related Content

What's hot

SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITIONSEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
kevig
 
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational ModelB.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
Assistant Professor, Shri Shivaji Science College, Amravati
 
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
pharmaindexing
 
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL DevelopmentProposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Jorge Barreto
 
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
Marius Corici
 
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGYINFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
IJwest
 
A study on the approaches of developing a named entity recognition tool
A study on the approaches of developing a named entity recognition toolA study on the approaches of developing a named entity recognition tool
A study on the approaches of developing a named entity recognition tool
eSAT Publishing House
 
Automatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product ReviewsAutomatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product Reviews
TELKOMNIKA JOURNAL
 
ER Diagram
ER DiagramER Diagram
ER Diagram
Robby Firmansyah
 
Me101
Me101Me101
Me101
abzshie
 
QER : query entity recognition
QER : query entity recognitionQER : query entity recognition
QER : query entity recognition
Dhwaj Raj
 
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETSCUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
ijaia
 
Improvement of telugu ocr by segmentation of touching characters
Improvement of telugu ocr by segmentation of touching charactersImprovement of telugu ocr by segmentation of touching characters
Improvement of telugu ocr by segmentation of touching characters
eSAT Publishing House
 
D017422528
D017422528D017422528
D017422528
IOSR Journals
 
A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...
Randa Elanwar
 
Semantic extraction of arabic
Semantic extraction of arabicSemantic extraction of arabic
Semantic extraction of arabic
csandit
 
Arabic tweets categorization
Arabic tweets categorizationArabic tweets categorization
Arabic tweets categorization
csandit
 

What's hot (17)

SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITIONSEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
 
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational ModelB.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
B.Sc. II (IV Sem) RDBMS & PL/SQL Unit-2 Relational Model
 
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
FEATURE LEVEL FUSION USING FACE AND PALMPRINT BIOMETRICS FOR SECURED AUTHENTI...
 
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL DevelopmentProposal of an Ontology Applied to Technical Debt on PL/SQL Development
Proposal of an Ontology Applied to Technical Debt on PL/SQL Development
 
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
58903230-SentiMatrix-Named-Entity-Recognition-for-Romanian-Language
 
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGYINFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
INFERENCE BASED INTERPRETATION OF KEYWORD QUERIES FOR OWL ONTOLOGY
 
A study on the approaches of developing a named entity recognition tool
A study on the approaches of developing a named entity recognition toolA study on the approaches of developing a named entity recognition tool
A study on the approaches of developing a named entity recognition tool
 
Automatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product ReviewsAutomatic Summarization in Chinese Product Reviews
Automatic Summarization in Chinese Product Reviews
 
ER Diagram
ER DiagramER Diagram
ER Diagram
 
Me101
Me101Me101
Me101
 
QER : query entity recognition
QER : query entity recognitionQER : query entity recognition
QER : query entity recognition
 
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETSCUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS
 
Improvement of telugu ocr by segmentation of touching characters
Improvement of telugu ocr by segmentation of touching charactersImprovement of telugu ocr by segmentation of touching characters
Improvement of telugu ocr by segmentation of touching characters
 
D017422528
D017422528D017422528
D017422528
 
A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...
 
Semantic extraction of arabic
Semantic extraction of arabicSemantic extraction of arabic
Semantic extraction of arabic
 
Arabic tweets categorization
Arabic tweets categorizationArabic tweets categorization
Arabic tweets categorization
 

Viewers also liked

Středověká literatura
Středověká literaturaStředověká literatura
Středověká literatura
eccehugo
 
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-306282010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
Hua Li, PhD
 
Xeografía dunha novela
Xeografía dunha novelaXeografía dunha novela
Xeografía dunha novela
IES Laxeiro
 
Escenarios para el Subsistema de Provision de RRHH
Escenarios para el Subsistema de Provision de RRHHEscenarios para el Subsistema de Provision de RRHH
Escenarios para el Subsistema de Provision de RRHH
Isaac Lopez
 
Davis Pauline - resume
Davis Pauline - resumeDavis Pauline - resume
Davis Pauline - resume
Pauline Davis
 
Manual usuario promob_5_plus
Manual usuario promob_5_plusManual usuario promob_5_plus
Manual usuario promob_5_plus
Savio Fernandes
 
2014-User Modeling for Contextual Suggestion-TREC
2014-User Modeling for Contextual Suggestion-TREC2014-User Modeling for Contextual Suggestion-TREC
2014-User Modeling for Contextual Suggestion-TREC
Hua Li, PhD
 
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
Hua Li, PhD
 
danh sach khach hang mien phi
danh sach khach hang mien phidanh sach khach hang mien phi
danh sach khach hang mien phi
thongtinkhachhang
 
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
Hua Li, PhD
 
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊKHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
thongtinkhachhang
 
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
edsbaba
 
Consecuencias crack y new deal
Consecuencias crack y new dealConsecuencias crack y new deal
Consecuencias crack y new deal
Fernando Alvarez Fernández
 
2009-Spatial Event Prediction by Combining Value Function Approximation and C...
2009-Spatial Event Prediction by Combining Value Function Approximation and C...2009-Spatial Event Prediction by Combining Value Function Approximation and C...
2009-Spatial Event Prediction by Combining Value Function Approximation and C...
Hua Li, PhD
 
машинные коды
машинные кодымашинные коды
машинные коды
18MILAN99
 
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
Hua Li, PhD
 
Mohamed Faisal_CV
Mohamed Faisal_CVMohamed Faisal_CV
Mohamed Faisal_CV
Mohamed Amer
 
DANH SÁCH KHÁCH HÀNG MIỄN PHÍ
DANH SÁCH KHÁCH HÀNG MIỄN PHÍDANH SÁCH KHÁCH HÀNG MIỄN PHÍ
DANH SÁCH KHÁCH HÀNG MIỄN PHÍ
thongtinkhachhang
 

Viewers also liked (18)

Středověká literatura
Středověká literaturaStředověká literatura
Středověká literatura
 
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-306282010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
2010-INCREMENTAL USER MODELING WITH HETEROGENEOUS USER BEHAVIORS-30628
 
Xeografía dunha novela
Xeografía dunha novelaXeografía dunha novela
Xeografía dunha novela
 
Escenarios para el Subsistema de Provision de RRHH
Escenarios para el Subsistema de Provision de RRHHEscenarios para el Subsistema de Provision de RRHH
Escenarios para el Subsistema de Provision de RRHH
 
Davis Pauline - resume
Davis Pauline - resumeDavis Pauline - resume
Davis Pauline - resume
 
Manual usuario promob_5_plus
Manual usuario promob_5_plusManual usuario promob_5_plus
Manual usuario promob_5_plus
 
2014-User Modeling for Contextual Suggestion-TREC
2014-User Modeling for Contextual Suggestion-TREC2014-User Modeling for Contextual Suggestion-TREC
2014-User Modeling for Contextual Suggestion-TREC
 
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
2014-Adaptive Interest Modeling Improves Content Services at the Network Edge...
 
danh sach khach hang mien phi
danh sach khach hang mien phidanh sach khach hang mien phi
danh sach khach hang mien phi
 
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
2003-An adaptive nearest neighbor search for a parts acquisition ePortal-p693...
 
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊKHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
KHÁCH HÀNG CUNG CẤP TRANG THIẾT BỊ
 
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
DENTAL CASE HISTORY (DEMOGRAPHIC DATA ,CHIEF COMPLAINT,HOPI)
 
Consecuencias crack y new deal
Consecuencias crack y new dealConsecuencias crack y new deal
Consecuencias crack y new deal
 
2009-Spatial Event Prediction by Combining Value Function Approximation and C...
2009-Spatial Event Prediction by Combining Value Function Approximation and C...2009-Spatial Event Prediction by Combining Value Function Approximation and C...
2009-Spatial Event Prediction by Combining Value Function Approximation and C...
 
машинные коды
машинные кодымашинные коды
машинные коды
 
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
2012-Discovering Virtual Interest Groups across Chat Rooms-KMIS
 
Mohamed Faisal_CV
Mohamed Faisal_CVMohamed Faisal_CV
Mohamed Faisal_CV
 
DANH SÁCH KHÁCH HÀNG MIỄN PHÍ
DANH SÁCH KHÁCH HÀNG MIỄN PHÍDANH SÁCH KHÁCH HÀNG MIỄN PHÍ
DANH SÁCH KHÁCH HÀNG MIỄN PHÍ
 

Similar to 2015-User Modeling of Skills and Expertise from Resumes-KMIS

Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
Industrial Organizational Psychology Understanding the Workplace 5th Edition ...Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
AmeryValenzuela
 
Text Document Classification System
Text Document Classification SystemText Document Classification System
Text Document Classification System
IRJET Journal
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
ijaia
 
Measurement And Validation
Measurement And ValidationMeasurement And Validation
Measurement And Validation
Jennifer Campbell
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
Editor IJCATR
 
Resume and the Future of Internet Recruiting
Resume and the Future of Internet RecruitingResume and the Future of Internet Recruiting
Resume and the Future of Internet Recruiting
Andrew Cunsolo
 
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
cscpconf
 
Ooa 1 Post
Ooa 1 PostOoa 1 Post
Ooa 1 Post
Rajesh Kumar
 
Modern Resume Analyser For Students And Organisations
Modern Resume Analyser For Students And OrganisationsModern Resume Analyser For Students And Organisations
Modern Resume Analyser For Students And Organisations
IRJET Journal
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
IJERA Editor
 
Resume Parser with Natural Language Processing
Resume Parser with Natural Language ProcessingResume Parser with Natural Language Processing
Resume Parser with Natural Language Processing
IRJET Journal
 
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
csandit
 
Enterprise Architecture Roles And Competencies V9
Enterprise Architecture Roles And Competencies V9Enterprise Architecture Roles And Competencies V9
Enterprise Architecture Roles And Competencies V9
Paul W. Johnson
 
NLP Ecosystem
NLP EcosystemNLP Ecosystem
Identifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text AnalyticsIdentifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text Analytics
Enterprise Knowledge
 
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
Nikita Sharma
 
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word EmbeddingIRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
IRJET Journal
 
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCEJOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
ijscai
 
International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...
ijscai
 
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCEJOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
ijscai
 

Similar to 2015-User Modeling of Skills and Expertise from Resumes-KMIS (20)

Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
Industrial Organizational Psychology Understanding the Workplace 5th Edition ...Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
Industrial Organizational Psychology Understanding the Workplace 5th Edition ...
 
Text Document Classification System
Text Document Classification SystemText Document Classification System
Text Document Classification System
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
 
Measurement And Validation
Measurement And ValidationMeasurement And Validation
Measurement And Validation
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
Resume and the Future of Internet Recruiting
Resume and the Future of Internet RecruitingResume and the Future of Internet Recruiting
Resume and the Future of Internet Recruiting
 
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
 
Ooa 1 Post
Ooa 1 PostOoa 1 Post
Ooa 1 Post
 
Modern Resume Analyser For Students And Organisations
Modern Resume Analyser For Students And OrganisationsModern Resume Analyser For Students And Organisations
Modern Resume Analyser For Students And Organisations
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
 
Resume Parser with Natural Language Processing
Resume Parser with Natural Language ProcessingResume Parser with Natural Language Processing
Resume Parser with Natural Language Processing
 
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
A SIMILARITY MEASURE FOR CATEGORIZING THE DEVELOPERS PROFILE IN A SOFTWARE PR...
 
Enterprise Architecture Roles And Competencies V9
Enterprise Architecture Roles And Competencies V9Enterprise Architecture Roles And Competencies V9
Enterprise Architecture Roles And Competencies V9
 
NLP Ecosystem
NLP EcosystemNLP Ecosystem
NLP Ecosystem
 
Identifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text AnalyticsIdentifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text Analytics
 
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
White paper - Job skills extraction with LSTM and Word embeddings - Nikita Sh...
 
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word EmbeddingIRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
IRJET- Cross-Domain Sentiment Encoding through Stochastic Word Embedding
 
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCEJOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
 
International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...
 
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCEJOB MATCHING USING ARTIFICIAL INTELLIGENCE
JOB MATCHING USING ARTIFICIAL INTELLIGENCE
 

2015-User Modeling of Skills and Expertise from Resumes-KMIS

  • 1. User Modeling of Skills and Expertise from Resumes Hua Li, Daniel J. T. Powell, Mark Clark, Tifani O'Brien and Rafael Alonso Leidos, Inc., Virginia, U.S.A. Keywords: User Modeling, Expertise Modeling, Resume, Profile, Skill. Abstract: Job applicants describe their skills and expertise in resumes and curriculum vitaes (CVs). These biographic data are often evaluated by human resource personnel or a search committee. This manual approach works well when the number of resumes is small. However, in this information age, the volume of available resumes can be overwhelming and there is a need for automatic evaluation of applicant skills and expertise. In this paper, we describe a user modeling algorithm to quantitatively identify skills and expertise from biographic data. This algorithm is called REMA (Resume Expertise Modeling Algorithm). REMA takes data from a resume document as input and produces an expertise model. The expertise model details the expertise topics for which the resume owner has claimed competency. Each topic carries a weight indicating the level of competency. There are two key insights for this algorithm. First, one's expertise is the cumulative result of the various “learning events” in one's career. These learning events are mentioned in various sections of the resume, such as earning a degree, writing a paper, or getting a patent. Second, one's knowledge and skills can become outdated or forgotten over time if not reinforced by learning. We have developed a prototype resume evaluation system based on REMA and are in the process of evaluating REMA’s performance. 1 INTRODUCTION A resume is a written summary of one’s education, work experience, skills, credentials, and accomplishments that is often used to apply for jobs. One key function of a resume is to provide information regarding one’s skills and expertise. The resume evaluator is required to manually judge the competence of a skill, i.e., the level of mastery of that skill, and/or the expertise in a skill domain, i.e., the level of mastery of all skills in that domain. Expertise modeling is used to automatically produce a quantitative assessment of the competence of various skills and the expertise of relevant skill domains from a resume. Expertise modeling is valuable for evaluators when faced with a large number of resumes to review. Given a position description, expertise modeling can be used to find suitable candidates by matching a set of skills between a position and a candidate’s resume. There are many other applications of expertise modeling, some of which are listed below.  Expertise finder: given a skill, find experts with that skill, i.e., find resumes with a high expertise level for that skill.  Virtual expertise group finder: given a set of resumes, cluster them based on skill set similarities.  Job finder: given a candidate resume, find matching positions within a career market.  Next skill to learn: suggest skills for career development. Given a user resume, find career- conducive skills (CCS) that the user should learn. The CCS are skills that frequently co-occur with a user’s skill set in expert resumes. Even though there’s extensive literature on expertise recommendation, to our knowledge, expertise modeling from resumes is still an open research area. Our model is called Resume Expertise Modeling Algorithm (REMA). It takes a resume, CV or biographical document as input and produces an expertise model. The algorithm focuses on the concept of “expertise mention” in the resume, e.g., earning a Ph.D. in psychology in 2000 or publishing a paper on psychology in 2001. Such mentions imply a certain level of education on a given topic (i.e., psychology) at a particular point in time (i.e., 2000 and 2001, respectively). The greater the number of learning events on a specific topic, the more competent that person is with the topic. This process is referred to as “reinforcement”. Li, H., Powell, D., Clark, M., O’Brien, T. and Alonso, R.. User Modeling of Skills and Expertise from Resumes. In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - Volume 3: KMIS, pages 229-233 ISBN: 978-989-758-158-8 Copyright c 2015 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved 229
  • 2. Conversely, skills can get rusty over time if a person stops learning. This process is referred to as “forgetting”. The expertise modeling algorithm processes the expertise mentions incrementally, in a chronological order. New expertise topics are inserted into the expertise model and their weights are increased by reinforcement and decreased by forgetting. 2 RELATED WORK The related work is mainly in the area of expertise recommendation or expert finding which attempts to find the right person with the appropriate skills and knowledge. This is useful for many purposes including problem solving, question answering, and collaboration. A significant amount of research has been generated in the Information Retrieval community (Smirnova1 and Balog, 2011, Balog et al. 2007; Balog et al. 2009; Liebregts and Bogers, 2009). This line of research focuses on content- based algorithms, similar to document search. These algorithms identify experts based on the content of documents that they are associated with (Liebregts and Bogers, 2009; Serdyukov et al. 2007). While these approaches have been very effective in finding the most knowledgeable people on a given topic based on a large collection of documents from an enterprise or the internet, it’s not clear how they can be used to assess the expertise on multiple topics based on a single resume. 3 THE REMAALGORITHM The REMA algorithm is shown in Figure 1. An input resume is first parsed into expertise mentions. The mentions are evaluated by Natural Language Processing (NLP) tools to extract expertise topics. Figure 1: REMA algorithm diagram. These topics are then processed by REMA’s expertise model adaptation component to generate the expertise model. This algorithm is an extension of our user modelling algorithm RAMA (Reinforcement and Aging Modeling Algorithm) described in detail elsewhere (Li and Alonso, 2014; Li and Alonso, 2012; Alonso et al., 2010). 3.1 Expertise Mentions Expertise mentions are phrases or statements in the resume that indicate significant learning events. For example, a resume may mention a paper on databases in a certain year in the publication section. When parsing the expertise mentions, the associated resume section and the date of the event are captured because they are important indicators of level of expertise. REMA uses a source relevance parameter to register the fact that expertise mentions in different parts of a resume carry different significance. For example, a mention in a patent and publication section should indicate more expertise than one in an experience and education section. Even within the same section, mentions originated from different sources may carry different significances. For example, within the publication section, mentions of a book or journal paper are more indicative of expertise than those of a conference paper. The date of event mentioned reflects the recency of the learning. In other words, skills or expertise acquired more recently are more up-to-date and less likely to be forgotten. We use regular expression and GATE to extract the date and time information from the expertise mention. 3.2 Expertise Topics Expertise topics are terms indicating skills or expertise such as database or machine learning. They are extracted from expertise mentions using NLP tools. In particular, Apache Lucene®1 is used to extract simple terms from text. WordNet®2 is used to identify noun words. GATE is used to extract noun chunks and named entities. OpenCalais web service is used to extract expertise related tags including "Industry Term", "Technology", and "Programming Language". Relationships between 1 Apache Lucene is a registered trademark of the Apache Software Foundation within the United States and/or other countries. 2 WordNet is a registered trademark of the Trustees of Princeton University within the United States and/or other countries. KMIS 2015 - 7th International Conference on Knowledge Management and Information Sharing 230
  • 3. Figure 2: Left: Effects of Reinforcement Factor (Learning Rate) on Weight; Right: Effects of decay half-life on time (Forgetting Factor) weight. expertise topics are also extracted from expertise mentions. For example, WordNet’s semantic relation “hyponym” is useful to map a broader expertise to a narrower one. We can also use Wikipedia’s®3 categories to establish a similar kind of relationship between expertise topics. 3.3 Expertise Model Adaptation This component has two subcomponents: content adaptation and weight adaptation. The former refers to the addition of new expertise topics in the expertise model. As expertise mentions are being processed in chronological order, unseen expertise topics will be automatically inserted into the expertise model with a default weight. The size of the default weight is equal to the weight change from the reinforcement of one expertise mention (see below). Weight adaptation refers to the dynamic adjustment of the weights of expertise topics in the model by reinforcement and forgetting mechanisms. Reinforcement increases the topic weight. The more expertise mentions on a given topic, the more life learning events the person has on that topic. As a result, the more competent that person is with the topic. This process is termed “reinforcement”. The parameter that controls the rate of learning is called the reinforcement factor. The effects of this factor on the learning rate in simulation experiments are shown in the left panel of Figure 2. Conversely, naturally forgetting will decrease the topic weight. In general, a person’s skills will stagnate over time if the person stops learning. There 3 Wikipedia is a registered trademark of the Wikimedia Foundation, Inc., in the United States and/or other countries. are at least two contributing factors to this stagnation. The first is our memory decay as described in decay theory4. This theory proposes that memory fades and information becomes less available for later retrieval as time passes. The second factor is the technological knowledge depreciation, or decay (Nemet, 2012, Park, Shin, and Park, 2006). The average decay rate is estimated at 13.3%, which corresponds to a half-life of 4.86 years (Park, Shin, and Park, 2006). The effects of five different decay half-life values over time are shown in the right panel of Figure 2. 3.4 Expertise Model The expertise model consists of weighted expertise topics and their relationships. Each topic carries an authority label that denotes the origin of the expertise term, such as a WordNet, Wikipedia, or a web service like OpenCalais. The collection of expertise topics represents the breadth of the person’s skills and expertise. The topic weights indicate the level of expertise and have a range from zero to one. The larger the weight, the more competent the person is with that skill or expertise. 4 REMA RESULTS We implemented a prototype REMA system in Java. It has a simple Graphical User Interface (GUI) that allows the user to process a directory of resumes and then display the resulting expertise models. This prototype will be used to conduct evaluation experiments in the near future to assess the 4 http://en.wikipedia.org/wiki/Decay_theory User Modeling of Skills and Expertise from Resumes 231
  • 4. Figure 3: Table view of an example expertise model. performance of the REMA algorithm. In this section we show some preliminary results. 4.1 Topic Cloud View for Expertise Model An expertise model is shown as a topic cloud where the larger the font size and the redder the color indicate higher expertise levels (Figure 3). Figure 4: Topic Cloud view of an example expertise model. 4.2 Table View for Expertise Model An example expertise model is shown as a table with the following six columns (Figure 4):  ID – the sequence number of each model element (expertise topic)  ELEMENT – the expertise topic  WEIGHT – the level of expertise  TYPE – the type of topic, XENTITY denotes an entity topic, XRELATION denotes a synset (set of synonyms) -> hypernym relationship defined in WordNet.  NLP – Natural language processing tool used for extracting this element  Features – features associated with this element. 5 CONCLUSIONS We developed REMA, a user modeling algorithm that quantitatively identifies skills and expertise from biographic data. REMA takes data from a resume document as input and produces an expertise model. There are two key concepts for this algorithm. First, one's expertise is the cumulative result of the various “learning events” in one's career. Second, one's knowledge and skills can become outdated or forgotten over time if not reinforced by learning. We have developed a prototype resume evaluation system based on REMA and are in the process of evaluating REMA’s performance. REFERENCES Alonso, R., P. Bramsen, and H. Li, 2010. Incremental user KMIS 2015 - 7th International Conference on Knowledge Management and Information Sharing 232
  • 5. modeling with heterogeneous user behaviors, International Conference on Knowledge Management and Information Sharing, International Conference on Knowledge Management and Information Sharing (KMIS 2010). Li, H. and R. Alonso. User Modeling for Contextual Suggestion. In Proceedings of 21st Text REtrieval Conference (TREC 2014). NIST, 2014. http://trec.nist.gov/pubs/trec23/papers/pro- RAMA_cs.pdf Li, H. and R. Alonso, Managing Analysis Context, ESAIR'12: Fifth International Workshop on Exploiting Semantic Annotations in Information Retrieval, 2012. Nemet, G., 2012. Historical Case Studies of Energy Technology Innovation, 1–11. Park, G., Shin, J., & Park, Y., 2006. Measurement of depreciation rate of technological knowledge : Technology cycle time approach. Journal of scientific and industrial research 65 (2), 121–127. Balog, K., Bogers, T., Azzopardi, L., de Rijke, M., van den Bosch, A.: Broad expertise retrieval in sparse data environments. In: SIGIR 2007, pp. 551–558 (2007) Balog, K., Soboroff, I., Thomas, P., Craswell, N., de Vries, A.P., Bailey, P. Overview of the TREC 2008 enterprise track. In: TREC 2008 (2009). Smirnova1 E. and K. Balog. A User-Oriented Model for Expert Finding. P. Clough et al. (Eds.): ECIR 2011, LNCS 6611, pp. 580–592, Springer-Verlag Berlin Heidelberg, 2011. Liebregts, R. and Bogers, T.: Design and evaluation of a university-wide expert search engine. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 587–594. Springer, Heidelberg (2009). Serdyukov, P., Hiemstra, D., Fokkinga, M.M., Apers, P.M.G.: Generative modelling of persons and documents for expert search. In: SIGIR 2007, pp. 827– 828 (2007). User Modeling of Skills and Expertise from Resumes 233