SlideShare a Scribd company logo
1 of 15
Resume Summary
Submitted by: Team 8
Krishna Chouhan, 201405635
Ala Praveen, 201405617
Kalpit Thakkar, 201201071
Anubhav Shrivastava, 201201105
Guide: Prof. Vasudeva Varma
Mentor : Ashish Kumar
1
Content
Abstract
Architecture
Technology
Working
Future work
References
Project Links
2
Abstract
Analyzing a resume manually is plausible but analyzing a huge collection of
resume manually is not possible. Moreover, finding a particular set of
information from collection of resume is not practical.
Storing the resumes in certain format into a storage and accessing the required
information is easier if the resume are parsed in and refined.
There are a number of document formats available for resume and it is not only
to store this unstructured data into a structured format for better storage but
also for fast extraction of that information.
3
Architecture
Resume summarizer is divided into two parts:
- Resume Parser;
- Search Resume
Resume Parser:
Parsing resume and converting unstructured resume files into a
structured collection of resume.
Search Resume:
This part of resume summarizer is used to search the required
information from collection of structured resume, and displaying the required4
5
Technology Used
- Java : Parsing the resume into a simple format that could be used by
hadoop for mapping and reducing the data is done in java.
- Hadoop : System uses hadoop mapping and reducing approach for
operating on resume. There are a number of mappers used one for each
file format but only one single reducer is implemented.
- MySql : For storing the structured data locally MySql DB is used.
6
Working
The basic working of this project starts with two types of input.
First input could be a resume, this resume is then processed to find the
information from the unstructured file and converted into structured format. The
resume provided as input could be a single resume provided as a live input by
user or we can provide collection of resume.
Second input is the search query, that is fired against the collection of
structured format of resume from the storage.
7
Input:
An interface is provided where user can upload the resume and can specify
the format (format is optional).
If user specify the format then we need to check the uploaded resume format
with specified format if it matches then process the resume using resume
parser and store the results in Database, otherwise check the uploaded resume
format with available formats we are able to parse, if uploaded resume format
matches with available formats then parse it using resume parser and store in
Database in some Structured format.
But the problem is resumes need not be in structured format, so processing
and storing the resumes in Database which are not in structured format is
difficult task.
8
Working
Resume are processed from either kind of input.
Firstly, the resume is parsed for the information according to the format of the
resume file .
Secondly, the data obtained from parsing is given as input to mapper. There
are multiple mappers according to the supported file formats and these
mappers maps the data accordingly.
Lastly, the data after mapping is passed to reducer and reduces reduces the
data into the desired format and stored into the storage.
Processing Resume:
9
Working
In the same interface we are providing another interface in such a way that user
can query the database (i.e. search query) by using some keywords (or) data
elements which are supposed to be existing in resumes.
After the user entered a query to search , then we perform certain operations
on Database by using conditions and filters which are in user’s query.
Then we refine the above results and rank them and then dispaly as a result to
user’s query.
The working flow is looks as follows:
Resume Search:
10
Working
11
Data is stored in a tuple based structured after processing the resumes.
Each tuple stores a resume and the fields of resume are stored in form of comma
separated columns.
Each field or column of a tuple stores certain information and thus each field
from the resume is matched to a column of a tuple. Also, the data that could not
be parsed successfully and the low priority information is stored in the last
column of the tuple.
While on displaying the data from a search query the data is searched from the
storage and resume are displayed according to the rank obtained by each resume
when searched for the appropriate fields. 12
WorkingStorage & Display:
13
Future Work
Resume Summarizer operates on the collection of resume that are in certain
formats and stores the resume in structured format. This project parses the
resume on a certain number of file formats and there is a future scope of
increasing these number of format to even more complicated file structures like
images.
Also the process of identifying the elements can be improved by implementing
machine learning into the summarizer after parsing.
14
References
http://stackoverflow.com/questions/2036236/tips-on-how-to-parse-custom-file-
format
https://thomaslevine.com/!/parsing-pdfs
http://stackoverflow.com/questions/4015477/read-pdf-files-using-java
https://blogs.oracle.com/prasanna/entry/openoffice_parser_extracting_text_from
http://javabeginnerstutorial.com/code-base/read-doc-file-in-java-using-poi/
http://stackoverflow.com/questions/16476711/how-to-read-docx-file-content-in-
java-api-using-poi-jar
https://en.wikipedia.org/wiki/MapReduce
https://docs.mongodb.org/manual/core/map-reduce/
15

More Related Content

What's hot

127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections
Amit Sharma
 
Mis chapter5
Mis chapter5Mis chapter5
Mis chapter5
Poleak
 
Degrees of data abstraction
Degrees of data abstractionDegrees of data abstraction
Degrees of data abstraction
Mary May Porto
 
Informatica data warehousing_job_interview_preparation_guide
Informatica data warehousing_job_interview_preparation_guideInformatica data warehousing_job_interview_preparation_guide
Informatica data warehousing_job_interview_preparation_guide
Dhanasekar T
 

What's hot (20)

127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections
 
T2
T2T2
T2
 
Module03
Module03Module03
Module03
 
Introduction to database
Introduction to databaseIntroduction to database
Introduction to database
 
Dbms
DbmsDbms
Dbms
 
Sql interview questions and answers
Sql interview questions and  answersSql interview questions and  answers
Sql interview questions and answers
 
Mis chapter5
Mis chapter5Mis chapter5
Mis chapter5
 
primary
primaryprimary
primary
 
Bt0066 database management system1
Bt0066 database management system1Bt0066 database management system1
Bt0066 database management system1
 
The design and implementation of database on library
The design and implementation of database on libraryThe design and implementation of database on library
The design and implementation of database on library
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
 
Oracle dba interview
Oracle dba interviewOracle dba interview
Oracle dba interview
 
Ebook5
Ebook5Ebook5
Ebook5
 
Database Systems - introduction
Database Systems - introductionDatabase Systems - introduction
Database Systems - introduction
 
T3
T3T3
T3
 
Databse management system
Databse management systemDatabse management system
Databse management system
 
Degrees of data abstraction
Degrees of data abstractionDegrees of data abstraction
Degrees of data abstraction
 
Informatica data warehousing_job_interview_preparation_guide
Informatica data warehousing_job_interview_preparation_guideInformatica data warehousing_job_interview_preparation_guide
Informatica data warehousing_job_interview_preparation_guide
 
Data models
Data modelsData models
Data models
 
Slide 3 data abstraction & 3 schema
Slide 3 data abstraction & 3 schemaSlide 3 data abstraction & 3 schema
Slide 3 data abstraction & 3 schema
 

Viewers also liked

Jordan Janiak - Resume
Jordan Janiak - ResumeJordan Janiak - Resume
Jordan Janiak - Resume
Jordan Janiak
 
Santhosh_Informatica_Administrator(2)
Santhosh_Informatica_Administrator(2)Santhosh_Informatica_Administrator(2)
Santhosh_Informatica_Administrator(2)
santhosh kumar
 
DVO_PolCOm_Apps and Ops Summary_14DEC2015
DVO_PolCOm_Apps and Ops Summary_14DEC2015DVO_PolCOm_Apps and Ops Summary_14DEC2015
DVO_PolCOm_Apps and Ops Summary_14DEC2015
Wm. Andrew Osterman
 
Informatica data quality online training
Informatica data quality online trainingInformatica data quality online training
Informatica data quality online training
Divya Shree
 
Payment Gateway Live hadoop project
Payment Gateway Live hadoop projectPayment Gateway Live hadoop project
Payment Gateway Live hadoop project
Kamal A
 

Viewers also liked (20)

DVO user generated content
DVO user generated contentDVO user generated content
DVO user generated content
 
Jordan Janiak - Resume
Jordan Janiak - ResumeJordan Janiak - Resume
Jordan Janiak - Resume
 
Beeldkwaliteit in een DVO: Paul Verbakel (Atlant-groep) en Martijn van Duuren...
Beeldkwaliteit in een DVO: Paul Verbakel (Atlant-groep) en Martijn van Duuren...Beeldkwaliteit in een DVO: Paul Verbakel (Atlant-groep) en Martijn van Duuren...
Beeldkwaliteit in een DVO: Paul Verbakel (Atlant-groep) en Martijn van Duuren...
 
Resume summary(cloud project)
Resume summary(cloud project)Resume summary(cloud project)
Resume summary(cloud project)
 
Table29 Data Validation 95
Table29 Data Validation 95Table29 Data Validation 95
Table29 Data Validation 95
 
Santhosh_Informatica_Administrator(2)
Santhosh_Informatica_Administrator(2)Santhosh_Informatica_Administrator(2)
Santhosh_Informatica_Administrator(2)
 
Undraleu ETL Code Review Tool for Informatica PowerCenter, Data Sheet
Undraleu ETL Code Review Tool for Informatica PowerCenter, Data SheetUndraleu ETL Code Review Tool for Informatica PowerCenter, Data Sheet
Undraleu ETL Code Review Tool for Informatica PowerCenter, Data Sheet
 
BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
Data validation infrastructure: the validate package
Data validation infrastructure: the validate packageData validation infrastructure: the validate package
Data validation infrastructure: the validate package
 
ETL Validator: Testing for Referential Integrity
ETL Validator: Testing for Referential IntegrityETL Validator: Testing for Referential Integrity
ETL Validator: Testing for Referential Integrity
 
DVO_PolCOm_Apps and Ops Summary_14DEC2015
DVO_PolCOm_Apps and Ops Summary_14DEC2015DVO_PolCOm_Apps and Ops Summary_14DEC2015
DVO_PolCOm_Apps and Ops Summary_14DEC2015
 
BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
Informatica data quality online training
Informatica data quality online trainingInformatica data quality online training
Informatica data quality online training
 
10 Benefits of a Clean Email List
10 Benefits of a Clean Email List10 Benefits of a Clean Email List
10 Benefits of a Clean Email List
 
Payment Gateway Live hadoop project
Payment Gateway Live hadoop projectPayment Gateway Live hadoop project
Payment Gateway Live hadoop project
 
BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
Data validation
Data validationData validation
Data validation
 
Bigdata Hadoop project payment gateway domain
Bigdata Hadoop project payment gateway domainBigdata Hadoop project payment gateway domain
Bigdata Hadoop project payment gateway domain
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 
Data validation option
Data validation optionData validation option
Data validation option
 

Similar to Resume summary

Resume Parsing And Processing Using Hadoop (1)
Resume Parsing And Processing Using Hadoop (1)Resume Parsing And Processing Using Hadoop (1)
Resume Parsing And Processing Using Hadoop (1)
Sourav Madhesiya
 

Similar to Resume summary (20)

IRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction FrameworkIRJET- Resume Information Extraction Framework
IRJET- Resume Information Extraction Framework
 
A Review of Data Access Optimization Techniques in a Distributed Database Man...
A Review of Data Access Optimization Techniques in a Distributed Database Man...A Review of Data Access Optimization Techniques in a Distributed Database Man...
A Review of Data Access Optimization Techniques in a Distributed Database Man...
 
A Review of Data Access Optimization Techniques in a Distributed Database Man...
A Review of Data Access Optimization Techniques in a Distributed Database Man...A Review of Data Access Optimization Techniques in a Distributed Database Man...
A Review of Data Access Optimization Techniques in a Distributed Database Man...
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
 
F04302053057
F04302053057F04302053057
F04302053057
 
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
 
DBA book sql rdbms 4rth Complete book Database systems Handbook dbms rdbms by...
DBA book sql rdbms 4rth Complete book Database systems Handbook dbms rdbms by...DBA book sql rdbms 4rth Complete book Database systems Handbook dbms rdbms by...
DBA book sql rdbms 4rth Complete book Database systems Handbook dbms rdbms by...
 
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdfDatabase system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
 
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
4rth Complete book Database systems Handbook dbms rdbms by Muhammad Sharif.pdf
 
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdfDatabase system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
 
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdfDatabase system Handbook 3rd DONE Complete DBMS book Full book.pdf
Database system Handbook 3rd DONE Complete DBMS book Full book.pdf
 
Notes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database DesignNotes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database Design
 
Resume Parsing And Processing Using Hadoop (1)
Resume Parsing And Processing Using Hadoop (1)Resume Parsing And Processing Using Hadoop (1)
Resume Parsing And Processing Using Hadoop (1)
 
industrial manpower resource manager
industrial manpower resource managerindustrial manpower resource manager
industrial manpower resource manager
 
Normalisation in Database management System (DBMS)
Normalisation in Database management System (DBMS)Normalisation in Database management System (DBMS)
Normalisation in Database management System (DBMS)
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
Database systems Handbook dbms.pdf
Database systems Handbook dbms.pdfDatabase systems Handbook dbms.pdf
Database systems Handbook dbms.pdf
 
Database systems Handbook.pdf
Database systems Handbook.pdfDatabase systems Handbook.pdf
Database systems Handbook.pdf
 
Database systems Handbook.pdf
Database systems Handbook.pdfDatabase systems Handbook.pdf
Database systems Handbook.pdf
 
Database systems Handbook dbms.pdf
Database systems Handbook dbms.pdfDatabase systems Handbook dbms.pdf
Database systems Handbook dbms.pdf
 

Recently uploaded

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 

Recently uploaded (20)

Trauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesTrauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical Principles
 
PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of TransportBasic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
Climbers and Creepers used in landscaping
Climbers and Creepers used in landscapingClimbers and Creepers used in landscaping
Climbers and Creepers used in landscaping
 
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxAnalyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
demyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxdemyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptx
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 

Resume summary

  • 1. Resume Summary Submitted by: Team 8 Krishna Chouhan, 201405635 Ala Praveen, 201405617 Kalpit Thakkar, 201201071 Anubhav Shrivastava, 201201105 Guide: Prof. Vasudeva Varma Mentor : Ashish Kumar 1
  • 3. Abstract Analyzing a resume manually is plausible but analyzing a huge collection of resume manually is not possible. Moreover, finding a particular set of information from collection of resume is not practical. Storing the resumes in certain format into a storage and accessing the required information is easier if the resume are parsed in and refined. There are a number of document formats available for resume and it is not only to store this unstructured data into a structured format for better storage but also for fast extraction of that information. 3
  • 4. Architecture Resume summarizer is divided into two parts: - Resume Parser; - Search Resume Resume Parser: Parsing resume and converting unstructured resume files into a structured collection of resume. Search Resume: This part of resume summarizer is used to search the required information from collection of structured resume, and displaying the required4
  • 5. 5
  • 6. Technology Used - Java : Parsing the resume into a simple format that could be used by hadoop for mapping and reducing the data is done in java. - Hadoop : System uses hadoop mapping and reducing approach for operating on resume. There are a number of mappers used one for each file format but only one single reducer is implemented. - MySql : For storing the structured data locally MySql DB is used. 6
  • 7. Working The basic working of this project starts with two types of input. First input could be a resume, this resume is then processed to find the information from the unstructured file and converted into structured format. The resume provided as input could be a single resume provided as a live input by user or we can provide collection of resume. Second input is the search query, that is fired against the collection of structured format of resume from the storage. 7
  • 8. Input: An interface is provided where user can upload the resume and can specify the format (format is optional). If user specify the format then we need to check the uploaded resume format with specified format if it matches then process the resume using resume parser and store the results in Database, otherwise check the uploaded resume format with available formats we are able to parse, if uploaded resume format matches with available formats then parse it using resume parser and store in Database in some Structured format. But the problem is resumes need not be in structured format, so processing and storing the resumes in Database which are not in structured format is difficult task. 8 Working
  • 9. Resume are processed from either kind of input. Firstly, the resume is parsed for the information according to the format of the resume file . Secondly, the data obtained from parsing is given as input to mapper. There are multiple mappers according to the supported file formats and these mappers maps the data accordingly. Lastly, the data after mapping is passed to reducer and reduces reduces the data into the desired format and stored into the storage. Processing Resume: 9 Working
  • 10. In the same interface we are providing another interface in such a way that user can query the database (i.e. search query) by using some keywords (or) data elements which are supposed to be existing in resumes. After the user entered a query to search , then we perform certain operations on Database by using conditions and filters which are in user’s query. Then we refine the above results and rank them and then dispaly as a result to user’s query. The working flow is looks as follows: Resume Search: 10 Working
  • 11. 11
  • 12. Data is stored in a tuple based structured after processing the resumes. Each tuple stores a resume and the fields of resume are stored in form of comma separated columns. Each field or column of a tuple stores certain information and thus each field from the resume is matched to a column of a tuple. Also, the data that could not be parsed successfully and the low priority information is stored in the last column of the tuple. While on displaying the data from a search query the data is searched from the storage and resume are displayed according to the rank obtained by each resume when searched for the appropriate fields. 12 WorkingStorage & Display:
  • 13. 13
  • 14. Future Work Resume Summarizer operates on the collection of resume that are in certain formats and stores the resume in structured format. This project parses the resume on a certain number of file formats and there is a future scope of increasing these number of format to even more complicated file structures like images. Also the process of identifying the elements can be improved by implementing machine learning into the summarizer after parsing. 14