1. FINAL REPORT 1
FINAL REPORT
Student Name: BHATIA HIMANSHU VINODKUMAR
Enrollment Number: 1011517034
Specialization: DECISION SCIENCE
Report Duration: 4th
April, 2016 to 19th
June, 2016
Schemopedia.org
Company Name: SCHEMOPEDIA.ORG
Mentor Name: MR. ABHISHEK RATHI Submitted to,
Prof. Chinmay Baxi
2. FINAL REPORT 2
ACKNOWLEDGEMENT
I would like to express my gratitude to all. I want to thank SBS Ahmedabad
for granting me the permission to commence this project.
I am deeply indebted to my supervisors Prof. Chinmay Baxi (Faculty
guide) and Mr. Abhishek Rathi and Mr. Anand Trivedi (Company Guide) for
their help, stimulating suggestions and encouragement in all the time of
internship.
Especially, I would like to thank my parents and friends for extended me
their moral support.
3. FINAL REPORT 3
TABLE OF CONTENT
TITLE
PAGE
NO.
Introduction to company 04
Vision & Potential Revenue streams 05
Problems 06
Project Detail & Objectives 07
Task Assigned & Strategy 08
Mid-Course Correction 13
Limitation 15
Learnings 16
Conclusion 17
Annexure-1- Revised BIBLE OF CODE 18
Annexure-2- PREPARED GUIDELINES &
CHECKLIST FOR SCHEME LISTING & CLEANING
20
4. FINAL REPORT 4
INTRODUCTION TO COMPANY
Schemopedia.org
Name of the Organization: - Schemopedia.org
Main activity of the Organization: Building Government to Citizen (G2C)
portal for scheme information.
Name & Designation of Head of the Organization:
Mr. Anand Trivedi and Mr. Abhishek Rathi – Co-founders.
Email ID:
anandtrivedi92@gmail.com & abishek.rathi@gmail.com
“Schemopedia” is a start-up company which is preparing a portal for the Indian
citizens for create awareness about the schemes of the government as well as
Non-Government.
Schemopedia is a one-stop platform for facilitating simplified information access
and convenient application to all schemes / benefits extended to citizens in their
capacity as an individual or an enterprise owner.
Schemopedia, within the umbrella concept of Governity, is the first step towards
bridging this information gap. Governity, as a whole is the concept through which
we envision advanced engagement levels between Government and Citizens.
Schemopedia is collecting data of schemes, so that they can cover each and every
scheme which are beneficial for the Indian citizens.
5. FINAL REPORT 5
VISION
To constantly innovate and build open source technology platforms that facilitate
interaction between Governments, Citizens and Development Agencies to
improve the flow of information, entitlements and services as well as duties and
obligations.
POTENTIAL REVENUE STREAMS
Benefit Access / Scheme application fees - different rates for individuals /
enterprises
Subscription fees from Government / Policy makers for analytics use
Paid campaigns & surveys for the Government
Paid Participatory scheme design / budgeting exercises for elected
representatives
6. FINAL REPORT 6
CURRENT SCENARIO BETWEEN GOVERNMENT & CITIZEN
Current Scenario between Government & Citizen
PROBLEMS
Very limited technology used in enabling above interaction on the part of
the Government
Information spread across 500+ Govt. websites and multiple offline
sources and local language support, non-friendly interface, etc.
Low citizen awareness leading to even lower benefit access leading to
hundreds of underutilized schemes / programs
Process complexity etc.
All in all, lack of Citizens’ “pull” causing complacency in Govt. reach out
7. FINAL REPORT 7
PROJECT DETAILS
Data mining, marketing planning and Implementation - Enabling the
Schemopedia launch
OBJECTIVES
To build the complete database of schemes from various government
Ministries and Non-government websites.
To check and confirm no single scheme missing from those websites.
To confirm all listed schemes are in required format of Schemopedia.org
To compile all listed schemes database in single sheet.
To build complete product database of Schemopedia.org by completing all
above objectives
8. FINAL REPORT 8
TASK ASSIGNED AND STRATEGY
To achieve all above mentioned objectives we have taken four steps
below
Database Creation (Scheme listing)
Data Verification (Data Validation)
Data Cleaning
Data Compiling
Analysis of the Bill Order Quantity data of Gubbi Group
Unique ID generation and cross-mapping for scheme listing
Phase 1 - Database Creation
In the initial stage of my internship I have to analyze the portal as user interface.
In that procedure I have done the analysis of the portal by checking scope of
improvement of the portal and to list down the advantages & disadvantages of the
portal. I have to list down the schemes from various ministries of Central
Government. As to create the database for schemopedia.org, the scheme listing is
the base for the product (Schemopedia.org)
9. FINAL REPORT 9
Process Followed for Finding Schemes
GOI Directory (goidirectory.nic.in) which includes various ministries select
among them
For e.g. Ministry of New and Renewable Energy is elaborated below
GOI Directory – Ministries - Ministry of New and Renewable Energy includes
various departments like- Ministry, Autonomous / Apex Institutions, PSUs / Joint
Ventures, Companies & Schemes/Programs & Missions.
The Scheme listing procedure to entry into the structured format which is similar
to the portal of Schemopedia.org
The procedure follows with profile filter, purpose, domain and scheme details.
For entering profile filter the bible of code is prepared as reference.
Revised Bible of Code (given in Annexure 1)
Guidelines, Set of Instructions & Checklist for Scheme listing &
cleaning (given in Annexure 2)
Government of
India Web
Directory
Union
Government
Autonomus/
Apex Institutions
PSUs / Joint
ventures
10. FINAL REPORT 10
SCHEME LISTING PROCEDURE
Read & understand the Scheme
Go through Guidelines, checklist & set of instructions
Find out the purpose of the scheme?
Who is beneficiary?
What is the eligibility of the beneficiary?
What benefit is availed to the beneficiary?
Scheme listing from various Ministries of Central Government -
done by me are as follows:
Ministry Name Entries
1. Ministry of Civil Aviation 150
2. Ministry of Culture 215
3. Ministry of Environment, forest and climate change 085
4. Ministry of Power 050
5. Ministry of Tourism 015
6. Ministry of Rural Development 050
Scheme listing of various Ministry from Central government.
Analysing the portal as User Interface
Prepared Guidelines & Checklist for the interns and employees
Data Cleaning
Data analysis
11. FINAL REPORT 11
Training the interns for Scheme listing & Cleaning
Prepared exhaustive list of schemes for uploading
Phase 2 - Data Validation
1. DATA VALIDATION: Data Validation is the process of ensuring that a
programme operates on correct and useful data. Data validation ensure
checking for correctness, meaningfulness and security of data that are input
to the system. Data validation checks that data are valid, sensible, secure,
and reasonable before they are processed and also check the errors like
flow / logical errors.
IMPORTANCE OF DATA VALIDATION
Incorrect data validation can lead to data corruption or security
exposure.
Meaningless, incorrect data give wrong output in analysis.
Data validation helpful in minimizing the data cleaning process.
2. DATA VERIFICATION: Data Verification is the process of ensuring
that the data collected from the right source or location from where we
really need and check the listed schemes are really correct or not. It makes
data correction which is misunderstood by the time of data listing and helps
to resolve the errors.
IMPORTANCE OF DATA VERIFICATION
Data correction
Helpful in getting Missing data from the source.
12. FINAL REPORT 12
PHASE 3: - DATA CLEANING AND COMPILING
Data cleaning here refers to the final cleaning of created database, as to upload
on the portal of Schemopedia.org.
Data Cleaning refers to detecting and correcting the errors and to validate it for
further verification according to the rules and regulation.
In this, Phase-1 have to carry out the following works:
Compilation of schemes done by all interns
To check mapping of schemes is done or not
To clean the Scheme according to the checklist prepared
To fill up the empty cell
To check each filter according to revised bible of code
I have trained the other interns of Schemopedia.org for the Scheme listing and
cleaning.
I have prepared the exhaustive list of Schemes from 3025 entries – All 3025
entries are cleaned and ready for uploading on portal of Schemopedia.org
OTHER WORKS AS FOLLOWS:
Exercise conducted for Learning data analysis using excel tool: Data set from
a survey conducted among newspaper readers on the perception and feedback
of a daily supplement of a Gujarati newspaper - City Bhaskar. It is a
supplement of Divya Bhaskar newspaper.
13. FINAL REPORT 13
MID-COURSE CORRECTION
PHASE 1 - DATABASE CREATION: -
In this phase we were surfing on internet for various schemes and listing it after
analyze the scheme form various government and Non-government websites.
That listed file than checked by superior. While checking that file they found
some basic problems like grammatically errors and also some missing content so
they give it back to us to resolve those errors.
In the initial period frequency of errors was quite more but then after day by day
with having concentrated mind and because of practice it is minimized in such a
way that they just checked and make it confirm instantly.
PHASE 2 - DATA VALIDATION: -
In this phase the work done in 1st
phase by individual is altered with colleagues
so that cross check can be done by this missing content and schemes can be find
out easily.
It is found that some work debate origin in this phase due to having some pointing
out errors but it was obvious and thus we could realize that is actual errors in
some case and make it correct at that time.
14. FINAL REPORT 14
PHASE 3 - DATA CLEANING AND COMPILING
In this phase there was maximum work concentration required because it was the
period when all work assembled and cleaning process took place on that. Thus
we found that though after giving 100% work efficiency we couldn’t found
accuracy required by superior and always superior give that work back to us for
doing again.
But having very good coordination among team and having good team effort we
make it possible. So, finally we made whole product ready in the last week of our
internship.
15. FINAL REPORT 15
LIMITATION
There were few limitations also there during work period of internship which is
mentioned below in brief.
Internet speed issues
Data transferring issues
Data missing issues
Lack of inbuilt Software
Lack of supervisor assistants
Non availability of PC’s
16. FINAL REPORT 16
LEARNING’S
Learnt to do Data mining from unstructured data.
Cultivated habit of Critical thinking while preparing guidelines, checklist
& questionnaire.
Good grip on various functions of Microsoft excel.
Time and Stress management: - Learnt to prioritize the task according to
time (deadline)
Learnt to remove the errors while cleaning the data
How to co-ordinate with other interns to complete the task
17. FINAL REPORT 17
CONCLUSION
Internship at Schemopedia for 11 weeks started from 04th
April 2016 to 19th
June
2016 has been a good learning experience for me. During this period, I came
across a number of good people, in network who helped me and also knowledge
sharing with them would be treasure for me.
Exposure to corporate sector during my internship had helped me to understand
the art of balancing the task and time. So that I could manage with target and
mind stress during my internship.
Experience in a corporate sector, is a good exposure about what really the
corporate life is about. The person who can manage these target and stress can
survive in these corporate world. The only thing required in corporate world is
the right attitude or a positive attitude towards attaining one’s goal.
18. FINAL REPORT 18
ANNEXURE 1 – REVISED BIBLE OF CODE
Revised Bible of Code
1st Profile 2nd Profile Filter 3rd profile filter 4t 5th
Filter
Enterprise
Area of business Type of Implementing Depends
upon the
profile
Manufacturing Co-operative Society
Miscellaneous
s
Services SHG
Infrastructure Large
Institute/ Research Labs Medium
Trading Small
Agriculture Micro
Association Start up
NGO Professional
No restriction Private developer
SPV of Industries
R&D Institutes
Association Bodies
Academic Institutes
Joint Liability Organization
Family
Caste / Category type Family Characteristics
APL Family Income less
BPL Family Size
EBC No restriction
SEBC
No restriction
Schedule Caste
Schedule Tribe
Neo middle class families
(Tribal APL families with
income limit of Rs. 2 (Non
Income Tax paying) Other
Backward Castes
(Notified OBC) - DE
notified
Individual
classesCaste/ Category
type
Specification
APL Gender (Male / Female)
BPL Age
19. FINAL REPORT 19
EBC No restriction
SEBC
No restriction
Schedule Caste
Schedule Tribe
Minorities
Physically & Mentally
Notified - DE notified
Other Backward Castes
Please note that in case of Individual, "(OBC) Specification" only means Age OR
Gender. If an
entry says "Students", it will be covered in 4th / 5th filter
20. FINAL REPORT 20
ANNEXURE 2 - PREPARED GUIDELINES & CHECKLIST FOR SCHEME LISTING & CLEANING
Guidelines
Read the scheme thoroughly
Understand the scheme – Find out the following from the scheme - What is the
objective of the scheme?
From whom the scheme is for?
What are the eligibility (required criteria for getting benefit of the scheme)? -
What benefit is given to the eligible applicant?
Profile filters – 1st
– read the scheme and decide who is the beneficiary from the three options
below: - for e.g. - Sample file – Scheme no 1
Individual
Family
Enterprise
Profile filters – 2nd
& 3rd
– depends upon the scheme and must be relevant to 1st
profile filter
– for e.g. Sample file – Scheme no 1
Take reference of Bible of code
In case the scheme includes the detail of BPL / APL and caste in this scenario the
multiple entries will take place
Reviewing ministry website along with all its departments / corporations
Estimate the number of schemes to be listed
Make daily plan for entire ministry and estimate no. of days of work
Start Scheme listing as per the guidelines / checklist given
Spell Check / Grammar Check for the entire entry / set of entries
Cross verification of the entries with the link for missing columns / data
Final mapping with the guidelines for corrections
21. FINAL REPORT 21
Profile filters – 4th
& 5th
– This is named as miscellaneous and depend upon the scheme
How to enter “By Purpose” - for e.g. Scheme no 1 to 5 of the Sample file.
22. FINAL REPORT 22
The description of “By Purpose” column must include in a single sentence, the following things about the
scheme – o What Benefit is being given (e.g. Financial Assistance),
In what form (e.g. in the form of Grant),
for what purpose / reason (e.g. for purchase of hand weaving machinery) - Who is
the beneficiary (e.g. to BPL artisans)?
So, “Financial assistance in the form of Grant for the purchase of hand weaving machinery to BPL artisans”
Keep by purpose in short and simple language.
In case of multiple entries of single schemes with same set of initial filters and similar
looking “By Purpose” and “Scheme Title, use * for highlighting
Only the differentiated word must be coded between * for highlighting the difference of
the multiple entries
How to enter Domain – For e.g. Scheme no 4 in the sample file
Domain must be from the list of domains as visible on the Portal’s main page
All possible options after going through the scheme need to be mention in domain
separated by comma e.g. Education, Labour and Employment
Master UID
Give the name as follows: - Department Name_Guj / Eng_001_State name
Unique number Sr.no – 1,2,3,4…….
Central / State
Sponsors of the scheme (Any one of the given either central or state)
State - for e.g. Scheme no 4 of Sample file
State name to be entered if you are listing Schemes of State Government
In Central Government Schemes, if the states / region are mentioned then those states /
region to be mention in state column otherwise enter “All States”
Department Name to be showcased in the portal
Department name must be the one from where you got the scheme
Sub department name_1 & 2 - for e.g. Sample file
If sub-department is given, then enter it in column sub – department _1
If there is further classification of sub – department_1, then enter that sub- department
in column named sub- department _2
Scheme title to be displayed - For e.g. – Scheme no 2 & 3 of the sample file.
Scheme title must be similar to the scheme purpose
23. FINAL REPORT 23
In case the scheme name is different & unable to understand easily – we are supposed to
enter the scheme name & then the purpose of it to give it a proper title - Keep the Scheme
Title in short and simple language.
The “Scheme Title” column must include in a single sentence, the following things about
the scheme –
What Benefit is being given (e.g. Financial Assistance),
In what form (e.g. in the form of Grant),
For what purpose / reason (e.g. for purchase of hand weaving machinery)
Who is the beneficiary (e.g. to BPL artisan)
So, “Financial assistance in the form of Grant for the purchase of hand weaving machinery to BPL artisans”
In case of multiple entries of single schemes with same set of initial filters and similar
looking “By Purpose” and “Scheme Title, use * for highlighting
Only the differentiated word must be coded between * for highlighting the difference of
the multiple entries
Eligibility - For e.g. Sample file
Eligibility criteria of the beneficiary to be entered
The eligibility must be as per profile filters also if NO eligibility criteria is given in scheme.
(for e.g. if the profile filter has said SC and eligibility on the website does not mention SC,
still we must write that “Beneficiary should be from SC category.)
In case the eligibility is mapped with benefits in tabular format, then form them in
sentence for listing and keep the order of entries same as those mentioned in table - In
case of multiple points # to be used.
Use “Alt + Enter” for moving to next line and keeping the entries well formatted - Enter
required eligibility criteria only
Beneficiary
The one who is getting the benefit of the scheme
Benefit type - the benefit type must be any of the following
Financial Assistance
Products
Services
Awards & Recognition
Description of Benefit - For e.g. Sample file – Scheme no 3
24. FINAL REPORT 24
Benefit of the scheme as given to be entered
Mention the main benefit right at the beginning and then the details later.
In case the benefit is given in tabular form then form them in sentence for listing and keep
the order of entries same as those mentioned in table while entering eligibility
In case of multiple points # to be used.
Use “Alt + Enter” for moving to next line Link – for e.g. Sample file
Link of the scheme to be entered in link column
If the link of scheme is in pdf form, then page number of the scheme to be entered
Conditionality
Conditionality is to be written which are related ONLY to the benefit and not regarding
any other aspect like application process, eligibility, etc. - In case of multiple points # to
be used.
Use “Alt + Enter” for moving to next line Annexure link
If the annexure is provided in the scheme, then enter the link of annexure with page number.
Possible Duration
As mentioned in the scheme the operative period of the scheme has to be entered
How to Apply
Enter the link of the application form and details
In case the scheme is in pdf form and having application form enter the page number of
application form and link with other details
In case of multiple points use # as bullet
Use “Alt + Enter” for moving to next line just before entering the link
Sponsoring Agency
Government / NGO - Any one of them to be entered.
Set of Instructions
When the beneficiary of the scheme is one which is government body, then NO LISTING
of that scheme is required.
When the scheme includes bidding process or tender based selection of the beneficiary,
then NO LISTING of that scheme is required.
Profile filters - In profile filters 1st
three are mandatory and to be entered with reference
of bible of code.
25. FINAL REPORT 25
“By Purpose” - DO NOT simply copy paste the “By Purpose” description from the scheme
but think once if the same is relevant. In most cases, the same will be relevant but in some
cases, additions / deletions might be necessary. Use your discretion.
“By Domain” - If the scheme is leading to multiple domains, enter all in main domain
column separating by comma (,)
“State” - If no states are given in scheme of central government – enter All states - Scheme
title - DO NOT simply copy paste the “By Purpose” description to “Scheme Title to be
displayed” and think once if the same is relevant. In most cases, the same will be relevant
but in some cases, additions / deletions might be necessary. Use your discretion.
Eligibility – Cover the required points of the profile filters and ensure all the points start
with the bullet #
Description of benefit – Mention the main benefit right at the beginning and then the
details later. And ensure all the points to be bulleted with using # only
Page number of the link is must when the scheme is in pdf form
In cases wherever the benefit is “Training” or “Scholarship”, the duration or standard for
which it is applicable needs to be mentioned in the “Benefit
Description”
In some cases, the Website structure is such that, navigating back to the original scheme
becomes difficult even after clicking on the link. E.g.
http://rubberboard.org.in/Scheme.asp?Id=4
In such cases, kindly also mention in the link column – the process to reach the
appropriate scheme e.g. Click on _____ the below link AND THEN, Click on ______ to reach
the Scheme description.
Regarding Scheme Duration, if a scheme is mentioned to have been expired – (a) Search
Google to check if the scheme is currently running?
(b) If not found, then ENTER the scheme and we shall verify it during validation exercises to
the Government offices
Checklist for Cleaning of Data entries
Mapping of possible entries
Profile filter – reference bible of code
Easy understandable language and in short for by purpose, Scheme title, eligibility & description
of benefit.
26. FINAL REPORT 26
Scheme title: only specific schemes name to be added
In case of multiple entries of single schemes with same set of initial filters and similar looking “By
Purpose” and “Scheme Title, use * for highlighting” and # for bullet points in eligibility, description
of benefit & how to apply
Unnecessary space needs to be removed
Benefit type: - Financial assistance, Product, Services & Awards / recognition.
Page number of scheme to be provided in the link column (if file is in pdf form - too large with
multiple schemes)
NO cells are to be left EMPTY
Cross verification of entries with link
Spell check & Grammar Check the entire data sheet at the end of every day before saving the
sheet