Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Challenges
• Not easy to extract data from PDF.
• Fonts are not available in computer.
• Software
Open Data
Why Open Data?
• Open data, especially open government data, is
a terrific resource that is as yet largely
untouched.
• Th...
What is data wrangler?
• A data wrangler is the person performing the
wrangling.
• Data wrangling is loosely the process o...
Why to put Data in C.S.V.
Software
•ABBYY FineReader
•Cometdocs
•Tabula
Data Wrangling
Upcoming SlideShare
Loading in …5
×

Data Wrangling

866 views

Published on

Challenges faced during extracting data from websites and tools to do the extraction. Presentation by Manish Dangol

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Data Wrangling

  1. 1. Challenges • Not easy to extract data from PDF. • Fonts are not available in computer. • Software
  2. 2. Open Data
  3. 3. Why Open Data? • Open data, especially open government data, is a terrific resource that is as yet largely untouched. • There are many areas where we can expect open data to be of value. • www.wheredoesmymoneygo.com • Tux tree
  4. 4. What is data wrangler? • A data wrangler is the person performing the wrangling. • Data wrangling is loosely the process of manually converting or mapping data from one "raw" form into another format which includes further munging, data visualization.
  5. 5. Why to put Data in C.S.V.
  6. 6. Software •ABBYY FineReader •Cometdocs •Tabula

×