Challenges
• Not easy to extract data from PDF.
• Fonts are not available in computer.
• Software
Open Data
Why Open Data?
• Open data, especially open government data, is
a terrific resource that is as yet largely
untouched.
• There are many areas where we can expect
open data to be of value.
• www.wheredoesmymoneygo.com
• Tux tree
What is data wrangler?
• A data wrangler is the person performing the
wrangling.
• Data wrangling is loosely the process of
manually converting or mapping data from
one "raw" form into another format which
includes further munging, data visualization.
Why to put Data in C.S.V.
Software
•ABBYY FineReader
•Cometdocs
•Tabula
Data Wrangling

Data Wrangling

  • 1.
    Challenges • Not easyto extract data from PDF. • Fonts are not available in computer. • Software
  • 2.
  • 3.
    Why Open Data? •Open data, especially open government data, is a terrific resource that is as yet largely untouched. • There are many areas where we can expect open data to be of value. • www.wheredoesmymoneygo.com • Tux tree
  • 4.
    What is datawrangler? • A data wrangler is the person performing the wrangling. • Data wrangling is loosely the process of manually converting or mapping data from one "raw" form into another format which includes further munging, data visualization.
  • 5.
    Why to putData in C.S.V.
  • 6.