Amu Prabhjot Singh 10BM60011 Divya Hamirwasia 10BM60025
   an interactive data transformation tool    developed by the Stanford Visualization    Group.   allows direct manipula...
   When the user selects any data, applicable transformations are    suggested by the tool based on the current context o...
   underlying declarative data transformation language   language consists of 8 classes of transformations    ◦ Map     ...
   This is the example data available with data    wrangler.   House crime data from the U.S. Bureau of    Justice Stati...
User interactions                                        Inferring transform Current working                            pa...
   GETTING STARTED    ◦ Browser based tool: http://vis.stanford.edu/wrangler/   DATA ENTRY    ◦ copy and paste the data ...
   helps to speed up the process of data    manipulation   helps managers to spend more time analyzing    and learning f...
Upcoming SlideShare
Loading in …5
×

DataWrangler @VGSOM

458 views

Published on

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
458
On SlideShare
0
From Embeds
0
Number of Embeds
27
Actions
Shares
0
Downloads
9
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

DataWrangler @VGSOM

  1. 1. Amu Prabhjot Singh 10BM60011 Divya Hamirwasia 10BM60025
  2. 2.  an interactive data transformation tool developed by the Stanford Visualization Group. allows direct manipulation of visual data provides automatic suggestions for relevant transformations used in activities like reformatting data values and formats, integrating data from multiple sources, missing values etc use of Wrangler reduces the specification time significantly
  3. 3.  When the user selects any data, applicable transformations are suggested by the tool based on the current context of interaction Data wrangler uses a modeling technique to enumerate and rate the possible transformations This model combines users inputs with diversity, frequency and specification difficulty of applicable transform types Wrangler provides short natural language descriptions of the transforms and also provides the visual previews of the transform results This helps analysts to assess the viable transforms quickly Wranglers interactive history viewer records and shows the step of transforms applied on the data set so as to facilitate reuse. Wrangler scripts can be run in a web browser using JavaScript or Python code
  4. 4.  underlying declarative data transformation language language consists of 8 classes of transformations ◦ Map  One to zero  One to One  One to Many ◦ Look ups and Joins ◦ Reshape  Fold  unfold ◦ Positional  Fill  Lag ◦ Sorting ◦ Aggregation ◦ Key Generation ◦ Schema Transforms
  5. 5.  This is the example data available with data wrangler. House crime data from the U.S. Bureau of Justice Statistics Csv format data
  6. 6. User interactions Inferring transform Current working parameters transform Generating candidate DATA WRANGLER transforms Data descriptions Ranking the resultsCorpus of historical usage statistics
  7. 7.  GETTING STARTED ◦ Browser based tool: http://vis.stanford.edu/wrangler/ DATA ENTRY ◦ copy and paste the data to be wrangled into the input window. ◦ Input format : csv files, tsv files and manual entry TRANSFORMS • Cut • Merge • Delete • Promote • Drop • Split • Edit • Translate • Extract • Transpose • Fill • Unfold • Fold OUTPUT Two types of outputs: ◦ Data Output.xlsx  Csv, tsv, row oriented JSON, column oriented JSON, look up tables ◦ Script  Python, java script
  8. 8.  helps to speed up the process of data manipulation helps managers to spend more time analyzing and learning from their data rather than spending much of the time just rearranging it allows interactive transformation of messy, real- world data and export data for use in Excel, R, Tableau, Protovis etc LIMITATION: data containing more than 40 columns and 1000 rows cannot be wrangled

×