View stunning SlideShares in full-screen with the new iOS app!Introducing SlideShare for AndroidExplore all your favorite topics in the SlideShare appGet the SlideShare app to Save for Later — even offline
View stunning SlideShares in full-screen with the new Android app!View stunning SlideShares in full-screen with the new iOS app!
an interactive data transformation tool developed by the Stanford Visualization Group. allows direct manipulation of visual data provides automatic suggestions for relevant transformations used in activities like reformatting data values and formats, integrating data from multiple sources, missing values etc use of Wrangler reduces the specification time significantly
underlying declarative data transformation language language consists of 8 classes of transformations ◦ Map One to zero One to One One to Many ◦ Look ups and Joins ◦ Reshape Fold unfold ◦ Positional Fill Lag ◦ Sorting ◦ Aggregation ◦ Key Generation ◦ Schema Transforms
This is the example data available with data wrangler. House crime data from the U.S. Bureau of Justice Statistics Csv format data
User interactions Inferring transform Current working parameters transform Generating candidate DATA WRANGLER transforms Data descriptions Ranking the resultsCorpus of historical usage statistics
GETTING STARTED ◦ Browser based tool: http://vis.stanford.edu/wrangler/ DATA ENTRY ◦ copy and paste the data to be wrangled into the input window. ◦ Input format : csv files, tsv files and manual entry TRANSFORMS • Cut • Merge • Delete • Promote • Drop • Split • Edit • Translate • Extract • Transpose • Fill • Unfold • Fold OUTPUT Two types of outputs: ◦ Data Output.xlsx Csv, tsv, row oriented JSON, column oriented JSON, look up tables ◦ Script Python, java script
helps to speed up the process of data manipulation helps managers to spend more time analyzing and learning from their data rather than spending much of the time just rearranging it allows interactive transformation of messy, real- world data and export data for use in Excel, R, Tableau, Protovis etc LIMITATION: data containing more than 40 columns and 1000 rows cannot be wrangled