Exploring Public APIs with MongoDB and Analytica         Nosh Petigara      nosh@analytica.com           @noshp        @an...
Today• MongoDB and public APIs• What is Analytica?• Demo  – Analytica shell (twitter data)  – Analytica for Excel (StackOv...
Some data sets to explore• Twitter API (JSON)   – https://dev.twitter.com/• Crunchbase API (JSON)   – http://developer.cru...
MongoDB and public APIs• Most APIs talk JSON   – MongoDB’s native JSON import• APIs vary wildly (internally and between on...
Importing data sets• Streaming JSON directly into MongoDB  – curl    https://stream.twitter.com/1/statuses/sample.jso    n...
Analytica• Analytics & reporting platform for MongoDB  – Natively understands JSON/document hierarchy  – Tailored for anal...
What can you do with Analytica?• Inspect and extract data• Augment your data model• Calculate & aggregate• Filter and tran...
Demos• Today  – Twitter stats [using the Analytica Shell]  – Stackoverflow community analysis [using    Analytica for Exce...
Demo 1: Twitter data
Some other examplesTweets vs. retweets   count(select(twitter.tweets.where(retweet_count <> 0)))vs. repliesFollower count...
Demo 2: StackOverflow User Profiles
Next steps• Beta  – http://analytica.com• Get in touch  – nosh@analytica.com or info@analytica.com• @analytica_inc on twit...
Upcoming SlideShare
Loading in …5
×

Exploring Public Datasets and APIs with MongoDB and Analytica

1,915 views

Published on

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,915
On SlideShare
0
From Embeds
0
Number of Embeds
474
Actions
Shares
0
Downloads
11
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Exploring Public Datasets and APIs with MongoDB and Analytica

  1. 1. Exploring Public APIs with MongoDB and Analytica Nosh Petigara nosh@analytica.com @noshp @analytica_inc
  2. 2. Today• MongoDB and public APIs• What is Analytica?• Demo – Analytica shell (twitter data) – Analytica for Excel (StackOverflow data)
  3. 3. Some data sets to explore• Twitter API (JSON) – https://dev.twitter.com/• Crunchbase API (JSON) – http://developer.crunchbase.com/• Stackoverflow (JSON and CSV) – http://data.stackexchange.com/• NYTimes (JSON, XML) – http://developer.nytimes.com/docs
  4. 4. MongoDB and public APIs• Most APIs talk JSON – MongoDB’s native JSON import• APIs vary wildly (internally and between one another) – MongoDB is schema-free• Data import is only half the battle – MongoDB’s query language and aggregation framework
  5. 5. Importing data sets• Streaming JSON directly into MongoDB – curl https://stream.twitter.com/1/statuses/sample.jso n –uUSERNAME:PASSword| ./mongoimport –d twitter –c tweets• Importing JSON files – ./mongoimport –d mydb –c mycoll file.json• CSV – ./mongoimport –d db –c coll --type csv -- headerline myfile.csv
  6. 6. Analytica• Analytics & reporting platform for MongoDB – Natively understands JSON/document hierarchy – Tailored for analytics – Works directly on MongoDB• Discovery, analysis, visualization cycle• In beta [http://analytica.com]
  7. 7. What can you do with Analytica?• Inspect and extract data• Augment your data model• Calculate & aggregate• Filter and transform data• Join collections
  8. 8. Demos• Today – Twitter stats [using the Analytica Shell] – Stackoverflow community analysis [using Analytica for Excel]• Not shown – REST API – Analytica web (Coming soon)
  9. 9. Demo 1: Twitter data
  10. 10. Some other examplesTweets vs. retweets count(select(twitter.tweets.where(retweet_count <> 0)))vs. repliesFollower counts max(twitter.tweets.user.followers_count)Popular hashtags set twitter.byhashtag = group(tweets.by(entities.hashtags.text)) set twitter.byhashtag.tweetcount = count(tweets) set twitter.populartags = orderdesc(byhashtag.by(tweetcount)) get twitter.populartags.text
  11. 11. Demo 2: StackOverflow User Profiles
  12. 12. Next steps• Beta – http://analytica.com• Get in touch – nosh@analytica.com or info@analytica.com• @analytica_inc on twitter

×