Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Report
Ziyue Jin
February 8, 2016
Character Count vs Pageview
I first export top 100 pageviews post from thebacklabel. Then...
0
10000
20000
30000
20 40 60
count
Pageviews
0
4000
8000
20 40 60
count
Pageviews
2
Character Count vs Page Entrance
This part follows exactly same process as previous part. We still cannot see a clear patt...
0
10000
20000
10 20 30 40 50 60
count
Entrances
0
2000
4000
6000
8000
10 20 30 40 50 60
count
Entrances
4
Word Cloud of Page Title
When there is no clear pattern in previous plot, I turn to focus on how the content of the page t...
200
400
600
800
0 300 600 900
number
total
0 200 400 600 800 1000
200400600800
word count
averagereadingtime
6
Upcoming SlideShare
Loading in …5
×

report1

132 views

Published on

  • Be the first to comment

  • Be the first to like this

report1

  1. 1. Report Ziyue Jin February 8, 2016 Character Count vs Pageview I first export top 100 pageviews post from thebacklabel. Then, I count number of characters in a post title. The reason I did this is because we care about length of our title on the webpage. Number of characters is better here. From the histogram, most of our titles have length from 20 to 40. ## [1] "/Users/jinziyue/Documents/Intern" Number of characters in Page title character count Frequency 0 10 20 30 40 50 60 70 051015202530 We plot the graph of character counts against pageviews. There is a peak in the graph, but it is caused by two outliers. After we remove outliers, we barely see a pattern in the graph. 1
  2. 2. 0 10000 20000 30000 20 40 60 count Pageviews 0 4000 8000 20 40 60 count Pageviews 2
  3. 3. Character Count vs Page Entrance This part follows exactly same process as previous part. We still cannot see a clear pattern in the graph of character counts vs page entrance. Number of Characters in Page title character count Frequency 10 20 30 40 50 60 70 051015 3
  4. 4. 0 10000 20000 10 20 30 40 50 60 count Entrances 0 2000 4000 6000 8000 10 20 30 40 50 60 count Entrances 4
  5. 5. Word Cloud of Page Title When there is no clear pattern in previous plot, I turn to focus on how the content of the page title affects page view. Suggested by Dale, I only focused on posts which has pageviews more than 20 and average reading time less than 10 minutes. I extract the top 200 posts and see what are most common words in their titles. (visualizing by word cloud) wine best backlabelrecipe recipes author justin hot ways bottle cocktails diy food lowdown reasons red warners will wines better champagne newpage table world airbnb beach chicken drink make perfect steak tasting tinder 2013 archives art bars beef beer blend candy cider corn dont edition facts know life like many must pairing pan pork quotes rosé sheet suppers tips top warnerworlds Of course, wine is the largest one in the plot. :) From the word cloud, we can see people like reading posts contain “best”, “recipe(s)” in their titles. Article Length vs Page Reading Time Suggested by Dale, I only care about article which has reading time less than 10 minutes here. Both plots here are about word counts in a post vs its average reading time. Even though the patter is not so clear here, but we can say that a post has word counts between 300 to 600 has longer average reading time (exclude the outlier). 5
  6. 6. 200 400 600 800 0 300 600 900 number total 0 200 400 600 800 1000 200400600800 word count averagereadingtime 6

×