A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging

A User Oriented
Modeling Analysis
of Cultural Backgrounds in Microblogging

Elena@Ilina.nl

Best Paper Award

http://www.asesite.org/awards/awards/164.html
2

Outline
1.
2.
3.
4.
5.

Introduction
Lewis Model of Cultures
Approach
Experimental Setup
Results

3

The Lewis Model of Cultures
Richard Lewis (2000) “When cultures collide:
Managing successfully across cultures”
Hispanic
America

MULTIACTIVE

Italy,
Portugal,
Spain

Argentina,
Brazil,
Chile,
Sub-Saharan
Mexico
Africa

USA

China

LINEARACTIVE

REACTIVE

UK

Germany,
Switzerland

Japan

Vietnam
4

Personality Traits
Multi-active

Linear-active
Talks half the time
Does one thing at a time
Plans ahead step by step
Polite but direct
Partly conceals feelings

Talks most of the time
Does several things at once
Plans grand outline only
Emotional
Displays feelings

Reactive

Listens most of the time
Reacts to partner’s action
Looks at general principles
Polite, indirect
Conceals feelings

5

Personalizing E-commerce
• Customized product descriptions
• User preferences and previous purchase history
- may not be directly available or not up to date
• Targeted advertisements
Web Site

Advertisiment Platform

http://google.com

Search Results

http://amazon.com

Web shop

http://groupon.com

Web site and e-mail

http://triggit.com

Facebook

6

Culture-oriented User Modeling
•
•
•
•

Adapting Applications to Cultural Origins
Using Social Web Data
Finding Microblogging Patterns
Creating Culture-oriented User Profiles
describing specific user preferences

When cultural background is not known,
can we find cultural cues from microblogs ?
7

Inferring User Cultural Traits
Culture-specific
User Traits
Differences in Behaviour

Microblogging
Patters

Differences in Microblogging

Adaptation
Employing User Profiles

Culture-oriented
User Modeling
Creating User Profiles

8

Content
Activity

• Tweeting Mobility (geo-locations)
• Posting on Weekends
• Friends and Followers
• User Mentions

Conversation

• URLs and Hashtags
• Automatically-detected Languages

Social

Twitter-specific Features

• Retweets and Replies
9

Example: German User
• User A from Berlin, German language specified in
Twitter Profile
• URLs and Hashtags: 49 and 4
• Automatically-detected Languages -2
• Tweeting Mobility (geo-locations) -7
• Posting on Weekends -23 out of 100
• User Mentions - 75
• Friends and Followers: 50 and 96
• Retweets and Replies: 1 and 28

Example: German User
Gute Nacht! #TWoff
Weekend’s Tweet
Language: de
tweet place: Berlin
I'm at Laroy w/ @username http://t.co/Ct0ObmPz
URLs: 0
Workday’s Tweet
Tags: 1
Language: de
Mentions: 0
tweet place: Sweden 1 w/ @username) http://t.co/8c
Tschüss Madrid :) (@ Terminal (Stockholm)
URLs:
Detected Languages:
Workday’s Tweet 1
Language: Tags: 0
es
Mentions: 1
English: 43
tweet place: Spain
German: 23
URLs: 1
Other: 34
Tags: 0
Mentions: 1

Twitter User Profile Information
Location: Germany (Berlin)
Language: German

Experimental Setup
1. Select Users
2. Collect Tweets
3. Create user profiles
4. Create a classifier
5. Evaluate performance
12

Select Users

Twitter
API

Retrieve
Users(CURL)

MySQL

13

Crawling & Data Processing
Twitter
API

Retrieve Streams
(CURL)

Performance
Report
Tests
(Matlab)

Store JSON
(java)

MySQL
(Tweets)

MySQL
(User Profiles)

Select and Store
Features (java)

Country Total Number Users Posted 100
Of Users
or More Tweets
Japan

4885

2984

Spain

4906

3119

Brazil

4910

2935

USA

1714

1316

Germany 2823

1644

1 199 800 tweets

Microblogging Patterns
Cool, factual,
planners

Hashtags
URLs
Mobility
Networking
LINEARACTIVE
(Germany,
USA)

MULTIACTIVE
(Brazil, Spain)

Courteous,
accommodating,
listeners

Weekends
Replies

Warm, emotional,
loquacious

Mentions
Retweets
Languages

REACTIVE
(Japan)

16

8
Germany
Japan
Spain
USA
Brazil

6

DE
JP
ES
US
BR

c1

4

2

0

−2

−4

−8

−6

−4

−2

0
c2

2

4

6

17

Linearactive
Reactive

Reactiv
e
2.5 (C)

MultiLinear
active
Reactive
1.1 (B)

Multi

4.1 (A)

B
A
C

18

Classification Models
1

Language Codes

Number of

LANG

DEF

3

DEF+LANG
19

• URLs
• Hashtags
• Automatically-detected
Languages
• Geo-locations Detected
• Posts on Weekends
• Friends
• Followers
• User Mentions
• Retweets
• Replies

2

Decision Tree (LANG Feature)
Language Code
>= 4.5
< 4.5
JP
>= 3.5
< 3.5
>= 2.5

< 2.5
< 1.5
< 0.5

>= 1.5
>= 0.5

BR
DE

DE

BR

ES

Language
Japanese
Spanish
Portuguese
German
English
Other

Code
5
4
3
2
1
0
20

Languages in User Profiles

Languages: Native, English, Other
21

Decision Tree (DEF Features)
Languages
>= 0.5
< 0.5
Tags

Tags
< 6.5
JP

<54.5
JP

< 13.5

>= 6.5

Mentions
>= 54.5
< 69.5
URLs
>= 26.5 BR
< 26.5
ES

>= 13.5
Mentions
>= 69.5
ES

US

22

Classification Results
Country-level
Model
1
2
3

Features
LANG
DEF
L.+D.

Resubst.Err.
0.22
0.17
0.02

Cross-valid. Err.
0.22
0.42
0.06

Culture-level
Model
1
2
3

Features
LANG
DEF
L.+D.

Resubst.Err.
0.17
0.10
0.01

Cross-valid. Err.
0.17
0.29
0.04
23

Lang. Code
Lang. Code
Languages

< 2.5
< 0.5

Linear-active
Tags
< 8.5

>= 0.5
Lang. Code

< 1.5
Languages

Multi-active

Reactive

Multi-active

>= 1.5

>=8.5

Linear-active
Multi-active
Replies

< 38.5
Tags

>=48.5
Tags
< 15

Reactive

>= 2.5

>= 1.5

< 1.5

Replies
< 48.5

>= 4.5

< 4.5

>=15
Multi-active

< 36.5

>=38.5

>=36.5
Mentions

Multi-active

< 84.5
Weekends

< 20.5
Multi-active

>=20.5

>=84.5
Multi-active

Linear-active

Linear-active

Key Findings (Cultural Groups)
• Linear-active Users prefer sharing URLs and Hashtags,
and have larger social networks.
• Reactive users do not share so many Hashtags, they,
however, tend to Reply more than Multi-active users.
They employ the least of foreign languages, have lowest
tweeting mobility and tweet mostly on Weekends.
• Multi-active users generally employ more foreign
languages in their content.
25

Key Findings (Country Groups)
• German users share the most of Hashtags and tend to
reply;
• Users from the USA share the most of URLs, have
largest social networks than others and tweeting
mobility;
• Spanish users tend to retweet and mention other users;
• Brazilian users reply the least;
• Users from Japan tweet the most on weekends and
share the least of hashtags and user mentions, employ
the least of foreign languages and have lowest tweeting
26
mobility.

Adaptation Options
When appropriate, creating adaptive apps such as ecommerce or social network web sites to fit user
preferences for:
• sharing content;
• employing foreign languages;
• changing locality;
• communicating with other users.

27

Further Work
• Employ larger data set;
• Include more countries and add features;
• Extend our platform for other social networking
web sites;
• Recommending products/content in accord to user
cultural origings

28

Conclusions
Culture-oriented User Modeling
• Found microblogging patterns for cultural groups
• Employed them for identifying cultural origins
• Got insights on culture-oriented user modeling and
adaptation

29

Thank You
Full-text

Elena Daehnhardt

Google Scholar

Supplementary
Material

Elena@Ilina.nl
www.daehnhardt.com

30

A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging

Recommended

Recommended

More Related Content

Similar to A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging

Similar to A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging (20)

Recently uploaded

Recently uploaded (20)

A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging

Editor's Notes