optmeth-presentation

DOMAIN
• Sina Weibo is a Chinese microblogging
website
• One of the most popular sites in China
• Over 600 mln. registered users
(well over 30% of the Internet)
• 86.6% of the Chinese microblogging
market
• ~100 mln. messages posted each day

OBJECTIVE
• Investigate information and
inﬂuence spread in the network
• Find the most inﬂuential users
and companies in the IT,  
Science &Technology sphere
• Find those who will spread the
word about Skoltech with
minimum cost and maximum
effectiveness

• The normal way to do that is to use the API
EXPECTATIONS

ROADBLOCKS
• 中国的语⾔言是很难理解
• API is essentially non-functional and the
documentation is misleading and confusing
• trafﬁc is severely limited (150 calls/hour)
• connection is unstable

SOLUTION STRUCTURE
To overcome the difﬁculties we came up with the following
solution:
• refer to : whiteboard
• state of the art parser/grabber to capture data
• API is used to get user statistics
• data is interpreted in the facility location framework

APPROACH
• Analyze most popular posts with tags like
#innovation, #education, #science, #technology
• Create a ranked list of their authors (clients) 
(higher relevance = higher rank)
• Find out, whom they follow (facilities)
• Optimize: open the facilities which provide
maximum information spread

CLIENTS VS FACILITIES
Kai-Fu Lee

POTENTIAL IMPROVEMENTS
• better cost assignment estimation (based on
facility posts ranks)
• better source clients (more tags)
• handling of inﬂuence of posts from multiple
facilities to the same client

CREDITS
• Kalan Abe: parser core, pagerank, graph visualization
• Nikita Pestrov: initial concept, raw data processing
for optimization via CVX, Chinese language
understanding
• Denis Antyukhov:Weibo API, parser, data grabbing,
infographics and presentation

Our project is available on GitHub:
https://github.com/pestrov/SkolWeng
where you can get the code, screenshots, raw data
and witness the history of our struggle
ThankYou

optmeth-presentation

Recommended

Recommended

More Related Content

Similar to optmeth-presentation

Similar to optmeth-presentation (20)

More from aphex34

More from aphex34 (6)

optmeth-presentation