SlideShare a Scribd company logo
1 of 41
위치공유 SNS와 소셜 어노테이션 서비스를 이용한
     소셜 검색 엔진의 설계 및 구현




              발 표 자 : 이 동 균
              지도교수 : 권 준 희
Aardwolf
Enhanced Aardvark, social search engine
Contents
 Related work
 Our work
 Scenario
 Conclusion
Related Work
Aardvark
               Aardvark is social                          search engine


                                      $50Million
Reference :
 - http://techcrunch.com/2010/02/11/google-acquires-aardvark-for-50-million
What is Social Search?
           the incorporation of information about social
           networks and relationships into the information
           retrieval process

           “Social search” is an umbrella term used to
           describe search acts that make use of social
           interactions with others.

Reference :
 - Sebastian Marius Kirsch, “Social Information Retrieval”, Rheinische Friedrich-Wilhelms-Universität Bonn, 2005
 - Einat Amitay et al, “Social Search and Discover Using Unified Approach”, ACM, 2009
 - David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009
 - Meredith Ringel Morris et al, “What Do People Ask Their Social Networks, and Why?”, CHI, 2010
 - Brynn M Evans, Ed H. Chi, “Towards a Model of Understanding Social Search”, ACM, 2010
What is Social Network?
           A social structure made of individual called
           ‘nodes’, which are connected by one or more
           specific types of interdependency, such as
           friendship, common interest, financial exchange,
           dislike or prestige.




Reference :
 - http://en.wikipedia.org/wiki/Social_network
 - http://en.wikipedia.org/wiki/Social_networking
Goal of Social Search
           To use the social information to improve
           the user’s search experience over regular
           full text search.


           Social search(a.k.a social information retrieval)
           = social networks + information search



Reference :
 - Sebastian Marius Kirsch, “Social Information Retrieval”, Rheinische Friedrich-Wilhelms-Universität Bonn, 2005
 - Einat Amitay et al, “Social Search and Discover Using Unified Approach”, ACM, 2009
Previous Studies
           Social Network Based Search
            - PeopleRank, SocialRank, SNDocRank, Aardvark

           Social Annotation Based Search
           - FolkRank, SocialBookmarkRank, SocialSimRank,
           SocialPageRank



Reference :
- Andreas Hotho et al, “Information Retrieval in Folksonomies”, The Semantic Web, 2006
- Yusuke Yanbe et al, “Can Social Bookmarking Enhance Search in the Web?”, ACM, 2007
- Shenghua Bao et, al, “Optimizing Web Search Using Social Annotations”, WWW2007, 2007
- David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009
- Liang Gou et al, “SNDocRank: a social network-based video search ranking framework”, ACM, 2010
- Abderrahmen Mtibaa et al, “PeopleRank: Social Opportunistic Forwarding”, IEEE, 2010
Aardvark
Aardvark is social search engine

- Social Network Based Search
- Deployed and validated in large-scale
is
Social Question-Answering Service
Which one is better?
                                        Aardvark




                     vs
 <Anonymous group>        <My Friends (of the friends)>
Social network rocks!

     • More Reliable

     • More Satisfactory

     • Even More Responsible



Reference :
- Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
- Meredith Ringel Morris et al, “What Do People Ask Their Social Networks, and Why?”, ACM, 2010
- David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009
Our work
Aardwolf
Enhanced Aardvark, social search engine
Structure of Aardvark
                        Sign-up time
                                                                  DB
                            Importer                                                    Index
                                                            User, Topic,
                                                          Graph, Content




                        at query time

                                                           Conversation                Routing
                            Gateway
                                                             Manager                   Engine



Reference :
- Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
First solution :
     We deal with user’s up-to-date topic



                                                                   Your
                                                                   up-to-date topic
                                                                   does matter

Reference :
- Xuerui Wang, Andrew McCallum, “Topics over Time”, ACM, 2006
- Andreas Hotho et al, “Trend Detection in Folksonomies”, SAMT, 2006
- http://burak-arikan.com/growth-of-a-twitter-graph, “Growth of a twitter graph”, 2010
Structure of Aardwolf

                         Up-to-date Topics          Up-to-date
                                                      Topic
                                                    Aggregator
Social Annotation Service


            at query time                    Up-to-date Topics


                                     Conversation                Routing
               Gateway
                                       Manager                   Engine
Why                                                                          ?
                           Social Annotation Service


     • Tag(Annotation) as
     topic

     • Easiest way to get
     user’s up-to-date
     topic
Reference :
- Andreas Hotho et al, “Information Retrieval in Folksonomies”, The Semantic Web, 2006
- Yusuke Yanbe et al, “Can Social Bookmarking Enhance Search in the Web?”, ACM, 2007
- Shenghua Bao et, al, “Optimizing Web Search Using Social Annotations”, WWW2007, 2007
Ranking process of Aardvark
    Gateway

          Query : “what would be perfect for my girlfriend’s present?”




    Routing
    Engine


                 Rank users by algorithm of Aardvark



          Users : Ranked by their social information


  Conversation
    Manager
Ranking Algorithm

     User’s answerable probability



            Intimacy            Expertise

                                                 Topic Assignment


                                Topic Expertise

Reference :
- Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
Ranking process of Aardwolf
    Gateway

          Query : “what would be perfect for my girlfriend’s present?”



                                      Up-to-date Topic
                                        Aggregator
                  Up-to-date Topics                           Social Annotation Service
    Routing
    Engine


                 Rank users by algorithm of Aardwolf



          Users : Ranked by their social information


  Conversation
    Manager
Ranking Algorithm
    Aardvark                       Topic Expertise




    Aardwolf


                                            Topic                              Up-to-date Topic




Reference :
- Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
Second solution :
We deal with user’s knowledge of location

    • User’s knowledge of location
Structure of Aardwolf
                    Location History      Location
                                          Manager
Location sharing SNS


           at query time

                           Conversation     Routing   Up-to-date
         Gateway                                        Topic
                             Manager        Engine    Aggregator
What is the                   ?
              Location Sharing SNS



User’s
knowledge
of Location
Ranking process of Aardvark
    Gateway

          Query : “where is the best restaurant in Kang-Nam, Seoul ?”




    Routing
    Engine
                 1. Select users if their profile mentions Seoul
                 2. Rank users by algorithm of Aardvark




          Users : Ranked by their social information


  Conversation
    Manager
Re-ranking process of Aardwolf
    Gateway

          Query : “where is the best restaurant in Kang-nam, Seoul ?”



                                        Location
                   Location history     Manager
    Routing
    Engine
                 1. Select users if they have been Kang-nam recetly.
                 2. Rank users by algorithm of Aardwolf
                 3. Re-rank users by their location score


          Users : re-ranked by their location score


  Conversation
    Manager
Re-ranking Algorithm
Re-rank score(RRs)
=       Rs(Ranking Score) + (1-            ) Ls(Location Score)
    e.g :
    Rs = {A: 0.2, B: 0.5, C:0.3}
    Ls = {A: 0.4, B: 0.4, C:0.2}
    RRs = {A: 0.3, B: 0.45, C: 0.25}



Ls
= LUC / Sum of LUC(Location Use Count)
    e.g :
    LUC = {Seoul: 20, Incheon: 30, Suwon: 50}
    Ls(Seoul) = 20 / 100 = 0.2
Scenario
Scenario
                      and his girlfriend have been seeing
B       A        A
                      each other for 3 years.

        Me            has been restaurant reviewer for several
                 C
                      years.


    D        C   Me   a big fan of Aardwolf.



“where is the best restaurant for Japanese
 Ramen in Kang-nam, Seoul?”
Ranking process of Aardwolf
    Gateway

          Query : “where is the best Japanese Ramen restaurant in Kang-nam, Seoul ?”



                                       Location
                   Location history    Manager
    Routing
    Engine
                 1. Select users if they have been Kang-nam recetly.




          Users : re-ranked by their location history


  Conversation
    Manager
Select users
by their location, Kang-nam, Seoul
                              Profile                Foursquare
 B           A

                          A   Seoul
             Me
                          B   Suwon
                                                Kang-nam, Seoul
                          C   Seoul
     D                C
                          D   Seoul



Aardvark                      Aardwolf
 A       C        D           A         B   C    D
Ranking process of Aardwolf
    Gateway

          Query : “where is the best Japanese Ramen restaurant in Kang-Nam, Seoul ?”



                 1. Select users if they have been kang-nam recetly.

    Routing                            Up-to-date Topic
    Engine                               Aggregator
                 Recent User Topics                            Social Annotation Service


                 2. Rank users by algorithm of Aardwolf


          Users : re-ranked by their location history


  Conversation
    Manager
Rank :
Consider Change of Topic , Ramen
                                          Up-to-date   Aardvark   Aardwolf
 B           A                All Topic
                                            Topic       Score      Score

                          A     26            8          0.5       0.30
             Me

                          B      5            2          N/A       0.06

     D                C   C     17           26         0.33       0.39

                          D      8           20         0.17       0.25


Aardvark                             Aardwolf
 A       C        D                   C      A     D     B
Re-ranking process of Aardwolf
    Gateway

          Query : “where is the best Japanese Ramen restaurant in Kang-Nam, Seoul ?”



                                        Location
                   Location history     Manager
    Routing
    Engine
                 1. Select users if they have been kang-nam recetly.
                 2. Rank users by algorithm of Aardwolf
                 3. Re-rank users by their location use count


          Users : re-ranked by their location history


  Conversation
    Manager
Re-Rank :
Consider Location History, Kang-nam
                                    Location Use
                                                    Rank score    Re-rank score
 B           A                    Count, Kang-nam

                              A         5               0.3          0.18
             Me
                              B         20              0.6          0.13

                              C         50              0.39         0.44
     D                C
                              D        120              0.25         0.74


Ranked                                 Re-ranked
 C       A        D       B              D     C    A         B
Aardwolf rocks!
                                    Up-to-date          Aardvark   Aardwolf
     Profile   Foursquare   Topic                 LUC
                                      Topic              score      score

A    Seoul     Kang-nam     26            8       5         0.5     0.18

B    Suwon     Kang-nam      5            2       20        N/A     0.13

C    Seoul     Kang-nam     17         26         50        0.33    0.44

D    Seoul     Kang-nam      8         20         120       0.17    0.74


Aardvark                            Aardwolf
 A    C        D                      D       C   A     B
Conclusion
Work in Progress
 • Modified ranking algorithm to consider user’s
 up-to-date topic

 • Added re-ranking algorithm to deal with user’s
 knowledge of location


Future work
 • Implement the Aardwolf social search engine
 • Experiment with the real data of Twitter,
 Delicious and Foursquare
@haandol




 Discussion

           Everything is connected to everything else.

More Related Content

Viewers also liked

Baratie - 사회적식당 만들기
Baratie - 사회적식당 만들기Baratie - 사회적식당 만들기
Baratie - 사회적식당 만들기
DongKyun Lee
 
Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)
Søren Lund
 
Beyond Unit Testing
Beyond Unit TestingBeyond Unit Testing
Beyond Unit Testing
Søren Lund
 
[090723]Web2.0to Sns
[090723]Web2.0to Sns[090723]Web2.0to Sns
[090723]Web2.0to Sns
DongKyun Lee
 

Viewers also liked (15)

[100129]나눔문화와 소셜네트워크
[100129]나눔문화와 소셜네트워크[100129]나눔문화와 소셜네트워크
[100129]나눔문화와 소셜네트워크
 
Baratie - 사회적식당 만들기
Baratie - 사회적식당 만들기Baratie - 사회적식당 만들기
Baratie - 사회적식당 만들기
 
Ignite D Blab 1st (윤태현)
Ignite D Blab 1st (윤태현)Ignite D Blab 1st (윤태현)
Ignite D Blab 1st (윤태현)
 
투자하기 좋은 기업 고르는 법
투자하기 좋은 기업 고르는 법투자하기 좋은 기업 고르는 법
투자하기 좋은 기업 고르는 법
 
연인 관계를 오래 지속하는 법
연인 관계를 오래 지속하는 법연인 관계를 오래 지속하는 법
연인 관계를 오래 지속하는 법
 
Web to sns
Web to snsWeb to sns
Web to sns
 
2010 Dblab신년회 안명환
2010 Dblab신년회 안명환2010 Dblab신년회 안명환
2010 Dblab신년회 안명환
 
Te dx kgu소개
Te dx kgu소개Te dx kgu소개
Te dx kgu소개
 
世界一周のしおり
世界一周のしおり世界一周のしおり
世界一周のしおり
 
Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)
 
Beyond Unit Testing
Beyond Unit TestingBeyond Unit Testing
Beyond Unit Testing
 
발표자료 해성
발표자료 해성발표자료 해성
발표자료 해성
 
introduce myself
introduce myselfintroduce myself
introduce myself
 
Kwon성격유형
Kwon성격유형Kwon성격유형
Kwon성격유형
 
[090723]Web2.0to Sns
[090723]Web2.0to Sns[090723]Web2.0to Sns
[090723]Web2.0to Sns
 

Similar to [100621]제안발표

Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
BookStoreLib
 
Aardvark Final Www2010
Aardvark Final Www2010Aardvark Final Www2010
Aardvark Final Www2010
guestcc519e
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched data
Dhaval Thakker
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to users
jobinwilson
 

Similar to [100621]제안발표 (20)

Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
 
Aardvark Final Www2010
Aardvark Final Www2010Aardvark Final Www2010
Aardvark Final Www2010
 
Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity Report
 
Design the Search Experience
Design the Search ExperienceDesign the Search Experience
Design the Search Experience
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched data
 
Gic2011 aula10-ingles
Gic2011 aula10-inglesGic2011 aula10-ingles
Gic2011 aula10-ingles
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to users
 
Recommendation engines matching items to users
Recommendation engines matching items to usersRecommendation engines matching items to users
Recommendation engines matching items to users
 
UProRevs-User Profile Relevant Results
UProRevs-User Profile Relevant ResultsUProRevs-User Profile Relevant Results
UProRevs-User Profile Relevant Results
 
Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
Linked services: Connecting services to the Web of Data
Linked services: Connecting services to the Web of DataLinked services: Connecting services to the Web of Data
Linked services: Connecting services to the Web of Data
 
Semantic web and Linked Data
Semantic web and Linked DataSemantic web and Linked Data
Semantic web and Linked Data
 
K1803057782
K1803057782K1803057782
K1803057782
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
 
Comparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering TechniqueComparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering Technique
 
8 Information Architecture Better Practices
8 Information Architecture Better Practices8 Information Architecture Better Practices
8 Information Architecture Better Practices
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMM
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

[100621]제안발표

  • 1. 위치공유 SNS와 소셜 어노테이션 서비스를 이용한 소셜 검색 엔진의 설계 및 구현 발 표 자 : 이 동 균 지도교수 : 권 준 희
  • 3. Contents Related work Our work Scenario Conclusion
  • 5. Aardvark Aardvark is social search engine $50Million Reference : - http://techcrunch.com/2010/02/11/google-acquires-aardvark-for-50-million
  • 6. What is Social Search? the incorporation of information about social networks and relationships into the information retrieval process “Social search” is an umbrella term used to describe search acts that make use of social interactions with others. Reference : - Sebastian Marius Kirsch, “Social Information Retrieval”, Rheinische Friedrich-Wilhelms-Universität Bonn, 2005 - Einat Amitay et al, “Social Search and Discover Using Unified Approach”, ACM, 2009 - David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009 - Meredith Ringel Morris et al, “What Do People Ask Their Social Networks, and Why?”, CHI, 2010 - Brynn M Evans, Ed H. Chi, “Towards a Model of Understanding Social Search”, ACM, 2010
  • 7. What is Social Network? A social structure made of individual called ‘nodes’, which are connected by one or more specific types of interdependency, such as friendship, common interest, financial exchange, dislike or prestige. Reference : - http://en.wikipedia.org/wiki/Social_network - http://en.wikipedia.org/wiki/Social_networking
  • 8. Goal of Social Search To use the social information to improve the user’s search experience over regular full text search. Social search(a.k.a social information retrieval) = social networks + information search Reference : - Sebastian Marius Kirsch, “Social Information Retrieval”, Rheinische Friedrich-Wilhelms-Universität Bonn, 2005 - Einat Amitay et al, “Social Search and Discover Using Unified Approach”, ACM, 2009
  • 9. Previous Studies Social Network Based Search - PeopleRank, SocialRank, SNDocRank, Aardvark Social Annotation Based Search - FolkRank, SocialBookmarkRank, SocialSimRank, SocialPageRank Reference : - Andreas Hotho et al, “Information Retrieval in Folksonomies”, The Semantic Web, 2006 - Yusuke Yanbe et al, “Can Social Bookmarking Enhance Search in the Web?”, ACM, 2007 - Shenghua Bao et, al, “Optimizing Web Search Using Social Annotations”, WWW2007, 2007 - David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009 - Liang Gou et al, “SNDocRank: a social network-based video search ranking framework”, ACM, 2010 - Abderrahmen Mtibaa et al, “PeopleRank: Social Opportunistic Forwarding”, IEEE, 2010
  • 10. Aardvark Aardvark is social search engine - Social Network Based Search - Deployed and validated in large-scale
  • 12. Which one is better? Aardvark vs <Anonymous group> <My Friends (of the friends)>
  • 13. Social network rocks! • More Reliable • More Satisfactory • Even More Responsible Reference : - Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010 - Meredith Ringel Morris et al, “What Do People Ask Their Social Networks, and Why?”, ACM, 2010 - David Carmel et al, “Personalized Social Search Based on the User’s Social Network”, ACM, 2009
  • 16. Structure of Aardvark Sign-up time DB Importer Index User, Topic, Graph, Content at query time Conversation Routing Gateway Manager Engine Reference : - Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
  • 17. First solution : We deal with user’s up-to-date topic Your up-to-date topic does matter Reference : - Xuerui Wang, Andrew McCallum, “Topics over Time”, ACM, 2006 - Andreas Hotho et al, “Trend Detection in Folksonomies”, SAMT, 2006 - http://burak-arikan.com/growth-of-a-twitter-graph, “Growth of a twitter graph”, 2010
  • 18. Structure of Aardwolf Up-to-date Topics Up-to-date Topic Aggregator Social Annotation Service at query time Up-to-date Topics Conversation Routing Gateway Manager Engine
  • 19. Why ? Social Annotation Service • Tag(Annotation) as topic • Easiest way to get user’s up-to-date topic Reference : - Andreas Hotho et al, “Information Retrieval in Folksonomies”, The Semantic Web, 2006 - Yusuke Yanbe et al, “Can Social Bookmarking Enhance Search in the Web?”, ACM, 2007 - Shenghua Bao et, al, “Optimizing Web Search Using Social Annotations”, WWW2007, 2007
  • 20. Ranking process of Aardvark Gateway Query : “what would be perfect for my girlfriend’s present?” Routing Engine Rank users by algorithm of Aardvark Users : Ranked by their social information Conversation Manager
  • 21. Ranking Algorithm User’s answerable probability Intimacy Expertise Topic Assignment Topic Expertise Reference : - Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
  • 22. Ranking process of Aardwolf Gateway Query : “what would be perfect for my girlfriend’s present?” Up-to-date Topic Aggregator Up-to-date Topics Social Annotation Service Routing Engine Rank users by algorithm of Aardwolf Users : Ranked by their social information Conversation Manager
  • 23. Ranking Algorithm Aardvark Topic Expertise Aardwolf Topic Up-to-date Topic Reference : - Damon Horowitz, “The Anatomy of a Large-Scale Social Search Engine”, WWW2010, 2010
  • 24. Second solution : We deal with user’s knowledge of location • User’s knowledge of location
  • 25. Structure of Aardwolf Location History Location Manager Location sharing SNS at query time Conversation Routing Up-to-date Gateway Topic Manager Engine Aggregator
  • 26. What is the ? Location Sharing SNS User’s knowledge of Location
  • 27. Ranking process of Aardvark Gateway Query : “where is the best restaurant in Kang-Nam, Seoul ?” Routing Engine 1. Select users if their profile mentions Seoul 2. Rank users by algorithm of Aardvark Users : Ranked by their social information Conversation Manager
  • 28. Re-ranking process of Aardwolf Gateway Query : “where is the best restaurant in Kang-nam, Seoul ?” Location Location history Manager Routing Engine 1. Select users if they have been Kang-nam recetly. 2. Rank users by algorithm of Aardwolf 3. Re-rank users by their location score Users : re-ranked by their location score Conversation Manager
  • 29. Re-ranking Algorithm Re-rank score(RRs) = Rs(Ranking Score) + (1- ) Ls(Location Score) e.g : Rs = {A: 0.2, B: 0.5, C:0.3} Ls = {A: 0.4, B: 0.4, C:0.2} RRs = {A: 0.3, B: 0.45, C: 0.25} Ls = LUC / Sum of LUC(Location Use Count) e.g : LUC = {Seoul: 20, Incheon: 30, Suwon: 50} Ls(Seoul) = 20 / 100 = 0.2
  • 31. Scenario and his girlfriend have been seeing B A A each other for 3 years. Me has been restaurant reviewer for several C years. D C Me a big fan of Aardwolf. “where is the best restaurant for Japanese Ramen in Kang-nam, Seoul?”
  • 32. Ranking process of Aardwolf Gateway Query : “where is the best Japanese Ramen restaurant in Kang-nam, Seoul ?” Location Location history Manager Routing Engine 1. Select users if they have been Kang-nam recetly. Users : re-ranked by their location history Conversation Manager
  • 33. Select users by their location, Kang-nam, Seoul Profile Foursquare B A A Seoul Me B Suwon Kang-nam, Seoul C Seoul D C D Seoul Aardvark Aardwolf A C D A B C D
  • 34. Ranking process of Aardwolf Gateway Query : “where is the best Japanese Ramen restaurant in Kang-Nam, Seoul ?” 1. Select users if they have been kang-nam recetly. Routing Up-to-date Topic Engine Aggregator Recent User Topics Social Annotation Service 2. Rank users by algorithm of Aardwolf Users : re-ranked by their location history Conversation Manager
  • 35. Rank : Consider Change of Topic , Ramen Up-to-date Aardvark Aardwolf B A All Topic Topic Score Score A 26 8 0.5 0.30 Me B 5 2 N/A 0.06 D C C 17 26 0.33 0.39 D 8 20 0.17 0.25 Aardvark Aardwolf A C D C A D B
  • 36. Re-ranking process of Aardwolf Gateway Query : “where is the best Japanese Ramen restaurant in Kang-Nam, Seoul ?” Location Location history Manager Routing Engine 1. Select users if they have been kang-nam recetly. 2. Rank users by algorithm of Aardwolf 3. Re-rank users by their location use count Users : re-ranked by their location history Conversation Manager
  • 37. Re-Rank : Consider Location History, Kang-nam Location Use Rank score Re-rank score B A Count, Kang-nam A 5 0.3 0.18 Me B 20 0.6 0.13 C 50 0.39 0.44 D C D 120 0.25 0.74 Ranked Re-ranked C A D B D C A B
  • 38. Aardwolf rocks! Up-to-date Aardvark Aardwolf Profile Foursquare Topic LUC Topic score score A Seoul Kang-nam 26 8 5 0.5 0.18 B Suwon Kang-nam 5 2 20 N/A 0.13 C Seoul Kang-nam 17 26 50 0.33 0.44 D Seoul Kang-nam 8 20 120 0.17 0.74 Aardvark Aardwolf A C D D C A B
  • 40. Work in Progress • Modified ranking algorithm to consider user’s up-to-date topic • Added re-ranking algorithm to deal with user’s knowledge of location Future work • Implement the Aardwolf social search engine • Experiment with the real data of Twitter, Delicious and Foursquare
  • 41. @haandol Discussion Everything is connected to everything else.