The Future of Search: How to Stay Relevant in Sourcing
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

The Future of Search: How to Stay Relevant in Sourcing

  • 1,190 views
Uploaded on

Presentation from Sourcecon 2011 Fall, presented by Greg Lindahl.

Presentation from Sourcecon 2011 Fall, presented by Greg Lindahl.

More in: Business , Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,190
On Slideshare
1,093
From Embeds
97
Number of Embeds
1

Actions

Shares
Downloads
33
Comments
0
Likes
2

Embeds 97

http://www.sourcecon.com 97

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. The  future  of  search:  How  to  stay  relevant  in  Sourcing   Greg  Lindahl   CTO,  Blekko   October 13, 2011 - SourceCon
  • 2. A  li@le  about  me  •  Technologist,  not  a  sourcer  •  I  did  get  sourced  once,  in  1995  •  I m  proud  to  have  the  ugliest  slides  at  the   conference!  
  • 3. A  li@le  about  you  •  You  guys  are  heavy,  sophisJcated  users  of   Google  &  specialized  engines  like  Topsy  •  You re  eager  to  learn  about  new  things  •  You  quickly  form  opinions  about  what s   useful    
  • 4. Challenges  in  Sourcing  •  The  order  of  search  results  is  based  on   incoming  links  &  Page  Rank  •  You  guys  are  heavy  users  of  advanced  features   such  as  boolean  search  •  Social  networks  are   walled  gardens  
  • 5. PageRank Today
  • 6. Advanced  interfaces  •  No  one  uses  the  advanced  search  interface  of   Google  •  You re  lucky  that  it  hasn t  disappeared!  •  New  Google  algorithms  trying  to  guess  your   intent  are  probably  a  net  minus  for  Sourcers  
  • 7. Social  Networks  •  Facebook  is  mostly  a   walled  garden  -­‐  many   users  don t  want  their  personal  info  to  be   public  •  Facebook  mixes  work  and  play  •  LinkedIn  is  a  pure  play,  but  younger  people   don t  use  it  •  Twi@er  is  open,  but  a  mixture  of  work  and   play  
  • 8. Search  implicaJons  of  social  •  General  search  engines  won t  be  good   enough,  now  or  in  the  future  •  It  will  remain  hard  to  try  to  match  up   candidates  with  several  social  accounts  plus  a   web  presence  
  • 9. Things  you  should  know:  BiGrams  •  Google  counts:   –  java  programmer  OR  developer:  80  million   –  java  programmer  OR   java  devloper :  23   million  •  Why?   –  Everyone  indexes  pairs  and  triples  of  words  which   are  thought  to  be  related   •  Names,  job  Jtles,  common  word  pairs  
  • 10. Future  search  direcJons  •  semanJc  search  •  2  parts:   –  understanding  the  source  documents  be@er   •  bigrams  of  names  just  a  start   –  understanding  your  query  intent  be@er  •  This  will  hurt  advanced  search  and  boolean   queries!  
  • 11. Future  search  direcJons  •  real-­‐Jme  search:  twi@er  and  non-­‐twi@er  •  Non-­‐twi@er  real-­‐Jme  incorporated  directly   into  the  major  search  engines  •  Twi@er  search  best  in  specialized  engines  
  • 12. New  market  entrants  •  blekko  •  yandex  •  duckduckgo  
  • 13. slash the web "
  • 14. Silicon  Valley  handshake  •  blekko  was  founded  in  2007  •  $55mm  in  financing,  29  employees  •  Backers:  USVP,  CMEA  Ventures,  Yandex   (strategic),  Ron  Conway,  Marc  Andreesen,  …,   Ashton  Kutcher  
  • 15. Curated  Search  –  No  Spam,  High  Quality  
  • 16. Algorithms + People = Better Search"ü Wikipedia  model:  Users  idenJfy  top  sites  for   every  category  ü Technology:  blekko  uses  social  data  plus   algorithms  to  make  more  relevant,  spam-­‐free   search  results  
  • 17. Slashtag  basics  •  Both  algorthmic  and  human-­‐curated  •  Curated  slashtags  developed  in  conjuncJon   with  outside  partners,  such  as  Stack  Overflow  •  Type   em  directly  into  the  search  box:                                                Greg  Lindahl  /date  
  • 18. Slashtags  •  Sort  order:  /date,  /relevance  •  Narrow  your  search   –  Algorthmic:  /forum  /blog  /gov  /edu   –  Site:  /foxnews.com   –  Curated  list  of  websites:  /health  •  Human-­‐edited  by  groups  (think  dmoz  or   wikipedia)  •  Every  user  has  their  own  namespace,  plus   there s  a  namespace  for  /blekko/  tags  
  • 19. Tips:  •  Use  /findslashtags  to  find  tags:                        python  /findslashags  •  Use  /web  to  get  rid  of  any  unwanted   autoslashtags  •  Watch  the  suggesJons  for  slashtag   suggesJons  as  you  type  •  We  have  some  API  outcalls  that  might  save   you  Jme:  /twi@er  =  Twi@er  search  API,  /video   =  YouTube,  /imges  =  bing,  /bing,  /google  
  • 20. Advanced  Slashtags  •  Can  be  negated:  -­‐/foxnews.com  •  Implicit  -­‐/spam  on  every  searce   –  and  you  can  add  any  result  to  it  with  1  click   –  use  this  to  get  rid  of  all  those   people  finders  •  Can  use  mulJple  tags  to  intersect   /linux  /blogs  /date  •  Can  include  tags  in  other  tags  to   OR  them  
  • 21. Sourcer  Slashtags  •  /people  -­‐-­‐  algorithmic  a@empt  to  find   resumes,  bios,  etc  •  /blogs  and  /forums,  -­‐/blogs  and  -­‐/forums  •  Develop  a  /spam  slashtag,  or  maybe  even   several  of  them  that  you  manually  add  to   various  searches  
  • 22. Programming  slashtags  •  /open-­‐source  -­‐-­‐  aliases  /oss  /foss  /floss  •  /linux,  /lkml,  /bsd,  /windows  •  /repo  -­‐-­‐  open  source  public  repositories  •  /apache,  /fsf  -­‐-­‐  umbrella  organizaJons  •  /perl,  /cpan,  /php,  /javascript,  /python,  / django,  /ruby,  /rails,  /java,  /erlang,  /scheme  -­‐-­‐   languages  •  /hpc,  /make,  /hacker,  /hakerspaces  
  • 23. A  few  examples  •  Java  programers  with  high  performance  compuJng   experience:   java  hpc  /people  •  Followup  on  a  candidate   marcus  wa@s  java  /blogs   marcus  wa@s  java  -­‐/blogs   marcus  wa@s  java  -­‐/blogs  /date   marcus  wa@s  java  -­‐/blogs  /date=2008-­‐2009  •  Ok,  more   marcus  wa@s  /twi@er   marcus  wa@s  /youtube  -­‐-­‐  oops,  basketball  guy  
  • 24. The  bad  stuff  •  blekko s  crawl  is  fairly  small,  2  billion  pages   today   –  increasing  this  Fall  •  Some  of  the  programming  slashtags  aren t  as   good  as  others   –  this  will  improve  over  Jme  
  • 25. For  more  info  •  help.blekko.com   –  Add  blekko  to  the  list  on  the  upper  right  search  box  •  blekko  toolbar  
  • 26. To  Sum  Up  •  Let  me  explain.  No,  there  is  too  much.  Let  me   sum  up.  •  Search  is  evolving  in  good  and  bad  ways  •  New  tools  pop  up  every  year  •  I  would  love  to  hear  feedback:   greg@blekko.com