The	  future	  of	  search:   	  How	  to	  stay	  relevant	  in	  Sourcing	                    Greg	  Lindahl 	          ...
A	  li@le	  about	  me	  •  Technologist,	  not	  a	  sourcer	  •  I	  did	  get	  sourced	  once,	  in	  1995	  •  I m	  ...
A	  li@le	  about	  you	  •  You	  guys	  are	  heavy,	  sophisJcated	  users	  of	     Google	  &	  specialized	  engines...
Challenges	  in	  Sourcing	  •  The	  order	  of	  search	  results	  is	  based	  on	     incoming	  links	  &	  Page	  R...
PageRank Today
Advanced	  interfaces                                    	  •  No	  one	  uses	  the	  advanced	  search	  interface	  of	...
Social	  Networks                                     	  •  Facebook	  is	  mostly	  a	   walled	  garden 	  -­‐	  many	  ...
Search	  implicaJons	  of	  social                                           	  •  General	  search	  engines	  won t	  be...
Things	  you	  should	  know:	  BiGrams                                           	  •  Google	  counts:	     –  java	  pr...
Future	  search	  direcJons                                         	  •  semanJc	  search 	  •  2	  parts:	      –  under...
Future	  search	  direcJons                                       	  •  real-­‐Jme 	  search:	  twi@er	  and	  non-­‐twi@e...
New	  market	  entrants                                        	  •  blekko	  •  yandex	  •  duckduckgo	  
slash the web "
Silicon	  Valley	  handshake	  •  blekko	  was	  founded	  in	  2007	  •  $55mm	  in	  financing,	  29	  employees	  •  Bac...
Curated	  Search	  –	  No	  Spam,	  High	  Quality	  
Algorithms + People = Better Search"ü Wikipedia	  model:	  Users	  idenJfy	  top	  sites	  for	     every	  category	  ü...
Slashtag	  basics                                                                     	  •  Both	  algorthmic	  and	  huma...
Slashtags                                    	  •  Sort	  order:	  /date,	  /relevance	  •  Narrow	  your	  search	      –...
Tips:                                                             	  •  Use	  /findslashtags	  to	  find	  tags:	      	  	 ...
Advanced	  Slashtags                                       	  •  Can	  be	  negated:	  -­‐/foxnews.com	  •  Implicit	  -­‐...
Sourcer	  Slashtags                                     	  •  /people	  -­‐-­‐	  algorithmic	  a@empt	  to	  find	     resu...
Programming	  slashtags                                    	  •  /open-­‐source	  -­‐-­‐	  aliases	  /oss	  /foss	  /floss	...
A	  few	  examples                                            	  •  Java	  programers	  with	  high	  performance	  compuJ...
The	  bad	  stuff	  •  blekko s	  crawl	  is	  fairly	  small,	  2	  billion	  pages	     today	      –  increasing	  this	...
For	  more	  info	  •  help.blekko.com	      –  Add	  blekko	  to	  the	  list	  on	  the	  upper	  right	  search	  box	 ...
To	  Sum	  Up	  •  Let	  me	  explain.	  No,	  there	  is	  too	  much.	  Let	  me	    sum	  up. 	  •  Search	  is	  evolv...
Upcoming SlideShare
Loading in...5
×

The Future of Search: How to Stay Relevant in Sourcing

941

Published on

Presentation from Sourcecon 2011 Fall, presented by Greg Lindahl.

Published in: Business, Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
941
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
33
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

The Future of Search: How to Stay Relevant in Sourcing

  1. 1. The  future  of  search:  How  to  stay  relevant  in  Sourcing   Greg  Lindahl   CTO,  Blekko   October 13, 2011 - SourceCon
  2. 2. A  li@le  about  me  •  Technologist,  not  a  sourcer  •  I  did  get  sourced  once,  in  1995  •  I m  proud  to  have  the  ugliest  slides  at  the   conference!  
  3. 3. A  li@le  about  you  •  You  guys  are  heavy,  sophisJcated  users  of   Google  &  specialized  engines  like  Topsy  •  You re  eager  to  learn  about  new  things  •  You  quickly  form  opinions  about  what s   useful    
  4. 4. Challenges  in  Sourcing  •  The  order  of  search  results  is  based  on   incoming  links  &  Page  Rank  •  You  guys  are  heavy  users  of  advanced  features   such  as  boolean  search  •  Social  networks  are   walled  gardens  
  5. 5. PageRank Today
  6. 6. Advanced  interfaces  •  No  one  uses  the  advanced  search  interface  of   Google  •  You re  lucky  that  it  hasn t  disappeared!  •  New  Google  algorithms  trying  to  guess  your   intent  are  probably  a  net  minus  for  Sourcers  
  7. 7. Social  Networks  •  Facebook  is  mostly  a   walled  garden  -­‐  many   users  don t  want  their  personal  info  to  be   public  •  Facebook  mixes  work  and  play  •  LinkedIn  is  a  pure  play,  but  younger  people   don t  use  it  •  Twi@er  is  open,  but  a  mixture  of  work  and   play  
  8. 8. Search  implicaJons  of  social  •  General  search  engines  won t  be  good   enough,  now  or  in  the  future  •  It  will  remain  hard  to  try  to  match  up   candidates  with  several  social  accounts  plus  a   web  presence  
  9. 9. Things  you  should  know:  BiGrams  •  Google  counts:   –  java  programmer  OR  developer:  80  million   –  java  programmer  OR   java  devloper :  23   million  •  Why?   –  Everyone  indexes  pairs  and  triples  of  words  which   are  thought  to  be  related   •  Names,  job  Jtles,  common  word  pairs  
  10. 10. Future  search  direcJons  •  semanJc  search  •  2  parts:   –  understanding  the  source  documents  be@er   •  bigrams  of  names  just  a  start   –  understanding  your  query  intent  be@er  •  This  will  hurt  advanced  search  and  boolean   queries!  
  11. 11. Future  search  direcJons  •  real-­‐Jme  search:  twi@er  and  non-­‐twi@er  •  Non-­‐twi@er  real-­‐Jme  incorporated  directly   into  the  major  search  engines  •  Twi@er  search  best  in  specialized  engines  
  12. 12. New  market  entrants  •  blekko  •  yandex  •  duckduckgo  
  13. 13. slash the web "
  14. 14. Silicon  Valley  handshake  •  blekko  was  founded  in  2007  •  $55mm  in  financing,  29  employees  •  Backers:  USVP,  CMEA  Ventures,  Yandex   (strategic),  Ron  Conway,  Marc  Andreesen,  …,   Ashton  Kutcher  
  15. 15. Curated  Search  –  No  Spam,  High  Quality  
  16. 16. Algorithms + People = Better Search"ü Wikipedia  model:  Users  idenJfy  top  sites  for   every  category  ü Technology:  blekko  uses  social  data  plus   algorithms  to  make  more  relevant,  spam-­‐free   search  results  
  17. 17. Slashtag  basics  •  Both  algorthmic  and  human-­‐curated  •  Curated  slashtags  developed  in  conjuncJon   with  outside  partners,  such  as  Stack  Overflow  •  Type   em  directly  into  the  search  box:                                                Greg  Lindahl  /date  
  18. 18. Slashtags  •  Sort  order:  /date,  /relevance  •  Narrow  your  search   –  Algorthmic:  /forum  /blog  /gov  /edu   –  Site:  /foxnews.com   –  Curated  list  of  websites:  /health  •  Human-­‐edited  by  groups  (think  dmoz  or   wikipedia)  •  Every  user  has  their  own  namespace,  plus   there s  a  namespace  for  /blekko/  tags  
  19. 19. Tips:  •  Use  /findslashtags  to  find  tags:                        python  /findslashags  •  Use  /web  to  get  rid  of  any  unwanted   autoslashtags  •  Watch  the  suggesJons  for  slashtag   suggesJons  as  you  type  •  We  have  some  API  outcalls  that  might  save   you  Jme:  /twi@er  =  Twi@er  search  API,  /video   =  YouTube,  /imges  =  bing,  /bing,  /google  
  20. 20. Advanced  Slashtags  •  Can  be  negated:  -­‐/foxnews.com  •  Implicit  -­‐/spam  on  every  searce   –  and  you  can  add  any  result  to  it  with  1  click   –  use  this  to  get  rid  of  all  those   people  finders  •  Can  use  mulJple  tags  to  intersect   /linux  /blogs  /date  •  Can  include  tags  in  other  tags  to   OR  them  
  21. 21. Sourcer  Slashtags  •  /people  -­‐-­‐  algorithmic  a@empt  to  find   resumes,  bios,  etc  •  /blogs  and  /forums,  -­‐/blogs  and  -­‐/forums  •  Develop  a  /spam  slashtag,  or  maybe  even   several  of  them  that  you  manually  add  to   various  searches  
  22. 22. Programming  slashtags  •  /open-­‐source  -­‐-­‐  aliases  /oss  /foss  /floss  •  /linux,  /lkml,  /bsd,  /windows  •  /repo  -­‐-­‐  open  source  public  repositories  •  /apache,  /fsf  -­‐-­‐  umbrella  organizaJons  •  /perl,  /cpan,  /php,  /javascript,  /python,  / django,  /ruby,  /rails,  /java,  /erlang,  /scheme  -­‐-­‐   languages  •  /hpc,  /make,  /hacker,  /hakerspaces  
  23. 23. A  few  examples  •  Java  programers  with  high  performance  compuJng   experience:   java  hpc  /people  •  Followup  on  a  candidate   marcus  wa@s  java  /blogs   marcus  wa@s  java  -­‐/blogs   marcus  wa@s  java  -­‐/blogs  /date   marcus  wa@s  java  -­‐/blogs  /date=2008-­‐2009  •  Ok,  more   marcus  wa@s  /twi@er   marcus  wa@s  /youtube  -­‐-­‐  oops,  basketball  guy  
  24. 24. The  bad  stuff  •  blekko s  crawl  is  fairly  small,  2  billion  pages   today   –  increasing  this  Fall  •  Some  of  the  programming  slashtags  aren t  as   good  as  others   –  this  will  improve  over  Jme  
  25. 25. For  more  info  •  help.blekko.com   –  Add  blekko  to  the  list  on  the  upper  right  search  box  •  blekko  toolbar  
  26. 26. To  Sum  Up  •  Let  me  explain.  No,  there  is  too  much.  Let  me   sum  up.  •  Search  is  evolving  in  good  and  bad  ways  •  New  tools  pop  up  every  year  •  I  would  love  to  hear  feedback:   greg@blekko.com  
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×