SlideShare a Scribd company logo
@
chenxiaoming@baidu.com
LOG


LOG




  LSP


  DISQL
!   46.70.93.94 - - [11/Nov/2011:11:11:11 -1100] "GET /book/
    1984.html HTTP/1.1“404 2326 http://www.baidu.com/s?
  wd=1984&rsv_bp=0&rsv_spt=3&inputT=947 "Mozilla/
  5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us)
  AppleWebKit/531.21.10 (KHTML, like Gecko) Version/4.0.4
  Mobile/7B314 Safari/531.21.10 “
——



!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       "GET /book/1984.html HTTP/1.1"
!       404
!       2326
!   "http://www.baidu.com/s?
    wd=1984&rsv_bp=0&rsv_spt=3&inputT=9
    47"
!   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2
    like Mac OS X; en-us) AppleWebKit/
    531.21.10 (KHTML, like Gecko) Version/
    4.0.4 Mobile/7B314 Safari/531.21.10 “
——



!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       "GET /book/1984.html HTTP/1.1"
!       404
!       2326
!   "http://www.baidu.com/s?
    wd=1984&rsv_bp=0&rsv_spt=3&inputT=9
    47"
!   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2
    like Mac OS X; en-us) AppleWebKit/
    531.21.10 (KHTML, like Gecko) Version/
    4.0.4 Mobile/7B314 Safari/531.21.10 “
——



!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       "GET /book/1984.html HTTP/1.1"
!       404
!       2326
!   "http://www.baidu.com/s?
    wd=1984&rsv_bp=0&rsv_spt=3&inputT=9
    47"
!   " Mozilla/5.0(iPad; U; CPU iPhone OS
    3_2 like Mac OS X; en-us) AppleWebKit/
    531.21.10 (KHTML, like Gecko) Version/
    4.0.4 Mobile/7B314 Safari/531.21.10"
——



!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       GET /book/1984.html HTTP/1.1
!       404
!       2326
!   "http://www.baidu.com/s?
    wd=1984&rsv_bp=0&rsv_spt=3&inputT=94
    7"
!   "Mozilla/5.0(iPad; U; CPU iPhone OS
    3_2 like Mac OS X; en-us)
    AppleWebKit/531.21.10 (KHTML, like
    Gecko) Version/4.0.4 Mobile/7B314
    Safari/531.21.10 "
——


!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       "GET /book/1984.html HTTP/1.1"
!       404
!       2326
!       "http://www.baidu.com/s?
        wd=1984&rsv_bp=0&rsv_spt=3&inputT=947"
!   " Mozilla/5.0(iPad; U; CPU iPhone OS
    3_2 like Mac OS X; en-us)
    AppleWebKit/531.21.10 (KHTML, like
    Gecko) Version/4.0.4 Mobile/7B314
    Safari/531.21.10 "
——



!       46.70.93.94 - -
!       [11/Nov/2011:11:11:11 -1100]
!       "GET /book/1984.html HTTP/1.1"
!       404
!       2326
!   " http://www.baidu.com/s?wd=1984&
    rsv_bp=0&rsv_spt=3&inputT=947 "
!   "Mozilla/5.0(iPad; U; CPU iPhone OS
    3_2 like Mac OS X; en-us)
    AppleWebKit/531.21.10 (KHTML, like
    Gecko) Version/4.0.4 Mobile/7B314
    Safari/531.21.10 "
——
——
LOG




  LSP


  DISQL
•    •    •           • 
•    •                • 
                      • 
          •  Ad$hoc
                      •  ……
LOG




  LSP


  DISQL
!    …




!                      $
!    B*S       $
!          $
!                  $
!           C++$       $
                              !    C++                 $
!     SQL                 $
!    PHP$+$C$            $    !                  $
                              !     Schema         $
!                    $
!                             !                $
       PHP .so   $
LSP


LOG




  LSP


  DISQL
UI
DQuery
DISQL


LOG




  LSP


  DISQL
! 
     !    
     !            _Url   _Res(        )
     !                       _Url          _Site
     !    
     !    
             ! 
             ! 
     !                              JSON
DQuery
!    
!            _Url   _Res(        )
!                       _Url          _S
!    
!    
        ! 
        ! 
!                              JSON
PHP-Callback
C-callback
!     PHP           SQL    (         )
     !           SQL      M/R
          ! 
!                         DAG
     !                      MapReduce
     ! 
!                  SQL
!          PHP
!          C++                  +        C-Runtime NEW!
     !                     RAII +
     !               Copy On Write
     !                                     schema
     !     C++      PHP
! 

! 
!                         parser
!               JSON
[
     {
         "cmd": "load“,
         "path": null
         "using": "SchemaReader"
         "from": 17
         "options": {"max_item_in_mem“: 100000}
         "include": [25]
     }
      , {"cmd":"filter"……}, {"cmd":"join"……},…… ……
]
SQL
[
    {
        "cmd": "load“,
        "path": null
        "using": "SchemaReader"
        "from": 17
        "options":
        {"max_item_in_mem“:
        100000}
        "include": [25]
    }
        , {"cmd":"filter"……},
        {"cmd":"join"……},…… ……
]
! 
     ! 
          ! 
     ! 
          ! 
     ! 
          ! 
     !            (     )
          !            MapReduce
     !   Schema
          !                    schema
     !                C++ PHP DOT
          ! 
Group


Unique             Shuffle
                              Map$Phase$
                   Reduce$
                                  $
                      $
                   Limit$1        $
                                Group
                                  $
                                  $
                               Combine$
                                  $$
         Group                  Count

         Shuffle                    $
                                Shuffle
                                  $
         Reduce$                  $
           $                    Reduce$
                                  $
         Count
                                  $
                                  $
                                 Sum
                                  $
                             Reduce$Phase
Schema


field     ID    name      age                field       ID     score
type    uint64 string   int32               type     uint64   double
index     2      5        9                 Index         0     1



                                  join



               Field      ID     name     age       Score
               Type     Uint64   string   int32     double
               Index      2        5       9         10
!    
!    
!     Combiner
!     Cached Combiner
!     key Join
!    
!    
       !                I/O
!     PHP
!     C++
!     DOT
!         / MapReduce
Processor        ——Pipes & Filter
  class$
Processor
init()
process(
)
fini()



            class$        class$         class$            class$
           Selector       Filter        Counter        UserProcessor

       init()           init()         init()         init()
       process(         process(       process(
                                                      process()
       )                )              )
       fini()            fini()          fini()          fini()
! 
          4 1        10 27
           3540        4761        1221          +34.5%
 DQuery    1153        3359        2206         +191%
           1569        2963        1394          +88.9%
!                          !   LSP

          24%                 PM          1352       47.4%
                  }$$67%
 DQuery   43%                 RD          1174       41.2%
          33%                 OP          190        6.66%
                                          136        4.77%
                                          2852       100%
LOG




  LSP


  DISQL
! 
     !             ●
! 
     !              ●             ●            ●                           …
! 
!         LSP
     !             ●     UI ●
!         DISQL
     !             ●            ●                  ●               ●
! 
! 
     !               (@               )         (chenxiaoming@baidu.com)
     !          Hadoop in China           12   2 2 20          DISQL2.0
……
                  ……
chenxiaoming@baidu.com
关注我们:t.baidu-tech.com

          资料下载和详细介绍:infoq.com/cn/zones/baidu-salon
“畅想•交流•争鸣•聚会”是百度技术沙龙的宗旨。 百度技术沙龙是由百度与InfoQ中文站定期组织的线下技术交流活动。目
的是让中高端技术人员有一个相对自由的思想交流和交友沟通的的平台。主要分讲师分享和OpenSpace两个关键环节,每期
只关注一个焦点话题。

讲师分享和现场Q&A让大家了解百度和其他知名网站技术支持的先进实践经验,OpenSpace环节是百度技术沙龙主题的升华
和展开,提供一个自由交流的平台。针对当期主题,参与者人人都可以发起话题,展开讨论。



                  InfoQ 策划·组织·实施
                  关注我们:weibo.com/infoqchina

More Related Content

What's hot

I, For One, Welcome Our New Perl6 Overlords
I, For One, Welcome Our New Perl6 OverlordsI, For One, Welcome Our New Perl6 Overlords
I, For One, Welcome Our New Perl6 Overlords
heumann
 
Go Web Development
Go Web DevelopmentGo Web Development
Go Web Development
Cheng-Yi Yu
 
What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017
Quentin Adam
 
Javascript - The Good, the Bad and the Ugly
Javascript - The Good, the Bad and the UglyJavascript - The Good, the Bad and the Ugly
Javascript - The Good, the Bad and the Ugly
Thorsten Suckow-Homberg
 
Nigel hamilton-megameet-2013
Nigel hamilton-megameet-2013Nigel hamilton-megameet-2013
Nigel hamilton-megameet-2013
trexy
 
Best training-in-mumbai-shell scripting
Best training-in-mumbai-shell scriptingBest training-in-mumbai-shell scripting
Best training-in-mumbai-shell scripting
vibrantuser
 
PerlでWeb API入門
PerlでWeb API入門PerlでWeb API入門
PerlでWeb API入門Yusuke Wada
 
Logrotate sh
Logrotate shLogrotate sh
Logrotate sh
Ben Pope
 
Whatsnew in-perl
Whatsnew in-perlWhatsnew in-perl
Whatsnew in-perl
daoswald
 
What is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcampWhat is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcamp
Quentin Adam
 
Ch1(introduction to php)
Ch1(introduction to php)Ch1(introduction to php)
Ch1(introduction to php)
Chhom Karath
 
Bologna Developer Zone - About Kotlin
Bologna Developer Zone - About KotlinBologna Developer Zone - About Kotlin
Bologna Developer Zone - About Kotlin
Marco Vasapollo
 
Ruby on Rails: Tasty Burgers
Ruby on Rails: Tasty BurgersRuby on Rails: Tasty Burgers
Ruby on Rails: Tasty Burgers
Aaron Patterson
 
Wsomdp
WsomdpWsomdp
Wsomdp
riahialae
 
Functional Pe(a)rls version 2
Functional Pe(a)rls version 2Functional Pe(a)rls version 2
Functional Pe(a)rls version 2
osfameron
 
PHP code examples
PHP code examplesPHP code examples
PHP code examples
programmingslides
 
Needle in an enterprise haystack
Needle in an enterprise haystackNeedle in an enterprise haystack
Needle in an enterprise haystackAndrew Mleczko
 
Desarrollando aplicaciones web en minutos
Desarrollando aplicaciones web en minutosDesarrollando aplicaciones web en minutos
Desarrollando aplicaciones web en minutosEdgar Suarez
 

What's hot (20)

I, For One, Welcome Our New Perl6 Overlords
I, For One, Welcome Our New Perl6 OverlordsI, For One, Welcome Our New Perl6 Overlords
I, For One, Welcome Our New Perl6 Overlords
 
Bash Scripting
Bash ScriptingBash Scripting
Bash Scripting
 
Go Web Development
Go Web DevelopmentGo Web Development
Go Web Development
 
What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017What is systemd? Why use it? how does it work? - devoxx france 2017
What is systemd? Why use it? how does it work? - devoxx france 2017
 
Javascript - The Good, the Bad and the Ugly
Javascript - The Good, the Bad and the UglyJavascript - The Good, the Bad and the Ugly
Javascript - The Good, the Bad and the Ugly
 
Having Fun Programming!
Having Fun Programming!Having Fun Programming!
Having Fun Programming!
 
Nigel hamilton-megameet-2013
Nigel hamilton-megameet-2013Nigel hamilton-megameet-2013
Nigel hamilton-megameet-2013
 
Best training-in-mumbai-shell scripting
Best training-in-mumbai-shell scriptingBest training-in-mumbai-shell scripting
Best training-in-mumbai-shell scripting
 
PerlでWeb API入門
PerlでWeb API入門PerlでWeb API入門
PerlでWeb API入門
 
Logrotate sh
Logrotate shLogrotate sh
Logrotate sh
 
Whatsnew in-perl
Whatsnew in-perlWhatsnew in-perl
Whatsnew in-perl
 
What is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcampWhat is systemd? Why use it? how does it work? - breizhcamp
What is systemd? Why use it? how does it work? - breizhcamp
 
Ch1(introduction to php)
Ch1(introduction to php)Ch1(introduction to php)
Ch1(introduction to php)
 
Bologna Developer Zone - About Kotlin
Bologna Developer Zone - About KotlinBologna Developer Zone - About Kotlin
Bologna Developer Zone - About Kotlin
 
Ruby on Rails: Tasty Burgers
Ruby on Rails: Tasty BurgersRuby on Rails: Tasty Burgers
Ruby on Rails: Tasty Burgers
 
Wsomdp
WsomdpWsomdp
Wsomdp
 
Functional Pe(a)rls version 2
Functional Pe(a)rls version 2Functional Pe(a)rls version 2
Functional Pe(a)rls version 2
 
PHP code examples
PHP code examplesPHP code examples
PHP code examples
 
Needle in an enterprise haystack
Needle in an enterprise haystackNeedle in an enterprise haystack
Needle in an enterprise haystack
 
Desarrollando aplicaciones web en minutos
Desarrollando aplicaciones web en minutosDesarrollando aplicaciones web en minutos
Desarrollando aplicaciones web en minutos
 

Similar to 20th.陈晓鸣 百度海量日志分析架构及处理经验分享

Big Data @ Orange - Dev Day 2013 - part 2
Big Data @ Orange - Dev Day 2013 - part 2Big Data @ Orange - Dev Day 2013 - part 2
Big Data @ Orange - Dev Day 2013 - part 2
ovarene
 
Anatomy of a PHP Request ( UTOSC 2010 )
Anatomy of a PHP Request ( UTOSC 2010 )Anatomy of a PHP Request ( UTOSC 2010 )
Anatomy of a PHP Request ( UTOSC 2010 )Joseph Scott
 
Living With Legacy Code
Living With Legacy CodeLiving With Legacy Code
Living With Legacy Code
Rowan Merewood
 
Apache Hadoop Shell Rewrite
Apache Hadoop Shell RewriteApache Hadoop Shell Rewrite
Apache Hadoop Shell Rewrite
Allen Wittenauer
 
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
Yandex
 
PHP Basics and Demo HackU
PHP Basics and Demo HackUPHP Basics and Demo HackU
PHP Basics and Demo HackU
Anshu Prateek
 
PySpark with Juypter
PySpark with JuypterPySpark with Juypter
PySpark with Juypter
Li Ming Tsai
 
Practical pig
Practical pigPractical pig
Practical pig
trihug
 
JIP Pipeline System Introduction
JIP Pipeline System IntroductionJIP Pipeline System Introduction
JIP Pipeline System Introduction
thasso23
 
Php training100%placement-in-mumbai
Php training100%placement-in-mumbaiPhp training100%placement-in-mumbai
Php training100%placement-in-mumbai
vibrantuser
 
Tips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software EngineeringTips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software Engineering
jtdudley
 
Building Interpreters with PyPy
Building Interpreters with PyPyBuilding Interpreters with PyPy
Building Interpreters with PyPyDaniel Neuhäuser
 
Buildingplatforms
BuildingplatformsBuildingplatforms
Buildingplatforms
codebits
 
Nodejs Explained with Examples
Nodejs Explained with ExamplesNodejs Explained with Examples
Nodejs Explained with Examples
Gabriele Lana
 
Nodejsexplained 101116115055-phpapp02
Nodejsexplained 101116115055-phpapp02Nodejsexplained 101116115055-phpapp02
Nodejsexplained 101116115055-phpapp02
Sunny Gupta
 
Hadoop I/O Analysis
Hadoop I/O AnalysisHadoop I/O Analysis
Hadoop I/O Analysis
Richard McDougall
 
PHP and Rich Internet Applications
PHP and Rich Internet ApplicationsPHP and Rich Internet Applications
PHP and Rich Internet Applicationselliando dias
 
Hadoop london
Hadoop londonHadoop london
Osd ctw spark
Osd ctw sparkOsd ctw spark
Osd ctw spark
Wisely chen
 
Perl on Amazon Elastic MapReduce
Perl on Amazon Elastic MapReducePerl on Amazon Elastic MapReduce
Perl on Amazon Elastic MapReduce
Pedro Figueiredo
 

Similar to 20th.陈晓鸣 百度海量日志分析架构及处理经验分享 (20)

Big Data @ Orange - Dev Day 2013 - part 2
Big Data @ Orange - Dev Day 2013 - part 2Big Data @ Orange - Dev Day 2013 - part 2
Big Data @ Orange - Dev Day 2013 - part 2
 
Anatomy of a PHP Request ( UTOSC 2010 )
Anatomy of a PHP Request ( UTOSC 2010 )Anatomy of a PHP Request ( UTOSC 2010 )
Anatomy of a PHP Request ( UTOSC 2010 )
 
Living With Legacy Code
Living With Legacy CodeLiving With Legacy Code
Living With Legacy Code
 
Apache Hadoop Shell Rewrite
Apache Hadoop Shell RewriteApache Hadoop Shell Rewrite
Apache Hadoop Shell Rewrite
 
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
Дмитрий Щадей "Что помогает нам писать качественный JavaScript-код?"
 
PHP Basics and Demo HackU
PHP Basics and Demo HackUPHP Basics and Demo HackU
PHP Basics and Demo HackU
 
PySpark with Juypter
PySpark with JuypterPySpark with Juypter
PySpark with Juypter
 
Practical pig
Practical pigPractical pig
Practical pig
 
JIP Pipeline System Introduction
JIP Pipeline System IntroductionJIP Pipeline System Introduction
JIP Pipeline System Introduction
 
Php training100%placement-in-mumbai
Php training100%placement-in-mumbaiPhp training100%placement-in-mumbai
Php training100%placement-in-mumbai
 
Tips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software EngineeringTips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software Engineering
 
Building Interpreters with PyPy
Building Interpreters with PyPyBuilding Interpreters with PyPy
Building Interpreters with PyPy
 
Buildingplatforms
BuildingplatformsBuildingplatforms
Buildingplatforms
 
Nodejs Explained with Examples
Nodejs Explained with ExamplesNodejs Explained with Examples
Nodejs Explained with Examples
 
Nodejsexplained 101116115055-phpapp02
Nodejsexplained 101116115055-phpapp02Nodejsexplained 101116115055-phpapp02
Nodejsexplained 101116115055-phpapp02
 
Hadoop I/O Analysis
Hadoop I/O AnalysisHadoop I/O Analysis
Hadoop I/O Analysis
 
PHP and Rich Internet Applications
PHP and Rich Internet ApplicationsPHP and Rich Internet Applications
PHP and Rich Internet Applications
 
Hadoop london
Hadoop londonHadoop london
Hadoop london
 
Osd ctw spark
Osd ctw sparkOsd ctw spark
Osd ctw spark
 
Perl on Amazon Elastic MapReduce
Perl on Amazon Elastic MapReducePerl on Amazon Elastic MapReduce
Perl on Amazon Elastic MapReduce
 

Recently uploaded

Meet Dinah Mattingly – Larry Bird’s Partner in Life and Love
Meet Dinah Mattingly – Larry Bird’s Partner in Life and LoveMeet Dinah Mattingly – Larry Bird’s Partner in Life and Love
Meet Dinah Mattingly – Larry Bird’s Partner in Life and Love
get joys
 
Snoopy boards the big bow wow musical __
Snoopy boards the big bow wow musical __Snoopy boards the big bow wow musical __
Snoopy boards the big bow wow musical __
catcabrera
 
Skeem Saam in June 2024 available on Forum
Skeem Saam in June 2024 available on ForumSkeem Saam in June 2024 available on Forum
Skeem Saam in June 2024 available on Forum
Isaac More
 
Tom Selleck Net Worth: A Comprehensive Analysis
Tom Selleck Net Worth: A Comprehensive AnalysisTom Selleck Net Worth: A Comprehensive Analysis
Tom Selleck Net Worth: A Comprehensive Analysis
greendigital
 
Modern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
Modern Radio Frequency Access Control Systems: The Key to Efficiency and SafetyModern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
Modern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
AITIX LLC
 
Treasure Hunt Puzzles, Treasure Hunt Puzzles online
Treasure Hunt Puzzles, Treasure Hunt Puzzles onlineTreasure Hunt Puzzles, Treasure Hunt Puzzles online
Treasure Hunt Puzzles, Treasure Hunt Puzzles online
Hidden Treasure Hunts
 
Young Tom Selleck: A Journey Through His Early Years and Rise to Stardom
Young Tom Selleck: A Journey Through His Early Years and Rise to StardomYoung Tom Selleck: A Journey Through His Early Years and Rise to Stardom
Young Tom Selleck: A Journey Through His Early Years and Rise to Stardom
greendigital
 
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
Mega P
 
A TO Z INDIA Monthly Magazine - JUNE 2024
A TO Z INDIA Monthly Magazine - JUNE 2024A TO Z INDIA Monthly Magazine - JUNE 2024
A TO Z INDIA Monthly Magazine - JUNE 2024
Indira Srivatsa
 
Christina's Baby Shower Game June 2024.pptx
Christina's Baby Shower Game June 2024.pptxChristina's Baby Shower Game June 2024.pptx
Christina's Baby Shower Game June 2024.pptx
madeline604788
 
DIGIDEVTV A New area of OTT Distribution
DIGIDEVTV  A New area of OTT DistributionDIGIDEVTV  A New area of OTT Distribution
DIGIDEVTV A New area of OTT Distribution
joeqsm
 
Hollywood Actress - The 250 hottest gallery
Hollywood Actress - The 250 hottest galleryHollywood Actress - The 250 hottest gallery
Hollywood Actress - The 250 hottest gallery
Zsolt Nemeth
 
This Is The First All Category Quiz That I Made
This Is The First All Category Quiz That I MadeThis Is The First All Category Quiz That I Made
This Is The First All Category Quiz That I Made
Aarush Ghate
 
Emcee Profile_ Subbu from Bangalore .pdf
Emcee Profile_ Subbu from Bangalore .pdfEmcee Profile_ Subbu from Bangalore .pdf
Emcee Profile_ Subbu from Bangalore .pdf
subran
 
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docxThe Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
Xtreame HDTV
 
Scandal! Teasers June 2024 on etv Forum.co.za
Scandal! Teasers June 2024 on etv Forum.co.zaScandal! Teasers June 2024 on etv Forum.co.za
Scandal! Teasers June 2024 on etv Forum.co.za
Isaac More
 
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
9u08k0x
 
240529_Teleprotection Global Market Report 2024.pdf
240529_Teleprotection Global Market Report 2024.pdf240529_Teleprotection Global Market Report 2024.pdf
240529_Teleprotection Global Market Report 2024.pdf
Madhura TBRC
 
I Know Dino Trivia: Part 3. Test your dino knowledge
I Know Dino Trivia: Part 3. Test your dino knowledgeI Know Dino Trivia: Part 3. Test your dino knowledge
I Know Dino Trivia: Part 3. Test your dino knowledge
Sabrina Ricci
 
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
9u08k0x
 

Recently uploaded (20)

Meet Dinah Mattingly – Larry Bird’s Partner in Life and Love
Meet Dinah Mattingly – Larry Bird’s Partner in Life and LoveMeet Dinah Mattingly – Larry Bird’s Partner in Life and Love
Meet Dinah Mattingly – Larry Bird’s Partner in Life and Love
 
Snoopy boards the big bow wow musical __
Snoopy boards the big bow wow musical __Snoopy boards the big bow wow musical __
Snoopy boards the big bow wow musical __
 
Skeem Saam in June 2024 available on Forum
Skeem Saam in June 2024 available on ForumSkeem Saam in June 2024 available on Forum
Skeem Saam in June 2024 available on Forum
 
Tom Selleck Net Worth: A Comprehensive Analysis
Tom Selleck Net Worth: A Comprehensive AnalysisTom Selleck Net Worth: A Comprehensive Analysis
Tom Selleck Net Worth: A Comprehensive Analysis
 
Modern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
Modern Radio Frequency Access Control Systems: The Key to Efficiency and SafetyModern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
Modern Radio Frequency Access Control Systems: The Key to Efficiency and Safety
 
Treasure Hunt Puzzles, Treasure Hunt Puzzles online
Treasure Hunt Puzzles, Treasure Hunt Puzzles onlineTreasure Hunt Puzzles, Treasure Hunt Puzzles online
Treasure Hunt Puzzles, Treasure Hunt Puzzles online
 
Young Tom Selleck: A Journey Through His Early Years and Rise to Stardom
Young Tom Selleck: A Journey Through His Early Years and Rise to StardomYoung Tom Selleck: A Journey Through His Early Years and Rise to Stardom
Young Tom Selleck: A Journey Through His Early Years and Rise to Stardom
 
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
_7 OTT App Builders to Support the Development of Your Video Applications_.pdf
 
A TO Z INDIA Monthly Magazine - JUNE 2024
A TO Z INDIA Monthly Magazine - JUNE 2024A TO Z INDIA Monthly Magazine - JUNE 2024
A TO Z INDIA Monthly Magazine - JUNE 2024
 
Christina's Baby Shower Game June 2024.pptx
Christina's Baby Shower Game June 2024.pptxChristina's Baby Shower Game June 2024.pptx
Christina's Baby Shower Game June 2024.pptx
 
DIGIDEVTV A New area of OTT Distribution
DIGIDEVTV  A New area of OTT DistributionDIGIDEVTV  A New area of OTT Distribution
DIGIDEVTV A New area of OTT Distribution
 
Hollywood Actress - The 250 hottest gallery
Hollywood Actress - The 250 hottest galleryHollywood Actress - The 250 hottest gallery
Hollywood Actress - The 250 hottest gallery
 
This Is The First All Category Quiz That I Made
This Is The First All Category Quiz That I MadeThis Is The First All Category Quiz That I Made
This Is The First All Category Quiz That I Made
 
Emcee Profile_ Subbu from Bangalore .pdf
Emcee Profile_ Subbu from Bangalore .pdfEmcee Profile_ Subbu from Bangalore .pdf
Emcee Profile_ Subbu from Bangalore .pdf
 
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docxThe Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
The Ultimate Guide to Setting Up Eternal IPTV on Your Devices.docx
 
Scandal! Teasers June 2024 on etv Forum.co.za
Scandal! Teasers June 2024 on etv Forum.co.zaScandal! Teasers June 2024 on etv Forum.co.za
Scandal! Teasers June 2024 on etv Forum.co.za
 
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
哪里买(osu毕业证书)美国俄勒冈州立大学毕业证双学位证书原版一模一样
 
240529_Teleprotection Global Market Report 2024.pdf
240529_Teleprotection Global Market Report 2024.pdf240529_Teleprotection Global Market Report 2024.pdf
240529_Teleprotection Global Market Report 2024.pdf
 
I Know Dino Trivia: Part 3. Test your dino knowledge
I Know Dino Trivia: Part 3. Test your dino knowledgeI Know Dino Trivia: Part 3. Test your dino knowledge
I Know Dino Trivia: Part 3. Test your dino knowledge
 
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
高仿(nyu毕业证书)美国纽约大学毕业证文凭毕业证原版一模一样
 

20th.陈晓鸣 百度海量日志分析架构及处理经验分享

  • 2. LOG LOG LSP DISQL
  • 3. !   46.70.93.94 - - [11/Nov/2011:11:11:11 -1100] "GET /book/ 1984.html HTTP/1.1“404 2326 http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=947 "Mozilla/ 5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/531.21.10 (KHTML, like Gecko) Version/4.0.4 Mobile/7B314 Safari/531.21.10 “
  • 4. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   "GET /book/1984.html HTTP/1.1" !   404 !   2326 !   "http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=9 47" !   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/ 531.21.10 (KHTML, like Gecko) Version/ 4.0.4 Mobile/7B314 Safari/531.21.10 “
  • 5. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   "GET /book/1984.html HTTP/1.1" !   404 !   2326 !   "http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=9 47" !   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/ 531.21.10 (KHTML, like Gecko) Version/ 4.0.4 Mobile/7B314 Safari/531.21.10 “
  • 6. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   "GET /book/1984.html HTTP/1.1" !   404 !   2326 !   "http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=9 47" !   " Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/ 531.21.10 (KHTML, like Gecko) Version/ 4.0.4 Mobile/7B314 Safari/531.21.10"
  • 7. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   GET /book/1984.html HTTP/1.1 !   404 !   2326 !   "http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=94 7" !   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/531.21.10 (KHTML, like Gecko) Version/4.0.4 Mobile/7B314 Safari/531.21.10 "
  • 8. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   "GET /book/1984.html HTTP/1.1" !   404 !   2326 !   "http://www.baidu.com/s? wd=1984&rsv_bp=0&rsv_spt=3&inputT=947" !   " Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/531.21.10 (KHTML, like Gecko) Version/4.0.4 Mobile/7B314 Safari/531.21.10 "
  • 9. —— !   46.70.93.94 - - !   [11/Nov/2011:11:11:11 -1100] !   "GET /book/1984.html HTTP/1.1" !   404 !   2326 !   " http://www.baidu.com/s?wd=1984& rsv_bp=0&rsv_spt=3&inputT=947 " !   "Mozilla/5.0(iPad; U; CPU iPhone OS 3_2 like Mac OS X; en-us) AppleWebKit/531.21.10 (KHTML, like Gecko) Version/4.0.4 Mobile/7B314 Safari/531.21.10 "
  • 12. LOG LSP DISQL
  • 13. •  •  •  •  •  •  •  •  •  Ad$hoc •  ……
  • 14. LOG LSP DISQL
  • 15. !  … !   $ !  B*S $ !   $ !   $
  • 16. !   C++$ $ !  C++ $ !   SQL $ !  PHP$+$C$ $ !   $ !  Schema $ !   $ !   !   $ PHP .so $
  • 17. LSP LOG LSP DISQL
  • 18.
  • 19. UI
  • 20.
  • 21.
  • 23.
  • 24. DISQL LOG LSP DISQL
  • 25. !  !   !   _Url _Res( ) !   _Url _Site !   !   !  !  !  JSON
  • 26. DQuery !   !   _Url _Res( ) !   _Url _S !   !   !  !  !  JSON
  • 29. !  PHP SQL ( ) !  SQL M/R !  !  DAG !  MapReduce !  !  SQL !  PHP !  C++ + C-Runtime NEW! !   RAII + !   Copy On Write !   schema !   C++ PHP
  • 30.
  • 31. !  !  !  parser !  JSON [ { "cmd": "load“, "path": null "using": "SchemaReader" "from": 17 "options": {"max_item_in_mem“: 100000} "include": [25] } , {"cmd":"filter"……}, {"cmd":"join"……},…… …… ]
  • 32. SQL [ { "cmd": "load“, "path": null "using": "SchemaReader" "from": 17 "options": {"max_item_in_mem“: 100000} "include": [25] } , {"cmd":"filter"……}, {"cmd":"join"……},…… …… ]
  • 33. !  !  !  !  !  !  !  !  ( ) !  MapReduce !   Schema !  schema !  C++ PHP DOT ! 
  • 34. Group Unique Shuffle Map$Phase$ Reduce$ $ $ Limit$1 $ Group $ $ Combine$ $$ Group Count Shuffle $ Shuffle $ Reduce$ $ $ Reduce$ $ Count $ $ Sum $ Reduce$Phase
  • 35. Schema field ID name age field ID score type uint64 string int32 type uint64 double index 2 5 9 Index 0 1 join Field ID name age Score Type Uint64 string int32 double Index 2 5 9 10
  • 36. !   !   !   Combiner !   Cached Combiner !   key Join !   !   !  I/O
  • 37.
  • 38. !   PHP !   C++ !   DOT !   / MapReduce
  • 39. Processor ——Pipes & Filter class$ Processor init() process( ) fini() class$ class$ class$ class$ Selector Filter Counter UserProcessor init() init() init() init() process( process( process( process() ) ) ) fini() fini() fini() fini()
  • 40. !  4 1 10 27 3540 4761 1221 +34.5% DQuery 1153 3359 2206 +191% 1569 2963 1394 +88.9% !  !   LSP 24% PM 1352 47.4% }$$67% DQuery 43% RD 1174 41.2% 33% OP 190 6.66% 136 4.77% 2852 100%
  • 41. LOG LSP DISQL
  • 42. !  !  ● !  !  ● ● ● … !  !  LSP !  ● UI ● !  DISQL !  ● ● ● ● !  !  !  (@ ) (chenxiaoming@baidu.com) !  Hadoop in China 12 2 2 20 DISQL2.0
  • 43. …… …… chenxiaoming@baidu.com
  • 44. 关注我们:t.baidu-tech.com 资料下载和详细介绍:infoq.com/cn/zones/baidu-salon “畅想•交流•争鸣•聚会”是百度技术沙龙的宗旨。 百度技术沙龙是由百度与InfoQ中文站定期组织的线下技术交流活动。目 的是让中高端技术人员有一个相对自由的思想交流和交友沟通的的平台。主要分讲师分享和OpenSpace两个关键环节,每期 只关注一个焦点话题。 讲师分享和现场Q&A让大家了解百度和其他知名网站技术支持的先进实践经验,OpenSpace环节是百度技术沙龙主题的升华 和展开,提供一个自由交流的平台。针对当期主题,参与者人人都可以发起话题,展开讨论。 InfoQ 策划·组织·实施 关注我们:weibo.com/infoqchina