SlideShare a Scribd company logo
1 of 4
Download to read offline
AN OPTIMIZED H.264-BASED VIDEO CONFERENCING SOFTWARE
                                     FOR MOBILE DEVICES

            Hans L. Cycon∗ , Thomas C. Schmidt, Gabriel Hege† , Matthias W¨ hlisch‡ , Mark Palkow§
                                                                          a

            {h.cycon,hege}@fhtw-berlin.de, {t.schmidt,waehlisch}@ieee.org, palkow@daviko.com


                              ABSTRACT                            one hand, high consumer demands on visual quality on the
Mobile phones and related gadgets in networks are on the spot     other, advise applications to take advantage of the latest stan-
to deliver sufficient performance for rich multimedia applica-     dard for video coding H.264/AVC [1].
tions and communication. In this report we introduce a video          H.264/AVC provides gains in compression efficiency of
conferencing software, which seamlessly integrates mobile         up to 50 % over a wide range of bit rates and video resolutions
with stationary users into fully distributed multi-party conver-  compared to previous standards. While H.264/AVC decoding
sations. Innovations related to this work are twofold. At first    software has been successfully deployed on handhelds, high
we report on a highly optimized realisation of a H.264 codec      computational complexity still prevented pure software en-
and our implementation experiences of the video conference        coders in current mobile systems. There are however also fast
software on a consumer mobile. Within the tight bounds of         hardware implementations available, which give rise to an in-
real-time requirements on mobiles, the coding software out-       creasing offer of device- and operator-bound video services.
performs compatible H.264 realizations. At second we present          In this work we first introduce a pure software solution
an integrated peer-to-peer group communication solution, which    for real-time video communication on standard smartphones
scales well for medium-size conferences and accounts for the      in section 2. These mobile clients extend a lightweight, fea-
heterogeneous nature of mobile and stationary participants.       ture rich conferencing application developed for an infras-
                                                                  tructure compliant use on standard PCs. In the second part
    Index Terms— Mobile video coding, H.264/MPEG-4 AVC we present the underlying peer-to-peer group communication
software codec, mobile conferencing, peer-to-peer group com- scheme, which performs well for medium-size conferences
munication, distributed SIP conference management                 and accounts for the heterogeneous nature of mobile and sta-
                                                                  tionary participants, cf. section 3. This includes on the one
                    1. INTRODUCTION                               hand SIP [2] standard compliant session signalling with re-
                                                                  spect to group communication, and on the other hand effi-
The idea of augmenting voice calls by video has been around       cient, serverless media distribution, self-adjusting to the ac-
for several decades, but only the flexibility of the Internet gen- tual network infrastructure support. Conclusions and an out-
erated a noticeable deployment. As compared to audio, video       look follow in the final section.
processing places significantly higher demands on end system
and network transmission capabilities. The rapid evolution               2. THE DAVIKO VIDEOCONFERENCING
of networks and processors have paved the way for realistic                                SOFTWARE
group conferences conducted at standard personal computers,
combining about a dozen visual streams of Half-QVGA (240          In this section we give an overview of our reference imple-
x 160 pixel @ 15-30 fps) resolution.                              mentation, a digital audio-visual conferencing system, realised
    Mobile phones and networked consumer portables are now        as a serverless multipoint video conferencing software with-
on the spot to deliver sufficient performance for rich multi-      out MCU developed by the authors [3]. It has been designed
media applications and communication, as well. Videocon-          in a peer-to-peer model as a lightweight Internet conferenc-
ferencing though, which requires simultaneous decoding and        ing tool aimed at email-like friendliness of use. The system
encoding in real-time, poses still a grand challenge to the mo-   is built upon a fast H.264/MPEG-4 AVC standard conformal
bile world. Limited and expensive wireless channels on the        video codec implementation [4] called DAVC. By control-
   ∗ The author is with FHTW Berlin, 10318 Berlin, Germany.                     ling the coding parameters appropriately, the software permits
   † Thomas and Gabriel are with HAW Hamburg, Dept.      Informatik, Berliner   scaling in bit rate from 48 to 1440 kbit/s on the fly.
Tor 7, 20099 Hamburg, Germany.                                                      Audio data is compressed using a 16 kHz speech-optimized
   ‡ Matthias is with link-lab, H¨ nower Str. 35, 10318 Berlin, Germany and
                                 o
also with HAW Hamburg.
                                                                                variable bit rate codec [5] with extremely short latencies of
   § Mark is with daViKo GmbH, Am Borsigturm 40, 13507 Berlin, Ger-             40 ms (plus network packet delay). All streams can be trans-
many.                                                                           mitted by unicast as well as by multicast protocols. Within
4 3
the application, audio streams are prioritized over video since
                                                                                           4 2
user experience is usually more sensitive to losses in audio
                                                                                           4 1
packets than those of video packets, which both may result
                                                                                           4 0                      < 3 fp s
from transmission errors or network congestions.
                                                                                           3 9
    An application-sharing facility is included for collabora-                                            2 8 4 fp s




                                                                         P S N R [d B ]
                                                                                           3 8
tion and teleteaching. It enables participants to share or broad-
                                                                                           3 7
cast not only static documents, but also any selected dynamic                                                                                                                                                 D A V C
                                                                                                                                                                                                               x 2 6 4
PC actions like animations. All audio/video - streams includ-                              3 6
                                                                                                                                                   2 1 0 fp s                                                   J M
                                                                                                                                                                                                                 M P E G -4
ing dynamic application sharing actions can be recorded on                                 3 5
                                                                                                                                       < 5 fp s
any site. This system is equally well suited to intranet and                               3 4

wireless video conferencing on a best effort basis, since the                              3 3
                                                                                                     0                  5 0                   1 0 0                      1 5 0               2 0 0              2 5 0              3 0 0
audio/video quality can be controlled to adapt the data stream                                                                                                  B itr a te [k b it/s ]

to the available bandwidth.                                                                                              (a) Akiyo (cif, 300 frames at 30 Hz)
    The daViKo conferencing system is available for desktop                               4 3
computers running MS–Windows or Linux and on handhelds                                    4 2

with Windows Mobile operating system.                                                     4 1
                                                                                          4 0
                                                                                                                       < 3 fp s
                                                                                          3 9
                                                                                          3 8                                                                                                                       1 0 6 fp s
2.1. The DAVC Codec


                                                                       P S N R [d B ]
                                                                                          3 7            8 8 fp s
                                                                                          3 6
DAVC, the core of the videoconferencing system, is a fast,                                3 5                                                                                                                D A V C
highly optimized H.264/MPEG-4 AVC standard implementa-                                    3 4                                                                                                                 x 2 6 4
                                                                                                                                                                                                               J M
tion. It realizes a Baseline profile, optimized for real-time en-                          3 3                                                                                                                   M P E G -4
                                                                                          3 2
coding (as well as real-time decoding) by means of a fast mo-                                                                                           < 5 fp s
                                                                                          3 1
tion estimation strategy including integer-pel diamond search                             3 0
as well as a fast subpel refinement strategy up to 1 pel motion
                                                   4
                                                                                                 0        2 0 0               4 0 0   6 0 0            8 0 0           1 0 0 0     1 2 0 0
                                                                                                                                                               B itr a te [k b it/s ]
                                                                                                                                                                                                1 4 0 0   1 6 0 0       1 8 0 0   2 0 0 0

accuracy. Motion estimation includes the choice of several
                                                                                                                       (b) Foreman (cif, 300 frames at 30 Hz)
different macroblock (MB) partitions and multiple reference
frames, as permitted by the H.264/MPEG-4 AVC standard.
                                                                    Fig. 1. RD plot for test sequences in CIF resolution compar-
For choosing between different MB partitions for motion-
                                                                    ing three different H.264/MPEG-4 AVC encoder implementa-
compensated (i.e. temporal) prediction and MB-based intra
                                                                    tions as well as a RD-optimized MPEG-4 (Part◦ 2) Advanced
(i.e. spatial) prediction modes, a fast rate-distortion (RD)
                                                                    Simple Profile implementation.
based mode decision algorithm with early termination con-
ditions has been employed.
     In comparison to the well-known open source H.264/MPEG-            Note that the DAVC codec along with the H.264/AVC de-
4 AVC encoder implementation of x264 [6], our DAVC en-              sign also includes some suitable mechanisms to quickly re-
coder implementation achieves up to 0.5 dB PSNR better RD           cover from video packet loss.
performance and a considerable increase in encoding speed
when using comparable encoder settings. For selected RD
                                                                    2.2. Mobile Video Codec Performance
points we measured 284 encoded frames per second (fps) as
compared to 210 fps for x264. In Figure 1, typical examples         In ongoing work, the DAVC codec has been adapted to sustain
of such a comparison between x264 and DAVC are shown.               real-time performance on mobile devices. The mobile codec
In addition to the RD-performance of those two real-time en-        version operates at reduced complexity for motion compen-
coder implementations, this plot also shows the RD behav-           sation with a highly optimized code base for the target plat-
ior of two non real-time encoder implementations, as given          form. Motion compensation has been limited to work on 16 x
by the H.264/MPEG-4 AVC Joint Model (JM) reference soft-            16 pixel blocks, only. The code tuning includes the efficient
ware (with Baseline profile settings) and a MPEG-4 (Part◦ 2)         use of the wireless MMX instruction set available at the tar-
Advanced Simple Profile implementation. The latter two en-           get system. Portability is sustained by an ANSI compliant C
coders were operated using a high-complexity RD-based mode          version, to be augmented incrementally by platform specific
decision strategy for demonstrating the capabilities of both        injections.
video coding standards when neglecting any real-time con-               The application was tested on a 520 MHz Xscale proces-
straints. Figures 1(a) and 1(b) also contain the number of          sor built in an Asus P735 system. Thereon it can reliably
encoded frames per second (fps) for selected RD points as a         encode and decode a QCIF video stream in parallel at 5/10
measure for maximum encoding speed. Similar results were            fps, without CPU exhaustion or frame dropping. Real-time
also achieved for other test sequences.                             encoding rate increases up to 10 fps for moderate video com-
4 3
                                                                                                                        quires group session management and media distribution at
                    4 2
                    4 1
                                                                                                                        peers, which for the sake of standard compliance we realize
                    4 0                                                                                                 with group conferencing functions in SIP, cf. [7, 8, 9]. Im-
                    3 9                                                                                                 plemented as pure software on standard personal devices, user
                    3 8
                                                                                                                        agent peers are exposed to severe restrictions in real-world de-
   P S N R [d B ]




                    3 7
                    3 6                                                                                                 ployments: Often they are located behind NATs and firewalls
                    3 5                                                                    D A V C F u ll               with network capacities confined to asymmetric DSL or wire-
                                                                                            D A V C M o b ile
                    3 4                                                                                                 less links. Capacity constraints and resilience to node failures
                    3 3
                    3 2
                                                                                                                        require peer-managed ad-hoc conferences to organize in a dis-
                    3 1                                                                                                 tributed multi-party model. As a key component, the hetero-
                    3 0                                                                                                 geneity of clients must be accounted for, whereas the range of
                          0   2 0   4 0   6 0   8 0            1 0 0       1 2 0   1 4 0   1 6 0      1 8 0     2 0 0
                                                      B itr a te [k b it/s ]                                            scalability is limited to about a dozen parties in videoconfer-
                                                                                                                        ences.
Fig. 2. RD plot for test sequence “Akiyo” in QCIF resolution
at 10 fps, comparing the DAVC mobile encoder to the DAVC
                                                                                                                        3.1. P2P Adaptive Architecture
fully optimized implementation.
                                                                                                                        A peer-to-peer conferencing system faces the grand challenge
                                                                                                                        to be robust w.r.t. the infrastructure. The role a user agent is
                                                                                                                        able to attain in a distributed scenario needs to be adaptively
                                                                                                                        determined according to constraints of its device and current
                                                                                                                        network attachment. In a simplified scenario, clients may be
                                                                                                                        divided into two groups, distinguished by their ability to act
                                                                                                                        as a SIP conference focus or not. A focus must be globally
                                                                                                                        addressable and have access to necessary processing and net-
                                                                                                                        work resources.
                                                                                                                            This elementary adaptation scheme can be based on indi-
                                                                                                                        vidual decisions of user agents and gives rise to a hybrid ar-
                                                                                                                        chitecture of super peers, chosen from potential focus nodes,
                                                                                                                        and remaining leaf nodes. To decide on its potential role of
                                                                                                                        building a focus, a client at first needs to determine NATs
                                                                                                                        and firewalls. Aside from address evaluation, this is done by
                                                                                                                        a simple probe packet exchange. As the implementation is
                                                                                                                        CPU-type aware, processing restrictions are easily evaluated,
                              Fig. 3. The mobile video application.
                                                                                                                        as well. However, an a priori judgement on available network
                                                                                                                        bandwidth is not easily obtained. An evaluation of the lo-
plexity. QCIF @15 fps is the maximal image feed that can be                                                             cal link capacity is frequently misleading, as wireless devices
obtained from the front camera in our test equipment. Perfor-                                                           may be located behind wired transmitters of lower, asymmet-
mance values for the mobile encoder are displayed in figure 2                                                            ric capacity such as in ADSL. Current experiments to quickly
and compared to the results for the full DAVC.                                                                          retrieve reasonable estimates of up- and downstream capacity
    Reduced coding complexity results in an enhanced data                                                               are ongoing on the basis of variable packet size, nonintrusive
rate send by the mobile, but the gross total rate for a bidi-                                                           estimators, cf. [10]. Note that network capacity detection is
rectional video exchange at 10 fps complies to 3GPP/UMTS                                                                of vital use for temporal adaptation of the video codecs, as
bandwidths constraints. Note that experimental conditions                                                               well.
are not fully compatible: The image sequence obtained from                                                                  Leaf nodes attach to super peers in subordinate position,
the front camera of the mobile is significantly more noisy than                                                          whereas potential focus nodes may be assigned to be super
our standard USB cameras connected to the desktop, which                                                                peers or leaves. Super peers provide global connectivity among
increases the image complexity and thereby the data rate.                                                               each other and NAT traversal assistance to leaves, while leaf
                                                                                                                        nodes experience super peers in different roles: A leaf nodes
                    3. DISTRIBUTED POINT-TO-MULTIPOINT                                                                  sees its next hop super peer as the conference focus, while the
                               CONFERENCING                                                                             remote super peers act as proxies on the path to the leaves be-
                                                                                                                        hind.1 This set-up corresponds to the well known architecture
Our application aims for simple, flexible, and cost-efficient                                                                 1 This architecture relies on the presence of at least one globally address-
ad-hoc conferencing functions, which scale appropriately well,                                                          able, sufficiently powerful peer. As there are many scenarios, where this is
but avoid any infrastructure assistance. Such a solution re-                                                            likely to fail, we advise for and offer a permanently deployed ‘silent’ relay-
of Gnutella 0.6 and successive hybrid unstructured peer-to-         [4] J. Ostermann, J. Bormans, P. List, D. Marpe, N. Nar-
peer systems, cf. [11]. Despite its architectural analogy, our          roschke, F. Pereira, T. Stockhammer, and T. Wedi,
routing layer for real-time group applications follows a differ-        “Video Coding with H.264/AVC: Tools, Performance
ent, next-hop design.                                                   and Complexity,” IEEE Circuits and Systems Magazine,
                                                                        vol. 4, no. 1, pp. 7–28, April 2004.
              4. CONCLUSIONS & OUTLOOK                              [5] “The Speex projectpage,” http://www.speex.org, 2007.
We have presented a peer-to-peer software for high-quality          [6] “VideoLan:    x264 - a free h264/avc encoder,”
videoconferencing on mobiles, admitting utmost flexibility               http://www.videolan.org/developers/x264.html, 2007.
with respect to end systems, operators and network provi-
sioning. To the best of our knowledge, this is the first soft-       [7] A. Johnston and O. Levin, “Session Initiation Proto-
ware implementation of an H.264 video encoder that operates             col (SIP) Call Control - Conferencing for User Agents,”
in real-time on mobile phones. An adaptive, fully distributed           IETF, RFC 4579, August 2006.
conference management scheme with SIP has been developed
                                                                    [8] R. Mahy, R. Sparks, J. Rosenberg, D. Petrie, and
as part of the multi-party scenario. This hybrid peer-to-peer
                                                                        A. Johnston, “A Call Control and Multi-party usage
model accounts for client capabilities as well as network at-
                                                                        framework for the Session Initiation Protocol (SIP),”
tachment, and does scale well beyond standard use.
                                                                        IETF, Internet Draft - work in progress 9, November
    In future work we will concentrate on further optimiza-
                                                                        2007.
tion and generalization of the video coding software to make
it available for a wider variety of platforms. Network adap-        [9] T. C. Schmidt and M. W¨ hlisch, “Group Conference
                                                                                                  a
tation and capacity evaluation will require further work to ar-         Management with SIP,” in SIP Handbook: Services,
rive at estimates that reliably serve the needs in real world           Technologies, and Security, S. Ahson and M. Ilyas, Eds.
environments, as well.                                                  Boca Raton, FL, USA: CRC Press, November 2008, to
    Additional research will target at benefits possibly inher-          appear, on invitation.
ited from key-based routing. As common application layer
multicast schemes, which rely on dedicated shared or source        [10] R. Prasad, C. Dovrolis, M. Murray, and kc claffy,
specific trees, are significantly sensitive to client departure           “Bandwidth Estimation: Metrics, Measurement Tech-
and of insufficient performance in medium size groups, and               niques, and Tools,” IEEE Network, vol. 17, no. 6, pp.
as conference routing actually can be seen as an application            27–35, November–December 2003.
layer broadcasting problem, new and highly optimized struc-
                                                                   [11] R. Steinmetz and K. Wehrle, Eds., Peer–to–Peer Sys-
tured broadcast algorithms are desirable. The bidirectional
                                                                        tems and Applications, ser. LNCS. Berlin Heidelberg:
shared tree approach introduced in [12] may be a promising
                                                                        Springer–Verlag, 2005, vol. 3485.
point to start at.
                                                                   [12] M. W¨ hlisch and T. C. Schmidt, “Between Underlay and
                                                                              a
                                                                        Overlay: On Deployable, Efficient, Mobility-agnostic
Acknowledgement                                                         Group Communication Services,” Internet Research,
                                                                        vol. 17, no. 5, pp. 519–534, November 2007.
This work is supported by the German Bundesministerium f¨ r
                                                        u
Bildung und Forschung within the project Moviecast
(http://moviecast.realmv6.org).

                         5. REFERENCES

 [1] ITU-T Recommendation H.264 & ISO/IEC 14496-10
     AVC, “Advanced Video Coding for Generic Audiovisual
     Services,” ITU, Tech. Rep., 2005, draft Version 3.
 [2] J. Rosenberg, H. Schulzrinne, G. Camarillo, A. John-
     ston, J. Peterson, R. Sparks, M. Handley, and
     E. Schooler, “SIP: Session Initiation Protocol,” IETF,
     RFC 3261, June 2002.
 [3] M. Palkow, “The daViKo               homepage,”      2008,
     http://www.daviko.com.
peer at some unrestricted place.

More Related Content

What's hot

un bon lien en anglais .
un bon lien en anglais .un bon lien en anglais .
un bon lien en anglais .Videoguy
 
The H.264/AVC Advanced Video Coding Standard: Overview and ...
The H.264/AVC Advanced Video Coding Standard: Overview and ...The H.264/AVC Advanced Video Coding Standard: Overview and ...
The H.264/AVC Advanced Video Coding Standard: Overview and ...Videoguy
 
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...csandit
 
IBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docIBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docVideoguy
 
Practical Fundamentals of Voice over IP (VoIP) for Engineers and Technicians
Practical Fundamentals of Voice over IP (VoIP) for Engineers and TechniciansPractical Fundamentals of Voice over IP (VoIP) for Engineers and Technicians
Practical Fundamentals of Voice over IP (VoIP) for Engineers and TechniciansLiving Online
 
Videoconferencing Technology
Videoconferencing TechnologyVideoconferencing Technology
Videoconferencing TechnologyVideoguy
 
Overview of Internet.ppt
Overview of Internet.pptOverview of Internet.ppt
Overview of Internet.pptVideoguy
 
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...IJMER
 
Lec40 41 42_43_44_45 video conferencing
Lec40 41 42_43_44_45 video conferencingLec40 41 42_43_44_45 video conferencing
Lec40 41 42_43_44_45 video conferencingShona Hira
 
Lec40 45 video conferencing
Lec40 45 video conferencingLec40 45 video conferencing
Lec40 45 video conferencingDom Mike
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPVideoguy
 
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)Thomas Siegers
 

What's hot (16)

Telex Radio Dispatch: brochure
Telex Radio Dispatch: brochureTelex Radio Dispatch: brochure
Telex Radio Dispatch: brochure
 
un bon lien en anglais .
un bon lien en anglais .un bon lien en anglais .
un bon lien en anglais .
 
The H.264/AVC Advanced Video Coding Standard: Overview and ...
The H.264/AVC Advanced Video Coding Standard: Overview and ...The H.264/AVC Advanced Video Coding Standard: Overview and ...
The H.264/AVC Advanced Video Coding Standard: Overview and ...
 
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...
A NEW SYSTEM ON CHIP RECONFIGURABLE GATEWAY ARCHITECTURE FOR VOICE OVER INTER...
 
IBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docIBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.doc
 
Report
ReportReport
Report
 
Practical Fundamentals of Voice over IP (VoIP) for Engineers and Technicians
Practical Fundamentals of Voice over IP (VoIP) for Engineers and TechniciansPractical Fundamentals of Voice over IP (VoIP) for Engineers and Technicians
Practical Fundamentals of Voice over IP (VoIP) for Engineers and Technicians
 
Videoconferencing Technology
Videoconferencing TechnologyVideoconferencing Technology
Videoconferencing Technology
 
Overview of Internet.ppt
Overview of Internet.pptOverview of Internet.ppt
Overview of Internet.ppt
 
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...
 
Lec40 41 42_43_44_45 video conferencing
Lec40 41 42_43_44_45 video conferencingLec40 41 42_43_44_45 video conferencing
Lec40 41 42_43_44_45 video conferencing
 
Lec40 45 video conferencing
Lec40 45 video conferencingLec40 45 video conferencing
Lec40 45 video conferencing
 
Netup dvb-tc-ci
Netup dvb-tc-ciNetup dvb-tc-ci
Netup dvb-tc-ci
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIP
 
A glance-at-voip
A glance-at-voipA glance-at-voip
A glance-at-voip
 
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)
Efficient Telecommunication Infrastructure with Internet Telephony (VoIP)
 

Similar to AN OPTIMIZED H.264-BASED VIDEO CONFERENCING SOFTWARE FOR ...

Deploying Hybrid Local Area and Wide Area Video Networks
Deploying Hybrid Local Area and Wide Area Video NetworksDeploying Hybrid Local Area and Wide Area Video Networks
Deploying Hybrid Local Area and Wide Area Video NetworksRonald Bartels
 
Monitoring whole mpeg transport stream
Monitoring whole mpeg transport streamMonitoring whole mpeg transport stream
Monitoring whole mpeg transport streamVolicon
 
MD Bridge in a Box Concept Explained
MD Bridge in a Box Concept ExplainedMD Bridge in a Box Concept Explained
MD Bridge in a Box Concept ExplainedMelissa Dingman
 
VIDEO CONFERENCING SYSTEM USING WEBRTC
VIDEO CONFERENCING SYSTEM USING WEBRTCVIDEO CONFERENCING SYSTEM USING WEBRTC
VIDEO CONFERENCING SYSTEM USING WEBRTCIRJET Journal
 
Evaluating Wavelet Tranforms for Video Conferencing Applications
Evaluating Wavelet Tranforms for Video Conferencing ApplicationsEvaluating Wavelet Tranforms for Video Conferencing Applications
Evaluating Wavelet Tranforms for Video Conferencing ApplicationsVideoguy
 
Sini online assignment
Sini online assignmentSini online assignment
Sini online assignmentsiniajay
 
10.1.1.184.6612
10.1.1.184.661210.1.1.184.6612
10.1.1.184.6612NITC
 
Java Abs Implementation Of Vvoip
Java Abs   Implementation Of VvoipJava Abs   Implementation Of Vvoip
Java Abs Implementation Of Vvoipncct
 
Ultra-Videoconferencing
Ultra-VideoconferencingUltra-Videoconferencing
Ultra-VideoconferencingVideoguy
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPVideoguy
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPVideoguy
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPVideoguy
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPVideoguy
 
Video Conferencing Standards
Video Conferencing StandardsVideo Conferencing Standards
Video Conferencing StandardsVideoguy
 
Video Conferencing Standards
Video Conferencing StandardsVideo Conferencing Standards
Video Conferencing StandardsVideoguy
 

Similar to AN OPTIMIZED H.264-BASED VIDEO CONFERENCING SOFTWARE FOR ... (20)

H264 final
H264 finalH264 final
H264 final
 
Deploying Hybrid Local Area and Wide Area Video Networks
Deploying Hybrid Local Area and Wide Area Video NetworksDeploying Hybrid Local Area and Wide Area Video Networks
Deploying Hybrid Local Area and Wide Area Video Networks
 
Video QoS
Video QoSVideo QoS
Video QoS
 
Monitoring whole mpeg transport stream
Monitoring whole mpeg transport streamMonitoring whole mpeg transport stream
Monitoring whole mpeg transport stream
 
MD Bridge in a Box Concept Explained
MD Bridge in a Box Concept ExplainedMD Bridge in a Box Concept Explained
MD Bridge in a Box Concept Explained
 
VIDEO CONFERENCING SYSTEM USING WEBRTC
VIDEO CONFERENCING SYSTEM USING WEBRTCVIDEO CONFERENCING SYSTEM USING WEBRTC
VIDEO CONFERENCING SYSTEM USING WEBRTC
 
Evaluating Wavelet Tranforms for Video Conferencing Applications
Evaluating Wavelet Tranforms for Video Conferencing ApplicationsEvaluating Wavelet Tranforms for Video Conferencing Applications
Evaluating Wavelet Tranforms for Video Conferencing Applications
 
Sini online assignment
Sini online assignmentSini online assignment
Sini online assignment
 
Live Streaming over P2PSIP
Live Streaming over P2PSIPLive Streaming over P2PSIP
Live Streaming over P2PSIP
 
[32]
[32][32]
[32]
 
10.1.1.184.6612
10.1.1.184.661210.1.1.184.6612
10.1.1.184.6612
 
Java Abs Implementation Of Vvoip
Java Abs   Implementation Of VvoipJava Abs   Implementation Of Vvoip
Java Abs Implementation Of Vvoip
 
Ultra-Videoconferencing
Ultra-VideoconferencingUltra-Videoconferencing
Ultra-Videoconferencing
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIP
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIP
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIP
 
Migrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIPMigrating Visual Communications from H.323 to SIP
Migrating Visual Communications from H.323 to SIP
 
Iptv
IptvIptv
Iptv
 
Video Conferencing Standards
Video Conferencing StandardsVideo Conferencing Standards
Video Conferencing Standards
 
Video Conferencing Standards
Video Conferencing StandardsVideo Conferencing Standards
Video Conferencing Standards
 

More from Videoguy

Energy-Aware Wireless Video Streaming
Energy-Aware Wireless Video StreamingEnergy-Aware Wireless Video Streaming
Energy-Aware Wireless Video StreamingVideoguy
 
Microsoft PowerPoint - WirelessCluster_Pres
Microsoft PowerPoint - WirelessCluster_PresMicrosoft PowerPoint - WirelessCluster_Pres
Microsoft PowerPoint - WirelessCluster_PresVideoguy
 
Proxy Cache Management for Fine-Grained Scalable Video Streaming
Proxy Cache Management for Fine-Grained Scalable Video StreamingProxy Cache Management for Fine-Grained Scalable Video Streaming
Proxy Cache Management for Fine-Grained Scalable Video StreamingVideoguy
 
Free-riding Resilient Video Streaming in Peer-to-Peer Networks
Free-riding Resilient Video Streaming in Peer-to-Peer NetworksFree-riding Resilient Video Streaming in Peer-to-Peer Networks
Free-riding Resilient Video Streaming in Peer-to-Peer NetworksVideoguy
 
Instant video streaming
Instant video streamingInstant video streaming
Instant video streamingVideoguy
 
Video Streaming over Bluetooth: A Survey
Video Streaming over Bluetooth: A SurveyVideo Streaming over Bluetooth: A Survey
Video Streaming over Bluetooth: A SurveyVideoguy
 
Video Streaming
Video StreamingVideo Streaming
Video StreamingVideoguy
 
Reaching a Broader Audience
Reaching a Broader AudienceReaching a Broader Audience
Reaching a Broader AudienceVideoguy
 
Considerations for Creating Streamed Video Content over 3G ...
Considerations for Creating Streamed Video Content over 3G ...Considerations for Creating Streamed Video Content over 3G ...
Considerations for Creating Streamed Video Content over 3G ...Videoguy
 
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMINGADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMINGVideoguy
 
Impact of FEC Overhead on Scalable Video Streaming
Impact of FEC Overhead on Scalable Video StreamingImpact of FEC Overhead on Scalable Video Streaming
Impact of FEC Overhead on Scalable Video StreamingVideoguy
 
Application Brief
Application BriefApplication Brief
Application BriefVideoguy
 
Video Streaming Services – Stage 1
Video Streaming Services – Stage 1Video Streaming Services – Stage 1
Video Streaming Services – Stage 1Videoguy
 
Streaming Video into Second Life
Streaming Video into Second LifeStreaming Video into Second Life
Streaming Video into Second LifeVideoguy
 
Flash Live Video Streaming Software
Flash Live Video Streaming SoftwareFlash Live Video Streaming Software
Flash Live Video Streaming SoftwareVideoguy
 
Videoconference Streaming Solutions Cookbook
Videoconference Streaming Solutions CookbookVideoconference Streaming Solutions Cookbook
Videoconference Streaming Solutions CookbookVideoguy
 
Streaming Video Formaten
Streaming Video FormatenStreaming Video Formaten
Streaming Video FormatenVideoguy
 
iPhone Live Video Streaming Software
iPhone Live Video Streaming SoftwareiPhone Live Video Streaming Software
iPhone Live Video Streaming SoftwareVideoguy
 
Glow: Video streaming training guide - Firefox
Glow: Video streaming training guide - FirefoxGlow: Video streaming training guide - Firefox
Glow: Video streaming training guide - FirefoxVideoguy
 

More from Videoguy (20)

Energy-Aware Wireless Video Streaming
Energy-Aware Wireless Video StreamingEnergy-Aware Wireless Video Streaming
Energy-Aware Wireless Video Streaming
 
Microsoft PowerPoint - WirelessCluster_Pres
Microsoft PowerPoint - WirelessCluster_PresMicrosoft PowerPoint - WirelessCluster_Pres
Microsoft PowerPoint - WirelessCluster_Pres
 
Proxy Cache Management for Fine-Grained Scalable Video Streaming
Proxy Cache Management for Fine-Grained Scalable Video StreamingProxy Cache Management for Fine-Grained Scalable Video Streaming
Proxy Cache Management for Fine-Grained Scalable Video Streaming
 
Adobe
AdobeAdobe
Adobe
 
Free-riding Resilient Video Streaming in Peer-to-Peer Networks
Free-riding Resilient Video Streaming in Peer-to-Peer NetworksFree-riding Resilient Video Streaming in Peer-to-Peer Networks
Free-riding Resilient Video Streaming in Peer-to-Peer Networks
 
Instant video streaming
Instant video streamingInstant video streaming
Instant video streaming
 
Video Streaming over Bluetooth: A Survey
Video Streaming over Bluetooth: A SurveyVideo Streaming over Bluetooth: A Survey
Video Streaming over Bluetooth: A Survey
 
Video Streaming
Video StreamingVideo Streaming
Video Streaming
 
Reaching a Broader Audience
Reaching a Broader AudienceReaching a Broader Audience
Reaching a Broader Audience
 
Considerations for Creating Streamed Video Content over 3G ...
Considerations for Creating Streamed Video Content over 3G ...Considerations for Creating Streamed Video Content over 3G ...
Considerations for Creating Streamed Video Content over 3G ...
 
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMINGADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
ADVANCES IN CHANNEL-ADAPTIVE VIDEO STREAMING
 
Impact of FEC Overhead on Scalable Video Streaming
Impact of FEC Overhead on Scalable Video StreamingImpact of FEC Overhead on Scalable Video Streaming
Impact of FEC Overhead on Scalable Video Streaming
 
Application Brief
Application BriefApplication Brief
Application Brief
 
Video Streaming Services – Stage 1
Video Streaming Services – Stage 1Video Streaming Services – Stage 1
Video Streaming Services – Stage 1
 
Streaming Video into Second Life
Streaming Video into Second LifeStreaming Video into Second Life
Streaming Video into Second Life
 
Flash Live Video Streaming Software
Flash Live Video Streaming SoftwareFlash Live Video Streaming Software
Flash Live Video Streaming Software
 
Videoconference Streaming Solutions Cookbook
Videoconference Streaming Solutions CookbookVideoconference Streaming Solutions Cookbook
Videoconference Streaming Solutions Cookbook
 
Streaming Video Formaten
Streaming Video FormatenStreaming Video Formaten
Streaming Video Formaten
 
iPhone Live Video Streaming Software
iPhone Live Video Streaming SoftwareiPhone Live Video Streaming Software
iPhone Live Video Streaming Software
 
Glow: Video streaming training guide - Firefox
Glow: Video streaming training guide - FirefoxGlow: Video streaming training guide - Firefox
Glow: Video streaming training guide - Firefox
 

AN OPTIMIZED H.264-BASED VIDEO CONFERENCING SOFTWARE FOR ...

  • 1. AN OPTIMIZED H.264-BASED VIDEO CONFERENCING SOFTWARE FOR MOBILE DEVICES Hans L. Cycon∗ , Thomas C. Schmidt, Gabriel Hege† , Matthias W¨ hlisch‡ , Mark Palkow§ a {h.cycon,hege}@fhtw-berlin.de, {t.schmidt,waehlisch}@ieee.org, palkow@daviko.com ABSTRACT one hand, high consumer demands on visual quality on the Mobile phones and related gadgets in networks are on the spot other, advise applications to take advantage of the latest stan- to deliver sufficient performance for rich multimedia applica- dard for video coding H.264/AVC [1]. tions and communication. In this report we introduce a video H.264/AVC provides gains in compression efficiency of conferencing software, which seamlessly integrates mobile up to 50 % over a wide range of bit rates and video resolutions with stationary users into fully distributed multi-party conver- compared to previous standards. While H.264/AVC decoding sations. Innovations related to this work are twofold. At first software has been successfully deployed on handhelds, high we report on a highly optimized realisation of a H.264 codec computational complexity still prevented pure software en- and our implementation experiences of the video conference coders in current mobile systems. There are however also fast software on a consumer mobile. Within the tight bounds of hardware implementations available, which give rise to an in- real-time requirements on mobiles, the coding software out- creasing offer of device- and operator-bound video services. performs compatible H.264 realizations. At second we present In this work we first introduce a pure software solution an integrated peer-to-peer group communication solution, which for real-time video communication on standard smartphones scales well for medium-size conferences and accounts for the in section 2. These mobile clients extend a lightweight, fea- heterogeneous nature of mobile and stationary participants. ture rich conferencing application developed for an infras- tructure compliant use on standard PCs. In the second part Index Terms— Mobile video coding, H.264/MPEG-4 AVC we present the underlying peer-to-peer group communication software codec, mobile conferencing, peer-to-peer group com- scheme, which performs well for medium-size conferences munication, distributed SIP conference management and accounts for the heterogeneous nature of mobile and sta- tionary participants, cf. section 3. This includes on the one 1. INTRODUCTION hand SIP [2] standard compliant session signalling with re- spect to group communication, and on the other hand effi- The idea of augmenting voice calls by video has been around cient, serverless media distribution, self-adjusting to the ac- for several decades, but only the flexibility of the Internet gen- tual network infrastructure support. Conclusions and an out- erated a noticeable deployment. As compared to audio, video look follow in the final section. processing places significantly higher demands on end system and network transmission capabilities. The rapid evolution 2. THE DAVIKO VIDEOCONFERENCING of networks and processors have paved the way for realistic SOFTWARE group conferences conducted at standard personal computers, combining about a dozen visual streams of Half-QVGA (240 In this section we give an overview of our reference imple- x 160 pixel @ 15-30 fps) resolution. mentation, a digital audio-visual conferencing system, realised Mobile phones and networked consumer portables are now as a serverless multipoint video conferencing software with- on the spot to deliver sufficient performance for rich multi- out MCU developed by the authors [3]. It has been designed media applications and communication, as well. Videocon- in a peer-to-peer model as a lightweight Internet conferenc- ferencing though, which requires simultaneous decoding and ing tool aimed at email-like friendliness of use. The system encoding in real-time, poses still a grand challenge to the mo- is built upon a fast H.264/MPEG-4 AVC standard conformal bile world. Limited and expensive wireless channels on the video codec implementation [4] called DAVC. By control- ∗ The author is with FHTW Berlin, 10318 Berlin, Germany. ling the coding parameters appropriately, the software permits † Thomas and Gabriel are with HAW Hamburg, Dept. Informatik, Berliner scaling in bit rate from 48 to 1440 kbit/s on the fly. Tor 7, 20099 Hamburg, Germany. Audio data is compressed using a 16 kHz speech-optimized ‡ Matthias is with link-lab, H¨ nower Str. 35, 10318 Berlin, Germany and o also with HAW Hamburg. variable bit rate codec [5] with extremely short latencies of § Mark is with daViKo GmbH, Am Borsigturm 40, 13507 Berlin, Ger- 40 ms (plus network packet delay). All streams can be trans- many. mitted by unicast as well as by multicast protocols. Within
  • 2. 4 3 the application, audio streams are prioritized over video since 4 2 user experience is usually more sensitive to losses in audio 4 1 packets than those of video packets, which both may result 4 0 < 3 fp s from transmission errors or network congestions. 3 9 An application-sharing facility is included for collabora- 2 8 4 fp s P S N R [d B ] 3 8 tion and teleteaching. It enables participants to share or broad- 3 7 cast not only static documents, but also any selected dynamic D A V C x 2 6 4 PC actions like animations. All audio/video - streams includ- 3 6 2 1 0 fp s J M M P E G -4 ing dynamic application sharing actions can be recorded on 3 5 < 5 fp s any site. This system is equally well suited to intranet and 3 4 wireless video conferencing on a best effort basis, since the 3 3 0 5 0 1 0 0 1 5 0 2 0 0 2 5 0 3 0 0 audio/video quality can be controlled to adapt the data stream B itr a te [k b it/s ] to the available bandwidth. (a) Akiyo (cif, 300 frames at 30 Hz) The daViKo conferencing system is available for desktop 4 3 computers running MS–Windows or Linux and on handhelds 4 2 with Windows Mobile operating system. 4 1 4 0 < 3 fp s 3 9 3 8 1 0 6 fp s 2.1. The DAVC Codec P S N R [d B ] 3 7 8 8 fp s 3 6 DAVC, the core of the videoconferencing system, is a fast, 3 5 D A V C highly optimized H.264/MPEG-4 AVC standard implementa- 3 4 x 2 6 4 J M tion. It realizes a Baseline profile, optimized for real-time en- 3 3 M P E G -4 3 2 coding (as well as real-time decoding) by means of a fast mo- < 5 fp s 3 1 tion estimation strategy including integer-pel diamond search 3 0 as well as a fast subpel refinement strategy up to 1 pel motion 4 0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0 1 2 0 0 B itr a te [k b it/s ] 1 4 0 0 1 6 0 0 1 8 0 0 2 0 0 0 accuracy. Motion estimation includes the choice of several (b) Foreman (cif, 300 frames at 30 Hz) different macroblock (MB) partitions and multiple reference frames, as permitted by the H.264/MPEG-4 AVC standard. Fig. 1. RD plot for test sequences in CIF resolution compar- For choosing between different MB partitions for motion- ing three different H.264/MPEG-4 AVC encoder implementa- compensated (i.e. temporal) prediction and MB-based intra tions as well as a RD-optimized MPEG-4 (Part◦ 2) Advanced (i.e. spatial) prediction modes, a fast rate-distortion (RD) Simple Profile implementation. based mode decision algorithm with early termination con- ditions has been employed. In comparison to the well-known open source H.264/MPEG- Note that the DAVC codec along with the H.264/AVC de- 4 AVC encoder implementation of x264 [6], our DAVC en- sign also includes some suitable mechanisms to quickly re- coder implementation achieves up to 0.5 dB PSNR better RD cover from video packet loss. performance and a considerable increase in encoding speed when using comparable encoder settings. For selected RD 2.2. Mobile Video Codec Performance points we measured 284 encoded frames per second (fps) as compared to 210 fps for x264. In Figure 1, typical examples In ongoing work, the DAVC codec has been adapted to sustain of such a comparison between x264 and DAVC are shown. real-time performance on mobile devices. The mobile codec In addition to the RD-performance of those two real-time en- version operates at reduced complexity for motion compen- coder implementations, this plot also shows the RD behav- sation with a highly optimized code base for the target plat- ior of two non real-time encoder implementations, as given form. Motion compensation has been limited to work on 16 x by the H.264/MPEG-4 AVC Joint Model (JM) reference soft- 16 pixel blocks, only. The code tuning includes the efficient ware (with Baseline profile settings) and a MPEG-4 (Part◦ 2) use of the wireless MMX instruction set available at the tar- Advanced Simple Profile implementation. The latter two en- get system. Portability is sustained by an ANSI compliant C coders were operated using a high-complexity RD-based mode version, to be augmented incrementally by platform specific decision strategy for demonstrating the capabilities of both injections. video coding standards when neglecting any real-time con- The application was tested on a 520 MHz Xscale proces- straints. Figures 1(a) and 1(b) also contain the number of sor built in an Asus P735 system. Thereon it can reliably encoded frames per second (fps) for selected RD points as a encode and decode a QCIF video stream in parallel at 5/10 measure for maximum encoding speed. Similar results were fps, without CPU exhaustion or frame dropping. Real-time also achieved for other test sequences. encoding rate increases up to 10 fps for moderate video com-
  • 3. 4 3 quires group session management and media distribution at 4 2 4 1 peers, which for the sake of standard compliance we realize 4 0 with group conferencing functions in SIP, cf. [7, 8, 9]. Im- 3 9 plemented as pure software on standard personal devices, user 3 8 agent peers are exposed to severe restrictions in real-world de- P S N R [d B ] 3 7 3 6 ployments: Often they are located behind NATs and firewalls 3 5 D A V C F u ll with network capacities confined to asymmetric DSL or wire- D A V C M o b ile 3 4 less links. Capacity constraints and resilience to node failures 3 3 3 2 require peer-managed ad-hoc conferences to organize in a dis- 3 1 tributed multi-party model. As a key component, the hetero- 3 0 geneity of clients must be accounted for, whereas the range of 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 1 4 0 1 6 0 1 8 0 2 0 0 B itr a te [k b it/s ] scalability is limited to about a dozen parties in videoconfer- ences. Fig. 2. RD plot for test sequence “Akiyo” in QCIF resolution at 10 fps, comparing the DAVC mobile encoder to the DAVC 3.1. P2P Adaptive Architecture fully optimized implementation. A peer-to-peer conferencing system faces the grand challenge to be robust w.r.t. the infrastructure. The role a user agent is able to attain in a distributed scenario needs to be adaptively determined according to constraints of its device and current network attachment. In a simplified scenario, clients may be divided into two groups, distinguished by their ability to act as a SIP conference focus or not. A focus must be globally addressable and have access to necessary processing and net- work resources. This elementary adaptation scheme can be based on indi- vidual decisions of user agents and gives rise to a hybrid ar- chitecture of super peers, chosen from potential focus nodes, and remaining leaf nodes. To decide on its potential role of building a focus, a client at first needs to determine NATs and firewalls. Aside from address evaluation, this is done by a simple probe packet exchange. As the implementation is CPU-type aware, processing restrictions are easily evaluated, Fig. 3. The mobile video application. as well. However, an a priori judgement on available network bandwidth is not easily obtained. An evaluation of the lo- plexity. QCIF @15 fps is the maximal image feed that can be cal link capacity is frequently misleading, as wireless devices obtained from the front camera in our test equipment. Perfor- may be located behind wired transmitters of lower, asymmet- mance values for the mobile encoder are displayed in figure 2 ric capacity such as in ADSL. Current experiments to quickly and compared to the results for the full DAVC. retrieve reasonable estimates of up- and downstream capacity Reduced coding complexity results in an enhanced data are ongoing on the basis of variable packet size, nonintrusive rate send by the mobile, but the gross total rate for a bidi- estimators, cf. [10]. Note that network capacity detection is rectional video exchange at 10 fps complies to 3GPP/UMTS of vital use for temporal adaptation of the video codecs, as bandwidths constraints. Note that experimental conditions well. are not fully compatible: The image sequence obtained from Leaf nodes attach to super peers in subordinate position, the front camera of the mobile is significantly more noisy than whereas potential focus nodes may be assigned to be super our standard USB cameras connected to the desktop, which peers or leaves. Super peers provide global connectivity among increases the image complexity and thereby the data rate. each other and NAT traversal assistance to leaves, while leaf nodes experience super peers in different roles: A leaf nodes 3. DISTRIBUTED POINT-TO-MULTIPOINT sees its next hop super peer as the conference focus, while the CONFERENCING remote super peers act as proxies on the path to the leaves be- hind.1 This set-up corresponds to the well known architecture Our application aims for simple, flexible, and cost-efficient 1 This architecture relies on the presence of at least one globally address- ad-hoc conferencing functions, which scale appropriately well, able, sufficiently powerful peer. As there are many scenarios, where this is but avoid any infrastructure assistance. Such a solution re- likely to fail, we advise for and offer a permanently deployed ‘silent’ relay-
  • 4. of Gnutella 0.6 and successive hybrid unstructured peer-to- [4] J. Ostermann, J. Bormans, P. List, D. Marpe, N. Nar- peer systems, cf. [11]. Despite its architectural analogy, our roschke, F. Pereira, T. Stockhammer, and T. Wedi, routing layer for real-time group applications follows a differ- “Video Coding with H.264/AVC: Tools, Performance ent, next-hop design. and Complexity,” IEEE Circuits and Systems Magazine, vol. 4, no. 1, pp. 7–28, April 2004. 4. CONCLUSIONS & OUTLOOK [5] “The Speex projectpage,” http://www.speex.org, 2007. We have presented a peer-to-peer software for high-quality [6] “VideoLan: x264 - a free h264/avc encoder,” videoconferencing on mobiles, admitting utmost flexibility http://www.videolan.org/developers/x264.html, 2007. with respect to end systems, operators and network provi- sioning. To the best of our knowledge, this is the first soft- [7] A. Johnston and O. Levin, “Session Initiation Proto- ware implementation of an H.264 video encoder that operates col (SIP) Call Control - Conferencing for User Agents,” in real-time on mobile phones. An adaptive, fully distributed IETF, RFC 4579, August 2006. conference management scheme with SIP has been developed [8] R. Mahy, R. Sparks, J. Rosenberg, D. Petrie, and as part of the multi-party scenario. This hybrid peer-to-peer A. Johnston, “A Call Control and Multi-party usage model accounts for client capabilities as well as network at- framework for the Session Initiation Protocol (SIP),” tachment, and does scale well beyond standard use. IETF, Internet Draft - work in progress 9, November In future work we will concentrate on further optimiza- 2007. tion and generalization of the video coding software to make it available for a wider variety of platforms. Network adap- [9] T. C. Schmidt and M. W¨ hlisch, “Group Conference a tation and capacity evaluation will require further work to ar- Management with SIP,” in SIP Handbook: Services, rive at estimates that reliably serve the needs in real world Technologies, and Security, S. Ahson and M. Ilyas, Eds. environments, as well. Boca Raton, FL, USA: CRC Press, November 2008, to Additional research will target at benefits possibly inher- appear, on invitation. ited from key-based routing. As common application layer multicast schemes, which rely on dedicated shared or source [10] R. Prasad, C. Dovrolis, M. Murray, and kc claffy, specific trees, are significantly sensitive to client departure “Bandwidth Estimation: Metrics, Measurement Tech- and of insufficient performance in medium size groups, and niques, and Tools,” IEEE Network, vol. 17, no. 6, pp. as conference routing actually can be seen as an application 27–35, November–December 2003. layer broadcasting problem, new and highly optimized struc- [11] R. Steinmetz and K. Wehrle, Eds., Peer–to–Peer Sys- tured broadcast algorithms are desirable. The bidirectional tems and Applications, ser. LNCS. Berlin Heidelberg: shared tree approach introduced in [12] may be a promising Springer–Verlag, 2005, vol. 3485. point to start at. [12] M. W¨ hlisch and T. C. Schmidt, “Between Underlay and a Overlay: On Deployable, Efficient, Mobility-agnostic Acknowledgement Group Communication Services,” Internet Research, vol. 17, no. 5, pp. 519–534, November 2007. This work is supported by the German Bundesministerium f¨ r u Bildung und Forschung within the project Moviecast (http://moviecast.realmv6.org). 5. REFERENCES [1] ITU-T Recommendation H.264 & ISO/IEC 14496-10 AVC, “Advanced Video Coding for Generic Audiovisual Services,” ITU, Tech. Rep., 2005, draft Version 3. [2] J. Rosenberg, H. Schulzrinne, G. Camarillo, A. John- ston, J. Peterson, R. Sparks, M. Handley, and E. Schooler, “SIP: Session Initiation Protocol,” IETF, RFC 3261, June 2002. [3] M. Palkow, “The daViKo homepage,” 2008, http://www.daviko.com. peer at some unrestricted place.