Network Working Group W. Chimiak
Request for Comments: 1453 BGSM
April 1993
A Comment on Packet Video Remote Conferencing and the
Transport/Network Layers
Status of this Memo
This memo provides information for the Internet community. It does
not specify an Internet standard. Distribution of this memo is
unlimited.
Abstract
The new generation of multimedia applications demands new features
and new mechanisms for proper performance. ATM technology has moved
from concept to reality, delivering very high bandwidths and new
capabilities to the data link layer user. In an effort to anticipate
the high bandwidth-delay data link layer, Delta-t [Delta-t], NETBLT
[RFC 988], and VMTP [RFC 1045] were developed. The excellent
insights and mechanisms pioneered by the creators of these
experimental Internet protocols were used in the design of Xpress
Transfer Protocol (XTP) [XTP92] with the goal of eventually
delivering ATM bandwidths to a user process. This RFC is a vehicle
to inform the Internet community about XTP as it benefits from past
Internet activity and targets general-purpose applications and
multimedia applications with the emerging ATM networks in mind.
Networking is no longer synonymous with analog telephony. High-
performance lower-layer networks have made possible exciting new
applications: collaboratory environments, distributed client/server
computing, remote conferencing, teleclassrooms, and distributed
life-sciences imaging. These applications normally demand a great
deal of bandwidth and often create operating system bottlenecks.
Enabling these new multimedia applications entails delivering
bandwidth to the applications, not just having bandwidth available on
the network. This statement may appear obvious, but often solutions
at the transport layer are satisfied by having bandwidth at that
layer without sufficient sensitivity to higher-layer access to the
bandwidth. The unavailability of bandwidth at upper layers is
becoming the real issue as the networks are becoming a high-
performance virtual backplane without concomitant high-performance
control schemes. It appears that new services are needed that
require communication with all layers. The ATM architecture calls
Chimiak [Page 1]
RFC 1453 Comments on Video Conferencing April 1993
for such an integrated control scheme.
The challenges of remote conferencing is an application whose
challenges may be met at the data link layer by the emerging
broadband networks. If so, important medical applications such as
medical imaging for diagnosis and treatment planning would be
possible [CHIM92]. Remote conferencing would permit imaging
applications for life sciences through the use of national resource
centers. Collaboratory conferences in molecular modeling, design
efforts, and visualization of data in numerous disciplines could
become possible.
At the Second Packet Video Workshop, held December, 1992, at MCNC in
the Research Triangle Park, North Carolina, a recurrent theme was the
use of multimedia in remote conferencing. Its applications included
the use of interactive, synchronized voice and video transmission,
multicast transmission, data transfer, graphics transmission,
noninteractive video and audio transmission, and data base query
within a virtually shared workspace. A few participants doubted the
ability of current computer networks to handle these multimedia
applications and preferred only connection-oriented, circuit-switched
services. Most participants, however, looked forward to using an
integrated network approach.
Remote conferencing as seen at the workshop requires a set of
functions. It must provide session scheduling that deals with
initiating a session, joining in-progress sessions, leaving a session
without tearing it down if there are multiple participants, and
terminating a session.
The remote-conferencing session needs a control subsystem that is
either tightly controlled for an n-to-n connection for two to 15
participants, or loosely controlled for a 1-to-n connection for any
number of participants. The Multipeer-Multicast Consortium is
working on defining the control requirements and the mechanisms for
control. At the Packet Video Workshop, one participant presented a
conference control protocol (CCP) shown in Figure 1 [CCP92]. In this
architecture the CCP controls the Network Voice Protocol (NVP)
[RFC741] and the Packet Video Protocol (PVP) [PVP81] over the
experimental Internet Stream Protocol, Version 2 (ST-II) [RFC1190]
rather than IP.
Latency and intramedia synchronization and intermedia synchronization
(lip-sync) are critical for the interactive voice and video streams
Chimiak [Page 2]
RFC 1453 Comments on Video Conferencing April 1993
of remote conferencing. Client/server applications including data
base operations are equally important. The ability to display
noninteractive video and high-resolution graphics is necessary.
As with most applications, security will eventually be an issue. At
the very least, there must be a mechanism to determine who can find
out what about conference and who can join a conference. This
determination will be part of an upper-layer protocol.
+--------------+ +--------+ +-----+ +------------+
|Teleconference| | File | |Email| | Domain |
| (CCP) | |Transfer| | | |Name Service|
+----+-------+-+ +-----+--+ +-+---+ +-----+------+
| | |__ __| |
| | || |
+-----+--+ +--+-----+ +-++-+ +----+---+
|Network | | Packet | | T | | U |
| Voice | | Video | | C | | D |
|Protocol| |Protocol| | P | | P |
+---+----+ +--+-----+ +-+--+ +--+-----+
|__ __| |__ __|
| | | |
+-+---+--+ +-+-------+-+
| Stream | | I |
|Protocol| | P |
+---+----+ +---+-------+
| |
+-----+----------------------+----+
|IEEE_802.X,FDDI,DARTnet,ATOMIC...|
+---------------------------------+
Figure 1: The Connection Control Protocol Architecture
The solutions must range in geography from single machines through
LAN, CAN, MAN, WAN conferences, as well as in data content from PCs
to high-end workstations. Implicit in the scaling is the notion of
product and application interoperability.
Remote conferencing applications should take advantage of future
network enhancements, as well as the functions provided by ATM, FDDI,
and SMDS. Doing so should provide function versus resource trade-
offs such as cost versus error control and cost versus rate control.
As a result, the transport layer should be able to provide the
services offered by the data link layer.
Most of the presentation on remote conferencing indicated a need for
some degree of multicast functionality, ranging from the 1-to-n, with
conference membership completely known, to conferences for which
Chimiak [Page 3]
RFC 1453 Comments on Video Conferencing April 1993
existence of a group is known, but exact membership is not.
In remote conferencing, it is preferable to use one network for all
information traffic. This network should have an offered quality of
service (QOS) criteria to accommodate tradeoff metrics, which would
include guaranteed throughput, connection reliability, high call-
completion rate, few dropped calls, tolerable error rate, varying
degrees of compression on the video and audio streams, and tolerable
motion artifacts, flow control, and latency. The QOS should be an
input to the transport layer from the application or transport
service user.
The compression/coding function should provide time-stamping and
packetizing information, as well as real-time echo cancellation.
These functions are usually at the presentation and session layer of
the Open System Interconnection (OSI) model or the at the application
in some Internet models, but not the transport layer.
RFC 1193 deals with the requirements of real-time communications,
which include some of the same capabilities [RFC1193]. But the
requirements articulated create the necessity for new
transport/network protocols. The new protocols under development by
the Audio Visual Transport [SCHU92] (RTC, RTCP), and other protocols
in this area (ST-II) suggest an architecture like that shown in
Figure 2.
These approaches may work. However, they encourage a discipline that
creates a protocol for each new class of application. Another
approach might be to unify the protocols. It is felt that this is
one of the main goals of XTP (see Figure 3).
Other design considerations of XTP include the following:
Chimiak [Page 4]
RFC 1453 Comments on Video Conferencing April 1993
+----------------------+
| media |
| application |
+--------+----+-+------+
| |RTCP| | |
| +----+ | T |
| RTP | C |
+-----+-----+ | P |
|ST-II| UDP | | |
+ +-----+---+------|
| | IP |
+-----+-------+--------+
| Data Link Layer |
+----------------------+
Figure 2: One emerging multimedia architecture
+--------------+ +--------+ +-----+ +------------++-----------+
|Teleconference| | File | |Email| | Domain || media |
| | |Transfer| | | |Name Service||application|
+------+-------+ +----+---+ +--+--+ +-----+------++-----+-----+
| | | | |
+---------------+--------+----------+-------------+
|
+-------+--------+
|Unified Protocol|
+----------------+
|Data Link Layer |
+----------------+
Figure 3: Another integrated multimedia architecture
(1) Developing a protocol based on the work and experience of
the protocol groups such as IETF, which produced NETBLT,
VMTP, TCP, IP, and UDP.
(2) Incorporating lessons from Delta-t.
(3) Observing the design paradigm shift of ATM, SONET, and VMTP
in the header, trailer, and information segment construction.
(4) Targeting the real-time requirements articulated by the
Department of Defense SAFENET committee and the French
Ministry of Defense military real-time specification [GAM-T-103].
Mechanisms in XTP allow an application to approach the performance
desired. A session-scheduling mechanism for joining and leaving a
Chimiak [Page 5]
RFC 1453 Comments on Video Conferencing April 1993
multicast conference exists. The XTP mechanism for multicast
satisfies the loosely controlled session requirements. The tightly
controlled session would require the use of multiple XTP
associations.
The priority mechanism that uses the 32-bit SORT field permits an
application to prioritize data. Because XTP is a transport layer,
this priority mechanism follows through every node tranversed. There
is also an out-of-band delivery mechanism. However, XTP does not
offer latency control by itself; it just provides a priority
mechanism.
The selective acknowledgement, fast negative acknowledgement, and
selective retransmission permit an application to choose an
appropriate level of error control. The combination of the priority
mechanism and these error-control mechanisms is likely to approach
the latency and synchronization requirements of remote conferencing.
Noninteractive audio and video, as well as graphics presentation, can
be accommodated in many ways by the application. It is important
that the transport layer provides adequate mechanisms to deliver the
appropriate data streams in a manner compatible with the
applications. These applications can probably be accomplished by
means of extant protocols, as well as XTP.
The scalability of the solution will be a function of the standards
used. At the Packet Video Workshop, some of the applications
sacrificed computer network standards in favor of desired
performance. This approach usually impedes scalability. From the
explanation of the applications taking this approach, it appeared
that using XTP would have provided an adequate transport service for
the applications.
XTP was designed to provide mechanisms to accommodate application
requirements, that is, the ability to respond to QOS requests. For
example, guaranteed throughput may be accomplished by using XTP's
rate and burst control together with flow control or no flow control.
Tolerable error rate can be accomplished with partially error
controlled connections (PECC), a service which can be placed in the
application or just above the transport layer [PECC92]. Motion
artifacts and varying degrees of compression should be done above the
transport layer in coordination with the transport layer or possibly
in a network management function.
To produce an affordable solution, the hardware fabrication process
should be a design consideration. Technologies are evolving too
Chimiak [Page 6]
RFC 1453 Comments on Video Conferencing April 1993
rapidly to assume that a generic protocol design will anticipate all
fabrication advances, but this fact should not impede use of the
features of advanced hardware fabrication processes.
System interface problems and VLSI techniques should be considered in
the specification of the protocol. An examination of the ATM and
SONET standards appears to support this philosophy. Similarly,
NETBLT and VMTP design efforts seem to support this approach. XTP
does use it.
It is very helpful to break down the protocol into parallel-state
machines for execution on more inexpensive hardware. This procedure
reduces the context switching and interrupt handling requirements of
the hardware, thereby decreasing production costs while producing a
scalable protocol machine.
In parallel with the IETF efforts to enable multimedia applications
such as remote conferencing, the XTP forum members have been
experimenting with major elements of these applications.
(1) At the University of Virginia, more than 100 simulated voice
channels were run on an FDDI network [UVAVOICE92]. The
purpose of this experiment was to test the limits of FDDI
and a software version of XTP in a simulated interactive
voice environment. Multicasted, noninteractive video has
been supported there for several years.
UVa also has a video-mail demo over XTP/FDDI that uses
Fluent multimedia interface and standard JPEG compression.
This PC-based demo delivers full frame, full color, 30
frames/sec video from any network disk to a remote VGA
screen. It is important that users could not discern any
difference in playback between a local disk and a remote
disk. An Xpress File System (XFS) is used on server and
client systems.
(2) The Technical University of Berlin, Germany, reports that
the coordination, implementation, and operation of
multimedia services (CIO) of the R&D in Advanced
Communications Technologies in Europe (RACE) is using XTP as
a starting point for design [XTPRACE].
(3) At the Naval Command, Control, and Ocean Surveillance Center
Research, Development, Test and Evaluation Division NRaD
(formerly the Naval Ocean Systems Command (NOSC)), voice is
Chimiak [Page 7]
RFC 1453 Comments on Video Conferencing April 1993
multicasted over XTP/FDDI. A simple multicast is
distributed to a group with a latency of around 25 ms, where
the latency represents delay from the voice signal from the
microphone to the audio signal to the speaker. This group
is currently doing research on an n-party multicast of voice
(telephone conference-call paradigm [n x n multicast]).
(4) Commercially, Starlight Networks Inc., migrated a subset of
XTP into the transport layer of its video application
server. By using XTP rate control, full-motion, full-screen
compressed video is delivered at a constant 1.2 Mbps, over
switched-hub Ethernet to viewstations. This network
delivers at least 10 simultaneous video streams.
Therefore, XTP has been used in applications that were previously
placed over IP or even a data link layer.
Separating protocol policies and mechanisms [XTPbk] permits adoption
of new policies without altering offered mechanisms. An excellent
example is UVa's Partially Error Controlled Connections (PECC). This
control system maximizes error control in such a way that receiving
FIFOs are never starved i.e., the application, driver, or operating
system buffer control, and not the transport layer becomes the
bottleneck.
Because XTP is mechanism-rich and policy-tolerant, this very dynamic
error control policy mechanism is possible. Separating policy and
mechanism permits an error-control or flow-control policy to adapt to
the data link layer conditions without shutting down a connection and
rebuilding (or multiplexing) a new one on a different protocol stack.
This may also provide an easier way for a network management
subsystem to maintain a desired QOS.
Remote conferencing presents new opportunities for research,
business, and administration. Although some are proposing that only
classical circuit-switched mechanisms be used, most network engineers
are searching for ways to use the new features of FDDI, SMDS, and ATM
in multimedia applications such as remote conferencing. Some new
applications have been placed directly on a data link layer. New
Transport/Network layer combinations have been proposed and are being
tested. It is believed that consideration should be given to XTP as
a possible solution because its forum membership includes commercial,
government, and research institutions, some of which have implemented
various applications that contribute to an overall remote-
Chimiak [Page 8]
RFC 1453 Comments on Video Conferencing April 1993
conferencing application.
[CCP92] Schooler, E., "An Architecture for Multimedia Connection
Management", in Proceedings of the 4th IEEE ComSoc
International Workshop on Multimedia Communications,
Monterey, CA, April 1992.
[CHIM92] Chimiak, W., "The Digital Radiology Environment", IEEE
JSAC, Vol. 10, No. 7, pp. 1133-1144, September 1992.
[Delta-t] Watson, R. W., "Delta-t Protocols Specification",
Lawrence Livermore Laboratory, April 15, 1983.
[GAM-T-103] French Ministry of Defense, "GAM-T-103 Military
Real-Time Local Area Network Reference Model
(Transfer Layer)", February 7, 1987.
[PECC92] Dempsey, B., Strayer, T. and Weaver A., "Adaptive Error
Control for Multimedia Data Transfer", in Proceedings
of the IWACA 92, Munich, Germany, pp. 279-288, March
1992.
[PVP81] Cole, R., "PVP - A Packet Video Protocol", W-Note 28,
Information Sciences institute, University of
California, Los Angeles, CA, August 1981.
[RFC1045] Cheriton, D., "VMTP: Versatile Message Transaction
Protocol Specification", RFC 1045, Stanford
University, February 1988.
[RFC998] Clark, D., Lambert, M., and L. Zhang, "NETBLT: A Bulk
Data Transfer Protocol", RFC 998, MIT, March 1987.
[RFC1193] Ferrari, D., "Client Requirements For Real-Time
Communication Services", RFC 1193, UC Berkeley,
November 1990.
[RFC1190] Topolcic, C., Editor, "Experimental Internet Stream
Protocol: Version 2 (ST-II)", RFC 1190, CIP Working
Group, October 1990.
[SCHU92] Schulzrinne, H., "A Transport Protocol for Audio and
Video Conferences and other Multiparticipant
Real-Time Applications", Internet Engineering Task
Force, Internet-Draft, October 1992.
Chimiak [Page 9]
RFC 1453 Comments on Video Conferencing April 1993
[UVAVOICE92] Weaver, A. C. and McNabb, J.F., "Digitized Voice
Distribution Using XTP and FDDI", Transfer, Vol. 5,
No. 6, pp. 2-7 (November/December 1992).
[XTP92] Xpress Transfer Protocol, version 3.6, XTP Forum,
1900 State Street, Suite D, Santa Barbara, California
93101 USA, January 11, 1992.
[XTPbk] Strayer, W.T., Dempsey, B.J., and Weaver, A.C., "XTP:
The Xpress Transfer Protocol", Addison-Wesley
Publishing Company, Inc., 1992.
[XTPRACE] Rebensburg, K. and Miloucheva, I., "The Use of XTP in a
Large European Communication Project", XTP Forum
Research Affiliate Annual Report, Document 92-183,
pp. 105-112, 1992.
Security Considerations
Security issues are discussed in section 2.1.
Author's Address
William J. Chimiak
Department of Radiology
Bowman Gray School of Medicine
Medical Center Boulevard
Winston-Salem, NC 27157-1022
Phone: 919-716-2815
EMail: chim@relito.medeng.wfu.edu
Chimiak [Page 10]