![]() |
| |
UNDERSTANDING VOICE OVER IP BY DR. ROBERT M. KUHN, Senior Consultant COMPASS CONSULTING INTERNATIONAL, INC.
In telecommunications (beyond the level of two cans and a piece of string) there are 2 bi-directional streams (possibly over the same channel. In fact, It was common for a number of years to have the control stream riding on the same medium as the voice stream. This is known as "in-band" control or signaling. However with the advent of digital transport systems, most in-band signaling has been replaced by a separate control channel which is known, appropriately enough, as "out-of-band" signaling.), a "voice" stream (which includes voice, music, fax squeals, modem shrieks, etc.) and a control stream. "Voice over IP" (VoIP) refers to the transport of a telecommunications voice stream over a data network using the data transport mechanisms associated with the Internet, called the Transmission Control Protocol/Internet Protocol (TCP/IP) suite.( You might think that the restriction to the voice stream was self evident. However, one vendor presents the following ingenious and useful application as part of their VoIP strategy. It is a solution primarily for remote workers. The worker needs an IP connected computer and a Plain Old Telephone System (POTS) line. The worker uses a browser to connect to a web-site which, in turn, communicates with the PBX. The user then identifies and authenticates, giving the phone number of the POTS phone. The PBX connects to the telephone, and provides all the sophisticated features of the PBX like hold, conference calling, etc. by means of the phone line plus feature access through the computer interface.) Discussing Voice over IP is often difficult since voice over IP means different things to different people. To some it means voice over the Internet; to others it means voice over a point-to-point wide area network (WAN); to still others it means voice over the local area network (LAN). Complicating matters, various authors attempt to differentiate between the meanings by coining terms like "IP Telephony," "Internet Telephony," "Telephony over IP" or "Telecommunications over IP." For our purposes all these terms are synonymous. For the purposes of this white paper, we will be concentrating on VoIP in the LAN and WAN environments only, largely ignoring the issue of voice over the public Internet. Traditional telephony, is based on circuit switching and time-division multiplexing and is so reliable that the industry "standard" calls for "five nines reliability" or availability 99.999% of the time. So then why is anyone considering voice over IP? In his article, IP Telephony, the End of the World as We Know It? (Cause/Effect, Vol 22 No 2, 1999), Scott Street of Compass Consulting International, Inc. discusses the change drivers for IP telephony. They fall into 4 categories: economic, functional/technical, management, and hype. Scott's conclusion in 1999 was that none of the factors (except hype) truly favored IP telephony at that time. In an analysis for the August, 2000 ACUTA National Conference, Scott Street and Robert Kuhn (also of Compass) showed that, while there has been movement in all four areas, the earlier analysis remains fundamentally true. However, under exceptional circumstances, it is already possible to make the economic and technical case for VoIP. These exceptional cases require economically a short amortization period, lack of adequate building-to-building copper, a strong IT-Telecomm synergy, and a robust in-place data infrastructure. Typically early VoIP adoptions are administrative implementations without stringent e-911 or disabled-access requirements and a minimal need for advanced features. One way to arrive at the core arguments in favor of VoIP is to compare voice and data technologies on a "bit-by-bit" basis. Take a typical large university as an example. The typical data network in a university today delivers 100 Megabits per second (108 bps) to the desktop, with the core switching devices approaching Terabit-per-second (1012 bps) speeds. Compare this to a typical telecommunications network for a similar institution. The connections to the desktop are 64 kilobits-per-second (6.4x104 bps) and the telephone switch capacity would not reach 15,000 simultaneous calls (109 bps). This gives the data switches a thousand-fold edge in switching capacity. So, while telephones are much cheaper than computers, and telephone networks cheaper than data networks, if you divide by the capacity, the data network equipment is much, much cheaper bit-for-bit. To put it another way, given the explosion in bandwidth requirements for data and video, voice can "ride for free" on the data networks in terms of both economics and management. In the long run this is leading vendors to favor VoIP strategies for the future of telephony, but the steps taken by different vendors vary widely in technical approach. To understand the current state of VoIP it is necessary to look more closely at what is actually going on in any particular implementation.
One area of continued confusion in VoIP discussions is the question of where in a given network does the voice get converted to IP. There are several differing approaches to this issue and given that each has its pros and cons, the discussion below serves to identify the options. A voice comes out of our mouths as movement of air which goes into the telephone handset and gets transformed into an electrical signal. Initially the amplitude and frequency of the electrical signal vary in proportion to the voice's amplitude and frequency (an analog signal). An analog phone sends that signal out along its wires. A digital phone on the other hand takes that analog signal, samples it at frequent intervals, describes it numerically, converts the results into a sequence of binary numbers (1s and 0s), and sends that binary sequence as a series of electrical "pulses." In the case of Voice over IP, rather than just sending the digital sequence out along the wires as pulses, the digital sequence is transmitted using the (TCP/IP) protocol suite. This involves chopping the data into packets with addressing and labeling headers. There is considerable variation on where this "packetization" occurs:
If the phone isn't an IP phone, then the signal must be carried over a circuit to another device.
The situation with connection to circuit-switched networks is somewhat analogous. The trunk lines (either single lines or multiplexed lines) would connect to either a trunk gateway or an (IP-enabled) PBX.
If we look at diagrams (d) and (f) the phones and the trunks are connected to the IP-enabled PBX exactly the way they would be connected to a traditional PBX. So the question naturally arises, what "IP-enables" a traditional PBX? A traditional PBX has its phones connected as in diagram (d). But theoretically a PBX could also work with phones connected to the IP network via mechanisms (a), (b), or (c). In this instance an IP connection between the PBX (or its IP line interface) and an IP Phone (or converter or line gateway) is treated by the PBX as a special sort of telephone line. Many of the PBX vendors have released line-side VoIP implementations for their PBXs.
Another use of the IP network by a PBX is to carry communications to another PBX or its own remote nodes, replacing PBX-to-PBX trunks or the direct fiber of a fiber remote with an IP trunk interface. Most of the major PBX vendors, like Nortel, Lucent, and NEC to name a few, have released trunk-side implementations of VoIP.
Finally, PBX vendors are eyeing the enormous capacities (in the terabit-per-second (1012 bps) range of the new generation of IP switches (or packet switches in general). A digital telephone produces a 64 kilobits-per-second (6.4x104 bps) stream. Even with the overhead of packetization, the IP stream is less than 100 kilobits-per-second (105 bps),and terabit switching corresponds to 107 or 10 million separate voice connections! So packet switching is much more efficient/cheaper than traditional circuit switching. It is therefore not surprising that the vendors are showing in their "road-maps" IP switching replacing circuit switching inside PBXs. So a PBX can be IP-enabled, even without IP-connected lines or trunks, using an IP switching matrix. NEC has on its roadmap the use of a Cisco router for IP switching internal to its PBXs in the future.
In the earliest IP-enablements of traditional PBXs, the PBX treated each IP line or trunk pretty much the way it treats any other line or trunk, and each conversation was switched through the PBX switching matrix. The diagram below shows 2 such IP PBXs, connected using IP trunking via a Wide Area Network (WAN) Link between the two IP clouds. A telephone call from an IP Phone on one of the clouds to an IP Phone on the other cloud uses a lot of system and network resources: involving the switching matrix in each PBX and 2 IP streams for each PBX.
This is to be contrasted with the "pure IP" situation. For example, with a Cisco AVVID implementation, a Call Manager is involved in the set-up of a call between 2 IP Phones, but once the call is established, the 2 phones exchange packets directly.
There is no theoretical reason why an IP-PBX couldn't behave just like the pure IP system and just set up the call between 2 IP Phones and then have them communicate directly, and increasingly implementations of IP-PBXs are moving to this model. In practice the features we have come to expect from a modern phone system are provided at the PBX and so by passing the calls through the PBX the (PBX) vendors found it easier to provide those features to the IP Phones. But this approach means that any IP-Phone-to-IP Phone conversation involves 2 packet streams and burdens the PBX, making it difficult to scale to large numbers of phones and/or simultaneous calls. In the "pure IP" situation, it would be possible to pass all calls through the controlling device (e.g. H.323 Gatekeeper or Cisco Call Manager) but the vendors have typically chosen the scaleable approach at the expense of features.
The H.323 family of standards under the aegis of the International Multimedia Teleconferencing Consortium (IMTC) and the International Telecommunications Union (ITU) is central to VoIP. This set of standards includes standards for voice (G.711, G.722, G.723, G.728, G.729) and video (H.261, H.263) compression which are accepted industry-wide. On the other hand, the accompanying signaling/control standards (e.g., H.225) are less widely implemented. In fact, the IETF has a competing standard, called Session Initiation Protocol (SIP). Moreover, Cisco has introduced its own Skinny family of "open" protocols. Which standards a particular device supports effects interoperability, but, at this stage in the development of VoIP most of the implementations are single-vendor trials. The issue of the success of competing standards is more important in that a device which fails to support the standards which eventually prevail will have a limited lifetime.
We have already referred the reader to our discussions of the current short-comings and future prospects for VoIP technologies. The purpose of this paper has been to present the various current technologies combined under that rubric.
Home |