We normally think in telephony of phones as having numeric addresses. In SIP, an end station has a SIP URI (a form of URL) that identifies it and is used in the SIP protocol. Because phones generally have numeric keypads, the phone is responsible for translating what you dial (such as extension 4094) into a SIP URL (such as sip:4094@sip.ilabs.interop.net). You can learn more about how SIP URIs, traditional telephone numbers, DNS, and IP addresses all interact in our white paper on “ENUM.”
The diagram below shows a SIP dialog involving two parties (Alice and Bob) and their SIP proxy servers, the Atlanta and Biloxi. In this case, the SIP messages have been heavily abbreviated to show the flow of traffic.
Although the diagram here shows that the proxies do not participate in the SIP protocol once Alice acknowledges that Bob has picked up the phone, not every call will work that way. A proxy may elect to “stay in the middle” of the conversation even after the call is connected to provide some mid-call features, such as conferencing services, or accounting. Note that even if the proxy is in the middle of the call, we’re still only talking about the SIP part of the call—the voice traffic will generally go directly from one phone to another once the call is set up.
Another common operation in SIP is called Registration. In our example call, this might be how the Biloxi proxy learned where Bob was located. The registration capability is especially useful in an environment where phones do not have static IP addresses (such as a DHCP environment or when a phone travels with its owner). In SIP, the registration server can be co-located with the proxy server or they could be different systems. Bob is also not limited to registering from a single location. He could have SIP phones at home and at the office that both register with the SIP server. Then, it is the responsibility of the proxy server to decide which phones to “ring” when a call comes in for Bob. With SIP, that could mean selecting a single phone to ring, or just ringing all the phones at once.
Because SIP is used for call control, features such as voice mail and auto-attendant are not part of the SIP protocol itself. Instead, they are provided by end points that send and receive calls themsleves. This means that a VoIP network based on SIP has no real parallel to the “PBX” in traditional telephony. You may hear the term “SIP Server” or “SIP PBX” used to describe the SIP proxy server, but the functionality is quite different. However, it is possible to integrate some traditional PBX features, such conferencing into a SIP proxy server. For example, the Asterisk SIP proxy server tested as part of our iLabs demonstration includes both voice mail and auto-attendant. In other cases, such as a conferencing server with its heavy digital signal processing requirements, you might want a separate dedicated device.
