Skip to Content
 
Logo of Marquette University BIEN 167 Module 3 Telerehabilitation

Telecommunication Technologies of Telerehab

Outline Universal Access Models of Tele-Encounters Tele-Technologies Telerehab Apps

 

Tele-Technologies of Telerehabilitation

This section provides an overview of the telecommunication technologies that relate to telerehabilitation applications, the a primary focus on conferencing standards.

ITU's Videoconferencing Standards

The impact of teleconferencing standards by the International Telecommunications Union (ITU) has been considerable. Since introduced in roughly 1996, costs for products have come down dramatically while quality has improved. Systems have become considerably easier to use, and designed more an consumer products.

These standards span both the classic phone line infrastructure (H.320 for moderate and higher bandwidth, H.324 for lower bandwidth videophones) and the packet-based Internet Protocol (IP) infrastructure (H.323, SIP). Each of these standards is really a specification of a collection of protocols, especially the H-series for audiovisual and multimedia systems (e.g. H.32x for videoconferencing, H.26x for video codecs, H.28x for remote device control, H.233-5 for security/confidentiality/encryption, H.350.x for director services architecture, H.450.x for call service features) and the G-series for audio codecs (e.g. G.72x). For a sample of conferencing solutions see the telemedical site.

  • Audio codecs range from (see also Module 2, Section 3):
    • the more basic G.711 (pulse code modulation requiring 48-64 Kbps of the connection), very low time delay (under 1 ms)
    • the newer higher quality G.722 (7 KHz voice with 64/32/16 Kbps) that focus on quality of speech and low time delays (under 2 ms)
    • the clever codecs with more aggressive compression such as G.723.1, (multimedia speech coder at 5.3 or 6.3 Kbps, well within the cell phone range of 8-11 Kbps), but with more time delay (60-100 ms)
    • G.728 (coding at 16 Kbps using low-delay linear prediction), under 2 ms time delay.
  • Video codecs include (see also Module 2, Section 2):
    • H.261 DCT (discrete cosine transform) algorithm
    • H.263 Improved DCT algorithm
    • H.264 Even more improved algorithm, still with DCT base

Each overall videoconferencing standard defines messaging protocols that govern transmission of audio, video and data between two (or more) systems, and specifies additional protocol standards for video and voice codecs, security, privacy, and multiplexing and data control. Often they share some features, such as RTP/RTCP) for real-time messaging control and CIF (352 x 288 pixels) or QCIF (176 x 144) or the standard TV resolution of 4CIF (704 x 576).

Each is periodically updated. An example of this is the recent H.264/MPEG-4 advanced video coding standard, an outgrowth of a concerted joint effort by two key international bodies developing video coding standards: the H.26x video coding standards group through ITU and the MPEG-4 computer multimedia transmission/storage standards group through ISO/IEC. It provides higher quality video at lower bit rates (same video quality at roughly half the bandwidth) and better error resilience. This has significantly improved quality-of-service, especially for broadband (e.g., LAN, cable-modem, DSL) but also for phone-based connections, and tips the scales to IP-based solutions. All of the top videoconferencing manufacturers, such as Polycom, VCON and Tandberg, are evolving to products that support both the H.323 and SIP and use the H.264 protocol. Similarly, the top digital video products used for streaming multimedia, multicasting or playing DVD movies, have or are migrating to the very flexible new MPEG-4 standard. As an example of this impact, the Mobile Usability Lab (MU-Lab) of the RERC-AMI abandoned the need to maintain protocols for “lower quality” and “higher quality” digital video storage (related to length of trials and concerns of file size), simply because the new MPEG-4 standard provides high quality storage even with relatively smaller files.

Advantages of conferencing/multimedia standards:

  • gathers expertise to reach consensus, discuss and lend support to strong technologies,
  • provides core interoperability between products (e.g., communication across platforms and networks),
  • provides compatibility of components and applications of varying sophistication,
  • provides a degree of quality control and performance management (depending on whether performance measures are involved),
  • provides improved reliability,
  • may provide customer confidence and a form of consumer empowerment.

Possible disadvantages:

  • consensus standards require a"common ground" and typically provide a set of minimal specifications that may be below the best in the field,
  • may discourage innovation,
  • requires an infrastructure to periodically update,
  • it can be difficult to remove "obsolete" standards.

It is common for companies to try to add value to their product that is over-and-above the standard. This helps explain why video and sound quality is often worse when products for different manufacturers connect during a teleconference - they connect so as to met the minimal standard, without value-added features.

Our Telerehab and Performance Assessment Lab has multiple examples of each of these standards, and you have an opportunity to experience systems supporting each of these standards.

See also our Glossary of Conferencing Terms.

H.320 Videoconferencing

The International Telecommunication Union's (ITU's) H.320 standard is applied mostly to dedicated circuit-based switched network (point-to-point) connections of moderate or high bandwidth, such as through the medium-bandwidth ISDN digital phone protocol or a fractionated high bandwidth T1 lines.

  • Bandwidth. The most common media is ISDN. Each 128Kbps ISDN line requires use of 2 phone lines (each 64Kbps) in parallel. The industry norm is to use 3 ISDN lines (i.e. connect at 3x128Kbps = 384 Kbps), which is also a quarter of a 1.44 Gbps T1 line. This normally gives near-TV level quality (30 fps, CIF or 4CIF image). We have 4 ISDN lines in one two rooms with Polycom systems, and for the demo you saw how clear both sound and video are at this bandwidth. There is thus no reason to go above 512Kbps for videoconferencing, or for H.320, below 128Kbps.
  • H.320 video. Support for the ITU H.261 standard is required, and H.263 is optional. In addition to these rather conservative but effective (for higher bandwidth) codecs, the default for the standard in image quality is CIF (354x288 pixels with color, though "high definition" 4CIF of 708x576 is also supported), and for frame rate is 30 fps (but can be 15 fps or values between these extremes such as 20 fps). The lower frame rates are more relevant if the connection is only 128 Kbps or 256 Kbps. The standard also support multiple cameras, and control of zoom/pan/tilt for both local and remote cameras.
  • H.320 audio. Targets targets voice (e.g., 50 Hz to 4 KHz). Codecs include the more basic G.711 (pulse code modulation requiring 48-64 Kbps of the connection) and the newer higher quality G.722 (7 KHz voice with 64/32/16 Kbps). Clever lower-bandwidth codecs with more aggressive compression (such as G.723.1 at 5.3 or 6.3 Kbps and G.728 (coding at 16 Kbps using low-delay linear prediction)) are more relevant for H.324 systems.
  • H.320 shared data. The optional T.120 standard includes a shared whiteboard, chat, and file transfer. See the H.323 discussion for more details.
  • H.320 control features. In addition to zoom/pan/tilt, there are connection protocols, prioritization for bandwidth distribution between audio, video and data (audio has highest priority), and specifications for the structure of multipoint connections.

Multipoint connections distribute equal bandwidth. For instance, with our 512Kpbs capacity, we can coordinate a five site conference in which each connection is at 128Kbps, or a three-site conference at 256Kbps (as long as each of the two other sites can handle 256Kbps).

Throughout the 1990s, federal grants programs helped support the implementation of T1 hub-spoke networks to rural communities, primarily for civic, educational and health needs. These were initially very expensive, and because of lack of interoperability between major vendors each state tended to pick one company. As systems H.320 emerged, this all changed. The H.320 protocol now sees widespread use for conferencing of all types, ranging from telemedicine consultations between a tertiary and rural hospital to our routine use of H.320 for our RERC meetings. It is the pillar for most hub-spoke networks.

For telehealth applications, an especially useful feature is both local and remote control of cameras -- remote control by specialists is part of the guidelines for obtaining reimbursement from the United States’ Medicare program.

Virtually all of the newer H.320 products are also H.323 compliant, and can facilitate transmission of one or more channels of data (e.g., signals, images, records, presentations). This is true, for example, with our Viewstation MP systems (Polycom).

H.324 Videophones

The ITU’s H.324 protocol, intended for “videophones” that use POTS (plain old telephone service) phone lines (available in 97% of US households). Products meeting this standard are not much more difficult to use than a high-end phone or a VCR. The main user-controlled feature being interactive control of the tradeoff between refresh rate (e.g., typical range of 1 to 15 fps) and image quality.

Here is a list of features:

  • Most function as either a stand-alone appliance with an integrated simple set-top, use TV or PC as monitor, remote control digital pan/tilt/zoom.
  • Control operations are through either a telephone keypad or remote control.
  • Most have an interactive on-screen display for control over settings.
  • The normal actual bandwidth for such applications is low, typically 31-33 Kbps (with 28.8 or 33.6 Kbps internal "V.80" modem). In this range, there is no way for high-quality video whenever motion is occurring.
  • Most use RCA video-in for remote camera, support both NTSC and PAL formats and auto-answer features, and have a built-in 33.6 Kbps modem.
  • The maximum frame rate is 15 fps, typically with a selectable range from about 1-15 fps.
  • Cameras are CCD (1/5", 1/4", 1/3" or 1/4" CMOS).
  • Typically the number of pixels is CIF (354x288) or QCIF (176x144), but there is some variation (e.g., 480x234 is also common).
  • Most systems enable users to see both feeds (via picture-in-picture or a partitioned screen).
  • Most have electronic pan/tilt/zoom (i.e., "fake" digital zooming of the pixel base).
  • Most use either an active matrix TFT LCD display (typically 3.5" or 4" screens) that are integrated with a phone, or set-top system that use a TV monitor.
  • Most employ low-delay full duplex audio with echo cancellation (using G.723.1, a fairly aggressive speech codec at 5.3 or 6.3 Kbps)
  • Most employ the H.263 video codec.
  • There are also security, privacy, multiplexing and data control protocols.

Examples:

  • MetaEye / TeleEye / TeleVyou (Leadtek) for both TV settop and phone systems
  • InfoView (InnoMedia)
  • HyperVPhone 2000 (Aiptek)
  • StarView StarView
  • C-Station (H.324/H.320), CareStation (Motion Media) [bought out older C-Phone]
  • DV324 [was Via-TV VC150 by 8x8 Inc, a company that supplied the embedded videophone for most of the telehomecare products, butwhich has moved more towards IP/SIP solutions)]
  • Beamer Videophones (Vialta)

Our past evaluation of H.324 products (e.g., see Chapter 13 by Tran et al) suggested that while most products are interoperable (often with some effort), there is no way around the choppy nature of the transmitted video, especially if color is desirable (32,34). For home telehealth products, a small screen (e.g., 3” by 5”) is often used so that pixelation is less noticeable. Such small screens can be integrated into standard phones, with cameras integrated and/or connected by wireless means.

These videophones are usually targeted towards telehomecare applications where ease-of-use is a high priority and high-quality video is not critical. There are a number of telehealth products that integrate videoconferencing with vitals, with "telenurse" and "patient" terminals that differ.

There are now products that are H.324/H.323, which makes some sense given the emergence of cable modem / DSL service.

IP Videoconferencing: ITU's H.323 and IETF's SIP

Conferencing over the packet-based circuits, often called Internet Telephony, refers to real-time transport of multimedia telephone calls over the Internet. When only voice is transferred, it is often called Voice over IP (VoIP). VoIP is currently a very big deal as we gradually transition towards an Internet-based phone system. Advantages of IP-based conferencing include low costs (e.g., web cams are as cheap as $20, and Microsoft's client-side H.323-based MSN Messenger (and NetMeeting) packages, and their SIP-based Windows Messenger product and Real Time Communication software suite within the .Net Framework, are free). Another advantage of the complex, multi-faceted standard is the support for multi-point conferences and for interfaces to H.320 systems. But perhaps the biggest advantage is the integration with computer-based data and application sharing.

While an attractive alternative to circuit-based approaches such as those using H.320 or H.324, there are several fundamental disadvantages:

  • Quality of Service: Sending information by asynchronous IP packets is like sending a letter via a collection small postcards that take different paths but end up being re-assembled in order at the final destination. There is no guarantee of a timely arrival of every postcard, especially if the Internet is overloaded as can happen during certain times of the day. Specifically, CODECs multiplex audio, video and/or data information, process and compress the information, and send it as a as a series of packets which are transmitted over the Internet network to the destination address over varying pathways in the network. Upon reaching their destination, packets are reassembled, decompressed and displayed. While there is some built-in redundancy to help with lost or delayed packets, the end result is that there is not guaranteed quality and reliability of service.
  • Security/Confidentiality: Sending IP information is much easier to intercept than a dedicated circuit connection, i.e. it is a less secure medium. Thus we have firewalls, encryption approaches, etc.

Currently, the most common way to implement such calls is via the H.323 protocol, and perhaps 90% of VoIP calls use this detailed ITU-T standard that comes out of the telecommunications community. A popular alternative approach, the IETF's Session Initiation Protocol (SIP), comes out of the internet software engineering community. We will discuss both.

H.323 "describes terminals and other entities that provide multimedia communications services over Packet Based Networks (PBN) which may not provide a guaranteed Quality of Service. H.323 may provide real-time audio, video and/or data communication" (from ITU-T Recommendation H.323 V4). Notice the explicit mention of a lack of guaranteed quality of service. In fact, H.323 serves as an umbrella for a collection of other "best practice" standards.

H.323 entities are:

  • a terminal endpoint on a LAN that must support signaling/control (H.245 and H.225 (includes Q.931 and RAS)), real-time 2-way transfer and communication protocols (often called RTP/RTCP), and audio codecs (including at least G.711 audio, and usually at least one other), with optional support for video (H.261, H.263) and data (T.120) standards,
  • a gateway that serves as an interface between a LAN and circuit-switched network (e.g., IP/PSTN), and can translate communication procedures and protocols between these networks,
  • a gatekeeper that manages a zone (collection of H.323 devices), with certain required functionality: address translation, admission control, bandwidth control, and certain optional functionality related to other call control and call management features),
  • a multi-point control unit (MCU) to support 3 or more endpoints.

An important subset of H.323 is the “voice and data” mode (i.e., video is not required for compliance, but if available must meet certain standards).

To give some history, in the late 1990's, Microsoft's Netmeeting package, freely available, became a de facto "gold standard" for low-end videoconferencing over IP. It was one of 9 IP-based packages that Donal Lauderdale and myself evaluated in 1998. In addition to support for H.323, including the T.120 standard (e.g., including platform-independent support for chat, file transfer, shared white board), it provided support for application-sharing on the Windows platform. For the most part, we found the various products we evaluated to be interoperable, but typically not without some effort. In about 1998 Microsoft also embedded Netmeeting within its MSN Messenger product. While the addition of video (or applications such as powerpoint) to voice was nice, a key problem with Netmeeting was an audio time delay of about a quarter of a second. This was in part built in to their implementation of H.323, perhaps in part because of concerns of unpredictable packet delays. A summary of Microsoft's implementation of T.120 is available. The quality of video for Netmeeting and similar products was strongly a function of bandwidth, and was pretty decent for connections involving two sites on a LAN. Because Netmeeting added value to H.323 via application-sharing on Windows platforms, and also had a "minimal" implementation with an available SDK, many third-party companies provided added value to the H.323/Netmeeting protocol through features such as multi-point conferencing and more convenient phone-like calling options.

Importantly, current state-of-the-art H.323-based VoIP products do not have significant audio time delays, and there are many H.323 vendors. While most H.323 implementations are proprietary, there is also an open source forum (www.openh323.org) with which our group used to participate. It is possible to access the core of the H.323 standard from the ITU site. A good source, one of many available on the web, is the H.323 Forum. The competition with the alternative SIP protocol has helped push improvements into theH.323 often-updated standard, and it is now up to Version 4 (H.323-V4), with there being many H.323 V4 products on the market and more coming.

With Windows XP, Microsoft dropped continued development of H.323-based Netmeeting in favor of the SIP-based collection of standards, discussed below. Microsoft has explicitly embedded SIP within its new .Net framework, and SIP is used for its Windows Messenger product (versus H.323 for its MSN Messenger product). The former has much shorter audio time delays and more features. It is also embedded within the .Net Framework (essentially a library), and thus SIP becomes one of many tools that is available for Windows developers for platforms ranging from mobile (e.g., PocketPC) to desktop.

SIP is "an application layer signaling protocol that defines initiation, modification and termination of interactive, multimedia communication sessions between users." (IETF RFC 2543 SIP). It consists of:

  • a user agent that initiates, receives and terminates calls (with client/server model)
  • a proxy server that relays call signaling, a SIP redirect server, and a SIP registrar that accepts requests and maintains user's whereabouts

SIP (IETF FC 2543) is thus a simpler control protocol which uses textual (ASCII commands) client-server model to create, maintain, modify or terminate multimedia sessions with one or more participants. It is intended to be used as a software module for managing sessions, similar to the strategy of HTTP protocol for the web or SMTP protocol for email. To the programmer, SIP is a toolbox that turns a telephony or multimedia session into a web application that can integrate with other Internet services. In considers user location, user capabilities, user availability, call setup and call handling. A nice summary is available from the SIPCenter, and to get a sense of the degree of commercial activity see the SIPCenter main page.

SIP differs from H.323 at a fundamental level: in SIP, the " intelligence" in distributed out to clients (i.e., their computers) in a more distributed architecture, as opposed to the H.323 model of an intelligent central coordinating site surrounded by "dumb" terminals.

Collectively, H.323 and SIP provide a a suite of standards (see also VoIP standards Reference page at protocols.com. The current "killer application" is not videoconferencing, but VoIP, or what is now often called simply a "digital phone." Billions of dollars rest on who and what coordinates a phone connection. Many believe that there will be a gradual shift away from standard circuit-based PSTN toward an packet-based Internet system, once consumers are convinced that quality-of-service issues can be addressed. This ha largely happened. Indeed, Marquette's telephone backbone is typical: we depend on products from Cisco and Siemens, and both of these companies have products for both the H.323 and SIP standards, and well as traditional digital/analog circuit-based phones. Packet-based approaches are starting to do well at large companies with their own controlled LAN environment. Siemens in particular has come to campus to try to convince Marquette to upgrade its phone network to these new hybrid, multi-standard phones.

Of note is that while most H.323 and SIP system make use of computers, there are "phone" systems that do not require the user to own a computer (e.g., mm146 from Motion Media for H.323 over cable/DSL, DV325 from 8x8 for SIP). In our own lab R&D where we've wanted to integrate our Intelligent Telerehabilitation Assistant (ITA) with videoconferencing capabilities that include mobile (PocketPC) capabilities, we originally spent some time understanding the H.323 standard, then switched to SIP.

Wireless Technologies - General Concepts

Reading material: part of Chapter 11 (Winters), pp. 100-103

  • Within wireless volume vs line-of-sight (e.g., IRDA standard)
  • One-way vs two-way telecommunication protocols
  • Signal strength typically drops with distance
  • Usability analysis
  • Wireless as an "enabling" technology

Classification scheme being used in the IEEE wireless standards process:

  • wMAN (metropolitan-area networks or wide-area WANs)
    • Cell phone technologies
    • "2-G" - digital
      • low-bandwidth voice, 8-14 Kbps voice, 900 MHz / 1.7 GHz
    • "2.5G" - GSM/GPRS (short text messaging, email, pictures), WAP/XML
      • data: 16-64 Kbps (pay by MB or time or flat?)
      • phone or PC cards
    • "3G" - still "coming"
    • Discussion: How needed?
  • wLAN (wireless local area networks)
    • License-free bands: 900 MHz, 2.4 GHz (ISM band)
    • ~ = IEEE 802.11b/g standards (e.g., WiFi)
    • Medium-bandwidth (up to 11 Mbps for 802.11b and 54 Mbps for 802.11g, function of distance)
    • Coverage volume ~ size of a small house (30-1000 m)
  • wPAN (wireless personal area networks)
    • Bluetooth - “Short-range 2-way radio” (10 m, e.g. in room, vehicle)
    • License-free 2.4 GHz ISM band
    • Person-centered voice/data "radios" (up to 721 Kbps for data)
    • Low-power
    • Distributed "radios" that spontaneously network
  • Integrated MAN-LAN-PAN Systems
    • Example: PocketPC
    • Discussion: Wearable devices?

Existing Medical Telemetry Products

  • Medical devices: most communicate via a proprietary system
    • Many operate over selected VHF or UHF one-way broadcast channels
    • Wireless Medical Telemetry Service (WMTS)
      • UHF TV channel 37 (608-614 MHz)
    • Medical Information Bus (IEEE 1073) collection of cross-product interoperability standards
  • Medical Informatics/IT/Electronic Healthcare Records
    • Most use IEEE 802.11b (e.g., hospitals wings at Zablocki VAMC)
    • Dealing with HIPAA challenging but working
  • Tradeoffs
    • Supports non-medical use?
    • Standards-based transmission?
    • Supports Voice, PDAs?
    • Uses hospital IT infrastructure?
    • HIPAA / confidentiality / security?
    • Range?
    • Implementation cost?
    • World-wide products?

Multi-Node Collaborative Conferencing

Recent years have seen a remarkable evolution in networking capabilities, Internet use, available bandwidth and new multimedia standards. A consequence is that there is more potential for multimedia connectivity between people than was previously possible. This includes the formation of "virtual communities" for This section briefly reviews some of the approaches and terminology for such connectivity, which could potentially change the how healthcare services are delivered, and indeed the field of clinical rehabilitation.

A gateway works as a mediator between IP and ISDN protocols. It addresses the occasional need for connecting calls from registered end point nodes (or terminals) to remote parties with H.320 and H.323 protocols. While perhaps less of an issue now that so many products support the H.320 and H.323 protocols, it clearly is valuable. In the Telerehab lab we have access to Gateways both through our MXM software from VCON and our TMS software from Tandberg. The process involves having the caller must first register with the gateway administrator. There are also security settings and call control considerations. To date, we've had limited experience with establishing gateways simply because the need hasn't been there.

A Multipoint Control Unit (MCU) is used for connecting registered end point nodes in a multipoint videoconference. Such conferences may be ad hoc (others invited) or dedicated (pre specified, specific end points). A conferencing bridge is a structured MCU that enables initiation and management of multipoint videoconferences and often of multicast streaming conferences. It may include value-added features such as voice-activated switching (where participants see the video of the speaker whose audio signal is the strongest) or continuous presence (where certain participants are always seen and/or heard).

A gatekeeper is used to connecting end points over WANs, e.g. it manages end point information like node directories and permission levels, performance tasks such as bandwidth allotment and service availability, and perhaps practical administrative functions such as event logs and billing. At the simplest level, a gatekeeper might only support IP-based connectivity. More generally, for a sophisticated gatekeeper many of the end point nodes may have different properties (e.g., mix of H.320, H.323, SIP; each with different bandwidth capacity). Often gatekeepers interact with each other.

| Telerehab Outline | Telerehab History |

©2003-2004 Jack Winters ... BIEN 167 Home