Collaborative Technology:
 Video Conferencing Gateway

Welcome
News
Programs
Workshops & Events
Jefferson Lab
Information Technology
Contact
 


SURA "Mbone"-H.323 Gateway Meeting
Final Report, 12/18/2000

M. F. Yafchak, SURA IT Program Coordinator


I. Background & Introduction IV. Conclusion
II. About the Attendees V. Next Steps
III. Evaluation Appendix A: Attendee Detail


I. Background and Introduction

Multimedia conferencing over IP is rapidly becoming the collaborative technology of choice within the R&E community. This is due to the increased exposure of the broader community to existing video and data conferencing technologies but also to the advent of new video conferencing standards and the application-focused service guarantees that are promised through the development of advanced IP networks (i.e., Internet 2). Unfortunately, as this genre of communication technology is coming of age and being more widely deployed by individuals or within specific projects, various disconnected "camps" of conferencing capabilities and approaches are emerging.

Two important examples of these are ITU H.323 standards-based video conferencing and collaborative conferencing environments being built upon what could historically be called "Mbone" conferencing tools.1 The former has its roots in the "business quality" ITU H.320 video conferencing (over ISDN and/or dedicated circuits) and has been popular for the past several years within the corporate and distance learning communities. The latter have been used for many years by those doing collaborative scientific research over the Internet while also serving as proof-of-concept applications for IP multicasting as it has been implemented and demonstrated on the Internet-based Multicast Backbone, or "Mbone". Each of these IP conferencing approaches today is continuing to evolve to meet the demands of its existing user communities while also increasing its presence (through various means and with varying degrees of intention) within the R&E community at large.

1This naming convention will continue to be used throughout this document. "Mbone-tool based" will broadly refer to "vic", "vat", "rat", "sdr", "wb", and related application components that comprise multicast-capable interactive sessions over the Mbone and current descendant IP multicast-enabled networks, either physical or virtual.

When comparing the two approaches above, aspects of technical merit as well as practical utility (who can do what with the application; what is the extent of user need and demand) must be considered.

The Mbone tools have their origins in the Internet community and are thus generally well behaved in an inter-organizational IP environment. They also include and rely on widely accepted Internet standards for many application components. In addition, Mbone applications are based on open source code, which is definitely a popular and valued concept within the Internet community. The fact that the source code is accessible for modification and promotes sharing of those modifications has served as a catalyst for continuous feedback, development, and ready availability. H.323 conferencing began only a few years ago with the business and professional service communities' desire to switch from relatively expensive fixed bandwidth conferencing to the more flexible and affordable network transport offered by IP.

H.323 is not a particular application but rather a standard that is open and designed to promote interoperability between different vendors' applications by specifying a mandatory level of compliance. Some developer/vendors also sell H.323-compliant software stacks and/or development kits, however, these are most typically offered as a commercial product. (A notable exception to this is the relatively new effort,"OpenH.323". Information about this effort is available on the Internet at http://www.openh323.org.) Because H.323 has its roots in the business and professional conferencing communities, most H.323-compliant products are focused on providing a uniform and broadly acceptable level of conference quality. (Mbone video conferencing applications integrate the same standards for audio/video compression but have not typically been subjected to the same market pressures for a consistent quality that meets previously established user expectations). Also because of historical roots in the commercial and corporate sectors, H.323 implementations tend to behave better in intranet environments rather than on the public Internet (though "Internet savvy" is improving) since Internet use was not emphasized in the earliest versions of the standard.

As collaborative environments and virtual presence evolve more fully, it is highly likely that such environments will incorporate functionality and/or concepts borrowed from both the Mbone tools-based and H.323 implementations of today combined with future developments and adaptations. What is not yet clear is how long it will take for this evolution to occur. Between now and then, the increased presence of both of the aforementioned approaches to conferencing is threatening to develop isolated "islands" of communication within the R&E community. These islands may even come to exist within the same organization, depending on how new communication applications are typically introduced and coordinated.

SURA is a membership organization that represents over 50 R&E institutions within the southern United States. Many SURA members are already actively using or considering the use of video conferencing, either Mbone-tool based, H.323 compliant, or both. Because of this, SURA has a vested interest in understanding the extent, impact, and possible solutions for this problem. As a first step, SURA has undertaken initial examination of whether or not a video conferencing "gateway" could and should be developed. Such a gateway would permit one or more of each type of conferencing endpoint to interact and exchange voice, video, and data.

To examine gateway potential more closely, on September 27, 2000, SURA hosted and facilitated a meeting between key organizations representing each approach within the R&E community. Goals of the meeting included:

  • Raising key technology developers' awareness of each other and each others' work,
  • Evaluation as to the technical feasibility of developing a gateway, generic or between specific implementations,
  • Ascertainment of each organization's view on the practicality and/or necessity of gateway development and on their willingness to participate in potential collaborations.

The remainder of this document reports the results of the discussion, as observed by SURA within the meeting itself and as reported in each organization's response to a list of questions to be considered after the meeting.

[Top]

 

II. About the Attendees

In addition to SURA, the following organizations were represented at the SURA Mbone-H.323 Gateway meeting:

Representatives from the AccessGrid and VRVS were invited to represent their specific conferencing environments that are built either largely or exclusively using Mbone tool-based applications. RADVision was invited as a recognized industry leader in H.323 standards as well as product development to broadly represent the H.323 standards and product development community. Representatives from ViDe, a SURA supported initiative focused on the practical aspects of development and deployment of digital video over IP, was invited to represent the interests of the R&E user community and also to preview potential collaborative project development in this area. (See Appendix A for a list of attendees)

The meeting began with each organization introducing itself to the others and summarizing their position/focus in terms of collaborative conferencing development. Each was also asked to highlight relevant aspects of the technology that they use in current implementations, if applicable. The more detailed information below is predominantly based on each organization's self-provided description and supporting materials.

NCSA/AccessGrid

The AccessGrid's philosophy on communication - the concept of "persistent shared spaces" and a focus on group interaction - is the primary driving force behind their technology explorations and developments. The AccessGrid is composed of AccessGrid sites, which are composed of AccessGrid nodes. Desktop-level participation is not currently a concern. "We are focused on group to group collaboration. We are interested more in enabling natural interactivity among the multiple group members right now than we are in designing and deploying collaborative tools". As deployment has progressed, necessary "show-casing" of the technology (as part of large audience presentations and demonstrations) has also had an influence on development of some aspects. More information about the AccessGrid is available in the attached presentation and on the AccessGrid Web site.

RADVision

The H.323 standard is an international standard for multimedia communication rather than simply a video conferencing application. "RADVision believes that H.323 is the best solution for the videoconferencing market as of today. There are new technology and protocols being introduced to the market. Those protocols are not as mature yet as H.323 is for full-scale deployment. RADVision is pushing for better integration of the new protocols with H.323. RADVision is also looking and helping to scale the H.323 implementation to the Internet". In addition, "despite various approaches and various protocols, [RADVision believes] that the future looks optimistic for the global multimedia communications over IP. This viewpoint is based on two important technical factors: "1. All protocols use RTP/RTCP for the media transmission itself, and 2. The concept of Media Processing decomposed from the Call Control is understood and accepted today. These two facts provide a perfect ground for easier interworking between worlds and systems using different call control / call management standards and their derivations. More information about RADVision is available on the RADVision Web site.

ViDe (Video Development Initiative)
Note: At the request of ViDe, their full response is being included with this report.

ViDe is an international and multi-organizational collaboration between 13 R&E organizations. The goal of the Video Development Initiative (ViDe) is to promote the deployment of digital video in higher education by leveraging collective resources and expertise towards addressing challenges to deployment. These challenges include poor interoperability, volatile standards and high cost. "ViDe is interested in universities' use of emerging video technologies for research and instruction. ViDe is especially interested in human-communication technologies that are network intensive, or that require new network functionality. We have focused our initial interest on collaborative videoconferencing (mainly H.323, but with some focus on IP multicast) and video on demand technologies. ViDe's opinion is that the discussion between AccessGrid, VRVS and RADVision developers was a positive contribution to the research video community, and we commend SURA in taking the initiative to organize this important discussion." More information about ViDe is available on the ViDe Web site.

VRVS Project (Virtual Room Video Conferencing Service)

VRVS is currently the largest-scale Mbone tool-based collaborative conferencing service on the Internet, serving a user community of several thousand scientists worldwide. The focus is on multipoint conferencing (though point to point is supported) and resources are scheduled. In response to a demonstrated need within the user community, VRVS has already developed and deployed an H.323 gateway to their particular implementation. Other current foci include the integration of MPEG2 and H.323 technology as well as development of the next generation of collaborative tools to the current VRVS system. More information about VRVS is available in the attached presentation and on the VRVS Web site.

[Top]

 

III. Evaluation of Gateway Development

Each organization addressed the various aspects of gateway development - technicality, practicality, perceived necessity - to varying degrees in the meeting discussion and also responded directly to the following questions in post-meeting reports to SURA:

  1. Discuss your organization's views on the *technical feasibility* of developing an Mbone-H.323 conferencing gateway, both "generically" and with respect to the specific applications that were presented in the meeting. If you have particular insights into difficult and/or critical development issues, please include those.
  2. Discuss your organization's views on the *necessity and/or practicality* of developing the gateway(s) described above.
  3. Describe what your organization may be willing/capable of doing to "gateway" these technologies to each other, including your willingness to consider collaborative projects (if so, of what type?).

Excerpts of the direct responses are included below, followed by a summary on each point from SURA. This summary is based upon reflection during and after meeting discussion as well as a review of all reported responses.

On the Technical Feasibility of Gateway Development

From the AccessGrid -

"As we discussed, there is more to the Access Grid than Mbone technology. The large numbers of streams we use make interoperability more difficult than simply transcoding streams. There must be a way for H.323 users to dynamically limit the streams they will be receiving. Since we have 4 video streams per one audio stream, using the classic voice-activates-video may not make sense. These issues must be addressed in any gateway discussion.

We also use our home grown version of distributed PowerPoint which delivers high fidelity presentations to all sites with excellent slide change synchronization. This and other tools under development would need to be integrated somehow into the H.323 environment.

During the meeting we learned quite a bit about the complexity of H.323 networks, with the gateways and MCUs. It is not at all clear to me how our Virtual Venue system of organizing meetings and persistent meeting spaces maps into the h.323 phone-call model. I think a lot of the setup and connecting would have to be done by hand. Oh yes, we are mostly open source and want to keep things that way. Any solution that we contribute to must be distributable as part of our open source model."

From RADVision -

"During the presentation, RADVision realized that there isn't any real large implementation of the Mbone network. We learned that groups and organization are using the Mbone tools to build Mbone applications [but] we think that those applications are not specific to the Mbone itself. We believe that those tools can be integrated into H.323.

Technical notes specifically from Orit Levin, RADVision's Director of Product Management and also ITU-T Editor of Annex O for "Work on Internet Technologies Complimentary to H.323":

"The most straightforward and up-to-date architecture would consist of 'media server' or 'media processors' whose job would be FAST processing of RTP streams, their possible re-routing, mixing, switching. These "media processors" would be controlled from the Management/Call Control/Application Servers by standard protocols, e.g., MEGACO (H.248) or MGCP. The Management Servers themselves (detached from the Media itself) would vary from "just" configurable proprietary applications (with Web interfaces) to Mbone applications to H.323 Gatekeepers and SIP stateful proxies, enabling dynamic call establishment with capabilities negotiation. Such a Call Signaling/Management Server would be a perfect place for provisioning of both the Conference Services and the mapping of different Call Signaling approaches that may be supported by the participating endpoints...More then ever, I don't believe in a low level automatic Call Signaling Gateway - between H.323 and SIP or between H.323 and Mbone or between SIP and Mbone). I do believe that, for an Application Server using the 'third party control model', the problem of combining a call between or among end points of different call signaling type is relatively a 'trivial' issue."

From VRVS -

"Basically, everything is possible. It is just a question of resources and time. Since we already developed a Mbone/H.323 gateway embedded in our VRVS system, I would say that it is technically possible. However, a lot of development issues have been faced and others have been fixed. Here are some insights on the four main ones:

  1. Only G.711 audio coding is possible with Mbone tools (Rat/Vat) in order to have the possibility of exchanging audio with an H.323 node. Mbone tools don't support G.722, G.728, etc. One would need a general transcoding piece of software in order to have all the audio coding possible.
  2. H.323 nodes can only display one video [window] at a time [as opposed] to the Mbone tools (Vic). It is absolutely necessary to have a piece of software that sends only one video to the H.323 node (from the speaker or any kind of video [source]).
  3. H.263 video encoding is not yet fully implemented in the Mbone tools so the gateway will not support an H.323 node sending H.263 video streams.
  4. Communication bandwidth set-up is different between [the] Mbone [tools] and H.323. You just adjust bandwidth dynamically using vic (up to 3 Mbps) [as opposed] to the call bandwidth set-up (128k, 256, 384k, etc.) used with H.323. This can cause some asymmetric communication."

From ViDe -

"We observed a bit of 'apples and oranges' problem: the Access Grid is mainly a technology for immersive, multi-location environments and its multicast transport is an 'implementation detail'. Asking if it possible to reduce 20 video/audio streams into any kind of single stream overlooks the main purpose of their project. Anyone tackling an H.323-Access Grid gateway would need a clear definition of the problem they are trying to solve, and we are not convinced that one would want to attempt stuffing the entire 18' wide Access Grid environment onto a PC screen.

We were very interested to hear that none of the developers were very keen on T.120 technology as a reliable base for data sharing, and we plan to explore some of the suggested alternative directions.

Everyone at the meeting agreed that the VRVS solution was 'elegant'. The VRVS backbone was designed as a series of VRVS servers that are networked in order to bridge (tunnel) sections of network where there no IP multicast support exists. This was a practical architectural departure from the 'no server' multicast model, and this design happens to work well for incorporating H.323 users by the addition of a call setup server. The VRVS design handles the case where an H.323 client wishes to connect into the VRVS backbone. RADVision mentioned some preliminary investigations into a "gateway" that would allow a single IP multicast client to connect into an H.323 conference through a gatekeeper.

What ViDe took away from the conference was that it would be "not difficult" to handle a single multicast user coming into a gatekeeper, or a single H.323 user coming into a VRVS multicast server. Perhaps handling end users one-by-one would solve the problem of unattached islands of video conferencing end-users. Technology to connect a multicast backbone such as VRVS with an H.323 "backbone" such as ViDeNet would go beyond this approach, and we believe that two types of gateways would be required: one for multicast coming into an H.323 world, and one for H.323 coming into a type of multicast world. It should be noted that the VRVS world is a subset of the IP multicast world, in that native IP multicast employs no servers at all; to introduce multicast servers into the mix is changing the native IP multicast architecture.

ViDe observes that open source H.323 is critical to the success of any effort to create a multicast-H.323 gateway and strongly advocates the Open H.323 effort. We expect that commercial H.323/SIP communications systems will develop rapidly and we believe commercial vendors should focus on the value they add to standards-based systems, rather than attempts to capture the standard."

Summary Observations from SURA -

It appears that at least limited functionality gateways can be developed between H.323 and specific Mbone tool based conferencing implementations. "Full-featured" gateways from an application point of view - those that smoothly offer all features of one technologies "user experience" for sharing with the others - would present many technical challenges. Several areas of difficulty in gateway development are described above and others can be suggested:

  • Support for multiple audio/video algorithms and the attendant transcoding
  • H.323 being IP multicast aware and capable
  • Establishment of sessions (announcement of availability vs. "dialing" vs. continuous presence)
  • Accommodation of varying multipoint models (centralized vs. decentralized control)
  • How to manage/select multiple views or determine appropriate levels of participation for "space limited" (individual use) endpoints
  • Lip sync
  • Integration of H.323 Supplementary Services (e.g., "hold", "transfer", "forwarding" for sessions)
  • Coordinated support for expansion of optional codecs (e.g., MPEG)

Since the Mbone tool based implementations are not bound to any standard for describing how sessions are established, maintained, etc, there is more freedom on that side for gateway innovation. It is highly likely that "flexing" an H.323-compliant implementation to share full functionality with either of the existing Mbone tool based implementations discussed here would result in some violation of the H.323 standard. This is speaking strictly from the point of view of trying to bridge/duplicate the technical underpinnings of how each implementation's functionality and features are enabled today.

If one considers the concept of a "gateway" at the level of the functionality that a user sees and what that user can then do, changes in the technical underpinnings could result in a greater likelihood of gateway success. In addition, though RADVision was just being introduced to both the AccessGrid and VRVS at this meeting, it was their observation in conversations surrounding the meeting that the user functionality of the Mbone-based implementations could likely be duplicated with an H.323 implementation. This application does not exist today, reflecting the gap between what the standard defines and/or allows and what product developers have yet to produce for the marketplace.

On the Practicality/Necessity of Gateway Development & the Potential for Collaboration

From the AccessGrid -

"Because our resources are so thin and since our research here targets group oriented collaboration and we are working with others installing AG nodes, we don't feel a strong necessity to work on developing this gateway. We do believe such a gateway would be beneficial to both communities and encourage the development of it."

"We learned that Phillipe [the VRVS] has figured out a way to bridge H.323 and Mbone streams. We are interested in that technology as an enabler for solving some of the other above mentioned interoperability problems. The problem is that this work is not open source and we don't see how it can be integrated into our work. Nevertheless, if someone knowledgeable about using this work with H.323 systems wants to experiment with connecting into an AG session, we'd like to try that. Beyond that, we are also willing to try experiments on our own to see if we can duplicate the bridging [that the VRVS] is doing. If we can show that, we would have a project that someone could work on to develop a full function open source bridge."

From RADVision -

"As described before we believe that there isn't a need for an Mbone to H.323 gateway. The main reason [is that] no one uses the Mbone network as a multicast network...As a company, we would work with each one of the groups if [each side realizes] that there is a business case to work together." "Technically we believe that we can build an Mbone gateway. But since it is not really needed, we believe that the right approach is to work with individual groups and try to find a solution to [fit] their application."

From VRVS -

"Since we found that it is critical in our application community (world-wide, distributed and using both technologies), the necessity to develop this gateway arose one year ago. Today, we have reached the point where we have a way to have both technologies working together. But there is still work to be performed."

"Our organization is capable of doing this gateway. More work has to be done especially in doing the transcoding of different Audio and Video compressions. Current resources and priorities will limit us to having only a basic gateway (H.261, G.711) and [using it] with VRVS. We are certainly willing to collaborate in some project trying to build the missing pieces as explained above but not to the detriment of current VRVS development and production. It could only be done with additional resources."

From ViDe -

"Because of the different technologies, different histories and cultures of the end-user communities, and past lack of interoperability, ViDe finds it difficult to predict who would really need or want to use a multicast/H.323 gateway, but we already appreciate the desire for a single end-user with one kind of technology on their desktop to be able to participate in videoconferences that happen to take place in the other type of technology.

ViDe is strongly committed to exploring ways of effectively informing our constituents of the range of options, differences, and compatibilities between the various conferencing solutions. If there were a VRVS/H.323 freely available to ViDeNet, ViDe would use it; we would expect VRVS and/or the H.323 developer to support development of such a gateway. If there were an IP Multicast/H.323 gateway available to ViDeNet, ViDe would use it; we believe that it would be appropriate for anyone seeking to do this to go after research dollars (SURA/NSF/private) to fund its development."

Summary Observations from SURA -

Universally, "vanilla" gateway development is outside the scope of current work for each of these organizations and could not be undertaken without the application of additional resources. No one tried to dispute or conceal this fact but all expressed their appreciation at being part of the meeting and their willingness to collaborate further if the resources (and, in some cases, a specifically defined need) could be identified. In addition to this, during the meeting and through conversations that surrounded the meeting, there were several conversations between different pairings of the organizations that were present, seeking to learn more about specific aspects of the other's work and contemplating further discussion. We hope that this is a sign that SURA's first goal - that of raising awareness of each other and each other's work - was met.

[Top]

 

IV. Conclusion

What has become obvious (or more obvious) throughout the exploration of an Mbone-H.323 gateway is the simple fact that the conceptual communication model and desired scope adopted by any given developer ultimately determines their implementation, including "look and feel", functionality and features, and integration of technology components. The AccessGrid and the VRVS are each specific implementations built towards specific purposes and target audiences. The evolution of each is likely to be a balance between the vision of the developers and response to specific user demand as the "target audience" becomes the "user community". ITU H.323 is not a specific implementation but rather the description for international multimedia over IP capability. As such, the definition of even seemingly small functionality has to remain mindful of "fitting in" with the larger scope of interoperability.

These differences in scope and motivational drivers produce interesting and significant differences in how each organization is likely to approach and also value collaborative development. Whereas the AccessGrid and VRVS may have more technical flexibility in what they could develop or how they might develop it, development is directed and channeled by specific project goals and targeted user demand. H.323 standards developers are not targeting particular user groups or even particular products; they are specifying guidelines that will enable a wide variety of compliant products to inter-operate with each other. Collaborative endeavors would be less guided or restricted in terms of implementation characteristics (what users are being targeted or particular features that are desired) than by how that implementation can be achieved.

Currently, the level of activity that each of these organizations is putting towards exploration and/or development of Mbone-H.323 gateways (which may be none at this time) is fully in keeping with their own project/business objectives and scope. This is expected and understandable. Yet, all of them did indicate their willingness to participate in further discussion and possible collaborative projects in this area if the need can be demonstrated and additional resources applied. This is good news for those who believe that further gateway work is necessary. However, it also means that gateway proponents will need to develop both the plan and the required resource pool for this development and also most likely need to drive the process.

[Top]

 

V. Recommended Next Steps

Through this meeting, we have had significant affirmation of both the technical feasibility of developing a gateway (or gateways) and also the willingness of leaders in each technology area to work together. However, the degree of urgency and necessity for developing such a gateway remains unclear. The logical next step for SURA (and others with interest in the possible bridging of Mbone tool-based applications and H.323 video conferencing) is to undertake closer investigation in order to evaluate: 1) what communities would benefit from the existence of a gateway, and 2) what the specific benefits would be. Questions for this investigation include:

What is the existing and potential overlap between the user communities for each of these technologies? This question is best approached through identification of the current and intended user base for AccessGrid and VRVS deployment within the SURA region matched to an assessment of current and intended H.323 deployment within those identified user communities. Due to the fact that both the AccessGrid and VRVS have functionality that is either dependent on or enhanced via IP multicast, this survey should also attempt to measure the extent of and plans for multicast deployment for those user communities and the networks that interconnect them.

What is the desire/demand for gateway capability on the part of the overlapping user communities identified above? This question should be approached through a survey that considers institutional plans for implementation as well as direct end user acceptance and intent. The survey should also identify varying conceptions of what a gateway would look like and what functionality it might provide.

What resources would be required (time, money, expertise) to develop a gateway application and/or service that would be useful to the SURA community? In order to answer this question, a more detailed technical definition of the end product is required. This definition is highly dependent on information gained from answering the two questions above. Once the question of "who wants what?" is clear, relevant developers should be brought together again to prepare an outline for development and an estimate of the cost.

[Top]

 


Appendix A: SURA Mbone-H.323 Gateway meeting attendee list

September 27, 2000, William Young Library, University of Kentucky

 

Sponsor/Facilitator: SURA (Southeastern Universities Research Association)
Mary Fran Yafchak
IT Program Coordinator
maryfran@sura.org

Invited Attendees:
NCSA Alliance/AccessGrid Terry Disz
Experimental Systems Engineer - Futures Lab
Argonne National Laboratory
disz@mcs.anl.gov

Bill Nickless
Mathematics & Computer Science Division
Argonne National Laboratory
nickless@mcs.anl.gov

Bob Olson
Mathematics & Computer Science Division
Argonne National Laboratory
olson@mcs.anl.gov

 
RADVision Orit Levin
Director of Product Management
orit@radvision.com

Avi Moyal
VP of West Coast Operations
avi@radvision.com

Ram Ayre
Technical Solutions & New Product Research

 
ViDe Jill Gemmill
ViDe Administrative Chair
University of Alabama at Birmingham
jgemmill@uab.edu

Mary Trauner
ViDe VC Technical Representative

Georgia Institute of Technology
mary.trauner@oit.gatech.edu

 
VRVS Phillipe Galvez
VRVS Project Leader
CalTech
philippe.galvez@cern.ch

[Top]



More on Collaborative Technology Initiatives
|
More on general IT Initiatives


This Web site maintained by SURA