A method is disclosed for processing images in conference between a plurality of video conferencing terminal. The method comprising first providing properties of at least two video conferencing terminals, then define a number of groups based on the properties of the at least two video conferencing terminals, then subsequently, for each of the video conferencing terminal, determining an ordered list of the a number of groups, determining a pane layout for displaying images from the video conferencing terminals, the pane layout being determined based on the properties of the video conferencing terminal the layout is to be displayed on and the properties of the other video conferencing terminals in the conference, mapping the ordered list of the number of groups to the respective determined pane layout, and finally providing at least one composite image stream to each of the video conferencing terminals according to the selected pane layout and mapping of the a number of groups. The method allow different types of telepresence endpoints, (e.g. different manufacturers, different numbers of screens/cameras, etc.) to work well together in the same video conference.
展开▼