Research

The ultimate project objective is to enable tele-immersive meetings or entertainment (e.g., online gaming) anytimeanywhere and on any device. This ambitious goal suggests that scalabilityinter-operability, and global optimum are the key properties desired for the developed system and algorithms. More specifically, our targeted solution should be able to cope with various system dynamics in real-life applications, while achieving the best possible performance and functionality that promote and advance tele-immersive user experiences. These system dynamics are, for instance, as follows,
Mainly focused on low-cost, commodity capturing and computing setups, the ITEM project primarily researches on three pillar areas, where the inter-connection is actively explored and interweaved. These three main research areas and some example subtopics are listed below.
  1. Computer vision and image understanding
    • Efficient camera calibration for various sensors
    • Low-resolution, noisy depth video enhancement
    • Stereo matching, optical flow estimation, Structure-from-Motion (SfM)
    • Video object cutout/matting, object tracking
    • 3D reconstruction of objects and environment
  2. Video/data representation, compression, and communication
    • Object-based video coding and delivery
    • View synthesis-driven color-plus-depth video coding
    • Multi-view, 3D video coding and delivery
    • Multi-point video conferencing system
  3. Graphics and human-computer interactions
    • Free-viewpoint image synthesis
    • Computational photography, image relighting
    • Non-photorealistic Rendering (NPR)
    • Interactive video object manipulation
    • Gesture-controlled multimedia content navigation