We develop a novel tele-collaboration system with low latency and high reliability by combining three 4K60p JPEG2000 video streaming systems and a 6-channel echo canceller. Our system realizes the synchronization of multiple video streams without even single frame delay. It can transmit 4K 60p streams with one-way latency of 80 msec, so users can communicate with one another with no unnatural pauses. Using a novel highperformance multi-channel acoustic echo canceller, our system can spatially localize each user's speech to the user's displayed position. We conduct a subjective assessment of our system through confrontational role-playing tasks and find that our system helps the subjects to share their feelings and atmosphere and enhances their cooperation.