Patent No. US11563790 (titled "Motion-Based Frame Rate Adjustment For In-Person Conference Participants") was filed by Zoom Communications Inc on Jan 31, 2022.
’790 is related to the field of video conferencing, specifically addressing the problem of inconsistent video quality due to varying degrees of motion among conference participants. Traditional conferencing systems often capture video streams at a uniform frame rate, which can lead to choppy video for participants who move frequently, while simultaneously wasting resources on participants who remain relatively still. This patent seeks to optimize video quality and resource utilization by dynamically adjusting frame rates based on individual participant activity.
The underlying idea behind ’790 is to analyze an initial video stream to identify regions of interest corresponding to individual conference participants. The system then determines appropriate frame rates for each region based on factors such as motion detected within that region. By capturing or adjusting video streams for each participant at a tailored frame rate, the system aims to improve the overall viewing experience while minimizing the computational and network overhead.
The claims of ’790 focus on a method, a non-transitory computer readable medium, and an apparatus that identify regions of interest within an initial video stream, instruct a video capture device to capture video streams for these regions at an initial frame rate, determine individual frame rates for each region based on participant-specific information, and then adjust the frame rates of the captured video streams accordingly. This dynamic adjustment ensures that each participant's video is rendered at an optimized frame rate.
In practice, the system would first use object detection algorithms to identify and isolate individual participants within a conference room. It would then analyze the video stream for each participant, measuring motion based on pixel changes between frames. Participants exhibiting high levels of activity would be assigned higher frame rates, while those with minimal movement would receive lower frame rates. The system then adjusts the frame rates of the video streams to match these determined values, ensuring a smoother viewing experience for remote participants.
This approach differentiates itself from prior solutions by implementing adaptive frame rate control at the individual participant level. Instead of applying a uniform frame rate to the entire conference room, ’790 tailors the frame rate to each participant's activity. This allows for a more efficient use of network bandwidth and processing power, as resources are allocated where they are most needed. The system can also incorporate other factors, such as participant priority or conversational context, to further refine the frame rate determination process, leading to a more engaging and optimized conference experience.
In the early 2020s when ’790 was filed, video conferencing systems commonly relied on techniques to optimize bandwidth usage and processing power, at a time when transmitting high-resolution video streams from multiple participants simultaneously could strain network resources and client devices. At that time, dynamically adjusting video frame rates based on activity levels within different regions of interest in a video feed was a non-trivial problem, especially when systems commonly relied on computationally intensive algorithms for video analysis and processing.
The examiner approved the application because the prior art of record, specifically Wang, fails to disclose or suggest instructing a video capture device to capture a first video stream associated with a first region of interest and a second video stream associated with a second region of interest at an initial frame rate, and then adjusting the frame rates of both streams independently. While Wang shows capturing two video streams from an adjustable camera, it does not show adjusting the frame rates of the two regions based on the information or activity of the two participants.
This patent contains 20 claims, of which claims 1, 9, and 15 are independent. The independent claims are directed to a method, a non-transitory computer readable medium, and an apparatus, respectively, all generally focused on adjusting video stream frame rates based on regions of interest and information associated with conference participants. The dependent claims generally elaborate on the specifics of identifying regions of interest, determining frame rates, and adjusting video streams.
Definitions of key terms used in the patent claims.
US Latest litigation cases involving this patent.

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.
Date
Description
Get instant alerts for new documents