Predicting Three-Dimensional Features For Autonomous Driving

Patent No. US12014553 (titled "Predicting Three-Dimensional Features For Autonomous Driving") was filed by Tesla Inc on Oct 14, 2021.

What is this patent about?

’553 is related to the field of autonomous vehicle control systems, specifically those employing machine learning. These systems rely on sensor data, such as camera images, to perceive the environment and make driving decisions. A key challenge is creating accurate and robust training data for the machine learning models, especially for predicting complex features like lane lines and the paths of other vehicles.

The underlying idea behind ’553 is to improve the accuracy of machine learning models used in autonomous driving by creating training data that leverages a time series of sensor data . Instead of relying on single snapshots, the system analyzes a sequence of images and sensor readings to determine a more accurate "ground truth" for features in the environment. This ground truth is then used to train the model to predict these features from a single image, enabling more robust and reliable autonomous driving.

The claims of ’553 focus on a system, method, and computer program product that obtain sensor data from a vehicle, determine a three-dimensional feature associated with the sensor data using a machine learning model, and adjust the vehicle's operation based on this feature. The machine learning model is trained using a dataset comprising a ground truth and corresponding sensor data captured over a period of time, where the model predicts the ground truth from a single time series element.

In practice, the system captures a video sequence and odometry data as the vehicle moves. By analyzing this sequence, the system can identify lane lines, even when partially occluded or poorly visible in individual frames. The system then constructs a 3D representation of the lane line based on the most accurate data from the entire sequence. This 3D representation, the ground truth, is then paired with a single image from the sequence to create training data. The trained model can then predict the 3D lane geometry from a single image.

This approach differs from traditional methods that rely on manual annotation of individual images or simpler 2D feature extraction. By using a time series to establish a more accurate ground truth, the system can train models that are more robust to noise, occlusion, and variations in lighting and weather. The ability to predict 3D trajectories also allows for more precise lane keeping and path planning, improving the safety and reliability of autonomous driving.

How does this patent fit in bigger picture?

Technical landscape at the time

In the late 2010s when ’553 was filed, autonomous driving systems were at a time when machine learning models were being increasingly used for perception tasks such as lane detection and object recognition, at a time when significant effort was being directed towards improving the quality and accuracy of training datasets for these models, and at a time when systems commonly relied on sensor data from cameras, lidar, and radar to capture the environment around a vehicle.

Novelty and Inventive Step

Claims were rejected for indefiniteness under 35 U.S.C. 112(b) and for obviousness under 35 U.S.C. 103. The rejection under 35 U.S.C. 103 was based on a combination of Kwant et al. and Song et al. The examiner indicated that certain claims would be allowable if rewritten to overcome the rejections under 35 U.S.C. 112(b) and to include all limitations of the base claim and any intervening claims. The prosecution record does NOT describe the technical reasoning or specific claim changes that led to allowance.

Claims

This patent contains 18 claims, of which claims 1, 10, and 18 are independent. The independent claims focus on a system, a method, and a computer program product, respectively, all relating to adjusting vehicle operation based on a three-dimensional feature determined from sensor data using a machine learning model. The dependent claims generally elaborate on and refine the elements and steps recited in the independent claims.

Key Claim Terms New

Definitions of key terms used in the patent claims.

Term (Source)Support for SpecificationInterpretation
Adjust operation of the vehicle
(Claim 1, Claim 10, Claim 18)
“In some embodiments, the three-dimensional trajectory of the vehicle lane is provided in automatically controlling the vehicle. For example, the three-dimensional trajectory is used to determine lane lines and corresponding drivable space.”Controlling the vehicle based on the determined three-dimensional feature.
Determined ground truth
(Claim 1, Claim 10, Claim 18)
“In some embodiments, a training dataset is determined including by determining for at least a selected time series element in the group of time series elements a corresponding ground truth based on a plurality of time series elements in the group of time series elements. For example, a ground truth is determined by examining the most relevant portions of each element of the group of time series elements including previous and/or subsequent time series elements in the group. The determined ground truth may be a three-dimensional representation of a vehicle lane line, a predicted path for a vehicle, or another similar prediction.”A ground truth determined based on a plurality of time series elements, used to train the machine learning model. The machine learning model is trained to output this ground truth based on an input of at least a portion of the corresponding sensor data.
Three-dimensional feature
(Claim 1, Claim 10, Claim 18)
“In some embodiments, the trained machine learning model is used to predict a three-dimensional representation of one or more features for autonomous driving including lane lines. For example, instead of identifying a lane line in two-dimensions from image data by segmenting an image of a lane line, a three-dimensional representation is generated using the time series of elements and odometry data corresponding to the time series. The three-dimensional representation includes changes in elevation that greatly improve the accuracy of lane line detection and the detection of corresponding lanes and identified drivable paths.”A representation of a feature in three dimensions, used to adjust the operation of a vehicle. The machine learning model is trained to output a ground truth indicative of this feature.
Time series elements
(Claim 1, Claim 10, Claim 18)
“In some embodiments, sensor data is received. The sensor data may include an image (such as video and/or still images), radar, audio, lidar, inertia, odometry, location, and/or other other forms of sensor data. The sensor data includes a group of time series elements. For example, a group of time series elements may include a group of images captured from a camera sensor of a vehicle over a time period.”Elements captured over a period of time by sensors. A plurality of these elements are used to determine a ground truth. A particular time series element is used as input to the machine learning model.

Litigation Cases New

US Latest litigation cases involving this patent.

Case NumberFiling DateTitle
2:25-cv-00742Jul 23, 2025Perceptive Automata Llc V. Tesla, Inc.

Patent Family

Patent Family

File Wrapper

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.

  • Date

    Description

  • Get instant alerts for new documents

US12014553

TESLA INC
Application Number
US17450914
Filing Date
Oct 14, 2021
Status
Granted
Expiry Date
Dec 15, 2039
External Links
Slate, USPTO, Google Patents