Generating Ground Truth For Machine Learning From Time Series Elements

Patent No. US10997461 (titled "Generating Ground Truth For Machine Learning From Time Series Elements") was filed by Tesla Inc on Feb 1, 2019.

What is this patent about?

’461 is related to the field of machine learning, specifically techniques for improving the training of machine learning models used in autonomous driving systems. The background involves the challenge of creating high-quality training datasets for deep learning systems, which traditionally requires significant manual effort in collecting, curating, and annotating data. This is especially difficult for edge cases where the model needs the most improvement, creating a need for automated training data generation.

The underlying idea behind ’461 is to leverage a time series of sensor data to automatically generate ground truth data for training machine learning models. Instead of relying on single data points, the invention uses a sequence of sensor readings (e.g., camera images, lidar data) captured over time to build a more complete and accurate representation of the environment. This is achieved by combining the most reliable information from different points in the time series to create a composite ground truth.

The claims of ’461 focus on a method, computer program product, and system for training a machine learning model. The core process involves receiving sensor data as a group of time series elements, determining a ground truth for the entire group by identifying respective portions of individual time series elements, and then training the model using a dataset comprising this ground truth and a *single*, selected time series element. The trained model then predicts the ground truth based on that single element.

In practice, this allows the system to learn to predict complex environmental features, such as the three-dimensional trajectory of a lane line, from a single image. The system analyzes a video sequence of a vehicle driving along a lane, combining information from multiple frames to create a highly accurate 3D representation of the lane line. This composite 3D lane line then becomes the ground truth, which is paired with a single frame from the video to train the model. The model learns to infer the complete 3D lane line from just that one frame.

This approach differentiates itself from traditional methods by automating the ground truth generation process and improving the accuracy of the training data. Instead of manually labeling data, the system uses the time series to create a more robust and complete representation of the environment. This is particularly useful for features that are difficult to capture in a single frame, such as occluded objects or distant lane markings. By training on this automatically generated, high-quality data, the machine learning model can achieve improved performance and robustness in autonomous driving applications.

How does this patent fit in bigger picture?

Technical landscape at the time

In the late 2010s when ’461 was filed, deep learning systems were being increasingly used for autonomous driving, at a time when training these systems typically involved manually curating and labeling large datasets. When hardware or software constraints made the accurate and efficient generation of training data non-trivial, systems commonly relied on labor-intensive manual annotation processes.

Novelty and Inventive Step

The examiner approved the application because prior art, specifically Vallespi-Gonzalez and Parchami, did not teach generating a training dataset that includes both a single time series element and a ground truth derived from a group of time series elements. Vallespi-Gonzalez only accessed training data that already included ground truth, and Parchami used input images combined with prediction images to generate a cognitive map, rather than using a selected time series element to output ground truth for a group of time series elements.

Claims

This patent includes 20 claims, with claims 1, 19, and 20 being independent. The independent claims are directed to a method, a computer program product, and a system for training a machine learning model using sensor data and a determined ground truth. The dependent claims generally elaborate on and refine the elements and features of the independent claims.

Key Claim Terms New

Definitions of key terms used in the patent claims.

Term (Source)Support for SpecificationInterpretation
Ground truth
(Claim 1, Claim 19, Claim 20)
“For example, a ground truth is determined by examining the most relevant portions of each element of the group of time series elements including previous and/or subsequent time series elements in the group. The determined ground truth may be a three-dimensional representation of a vehicle lane line, a predicted path for a vehicle, or another similar prediction. An element of the group of time series elements is selected and associated with the ground truth.”A reference or standard of accuracy determined for a group of time series elements, used for training a machine learning model.
Machine learning model
(Claim 1, Claim 19, Claim 20)
“The training data set is used to train a machine learning model for generating highly accurate machine learning results. Using the trained machine learning model, a neural network can infer features associated with autonomous driving such as vehicle lanes, drivable space, objects (e.g., pedestrians, stationary vehicles, moving vehicles, etc.), weather (e.g., rain, hail, fog, etc.), traffic control objects (e.g., traffic lights, traffic signs, street signs, etc.), traffic patterns, etc.”A computational model trained using a training dataset to output a ground truth based on an input of a selected time series element.
Selected time series element
(Claim 1, Claim 19, Claim 20)
“An element of the group of time series elements is selected and associated with the ground truth. The selected element and the ground truth are part of the training dataset. As an example, training data is created for predicting a three-dimensional representation of a vehicle lane using only a single image.”A single element from the group of time series elements that is used along with the ground truth to form the training dataset.
Time series elements
(Claim 1, Claim 19, Claim 20)
“For example, a group of time series elements may include a group of images captured from a camera sensor of a vehicle over a time period. In some embodiments, a training dataset is determined including by determining for at least a selected time series element in the group of time series elements a corresponding ground truth based on a plurality of time series elements in the group of time series elements.”A set of data points indexed in time order, captured by sensors on a vehicle over a period of time.
Training dataset
(Claim 1, Claim 19, Claim 20)
“In some embodiments, a training dataset is determined including by determining for at least a selected time series element in the group of time series elements a corresponding ground truth based on a plurality of time series elements in the group of time series elements. An element of the group of time series elements is selected and associated with the ground truth. The selected element and the ground truth are part of the training dataset.”A set of data used to train a machine learning model, comprising a determined ground truth and a selected time series element.

Litigation Cases New

US Latest litigation cases involving this patent.

Case NumberFiling DateTitle
2:25-cv-00742Jul 23, 2025Perceptive Automata Llc V. Tesla, Inc.

Patent Family

Patent Family

File Wrapper

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.

  • Date

    Description

  • Get instant alerts for new documents

US10997461

TESLA INC
Application Number
US16265729
Filing Date
Feb 1, 2019
Status
Granted
Expiry Date
Feb 18, 2039
External Links
Slate, USPTO, Google Patents