Identifying Nucleotides By Determining Phasing

Patent No. US11676275 (titled "Identifying Nucleotides By Determining Phasing") was filed by Illumina Inc on Aug 26, 2021.

What is this patent about?

’275 is related to the field of nucleic acid sequencing and, more generally, to the analysis of image data obtained from multiple reference points. A common challenge in this field is the need to analyze vast amounts of image data generated during sequencing processes, such as sequencing-by-synthesis. Existing methods often struggle to balance the need for rapid data acquisition and storage with the computationally intensive task of image analysis, leading to bottlenecks in the overall sequencing workflow.

The underlying idea behind ’275 is to prioritize image data acquisition and storage while performing image analysis in the background. This is achieved by scheduling image analysis tasks to run when the processor is not actively acquiring or storing image data. This approach allows for efficient use of computational resources, reducing data storage requirements and improving the overall speed and accuracy of sequence data analysis.

The claims of ’275 focus on a system and method for base calling in nucleic acid sequencing . Specifically, the claims cover acquiring fluorescent emissions from labeled nucleotides hybridized to a cluster of identical nucleic acids, calculating the fraction of labeled nucleotides that are out of phase, identifying a particular base called nucleotide based on the acquired fluorescent emissions and the calculated fraction of labeled nucleotides that are out of phase, and assigning a quality score to the particular base called nucleotide based on the identification.

In practice, the invention involves a system with a processor, storage, and a program that manages the acquisition, storage, and analysis of image data. The program identifies instances where acquiring image data conflicts with analyzing existing data, resolving these conflicts by prioritizing data acquisition. This allows the system to perform tasks like template generation, registration, intensity extraction, color matrix estimation, phasing estimation, base calling, and quality scoring in a streamlined manner, even with limited computational resources.

The invention differentiates itself from prior approaches by intelligently scheduling image analysis tasks around data acquisition, rather than treating them as separate, sequential processes. This time-division multiplexing approach reduces the need for high-end computing infrastructure and minimizes data transfer requirements, making high-throughput sequencing more accessible and efficient. The system also incorporates real-time metrics and control nucleic acids to ensure data quality and accuracy.

How does this patent fit in bigger picture?

Technical landscape at the time

In the early 2010s when ’275 was filed, image analysis for sequencing applications was typically implemented using dedicated computing hardware due to the computational intensity of processing large image datasets. At a time when data storage was relatively expensive, systems commonly relied on optimized algorithms to reduce data volume and extract relevant information in real-time. When hardware or software constraints made parallel processing non-trivial, prioritization of data acquisition over analysis was a common strategy to avoid data loss.

Novelty and Inventive Step

The examiner approved the application because the closest prior art failed to teach or suggest acquiring fluorescent emissions from labeled nucleotides hybridized to a cluster of identical nucleic acids bound to a substrate; calculating the fraction of labeled nucleotides in the cluster that are out of phase; identifying a particular base called nucleotide based on the acquired fluorescent emissions and the calculated fraction of labeled nucleotides that are out of phase; and assigning a quality score to the particular base called nucleotide based on the identification.

Claims

This patent contains 25 claims, with independent claims 1, 16, and 19. Independent claims 1 and 16 are directed to a system and method, respectively, for base calling a nucleotide in a nucleic acid molecule using fluorescent emissions and calculating the fraction of out-of-phase labeled nucleotides. Independent claim 19 recites a formula for calculating the fraction of out-of-phase nucleotides. The dependent claims generally elaborate on the system and method, providing details and variations of the base calling process, including specific components, calculations, and quality score assignments.

Key Claim Terms New

Definitions of key terms used in the patent claims.

Term (Source)Support for SpecificationInterpretation
Cluster of identical nucleic acids
(Claim 1, Claim 16)
“The array can include a single copy of a target nucleic acid at each site (also referred to as a feature) or multiple copies having the same sequence can be present at each site or feature. Multiple copies can be produced by amplification methods such as, bridge amplification or emulsion PCR as described in further detail below.”A group of nucleic acid molecules that have the same sequence.
Fluorescent emissions
(Claim 1, Claim 16)
“In pyrosequencing, released PPi can be detected by being immediately converted to adenosine triphosphate (ATP) by ATP sulfurylase, and the level of ATP generated is detected via luciferase-produced photons. The nucleic acids to be sequenced can be attached to features in an array and the array can be imaged to capture the chemiluminescent signals that are produced due to incorporation of a nucleotides at the features of the array.”Light emitted from labeled nucleotides that are hybridized to a cluster of identical nucleic acids bound to a substrate.
Fraction of labeled nucleotides that are out of phase
(Claim 1, Claim 16)
“The methods and systems provided herein can assume that a fixed fraction of molecules at each feature become phased and/or pre-phased at each cycle, in the sense that those molecules fall one base behind in sequencing. Thus, in a preferred embodiment, a phasing estimation is performed to adjust the observed intensities in a way that reduces the noise created by phased molecules.”The proportion of labeled nucleotides in a cluster that are not synchronized with the main sequencing reaction, either lagging behind (phased) or jumping ahead (pre-phased).
Labeled nucleotides
(Claim 1, Claim 16)
“In particular embodiments, each cycle involves simultaneous delivery of four different nucleotide types to the array and each nucleotide type has a spectrally distinct label. Four images can then be obtained, each using a detection channel that is selective for one of the four different labels.”Nucleotides that have been modified to emit a detectable signal, such as fluorescence.
Quality score
(Claim 1, Claim 16)
“Quality scoring refers to the process of assigning a quality score to each base call. Accordingly, presented herein are methods and systems for evaluating the quality of a base call from a sequencing read. In some embodiments, the methods can comprise the steps of: (a) calculating a set of predictor values for the base call; (b) using the predictor values to look up a quality score in a quality table.”A value assigned to a base call that indicates the probability of error in the identification of the nucleotide.

Patent Family

Patent Family

File Wrapper

The dossier documents provide a comprehensive record of the patent's prosecution history - including filings, correspondence, and decisions made by patent offices - and are crucial for understanding the patent's legal journey and any challenges it may have faced during examination.

  • Date

    Description

  • Get instant alerts for new documents

US11676275

ILLUMINA INC
Application Number
US17445994
Filing Date
Aug 26, 2021
Status
Granted
Expiry Date
Apr 7, 2031
External Links
Slate, USPTO, Google Patents