Once individual cells are successfully isolated and placed into wells, droplets, or microfluidic chambers, the next critical step is to capture the mRNA molecules and convert them into complementary DNA (cDNA). This step forms the molecular foundation of the entire scRNA-seq workflow.
Poly-A Tailing and Oligo(dT) Priming
Most eukaryotic mRNA molecules have a polyadenylated (poly-A) tail at their 3’ ends. This tail allows selective targeting of mRNA using oligo(dT) primers, short stretches of thymidine (T) nucleotides.
These oligo(dT) primers are immobilized on beads or attached to surfaces (e.g., Gel Beads in Emulsion, or GEMs in 10x Genomics).
The primer often contains three domains:
- Oligo(dT) sequence to hybridize with mRNA poly-A tails
- A cell barcode that uniquely identifies the originating cell
- A Unique Molecular Identifier (UMI) that uniquely tags each mRNA molecule
For more Click here NIH Reference on Primer Design
Cell-Specific Barcoding
Each cell is assigned a unique barcode sequence, typically ~12–16 nucleotides long. This barcode ensures that after pooling and sequencing, the data from each cell can be distinguished during bioinformatic analysis.
- In droplet-based methods like Drop-seq and 10x Genomics, a single cell is co-encapsulated with a barcoded bead inside a microdroplet.
- In microwell or nanowell platforms, spatial information assigns barcodes per well.
This cell barcode is attached to every cDNA molecule generated from that cell’s mRNA, allowing demultiplexing of sequencing reads into distinct cells.
Unique Molecular Identifiers (UMIs)
A UMI is a short (typically 8–10 bp) random sequence that labels each individual mRNA molecule prior to PCR amplification.
Since PCR introduces bias—some molecules are overamplified—UMIs help quantify actual mRNA abundance by counting only unique sequences.
This approach eliminates duplicate artifacts, ensuring molecule-level resolution of gene expression.
Learn more about UMIs
Reverse Transcription (RT)
The barcoded mRNA is reverse transcribed into cDNA using reverse transcriptase enzymes. This reaction takes place:
- Either inside each droplet (in droplet-based systems)
- Or within each microwell (for plate-based systems)
During RT:
- The oligo(dT) primer is extended to form the first-strand cDNA
- The barcode and UMI become integrated into the cDNA
- Enzymes used include SuperScript II, SMARTScribe, or thermostable variants for better efficiency
SMART-seq2 uses template switching during RT to capture full-length transcripts, enabling isoform analysis. This differs from 3’-tag protocols that only capture the end of transcripts.