Three biological replicates per condition (control vs. treated), each a separately cultured HEK293T flask passaged ≤20 times. One technical replicate (independent library) from a pooled reference RNA is included across batches to estimate technical variance. Input is fixed at 500 ng total RNA (RIN ≥ 8.0). Libraries are processed in randomized plate order to avoid positional batch effects; the operator is blinded to condition via barcoded tube labels. Single fragmentation time point (94°C, 8 min) is used to target 200-300 bp inserts. Twelve unique dual-index (UDI) barcodes allow pooling of all 6 samples plus controls on one lane.
BSL-1 for HEK293T culture (use BSL-2 practices if transfected with viral vectors). TRIzol/phenol-chloroform extraction is corrosive and toxic — perform in a chemical fume hood, wear nitrile gloves, lab coat, and goggles; collect phenol waste in designated halogenated/organic waste. Guanidinium salts must never contact bleach (releases cyanide gas). Ethanol is flammable. Dispose of biological waste per institutional BSL-1 protocol; RNase contamination control requires dedicated reagents and gloves.
Positive control: Universal Human Reference RNA (UHRR) processed in parallel to confirm library construction and provide a cross-batch anchor. Negative/no-template control: water through the full workflow to detect adapter-dimer contamination and reagent carryover (should yield no quantifiable Qubit signal and no Bioanalyzer peak). ERCC spike-in mix (1 µL of 1:100 dilution per 500 ng) as an internal quantitative control for dynamic range and strand specificity. A no-RT control on one sample confirms absence of gDNA contamination.
Final library yield 10-50 nM at 20 µL. Bioanalyzer shows a single symmetric peak with insert mode ~250-300 bp and <2% adapter dimer (~120 bp). After sequencing (PE150), >90% reads pass filter, duplication <25%, rRNA contamination <5%, exonic mapping rate >80%, and strand specificity >95% (RSeQC). ERCC log2(observed) vs log2(expected) R² > 0.92 across the dynamic range.
To construct directional (strand-specific) Illumina-compatible RNA-seq libraries from 500 ng of high-quality HEK293T total RNA via poly(A) capture, fragmentation, dUTP-marked second-strand synthesis, adapter ligation, and PCR enrichment. The protocol targets a final library insert size of 200-300 bp and a yield sufficient for 30-40 million paired-end reads per sample, enabling quantification of protein-coding and lncRNA transcripts with correct strand orientation preserved (>95% of reads assigned to the expected strand).
Independent variable: experimental condition (control vs. treated HEK293T). Dependent variables: per-gene read counts / normalized expression (TPM), library yield (nM), insert-size distribution, and percent strand specificity. Controlled variables: input mass (500 ng), RIN threshold (≥8.0), fragmentation time (8 min), PCR cycle number (held constant within batch), adapter concentration, passage number (≤20), and bead lot.
If poly(A)+ mRNA is selected, chemically fragmented, and the second strand is synthesized with dUTP followed by USER/UDG digestion prior to PCR, then only the first-strand-derived cDNA will amplify, yielding libraries in which read strandedness faithfully reports transcript orientation (expected >95% strand specificity by RSeQC infer_experiment.py), distinguishing sense from antisense transcription at overlapping loci.
Demultiplex with bcl2fastq/BCL Convert. Trim adapters with fastp. Align to GRCh38 with STAR (two-pass) or quantify with Salmon (--libType ISR for dUTP). QC with FastQC, RSeQC (inferexperiment, readdistribution, geneBody_coverage), and Picard CollectRnaSeqMetrics. Generate gene-level counts via featureCounts (-s 2 for reverse strandedness) or tximport from Salmon. Normalize with DESeq2 median-of-ratios or edgeR TMM; report TPM for visualization.
Differential expression with DESeq2 negative-binomial Wald test, n=3 biological replicates per group, Benjamini-Hochberg FDR correction at α = 0.05, with independent filtering and apeglm log2FC shrinkage. Power: with 3 replicates and a typical biological CV of 0.4, the design detects ~2-fold changes for moderately expressed genes (baseMean >50) at ~80% power. Report adjusted p-values and effect sizes; flag genes with |log2FC| > 1 and padj < 0.05.