Stool DNA from study subjects (one library per subject), processed in randomized plate position with at least one extraction blank and one no-template PCR control per 96-well plate, plus one ZymoBIOMICS mock community per plate. First-PCR cycle number is fixed at 25; index PCR at 8 cycles. Input is standardized to 12.5 ng genomic DNA per first PCR. Each plate uses a distinct dual-index set to prevent cross-run barcode bleed. The bioinformatician is blinded to subject metadata during denoising/QC.
BSL-2 for human stool and stool-derived DNA (potential enteric pathogens). Handle raw stool and extractions in a biosafety cabinet, wear gloves, lab coat, and eye protection; decontaminate with 10% bleach then 70% ethanol. Bead-beating can aerosolize — keep tubes sealed and open in the cabinet. Ethanol and SPRI reagents are flammable/irritant. Autoclave or incinerate biohazardous waste; never mix bleach with guanidinium-based lysis buffers.
Mock community (ZymoBIOMICS) as a positive control to validate taxonomic accuracy and quantify bias. Extraction blank (buffer-only through DNA extraction) to capture kit/lab contaminants. No-template PCR control (water instead of DNA) to detect reagent/primer contamination. A high-biomass positive sample anchors run-to-run consistency. PhiX spike-in provides base-diversity for the low-complexity amplicon run. Index-hopping is monitored via unused barcode combinations.
Clean ~550 bp first-PCR band, no band in NTC, final library ~630 bp. Mock community recovers all expected taxa with relative abundances within ±15% of the theoretical values. Per-sample read depth >20,000 merged reads, Q30 >70% on R2 (challenging at 2x300), and contaminant ASVs (from blanks) constitute <2% of high-biomass sample reads. Pooled library quantified accurately by qPCR before loading.
To amplify the bacterial 16S rRNA V3-V4 hypervariable region (~460 bp) from human stool extracted DNA using locus-specific primers bearing Illumina overhang adapters, followed by a dual-index PCR to attach unique sample barcodes and sequencing adapters. The protocol targets balanced, contamination-controlled amplicon libraries suitable for paired-end 2x300 MiSeq sequencing and amplicon sequence variant (ASV)-level taxonomy.
Independent variable: subject/sample group. Dependent variables: ASV relative abundances, alpha diversity (Shannon, observed ASVs), beta diversity distances, read depth per sample, and contaminant ASV fraction. Controlled variables: input DNA mass (12.5 ng), primer set and concentration, first-PCR cycles (25), index-PCR cycles (8), polymerase lot, PhiX fraction, and loading concentration (8-10 pM).
If the V3-V4 region is amplified with a limited-cycle first PCR and contamination is tracked with extraction-blank and no-template controls plus a defined mock community, then the resulting libraries will faithfully recover the input community composition (mock community within ±15% of expected relative abundances) while flagging and permitting removal of reagent/lab contaminant ASVs.
Demultiplex on the MiSeq. Process with QIIME2 or DADA2: primer-trim (cutadapt), quality-truncate R1/R2 by quality profiles, denoise to ASVs (DADA2), merge pairs, remove chimeras. Assign taxonomy against SILVA 138 or GTDB with a naive Bayes classifier. Remove contaminants identified by the decontam package using the extraction blanks (prevalence/frequency method). Compute alpha/beta diversity; rarefy or use compositional (CLR) transforms.
Differential abundance with ANCOM-BC or DESeq2 on ASV/genus counts, with Benjamini-Hochberg FDR at α = 0.05 to control multiple comparisons across taxa. Beta-diversity differences tested by PERMANOVA (adonis2, 999 permutations) on Bray-Curtis/UniFrac distances; alpha diversity compared with Wilcoxon/Kruskal-Wallis. Report effect sizes and adjusted p-values. Power depends on sample size and effect; pre-specify n to detect target diversity differences.