rdoe_persona_userprofile
rdoe_persona_select_up
Article

Whole-genome sequencing vs. whole-exome sequencing vs. custom panels

Whole-genome sequencing (WGS), whole-exome sequencing (WES), or custom probe panels: how do you decide?

It’s easy to be overwhelmed by the number of workflows that are available for NGS. While cost and turnaround time are important considerations, the most import question is What are you trying to learn about your samples? Here, we provide an overview of three general categories of NGS workflows and the types of information that each provides: whole-genome sequencing (WGS, also known as full-genome sequencing or genome-wide sequencing), whole-exome sequencing (WES), and target enrichment using panels that capture other genomic regions.

Whole-genome sequencing

WGS, as its name implies, yields sequence data across the entire genome—which, for humans, contains ~3 billion base pairs1. This includes (a) protein-coding regions, comprising ~2–5% of the genome (the exome),2,3,4 and (b) the rest of the genome, including intergenic and regulatory regions, intron sequences, and regions corresponding to noncoding RNAs. 

WGS can identify genetic variants in coding and noncoding regions, including insertions and deletions (indels), chromosomal rearrangements, copy number variations (CNVs), and single-nucleotide polymorphisms (SNPs). WGS also provides the basis for genome-wide association studies (GWAS), used for finding associations between genetic variants and particular traits or diseases. Unlike many targeted approaches, WGS can also identify variants that change mRNA splicing patterns or that impact the expression of noncoding RNAs, many of which have critical cellular functions5,6

While WGS is the most comprehensive and least biased of the workflows discussed here, it requires the greatest amount of sequencing (which is expensive) and the most intensive analysis (which is time-consuming and requires specialized expertise). In addition, because the data from WGS spans the entire genome, it typically does not achieve the depth of sequencing required to answer questions about somatic mutations or rare variants.

Whole-exome sequencing

WES is a more focused method that primarily captures regions that encode proteins, though some WES panels also include regulatory regions. This selection process is called target enrichment, and is typically carried out during sample preparation using biotinylated probes complementary to the desired sequences (an exome panel, for WES).

Because the majority of known disease-associated variants occur within coding regions, WES is often used to detect these mutations. Compared to WGS, WES greatly reduces sequencing costs by focusing on only ~2–5% of the genome. Thus, for researchers whose focus is protein-coding regions, WES offers a cost-effective alternative to WGS by enabling deeper sequence coverage of target regions and simplifying data analysis. 

When choosing between WGS and WES, it is important to remember that (1) WES data includes only genome regions covered by the exome panel, thus limiting the discovery of variants in other regions and the detection of structural variants, and (2) even the best target enrichment workflows are prone to some degree of target dropout and coverage bias, especially in GC- or AT-rich regions—although well-designed, high-quality probe panels can greatly reduce these issues. 

Other target enrichment panels, including custom-designed panels

Target enrichment panels designed to capture other regions offer an efficient way to gain genetic insights into specific biological processes or disorders. These panels cover highly curated combinations of genomic regions, and can be designed to include a combination of coding and noncoding regions and/or genes associated with particular conditions—such as specific cancers, metabolic disorders, and neurological or cardiac disorders. Custom panels can also target the genomes of other organisms; for example, targeted NGS of bacteria, fungi, and viruses enables the detection of these pathogens in complex samples where the majority of the nucleic acids is from the host.

Although these smaller panels ensure that most of the sequencing resources are devoted to the specific research question, their success depends on many factors, including expert probe design and high-quality probes, as noted above for WES. 

In the end, the research question itself is the best guide for choosing the NGS workflow.

References
 
  1. Bates S. Base Pair. Genome.gov. Published July 23, 2024. https://www.genome.gov/genetics-glossary/Base-Pair#:~:text=One%20copy%20of%20the%20human
  2. Comfort N. Genetics: We are the 98%. Nature. 2015;520(7549):615-616. doi:https://doi.org/10.1038/520615a
  3. Hartford CCR, Lal A. When Long Noncoding Becomes Protein Coding. Molecular and Cellular Biology. 2020;40(6). doi:https://doi.org/10.1128/MCB.00528-19
  4. National Centre for Biotechnology Information. National Center for Biotechnology Information. Nih.gov. Published 2024. https://www.ncbi.nlm.nih.gov/
  5. Ellingford JM, Ahn JW, Bagnall RD, et al. Recommendations for clinical interpretation of variants found in non-coding regions of the genome. Genome Medicine. 2022;14(1). doi:https://doi.org/10.1186/s13073-022-01073-3
  6. Tabarini N, Biagi E, Uva P, et al. Exploration of Tools for the Interpretation of Human Non-Coding Variants. International Journal of Molecular Sciences. 2022;23(21):12977. doi:https://doi.org/10.3390/ijms232112977

For Research Use Only. Not for use in diagnostic procedures.