Development of an extensive workflow for comprehensive clinical pharmacogenomic profiling: lessons from a pilot study on 100 whole exome sequencing data

Tafazoli, Alireza; van der Lee, Maaike; Swen, Jesse J.; Zeller, Anna; Wawrusiewicz-Kurylonek, Natalia; Mei, Hailiang; Vorderman, Ruben H. P.; Konopko, Krzysztof; Zankiewicz, Andrzej; Miltyk, Wojciech

doi:10.1038/s41397-022-00286-4

Download PDF

Article
Open access
Published: 13 August 2022

Development of an extensive workflow for comprehensive clinical pharmacogenomic profiling: lessons from a pilot study on 100 whole exome sequencing data

The Pharmacogenomics Journal volume 22, pages 276–283 (2022)Cite this article

3401 Accesses
5 Citations
Metrics details

Subjects

Abstract

This pilot study is aimed at implementing an approach for comprehensive clinical pharmacogenomics (PGx) profiling. Fifty patients with cardiovascular diseases and 50 healthy individuals underwent whole-exome sequencing. Data on 1800 PGx genes were extracted and analyzed through deep filtration separately. Theoretical drug induced phenoconversion was assessed for the patients, using sequence2script. In total, 4539 rare variants (including 115 damaging non-synonymous) were identified. Four publicly available PGx bioinformatics algorithms to assign PGx haplotypes were applied to nine selected very important pharmacogenes (VIP) and revealed a 45–70% concordance rate. To ensure availability of the results at point-of-care, actionable variants were stored in a web-hosted database and PGx-cards were developed for quick access and handed to the study subjects. While a comprehensive clinical PGx profile could be successfully extracted from WES data, available tools to interpret these data demonstrated inconsistencies that complicate clinical application.

A call for increased inclusivity and global representation in pharmacogenetic testing

Article Open access 22 February 2024

Clinical pharmacogenetic analysis in 5,001 individuals with diagnostic Exome Sequencing data

Article Open access 18 February 2022

Application of long-read sequencing to elucidate complex pharmacogenomic regions: a proof of principle

Article Open access 05 November 2021

Introduction

Pharmacogenomics (PGx) is aimed at reducing adverse drug reactions (ADRs) and lack of efficacy by adjusting drug therapy based on an individual’s genetic profile. Many single gene-drug interactions have been described so far. For interactions with the highest evidence, guidelines and recommendations are available [1]. Not only patients, but also healthy individuals may benefit from PGx testing for future prescriptions by saving the PGx data in electronic health records (EHR) [2]. Currently, targeted genotyping is standard practice in most PGx laboratories for the identification of important variants in the pharmacogenes. However, these panel-based tests are not able to identify rare genomic variants which are expected to have a substantial impact on a patient’s drug response. Sequencing-based methods, on the other hand, are capable of detecting most of the rare variants [3, 4]. These additional variants may help to better explain and predict drug-related phenotypes. Several groups have investigated the utility of next-generation sequencing (NGS) for PGx, both with the use of whole-exome sequencing (WES) as well as whole-genome sequencing (WGS) [5,6,7,8,9,10,11]. Such studies for example, demonstrated the utilization of WGS for the identification of putatively functional variants within well-known pharmacogenes. The result successfully represented the missing causative variants underlying drug response phenotypes [12]. However, state-of-the-art high throughput sequencing approaches result in a large amount of data, making it necessary to develop more powerful PGx-bioinformatics tools as well as assess the clinical validity and utility of sequencing-based tests [13]. Multiple tools have been developed and tested in NGS-based PGx studies [14, 15]. We provided a comprehensive review of such tools and their functional algorithms previously [16]. The performance of available haplotyping tools was also compared for CYP2D6 before. The study showed that while the overall performance was good, there were discrepancies between the individual tools. Nevertheless, a comparison of the utility of these tools for clinical PGx samples and a wide range of genes is yet to be made [17]. In this pilot study, we aim to develop an approach for comprehensive clinical PGx profiling of 100 participants. Also, we introduce a method of deep filtration for dealing with variants in less-studied drug-related genes.

Methods

Sample collection

Blood samples from 100 participants of a local and longitudinal observational biomedical project were obtained (50 cardiovascular patients with pulmonary hypertension and ischemic disease and 50 healthy individuals with common demographic features as the control group). The project was approved by the Medical University of Bialystok bioethics committee (approval code: R-I-002/630/2018) and all participants provided informed consent.

Whole-exome sequencing and primary analysis of data

DNA extraction, NGS library preparation, and quality assessment were performed according to standard manufacturer protocols (Supplementary Material). Pre-Capture Pooling Human All Exon V7” was used. The SureSelect^XT kits provide a target enrichment system for Illumina paired-end multiplexed sequencing library preparation. Sequencing was performed using the Illumina NovaSeq 6000 instrument. A standard bioinformatics analysis pipeline for raw data was employed for both GRCh37 and GRCh38 genome builds (Supplementary Material). In short, low-quality bases (read depth <10) were omitted and reads were aligned to the reference genome with the BWA-mem (Burrows-Wheeler) algorithm.

Data filtration and functional assessment

For variant interpretation, annotation, and initial functional assessment SnpEff and ENSEMBL-VEP were used [18, 19]. For the PGx assessment, a list of 1800 drug-related genes was prepared. These genes include metabolizer enzymes, drug transporters, drug receptors, and drug-target molecules. They were collected from the PharmGKB comprehensive gene list (only genes with at least one annotated variant extracted) (n = 1707), all CPIC gene-drug lists (n = 119), and the FDA table of “Pharmacogenomic Biomarkers” in drug labeling (n = 132), plus a systematic search in PubMed for any unannotated but newly introduced drug-related genes (n = 17). The relevant keywords selected and specified period applied for choosing state-of-the-art articles. Both the abstract and main text were evaluated systematically and the final result was added to our comprehensive gene list (Supplementary Material). Duplicates were then removed and variant call format (VCF) files were filtered to contain only these 1800 genes’ regions. Variants within these regions were flagged if they were predicted as pathogenic or likely pathogenic by SnpEff and VEP and went through multiple in silico prediction tools including VarSeq (Golen Helix™), Ensembl variant table, gnomAD, and ExAC report on selected variants, Varsome, and VarAFT. Also, SWISS-MODEL [20] and PyMol 2.4 [21] were applied to selected variants (predicted as highly damaging) for the implementation of homology modeling for confirmation of the negative effects of amino acid changes in the related protein.

PGx analysis with multiple dedicated bioinformatics tools

For analysis of variants in known and well-established PGx genes we used four PGx-dedicated tools: Stargazer (V.1.0.8), Aldy (V.3.3), PharmCAT (V.0.8.0), and PharmaKU which uses Stargazer V1.2.2 [22,23,24,25,26]. For Aldy, PharmCAT, and PharmaKU, GRCh38 BAM and VCF files were used and the tools were run according to their instructions in the accompanying documentation. Stargazer only works with GRCh37, hence the GRCh37-based VCF-only mode was used according to the documentation. All VCFs used in this section were the original files from the first standard analysis steps for NGS output containing all genes without any pre-filtering. Hence, results were obtained from all genes included in the tools which differed between tools. However, core pharmacogenes (defined as actionable in PGx guideline providers) were covered by all.

Haplotype/diplotype evaluation for well-known pharmacogenes

First, we made a comparison table for results from selected PGx-bioinformatics tools for nine core pharmacogenes: CYP2B6, CYP2C19, CYP2C9, CYP2D6, CYP3A5, DPYD, SLCO1B1, UGT1A1, and VKORC1. These genes are all present in guidelines from PharmGKB, CPIC, and DPWG in addition to being annotated in PharmVar. Results differed depending on the algorithm and variants used by each tool. A “3 vs. 1” conflict rule was used: if the same diplotype and phenotype were called by 3 out of the 4 tools, that was considered the correct assignment. If there was no majority agreement or if one tool did not give any calls, randomly selected discrepancies (to a total of 20 discrepancies) were manually investigated with the use of PharmVar to assess what the correct assignment was. The outcome resulted in the conclusion that Stargazer was most often correct. Hence, for discrepancies, the Stargazer assignment was selected as the correct call.

Possible drug-drug-gene interactions based on the final predicted metabolizer phenotypes or star alleles were theoretically assessed for all patients. For this, the registered demographic data and complete history of drugs plus clinical manifestations in the case of patients with reported ADRs’ phenotype were used. Possible mismatches between the individuals’ genotype-based prediction of drug metabolism and the true capacity to metabolize drugs were identified by freely available resources as a model of phenoconversion assessment for high throughput DNA sequencing data. Figure 1 illustrates our complete workflow for NGS-based clinical PGx tests for individuals.

**Fig. 1: Designed workflow for NGS-based comprehensive clinical PGx test data analysis.**

Electronic health records and data storage

Haplotypes and phenotypes assigned in the previous step are included in the EHR in university hospital in Bialystok to guide future drug therapy for all participants. Additionally, results are reported back to the participants using a special PGx card as well. Such reporting methods are to allow the utilization of the information based on the provided guidelines or recommendations by CPIC, FDA, DPWG, or other guidelines. Each participant’s profile in the current study includes specific records with information related to CPIC and DPWG guidelines plus novel variant data in less-known drug-related genes. Access to this database is provided through a publicly available Internet webpage, entitled “clinicalpgx.pl.” (Figs. 2 and 3).

**Fig. 2: Designed medication safety card for reaching out to individuals’ clinical PGx test result.**

Results

WES data analysis for clinical PGx practice

WES resulted in 1026.75 Gb of 101 bp paired-end reads the output and 93.92% of reads with Q > 30. On average, 30,000 variants were identified for each sample. PGx-VCF files displayed between 3300–3600 identified variants in 1800 drug-related genes for each sample. In total 299,297 unique variants from all samples passed the genotype quality and desired read depth (DP) filtrations. Out of the 299,297 variants, 4539 (1.51%) were identified as rare variants with minor allele frequency ≤ 0.01 based on data from the 1K genomes, gnomAD, and ExAC databases. Also, the approach revealed 36 variants within our nine core pharmacogenes, with 28 of them considered rare and/or extremely rare in 1K genomes and gnomAD. These 36 variants were not in CPIC or PharmGKB. Overall, of the 4539 rare variants, there were 21 frameshift, 19 in-frame deletion/insertion, 50 intronic, 18 splice-site, 26 stop codon, 1804 synonymous, and 2447 missense non-synonymous variants in coding regions plus 154 other types of changes (i.e., 27 UTR, 9 initial codon variants, etc.) found (Fig. 4). Multiple functional assessment algorithms identified 115 of the non-synonymous rare variants as damaging. The final step integrated with in silico analysis methods like extra deep filtration, deep computational analysis, and machine learning approaches alongside protein modeling implementation to inferring and providing higher accuracy rate in functional predictions for variants, particularly in less-known drug-related genes. Next, we checked the ability of common genetic bioinformatics tools (SIFT, Polyphen2, FATHMM, Mutation taster, Mutation Assessor, and CAAD) to identify known impactful variants in pharmacogenes. The evaluation analyzed a list of selected 39 interpreted variants (based on the U-PGx [27] consortiums panel which were previously analyzed through our other investigations and confirmed in PharmVAR) from 11 core pharmacogenes (CYP2B6, CYP2C19, CYP2C9, CYP2D6, CYP3A5, DPYD, F5L, SLCO1B1, TPMT, UGT1A1, and VKORC1). The results showed that most of the bioinformatics tools used were not successful in identifying these variants as potentially deleterious or impactful. This was particularly evident in the variants associated with a decrease or increase in function. However, variants that are known to be completely deleterious (loss of function) in PGx were identified in half of the cases. This was considered while we used ExAC-LOF as one of the main filtration tools for the detection of rare variants in our study. The tool contains information on loss of function variants from ExAC, which was one of our main databases for highlighting rare variants as well.

**Fig. 4: Total rare variants in WES result for all samples.**

As the common bioinformatics tools like SIFT, Polyphen2, CAAD, etc. were shown to be not suitable for the identification of impactful PGx variants, we may do the pre-filtration of WES data for obtaining only PGx-related genes VCF file and use that in common tools. In this case, all the identified malfunction variants (for example loss of function, which are mostly highlighted by common tools) come from drug-related genes making the downstream processing and detailed analysis more manageable. Then we can go further and do more in silico analysis by other available tools as the confirmation approach for predicted functional assessments.

Bioinformatics tools for NGS-PGx outcome (comparison of tools result)

Besides the prediction of yet unused variants in pharmacogenes, we have also explored the results from PGx-dedicated tools for the assignment of *-haplotypes based on well-known pharmacovariants. We evaluated these results in detail and evaluate the potential use of such tools for clinical practice. Four different software tools were used for assigning diplotypes (star alleles) and phenotypes. However, the result from these bioinformatics tools showed discrepancies. While Aldy and Stargazer showed the most similarity for nine core pharmacogenes, PharmCAT was unable to call star alleles for every gene. PharmaKU uses Stargazer as its basis (v V1.2.2) and accepts genome version GRCh38 as well [26]. While the outcome was mostly concordant with Stargazer, there were discrepancies (one per ten calls) mainly due to default calls in PharmaKU, especially in the absence of input data. If data on a gene or its variants was missing, Stargazer would not provide any results, PharmaKU on the other hand would assume the genes were entire wildtype and calls a *1/*1 haplotype. Figure 5 displays the concordance rate for PGx-dedicated bioinformatics tools and for each selected pharmacogene in detail. Also, Table 1 displays multi-tools’ discrepancies reports and provides more details on calling star alleles. The overall result brought some important insights for these tool’s functions, which are worth considering while using such tools in clinical PGx tests: (1) most common cause of discrepancies comes from differences in the variants each tool uses. Also, not all the tools implement phasing for haplotype detection. For example, while Stargazer uses Beagle as a built-in algorithm for running the phasing for the samples, PharmCAT works best with a phased VCF file as input data. Aldy uses only unphased data. (2) the genotype and phenotype assignments are not the same in every tool. For example, Aldy’s variant to haplotype translation does not always match PharmVar (i.e., CYP2B6*4 may call as *1). PharmaKU, on the other hand, does not provide any information on variants as a web-based report with only star alleles and predicted phenotype. Regarding phenotype assignments, Stargazer’s phenotype predictions do not always match the guidelines (e.g., CYP2B6*1/*6 is translated as a normal metabolizer instead of intermediate). Additionally, Aldy does not provide phenotype translation in the result, while the other evaluated tools have that. Finally, (3) the transparency and ease of use are different. Differences occur in all aspects of these tools: for example, the necessity for pre-processing of the input data (Aldy, PharmCAT, and complete mode for Stargazer), the comprehensiveness of the report, the technical features, and the genotype and phenotype translation.

**Fig. 5: PGx-dedicated bioinformatics algorithms concordance rate for selected nine core pharmacogenes in the current study.**

Table. 1 Multi PGx-dedicated bioinformatics tools’ discrepancies report for each core pharmacogene in current study and the total measurement for such conflicts in 100 WES data.

Full size table

Actionable pharmacovariants in individuals

The most “non-normal” phenotypes were identified for CYP2D6. CYP3A5, on the other hand, was the most consistent with almost all samples having a poor metabolizer phenotype. The overall frequency of abnormal alleles leads to aberrant phenotypes within our participants for nine core pharmacogenes were as follow: CYP2B6 (47%), CYP2C19 (17%), CYP2C9 (31%), CYP2D6 (60%), CYP3A5 (90%), DPYD (6%), SLCO1B1 (47%), UGT1A1 (18%), and VKORC1 (47%). Figure 6 indicates the frequency for each allele in selected genes and linked phenotypes in detail. Moreover, for running the theoretically phenoconversion measurements, particularly drug-drug-gene interactions on a large number of samples, we used Sequence2script [28] to identify any potential drug-drug-gene interactions. The assessment, however, showed almost no changes in drug response phenotypes and dosage modifications for our samples. Finally, all phenotypes are included on the PGx result card and on the website (clinicalpgx.pl), which contains both CPIC and DPWG guidelines to allow them to be accessed by the participants and their healthcare providers. An example of the result from our approach for actionable pharmacovariants and prescription recommendations is provided on “clinicalpgx.pl/data” for anonymous person.

**Fig. 6: Different metabolizer for nine core pharmacogenes.**

Discussion

The result of our study on PGx profiling from WES data may be helpful for utilization of bioinformatics tools, specifically PGx-dedicated algorithms, in daily clinical practice. Our investigation also displayed the advantage of pre-filtration of VCF files for only drug-related genes in order to help for identification of more pharmacovariants within such genes. We demonstrated higher accuracy of PGx independent bioinformatics tools, particularly in clinical research, compared to web-hosted algorithms like PharmaKU as the tool could be used just as a confirmation for Stargazer result, where the input data provided correctly. While the web-based PGx haplotype tools are easier to use, they also seem to be less accurate than the more transparent command line-based programs. In order to touch on the advantages of more in-depth methods in clinical reports, the selection and utilization of correct PGx-dedicated bioinformatics tool(s) must be considered by test centers as well. True applications of PGx bioinformatics algorithms in clinics will bring several advantages, not only in biomarker identification but also in physicians’ accurate decision-making and drug stratification [29]. Choosing the right tool and annotation databases in addition to setting up a consistent workflow for routine practice in clinical centers requires advanced knowledge and awareness of existing tools or resources and their functional approaches in variant interpretations [30].

Even though all the available PGx-dedicated bioinformatics tools are limited to the specific number of pharmacogenes and included the distinct number of pharmacovariants in their panel, most of these curated genetic variations come with clinical guidelines and annotations for treatment modifications as well. Hence, applying multi-tools for including more genes and variants seems reasonable. So far, most studies reported the advantages of using PGx bioinformatics tools in clinical investigations but as a separate entity [31, 32]. For those reported the multi-tool utilization, again not all of the genes in all samples were evaluated in that way [33]. Among PGx-dedicated bioinformatics algorithms, we propose to use at least two of such tools for providing more confident haplotype calls. However, an important limitation of applying different tools would be the necessity for running the alignment part for different reference genomes as some of them might need GRCh37 while the others work with GRCh38. In our study, we tried to use NCBI’s genome remapping service (https://www.ncbi.nlm.nih.gov/genome/tools/remap) to perform this re-alignment and liftover from GRCh38 to GRCh37. However, after evaluating of results, we noticed that the approach led to the exclusion of some important PGx variants. Therefore, we decided to perform a separate realignment with GRCh37 assembly as the reference for the raw data. These data were subsequently used by Stargazer.

The outcome of our result in the adaptation of PGx-dedicated bioinformatics tools for clinical interpretation of PGx variants may help clinicians to improve the implementation of the NGS-guided clinical PGx tests. Once the utilization of such computational assessments is established in the center, the related healthcare system may benefit from the fast and more accurate PGx marker diagnosis in a shorter turnaround time.

Also, it is worth considering that not all types of PGx variants may be identified by common bioinformatics tools. As we have shown, increased and decreased functions (rapid and intermediate metabolizers) are mostly ignored by tools like SIFT, Polyphen2, FATHMM, Mutation taster, Mutation Assessor, etc. Therefore, it might be valuable to filter PGx regions from VCF files to select candidates for in silico validation studies as opposed to using inaccurate in silico tools for the assessment of variant impact. This type of approach would be for novel variants with unknown significance, which have an impact on the protein (e.g., missense or frameshift variants) [34]. Nevertheless, the workflow in this level for the current study brought many interesting outcomes for novel and/or not-annotated variants in our samples, which the interpretation and further analysis are still in progress. Today, computational assessments proved to be a promising approach for the translation of novel variants into healthcare [35, 36].

Besides gene-drug interactions it is also of importance to consider phenoconversion for improving accuracy rate for phenotype prediction in personalized therapy area too. Under the influence of comedication the activity of an enzyme can switch, for example from intermediate to normal metabolizer due to an inducer effect. For instance, proton-pump inhibitors can reduce CYP2C19 activity and thereby convert a normal metabolizer phenotype to an intermediate metabolizer phenotype. This can, in turn, has an impact on other drugs used by the patients. The use of both proton-pump inhibitors (CYP2C19 inhibitor) and clopidogrel (CYP2C19 substrate) is highly likely in a cardiovascular cohort such as ours, therefore it is important to be aware of these types of drug-drug-gene interactions. Future research and investigations will need to take comedication into account when studying the impact of PGx on clinical outcomes.

Integration of computational assessment and bioinformatic functional analysis of pharmacovariants within high throughput DNA sequencing data is rapidly expanding. However, while a comprehensive clinical PGx profile could be successfully extracted from WES data, available tools to interpret such data are not consistent for all pharmacogenes and show several discrepancies compared to each other. Moreover, WES data demonstrates an abundance of variants not yet used in clinical practice. To bring the translation of such technologies into daily clinical setting the clinical validity and utility of dedicated bioinformatics tools should be investigated more.

Data availability

The data are available from the corresponding author at Department of Analysis and Bioanalysis of Medicines, Medical University of Bialystok on reasonable request.

References

Abdullah-Koolmees H, Van Keulen AM, Nijenhuis M, Deneer VH. Pharmacogenetics guidelines: overview and comparison of the DPWG, CPIC, CPNDS, and RNPGx guidelines. Front Pharmacol. 2021;11:1976.
Scott SA. Clinical pharmacogenomics: opportunities and challenges at point of care. Clin Pharmacol Ther. 2013;93:33–5.
Article CAS PubMed Google Scholar
Kozyra M, Ingelman-Sundberg M, Lauschke VM. Rare genetic variants in cellular transporters, metabolic enzymes, and nuclear receptors can be important determinants of interindividual differences in drug response. Genet Med. 2017;19:20–9.
Article CAS PubMed Google Scholar
Lopes JL, Harris K, Karow MB, Peterson SE, Kluge ML, Kotzer KE, et al. Targeted genotyping in clinical pharmacogenomics: what is missing? J Mol Diagn. 2022;24:253–61.
Yang Y, Botton MR, Scott ER, Scott SA. Sequencing the CYP2D6 gene: from variant allele discovery to clinical pharmacogenetic testing. Pharmacogenomics. 2017;18:673–85.
Article CAS PubMed PubMed Central Google Scholar
van der Lee M, Allard WG, Bollen S, Santen GW, Ruivenkamp CA, Hoffer MJ, et al. Repurposing of diagnostic whole exome sequencing data of 1,583 individuals for clinical pharmacogenetics. Clin Pharmacol Ther. 2020;107:617–27.
Article PubMed Google Scholar
Cousin MA, Matey ET, Blackburn PR, Boczek NJ, McAllister TM, Kruisselbrink TM, et al. Pharmacogenomic findings from clinical whole exome sequencing of diagnostic odyssey patients. Mol Genet Genom Med. 2017;5:269–79.
Article Google Scholar
Santos M, Niemi M, Hiratsuka M, Kumondai M, Ingelman-Sundberg M, Lauschke VM, et al. Novel copy-number variations in pharmacogenes contribute to interindividual differences in drug pharmacokinetics. Genet Med. 2018;20:622–9.
Article CAS PubMed Google Scholar
Price MJ, Carson AR, Murray SS, Phillips T, Janel L, Tisch R, et al. First pharmacogenomic analysis using whole exome sequencing to identify novel genetic determinants of clopidogrel response variability: results of the genotype information and functional testing (GIFT) exome study. J Am Coll Cardiol. 2012;59:E9–E.
Article Google Scholar
Mizzi C, Peters B, Mitropoulou C, Mitropoulos K, Katsila T, Agarwal MR, et al. Personalized pharmacogenomics profiling using whole-genome sequencing. Pharmacogenomics. 2014;15:1223–34.
Article CAS PubMed Google Scholar
Sivadas A, Salleh M, Teh L, Scaria V. Genetic epidemiology of pharmacogenetic variants in South East Asian Malays using whole-genome sequences. Pharmacogenomics J. 2017;17:461–70.
Article CAS PubMed Google Scholar
Choi J, Tantisira KG, Duan QL. Whole genome sequencing identifies high-impact variants in well-known pharmacogenomic genes. Pharmacogenomics J. 2019;19:127–35.
Article CAS PubMed Google Scholar
Ng D, Hong CS, Singh LN, Johnston JJ, Mullikin JC, Biesecker LG. Assessing the capability of massively parallel sequencing for opportunistic pharmacogenetic screening. Genet Med. 2017;19:357–61.
Article CAS PubMed Google Scholar
Gordon A, Fulton RS, Qin X, Mardis ER, Nickerson DA, Scherer S. PGRNseq: a targeted capture sequencing panel for pharmacogenetic research and implementation. Pharmacogenet Genomics. 2016;26:161.
Article CAS PubMed PubMed Central Google Scholar
Gulilat M, Lamb T, Teft WA, Wang J, Dron JS, Robinson JF, et al. Targeted next generation sequencing as a tool for precision medicine. BMC Med Genomics. 2019;12:1–17.
Article CAS Google Scholar
Tafazoli A, Guchelaar H-J, Miltyk W, Kretowski AJ, Swen JJ. Applying next-generation sequencing platforms for pharmacogenomic testing in clinical practice. Front Pharmacol. 2021;12:2025.
Article Google Scholar
Twesigomwe D, Wright GE, Drögemöller BI, da Rocha J, Lombard Z, Hazelhurst S. A systematic comparison of pharmacogene star allele calling bioinformatics algorithms: a focus on CYP2D6 genotyping. NPJ Genom Med. 2020;5:1–11.
Article Google Scholar
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92.
Article CAS PubMed PubMed Central Google Scholar
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, et al. The ensembl variant effect predictor. Genome Biol. 2016;17:1–14.
Article Google Scholar
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–303.
Article CAS PubMed PubMed Central Google Scholar
DeLano WL. The PyMOL molecular graphics system. 2002. http://www.pymol.org.
Lee S-B, Wheeler MM, Patterson K, McGee S, Dalton R, Woodahl EL, et al. Stargazer: a software tool for calling star alleles from next-generation sequencing data using CYP2D6 as a model. Genet Med. 2019;21:361–72.
Article CAS PubMed Google Scholar
Lee SB, Wheeler MM, Thummel KE, Nickerson DA. Calling star alleles with stargazer in 28 pharmacogenes with whole genome sequences. Clin Pharmacol Ther. 2019;106:1328–37.
Article CAS PubMed PubMed Central Google Scholar
Numanagić I, Malikić S, Ford M, Qin X, Toji L, Radovich M, et al. Allelic decomposition and exact genotyping of highly polymorphic and structurally variant genes. Nat Commun. 2018;9:1–11.
Article Google Scholar
Sangkuhl K, Whirl‐Carrillo M, Whaley RM, Woon M, Lavertu A, Altman RB, et al. Pharmacogenomics clinical annotation tool (Pharm CAT). Clin Pharmacol Ther. 2020;107:203–10.
Article CAS PubMed Google Scholar
John SE, Channanath AM, Hebbar P, Nizam R, Thanaraj TA, Al-Mulla F. PharmaKU: a web-based tool aimed at improving outreach and clinical utility of pharmacogenomics. J Pers Med. 2021;11:210.
Article PubMed PubMed Central Google Scholar
van der Wouden CH, van Rhenen MH, Jama WO, Ingelman‐Sundberg M, Lauschke VM, Konta L, et al. Development of the PG x‐Passport: a panel of actionable germline genetic variants for pre‐emptive pharmacogenetic testing. Clin Pharmacol Ther. 2019;106:866–73.
Article PubMed PubMed Central Google Scholar
Bousman CA, Wu P, Aitchison KJ, Cheng T. Sequence2Script: a web-based tool for translation of pharmacogenetic data into evidence-based prescribing recommendations. Front Pharmacol. 2021;12:238.
Overkleeft R, Tommel J, Evers AW, den Dunnen JT, Roos M, Hoefmans M-J, et al. Using personal genomic data within primary care: a bioinformatics approach to pharmacogenomics. Genes. 2020;11:1443.
Article CAS PubMed Central Google Scholar
Tong H, Phan NV, Nguyen TT, Nguyen DV, Vo NS, Le L. Review on databases and bioinformatic approaches on pharmacogenomics of adverse drug reactions. Pharmacogenomics Pers Med. 2021;14:61.
Google Scholar
Ly RC, Shugg T, Ratcliff R, Osei W, Lynnes TC, Pratt VM, et al. Analytical validation of a computational method for pharmacogenetic genotyping from clinical whole exome sequencing. J Mol Diagn. 2022;24:576–85.
Wankaew N, Chariyavilaskul P, Chamnanphon M, Assawapitaksakul A, Chetruengchai W, Pongpanich M, et al. Genotypic and phenotypic landscapes of 51 pharmacogenes derived from whole-genome sequencing in a Thai population. PLoS ONE. 2022;17:e0263621.
Article CAS PubMed PubMed Central Google Scholar
Caspar SM, Schneider T, Meienberg J, Matyas G. Added value of clinical sequencing: WGS-based profiling of pharmacogenes. Int J Mol Sci. 2020;21:2308.
Article CAS PubMed Central Google Scholar
Siamoglou S, Koromina M, Hishinuma E, Yamazaki S, Tsermpini E-E, Kordou Z, et al. Identification and functional validation of novel pharmacogenomic variants using a next-generation sequencing-based approach for clinical pharmacogenomics. Pharmacol Res. 2022;176:106087.
Zhou Y, Mkrtchian S, Kumondai M, Hiratsuka M, Lauschke VM. An optimized prediction framework to assess the functional impact of pharmacogenetic variants. Pharmacogenomics J. 2019;19:115–26.
Article CAS PubMed Google Scholar
Pandi M-T, Koromina M, Tsafaridis I, Patsilinakos S, Christoforou E, van der Spek PJ, et al. A novel machine learning-based approach for the computational functional assessment of pharmacogenomic variants. Hum Genomics. 2021;15:1–13.
Article Google Scholar

Download references

Funding

This article was conducted within the projects which have received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant, agreement no. 754432, and the Polish Ministry of Science and Higher Education and from financial resources for science in 2018–2023 granted for the implementation of an international co-financed project.

Author information

Authors and Affiliations

Department of Analysis and Bioanalysis of Medicines, Faculty of Pharmacy with the Division of Laboratory Medicine, Medical University of Bialystok, 15-089, Bialystok, Poland
Alireza Tafazoli & Wojciech Miltyk
Clinical Research Centre, University Hospital of Bialystok, 15-276, Bialystok, Poland
Alireza Tafazoli & Anna Zeller
Department of Clinical Pharmacy and Toxicology, Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
Maaike van der Lee & Jesse J. Swen
Department of Clinical Genetics, Medical University of Białystok, 15-089, Białystok, Poland
Natalia Wawrusiewicz-Kurylonek
Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
Hailiang Mei & Ruben H. P. Vorderman
Department of Photonics, Electronics, and Lighting Technology, Faculty of Electrical Engineering, Bialystok University of Technology, 15-351, Bialystok, Poland
Krzysztof Konopko & Andrzej Zankiewicz

Authors

Alireza Tafazoli
View author publications
You can also search for this author in PubMed Google Scholar
Maaike van der Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jesse J. Swen
View author publications
You can also search for this author in PubMed Google Scholar
Anna Zeller
View author publications
You can also search for this author in PubMed Google Scholar
Natalia Wawrusiewicz-Kurylonek
View author publications
You can also search for this author in PubMed Google Scholar
Hailiang Mei
View author publications
You can also search for this author in PubMed Google Scholar
Ruben H. P. Vorderman
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Konopko
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Zankiewicz
View author publications
You can also search for this author in PubMed Google Scholar
Wojciech Miltyk
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AT designed the study, performed most of the lab work and ran both common and PGx-dedicated bioinformatics algorithms, did WES data interpretations, analyzed the PGx-related tools data, designed the webpage for clinical PGx data and medication safety card, and wrote the entire manuscript draft, MvdL helped in running PGx-dedicated bioinformatics algorithms and analyzing the related data, and edit the manuscript, AZ and NWK helped and ran some parts of lab work, HM and RHPV did the raw WES data analysis for 100 samples, KK and AZ created and launched the webpage for clinical PGx data plus medication safety card, JJS and WM supervised the entire study and edited final version of the manuscript.

Corresponding author

Correspondence to Wojciech Miltyk.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tafazoli, A., van der Lee, M., Swen, J.J. et al. Development of an extensive workflow for comprehensive clinical pharmacogenomic profiling: lessons from a pilot study on 100 whole exome sequencing data. Pharmacogenomics J 22, 276–283 (2022). https://doi.org/10.1038/s41397-022-00286-4

Download citation

Received: 03 July 2022
Revised: 20 July 2022
Accepted: 25 July 2022
Published: 13 August 2022
Issue Date: December 2022
DOI: https://doi.org/10.1038/s41397-022-00286-4

This article is cited by

Pharmacovariome scanning using whole pharmacogene resequencing coupled with deep computational analysis and machine learning for clinical pharmacogenomics
- Alireza Tafazoli
- John Mikros
- George P. Patrinos
Human Genomics (2023)