METHODS: C0 were retrieved from a large neonatal vancomycin dataset. Individual estimates of AUC0-24 were obtained from Bayesian post hoc estimation. Various ML algorithms were used for model building to C0 and AUC0-24. An external dataset was used for predictive performance evaluation.
RESULTS: Before starting treatment, C0 can be predicted a priori using the Catboost-based C0-ML model combined with dosing regimen and nine covariates. External validation results showed a 42.5% improvement in prediction accuracy by using the ML model compared with the population pharmacokinetic model. The virtual trial showed that using the ML optimized dose; 80.3% of the virtual neonates achieved the pharmacodynamic target (C0 in the range of 10-20 mg/L), much higher than the international standard dose (37.7-61.5%). Once therapeutic drug monitoring (TDM) measurements (C0) in patients have been obtained, AUC0-24 can be further predicted using the Catboost-based AUC-ML model combined with C0 and nine covariates. External validation results showed that the AUC-ML model can achieve an prediction accuracy of 80.3%.
CONCLUSION: C0-based and AUC0-24-based ML models were developed accurately and precisely. These can be used for individual dose recommendations of vancomycin in neonates before treatment and dose revision after the first TDM result is obtained, respectively.
OBJECTIVE: The aim of this proof-of-concept study was to evaluate whether combining population pharmacokinetic and machine learning approaches could provide a more accurate prediction of the clearance of renally eliminated drugs in individual neonates.
METHODS: Six drugs that are primarily eliminated by the kidneys were selected (vancomycin, latamoxef, cefepime, azlocillin, ceftazidime, and amoxicillin) as 'proof of concept' compounds. Individual estimates of clearance obtained from population pharmacokinetic models were used as reference clearances, and diverse machine learning methods and nested cross-validation were adopted and evaluated against these reference clearances. The predictive performance of these combined methods was compared with the performance of two other predictive methods: a covariate-based maturation model and a postmenstrual age and body weight scaling model. Relative error was used to evaluate the different methods.
RESULTS: The extra tree regressor was selected as the best-fit machine learning method. Using the combined method, more than 95% of predictions for all six drugs had a relative error of < 50% and the mean relative error was reduced by an average of 44.3% and 71.3% compared with the other two predictive methods.
CONCLUSION: A combined population pharmacokinetic and machine learning approach provided improved predictions of individual clearances of renally cleared drugs in neonates. For a new patient treated in clinical practice, individual clearance can be predicted a priori using our model code combined with demographic data.
METHODS: We evaluated 88 breast cancer risk variants that were identified previously by GWAS in 11,760 cases and 11,612 controls of Asian ancestry. SNPs confirmed to be associated with breast cancer risk in Asian women were used to construct a polygenic risk score (PRS). The relative and absolute risks of breast cancer by the PRS percentiles were estimated based on the PRS distribution, and were used to stratify women into different levels of breast cancer risk.
RESULTS: We confirmed significant associations with breast cancer risk for SNPs in 44 of the 78 previously reported loci at P
METHODS: The development data set comprised 138,309 women from 17 case-control studies. PRSs were generated using a clumping and thresholding method, lasso penalized regression, an Empirical Bayes approach, a Bayesian polygenic prediction approach, or linear combinations of multiple PRSs. These PRSs were evaluated in 89,898 women from 3 prospective studies (1592 incident cases).
RESULTS: The best performing PRS (genome-wide set of single-nucleotide variations [formerly single-nucleotide polymorphism]) had a hazard ratio per unit SD of 1.62 (95% CI = 1.46-1.80) and an area under the receiver operating curve of 0.635 (95% CI = 0.622-0.649). Combined Asian and European PRSs (333 single-nucleotide variations) had a hazard ratio per SD of 1.53 (95% CI = 1.37-1.71) and an area under the receiver operating curve of 0.621 (95% CI = 0.608-0.635). The distribution of the latter PRS was different across ethnic subgroups, confirming the importance of population-specific calibration for valid estimation of breast cancer risk.
CONCLUSION: PRSs developed in this study, from association data from multiple ancestries, can enhance risk stratification for women of Asian ancestry.
METHODS: We evaluated a truncating variant, p.Arg798Ter (rs137852986), and 10 missense variants of BRIP1, in 48 144 cases and 43 607 controls of European origin, drawn from 41 studies participating in the Breast Cancer Association Consortium (BCAC). Additionally, we sequenced the coding regions of BRIP1 in 13 213 cases and 5242 controls from the UK, 1313 cases and 1123 controls from three population-based studies as part of the Breast Cancer Family Registry, and 1853 familial cases and 2001 controls from Australia.
RESULTS: The rare truncating allele of rs137852986 was observed in 23 cases and 18 controls in Europeans in BCAC (OR 1.09, 95% CI 0.58 to 2.03, p=0.79). Truncating variants were found in the sequencing studies in 34 cases (0.21%) and 19 controls (0.23%) (combined OR 0.90, 95% CI 0.48 to 1.70, p=0.75).
CONCLUSIONS: These results suggest that truncating variants in BRIP1, and in particular p.Arg798Ter, are not associated with a substantial increase in breast cancer risk. Such observations have important implications for the reporting of results from breast cancer screening panels.
METHODS: We conducted a fine-mapping analysis in 55,540 breast cancer cases and 51,168 controls from the Breast Cancer Association Consortium.
RESULTS: Conditional analyses identified two independent association signals among women of European ancestry, represented by rs9790517 [conditional P = 2.51 × 10(-4); OR, 1.04; 95% confidence interval (CI), 1.02-1.07] and rs77928427 (P = 1.86 × 10(-4); OR, 1.04; 95% CI, 1.02-1.07). Functional annotation using data from the Encyclopedia of DNA Elements (ENCODE) project revealed two putative functional variants, rs62331150 and rs73838678 in linkage disequilibrium (LD) with rs9790517 (r(2) ≥ 0.90) residing in the active promoter or enhancer, respectively, of the nearest gene, TET2. Both variants are located in DNase I hypersensitivity and transcription factor-binding sites. Using data from both The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), we showed that rs62331150 was associated with level of expression of TET2 in breast normal and tumor tissue.
CONCLUSION: Our study identified two independent association signals at 4q24 in relation to breast cancer risk and suggested that observed association in this locus may be mediated through the regulation of TET2.
IMPACT: Fine-mapping study with large sample size warranted for identification of independent loci for breast cancer risk.