OBJECTIVE: To develop and validate a deep learning model using readily available clinical information to predict treatment success with the first ASM for individual patients.
DESIGN, SETTING, AND PARTICIPANTS: This cohort study developed and validated a prognostic model. Patients were treated between 1982 and 2020. All patients were followed up for a minimum of 1 year or until failure of the first ASM. A total of 2404 adults with epilepsy newly treated at specialist clinics in Scotland, Malaysia, Australia, and China between 1982 and 2020 were considered for inclusion, of whom 606 (25.2%) were excluded from the final cohort because of missing information in 1 or more variables.
EXPOSURES: One of 7 antiseizure medications.
MAIN OUTCOMES AND MEASURES: With the use of the transformer model architecture on 16 clinical factors and ASM information, this cohort study first pooled all cohorts for model training and testing. The model was trained again using the largest cohort and externally validated on the other 4 cohorts. The area under the receiver operating characteristic curve (AUROC), weighted balanced accuracy, sensitivity, and specificity of the model were all assessed for predicting treatment success based on the optimal probability cutoff. Treatment success was defined as complete seizure freedom for the first year of treatment while taking the first ASM. Performance of the transformer model was compared with other machine learning models.
RESULTS: The final pooled cohort included 1798 adults (54.5% female; median age, 34 years [IQR, 24-50 years]). The transformer model that was trained using the pooled cohort had an AUROC of 0.65 (95% CI, 0.63-0.67) and a weighted balanced accuracy of 0.62 (95% CI, 0.60-0.64) on the test set. The model that was trained using the largest cohort only had AUROCs ranging from 0.52 to 0.60 and a weighted balanced accuracy ranging from 0.51 to 0.62 in the external validation cohorts. Number of pretreatment seizures, presence of psychiatric disorders, electroencephalography, and brain imaging findings were the most important clinical variables for predicted outcomes in both models. The transformer model that was developed using the pooled cohort outperformed 2 of the 5 other models tested in terms of AUROC.
CONCLUSIONS AND RELEVANCE: In this cohort study, a deep learning model showed the feasibility of personalized prediction of response to ASMs based on clinical information. With improvement of performance, such as by incorporating genetic and imaging data, this model may potentially assist clinicians in selecting the right drug at the first trial.
METHODS: Eighteen students with prior experience in traditional PDPBL processes participated in the study, divided into three groups to perform PDPBL sessions with various triggers from pharmaceutical chemistry, pharmaceutics, and clinical pharmacy fields, while utilizing chat AI provided by ChatGPT to assist with data searching and problem-solving. Questionnaires were used to collect data on the impact of ChatGPT on students' satisfaction, engagement, participation, and learning experience during the PBL sessions.
RESULTS: The survey revealed that ChatGPT improved group collaboration and engagement during PDPBL, while increasing motivation and encouraging more questions. Nevertheless, some students encountered difficulties understanding ChatGPT's information and questioned its reliability and credibility. Despite these challenges, most students saw ChatGPT's potential to eventually replace traditional information-seeking methods.
CONCLUSIONS: The study suggests that ChatGPT has the potential to enhance PDPBL in pharmacy education. However, further research is needed to examine the validity and reliability of the information provided by ChatGPT, and its impact on a larger sample size.
SUMMARY ANSWER: The inter-observer agreement among embryologists deciding whether to freeze blastocysts of marginal morphology was low and was not improved by a modified grading system.
WHAT IS KNOWN ALREADY: While previous research on inter-observer variability on the decision of which embryo to transfer from a cohort of blastocysts is good, the impact of grading variability regarding decision to freeze borderline blastocysts has not been investigated. Agreement for inner cell mass (ICM) and trophectoderm (TE) grade is only fair, factors which contribute to the grade that influences decision to freeze.
STUDY DESIGN, SIZE, DURATION: This was a prospective study involving 18 embryologists working at four different IVF clinics within a single organisation between January 2019 and July 2019.
PARTICIPANTS/MATERIALS, SETTING, METHODS: All embryologists currently practicing blastocyst grading at a multi-site organisation were invited to participate. The survey was comprised of blastocyst images in three planes and asked (i) the likelihood of freezing and (ii) whether the blastocyst would be frozen based on visual assessment. Blastocysts varied by quality and were categorised as either top (n = 20), borderline (n = 60) or non-viable/degenerate quality (n = 20). A total of 1800 freeze decisions were assessed. To assess the impact of grading criteria on inter-observer agreement for decision to freeze, the survey was taken once when the embryologists used the Gardner criteria and again 6 months after transitioning to a modified Gardner criterion with four grades for ICM and TE. The fourth grade was introduced with the aim to promote higher levels of agreement for the clinical usability decision when the blastocyst was of marginal quality.
MAIN RESULTS AND THE ROLE OF CHANCE: The inter-observer agreement for decision to freeze was near perfect (kappa 1.0) for top and non-viable/degenerate quality blastocysts, and this was not affected by the blastocysts grading criteria used (top quality; P = 0.330 and non-viable/degenerate quality; P = 0.18). In contrast, the cohort of borderline blastocysts received a mixed freeze rate (average 52.7%) during the first survey, indicative of blastocysts that showed uncertain viability and promoting significant disagreement for decision to freeze among the embryologists (kappa 0.304). After transitioning to a modified Gardner criteria with an additional grading tier, the average freeze rate increased (64.8%; P
METHOD: In the present study, three separate data cohorts containing 1288 breast lesions from three countries (Malaysia, Iran, and Turkey) were utilized for MLmodel development and external validation. The model was trained on ultrasound images of 725 breast lesions, and validation was done separately on the remaining data. An expert radiologist and a radiology resident classified the lesions based on the BI-RADS lexicon. Thirteen morphometric features were selected from a contour of the lesion and underwent a three-step feature selection process. Five features were chosen to be fed into the model separately and combined with the imaging signs mentioned in the BI-RADS reference guide. A support vector classifier was trained and optimized.
RESULTS: The diagnostic profile of the model with various input data was compared to the expert radiologist and radiology resident. The agreement of each approach with histopathologic specimens was also determined. Based on BI-RADS and morphometric features, the model achieved an area under the receiver operating characteristic (ROC) curve (AUC) of 0.885, which is higher than the expert radiologist and radiology resident performances with AUC of 0.814 and 0.632, respectively in all cohorts. DeLong's test also showed that the AUC of the ML protocol was significantly different from that of the expert radiologist (ΔAUCs = 0.071, 95%CI: (0.056, 0.086), P = 0.005).
CONCLUSIONS: These results support the possible role of morphometric features in enhancing the already well-excepted classification schemes.
MATERIALS AND METHODS: An electronic search was carried out using databases such as PubMed, Scopus, and the Web of Science Core Collection. Two reviewers searched the databases separately and concurrently. The initial search was conducted on 6 July 2021. The publishing period was unrestricted; however, the search was limited to articles involving human participants and published in English. Combinations of Medical Subject Headings (MeSH) phrases and free text terms were used as search keywords in each database. The following data was taken from the methods and results sections of the selected papers: The amount of AI training datasets utilized to train the intelligent system, as well as their conditional properties; Unilateral CLP, Bilateral CLP, Unilateral Cleft lip and alveolus, Unilateral cleft lip, Hypernasality, Dental characteristics, and sagittal jaw relationship in children with CLP are among the problems studied.
RESULTS: Based on the predefined search strings with accompanying database keywords, a total of 44 articles were found in Scopus, PubMed, and Web of Science search results. After reading the full articles, 12 papers were included for systematic analysis.
CONCLUSIONS: Artificial intelligence provides an advanced technology that can be employed in AI-enabled computerized programming software for accurate landmark detection, rapid digital cephalometric analysis, clinical decision-making, and treatment prediction. In children with corrected unilateral cleft lip and palate, ML can help detect cephalometric predictors of future need for orthognathic surgery.