Autism spectrum disorder (ASD) is caused by combined genetic and environmental factors. Genetic heritability in ASD is estimated as 60-90%, and genetic investigations have revealed many monogenic factors. We analyzed 405 patients with ASD using family-based exome sequencing to detect disease-causing single-nucleotide variants (SNVs), small insertions and deletions (indels), and copy number variations (CNVs) for molecular diagnoses. All candidate variants were validated by Sanger sequencing or quantitative polymerase chain reaction and were evaluated using the American College of Medical Genetics and Genomics/Association for Molecular Pathology guidelines for molecular diagnosis. We identified 55 disease-causing SNVs/indels in 53 affected individuals and 13 disease-causing CNVs in 13 affected individuals, achieving a molecular diagnosis in 66 of 405 affected individuals (16.3%). Among the 55 disease-causing SNVs/indels, 51 occurred de novo, 2 were compound heterozygous (in one patient), and 2 were X-linked hemizygous variants inherited from unaffected mothers. The molecular diagnosis rate in females was significantly higher than that in males. We analyzed affected sibling cases of 24 quads and 2 quintets, but only one pair of siblings shared an identical pathogenic variant. Notably, there was a higher molecular diagnostic rate in simplex cases than in multiplex families. Our simulation indicated that the diagnostic yield is increasing by 0.63% (range 0-2.5%) per year. Based on our simple simulation, diagnostic yield is improving over time. Thus, periodical reevaluation of ES data should be strongly encouraged in undiagnosed ASD patients.
Cornelia de Lange Syndrome (CdLS) is a rare, dominantly inherited multisystem developmental disorder characterized by highly variable manifestations of growth and developmental delays, upper limb involvement, hypertrichosis, cardiac, gastrointestinal, craniofacial, and other systemic features. Pathogenic variants in genes encoding cohesin complex structural subunits and regulatory proteins (NIPBL, SMC1A, SMC3, HDAC8, and RAD21) are the major pathogenic contributors to CdLS. Heterozygous or hemizygous variants in the genes encoding these five proteins have been found to be contributory to CdLS, with variants in NIPBL accounting for the majority (>60%) of cases, and the only gene identified to date that results in the severe or classic form of CdLS when mutated. Pathogenic variants in cohesin genes other than NIPBL tend to result in a less severe phenotype. Causative variants in additional genes, such as ANKRD11, EP300, AFF4, TAF1, and BRD4, can cause a CdLS-like phenotype. The common role that these genes, and others, play as critical regulators of developmental transcriptional control has led to the conditions they cause being referred to as disorders of transcriptional regulation (or "DTRs"). Here, we report the results of a comprehensive molecular analysis in a cohort of 716 probands with typical and atypical CdLS in order to delineate the genetic contribution of causative variants in cohesin complex genes as well as novel candidate genes, genotype-phenotype correlations, and the utility of genome sequencing in understanding the mutational landscape in this population.