Amelogenin paralogs on Chromosome X (AMELX) and Y (AMELY) are commonly used sexing markers. Interstitial deletion of Yp involving the AMELY locus has previously been reported. The combined frequency of the AMELY null allele in Singapore and Malaysia populations is 2.7%, 0.6% in Indian and Malay ethnic groups respectively. It is absent among 541 Chinese screened. The null allele in this study belongs to 3 Y haplogroups; J2e1 (85.7%), F* (9.5%) and D* (4.8%). Low and high-resolution STS mapping, followed by sequence analysis of breakpoint junction confirmed a large deletion of 3 to 3.7-Mb located at the Yp11.2 region. Both breakpoints were located in TSPY repeat arrays, suggesting a non-allelic homologous recombination (NAHR) mechanism of deletion. All regional null samples shared identical breakpoint sequences according to their haplogroup affiliation, providing molecular evidence of a common ancestry origin for each haplogroup, and at least 3 independent deletion events recurred in history. The estimated ages based on Y-SNP and STR analysis were approximately 13.5 +/- 3.1 kyears and approximately 0.9 +/- 0.9 kyears for the J2e1 and F* mutations, respectively. A novel polymorphism G > A at Y-GATA-H4 locus in complete linkage disequilibrium with J2e1 null mutations is a more recent event. This work re-emphasizes the need to include other sexing markers for gender determination in certain regional populations. The frequency difference among global populations suggests it constitutes another structural variation locus of human chromosome Y. The breakpoint sequences provide further information to a better understanding of the NAHR mechanism and DNA rearrangements due to higher order genomic architecture.
Massively parallel sequencing (MPS) can identify sequence variation within short tandem repeat (STR) alleles as well as their nominal allele lengths that traditionally have been obtained by capillary electrophoresis. Using the MiSeq FGx Forensic Genomics System (Illumina), STRait Razor, and in-house excel workbooks, genetic variation was characterized within STR repeat and flanking regions of 27 autosomal, 7 X-chromosome and 24 Y-chromosome STR markers in 777 unrelated individuals from four population groups. Seven hundred and forty six autosomal, 227 X-chromosome, and 324 Y-chromosome STR alleles were identified by sequence compared with 357 autosomal, 107 X-chromosome, and 189 Y-chromosome STR alleles that were identified by length. Within the observed sequence variation, 227 autosomal, 156 X-chromosome, and 112 Y-chromosome novel alleles were identified and described. One hundred and seventy six autosomal, 123 X-chromosome, and 93 Y-chromosome sequence variants resided within STR repeat regions, and 86 autosomal, 39 X-chromosome, and 20 Y-chromosome variants were located in STR flanking regions. Three markers, D18S51, DXS10135, and DYS385a-b had 1, 4, and 1 alleles, respectively, which contained both a novel repeat region variant and a flanking sequence variant in the same nucleotide sequence. There were 50 markers that demonstrated a relative increase in diversity with the variant sequence alleles compared with those of traditional nominal length alleles. These population data illustrate the genetic variation that exists in the commonly used STR markers in the selected population samples and provide allele frequencies for statistical calculations related to STR profiling with MPS data.
The human sex test in forensic multiplexes is based on the amelogenin gene on both the X and Y chromosomes commonly used in sex genotyping. In this study of 338 male individuals in a Malaysian population comprising Malays, Chinese and Indians, using the AmpFlSTR Profiler Plus kit, the amelogenin test gave a significant proportion of null alleles in the Indian ethnic group (3.6% frequency) and 0.88% frequency in the Malay ethnic group due to a deletion of the gene on the Y chromosome. This sex test also failed in a forensic casework sample. Failure of the amelogenin test highlights the need for more reliable sex determination than is offered by the amelogenin locus in the Malay and Indian populations. The gender of the Indian-Malay amelogenin nulls was confirmed by the presence of three Y-STR alleles (DYS438, DYS390 and DYS439). For the Indian ethnic group, one of the Y-STR forms a stable haplotype with the amelogenin null. The amelogenin-deletion individuals also showed a null with a male-specific minisatellite MSY1, indicating that a very large deletion was involved that included the amelogenin and the MSY1 loci on the short arm of the Y chromosomes (Yp).