FINDINGS: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.
METHODS: To discover novel pancreatic cancer risk loci and possible causal genes, we performed a pancreatic cancer transcriptome-wide association study in Europeans using three approaches: FUSION, MetaXcan, and Summary-MulTiXcan. We integrated genome-wide association studies summary statistics from 9040 pancreatic cancer cases and 12 496 controls, with gene expression prediction models built using transcriptome data from histologically normal pancreatic tissue samples (NCI Laboratory of Translational Genomics [n = 95] and Genotype-Tissue Expression v7 [n = 174] datasets) and data from 48 different tissues (Genotype-Tissue Expression v7, n = 74-421 samples).
RESULTS: We identified 25 genes whose genetically predicted expression was statistically significantly associated with pancreatic cancer risk (false discovery rate < .05), including 14 candidate genes at 11 novel loci (1p36.12: CELA3B; 9q31.1: SMC2, SMC2-AS1; 10q23.31: RP11-80H5.9; 12q13.13: SMUG1; 14q32.33: BTBD6; 15q23: HEXA; 15q26.1: RCCD1; 17q12: PNMT, CDK12, PGAP3; 17q22: SUPT4H1; 18q11.22: RP11-888D10.3; and 19p13.11: PGPEP1) and 11 at six known risk loci (5p15.33: TERT, CLPTM1L, ZDHHC11B; 7p14.1: INHBA; 9q34.2: ABO; 13q12.2: PDX1; 13q22.1: KLF5; and 16q23.1: WDR59, CFDP1, BCAR1, TMEM170A). The association for 12 of these genes (CELA3B, SMC2, and PNMT at novel risk loci and TERT, CLPTM1L, INHBA, ABO, PDX1, KLF5, WDR59, CFDP1, and BCAR1 at known loci) remained statistically significant after Bonferroni correction.
CONCLUSIONS: By integrating gene expression and genotype data, we identified novel pancreatic cancer risk loci and candidate functional genes that warrant further investigation.
RESULTS: In this study, we propose the Context Based Dependency Network (CBDN), a method that is able to infer gene regulatory networks with the regulatory directions from gene expression data only. To determine the regulatory direction, CBDN computes the influence of source to target by evaluating the magnitude changes of expression dependencies between the target gene and the others with conditioning on the source gene. CBDN extends the data processing inequality by involving the dependency direction to distinguish between direct and transitive relationship between genes. We also define two types of important regulators which can influence a majority of the genes in the network directly or indirectly. CBDN can detect both of these two types of important regulators by averaging the influence functions of candidate regulator to the other genes. In our experiments with simulated and real data, even with the regulatory direction taken into account, CBDN outperforms the state-of-the-art approaches for inferring gene regulatory network. CBDN identifies the important regulators in the predicted network: 1. TYROBP influences a batch of genes that are related to Alzheimer's disease; 2. ZNF329 and RB1 significantly regulate those 'mesenchymal' gene expression signature genes for brain tumors.
CONCLUSION: By merely leveraging gene expression data, CBDN can efficiently infer the existence of gene-gene interactions as well as their regulatory directions. The constructed networks are helpful in the identification of important regulators for complex diseases.
METHODS: This review was performed following the PRISMA guidelines. A systematic search of the study was conducted by retrieving articles from the electronic databases PubMed and Web of Science to identify articles focussed on gene expression and approaches for osteoblast and osteoclast differentiation.
RESULTS: Six articles were included in this review; there were original articles of in vitro human stem cell differentiation into osteoblasts and osteoclasts that involved gene expression profiling. Quantitative polymerase chain reaction (qPCR) was the most used technique for gene expression to detect differentiated human osteoblasts and osteoclasts. A total of 16 genes were found to be related to differentiating osteoblast and osteoclast differentiation.
CONCLUSION: Qualitative information of gene expression provided by qPCR could become a standard technique to analyse the differentiation of human stem cells into osteoblasts and osteoclasts rather than evaluating relative gene expression. RUNX2 and CTSK could be applied to detect osteoblasts and osteoclasts, respectively, while RANKL could be applied to detect both osteoblasts and osteoclasts. This review provides future researchers with a central source of relevant information on the vast variety of gene expression approaches in analysing the differentiation of human osteoblast and osteoclast cells. In addition, these findings should enable researchers to conduct accurately and efficiently studies involving isolated human stem cell differentiation into osteoblasts and osteoclasts.
METHODS: RNA was isolated from peripheral whole blood samples (2 x 10 ml) collected from NPC patients/controls (EDTA vacutainer). Gene expression patterns from 99 samples (66 NPC; 33 controls) were assessed using the Affymetrix array. We also collected expression data from 447 patients with other cancers (201 patients) and non-cancer conditions (246 patients). Multivariate logistic regression analysis was used to obtain biomarker signatures differentiating NPC samples from controls and other diseases. Differences were also analysed within a subset (n=28) of a pre-intervention case cohort of patients whom we followed post-treatment.
RESULTS: A blood-based gene expression signature composed of three genes - LDLRAP1, PHF20, and LUC7L3 - is able to differentiate NPC from various other diseases and from unaffected controls with significant accuracy (area under the receiver operating characteristic curve of over 0.90). By subdividing our NPC cohort according to the degree of patient response to treatment we have been able to identify a blood gene signature that may be able to guide the selection of treatment.
CONCLUSION: We have identified a blood-based gene signature that accurately distinguished NPC patients from controls and from patients with other diseases. The genes in the signature, LDLRAP1, PHF20, and LUC7L3, are known to be involved in carcinoma of the head and neck, tumour-associated antigens, and/or cellular signalling. We have also identified blood-based biomarkers that are (potentially) able to predict those patients who are more likely to respond to treatment for NPC. These findings have significant clinical implications for optimizing NPC therapy.