Keyword extraction is essential in determining influenced keywords from huge documents as the research repositories are becoming massive in volume day by day. The research community is drowning in data and starving for information. The keywords are the words that describe the theme of the whole document in a precise way by consisting of just a few words. Furthermore, many state-of-the-art approaches are available for keyword extraction from a huge collection of documents and are classified into three types, the statistical approaches, machine learning, and graph-based methods. The machine learning approaches require a large training dataset that needs to be developed manually by domain experts, which sometimes is difficult to produce while determining influenced keywords. However, this research focused on enhancing state-of-the-art graph-based methods to extract keywords when the training dataset is unavailable. This research first converted the handcrafted dataset, collected from impact factor journals into n-grams combinations, ranging from unigram to pentagram and also enhanced traditional graph-based approaches. The experiment was conducted on a handcrafted dataset, and all methods were applied on it. Domain experts performed the user study to evaluate the results. The results were observed from every method and were evaluated with the user study using precision, recall and f-measure as evaluation matrices. The results showed that the proposed method (FNG-IE) performed well and scored near the machine learning approaches score.
Monkeypox is a double-stranded DNA virus with an envelope and is a member of the Poxviridae family's Orthopoxvirus genus. This virus can transmit from human to human through direct contact with respiratory secretions, infected animals and humans, or contaminated objects and causing mutations in the human body. In May 2022, several monkeypox affected cases were found in many countries. Because of its transmitting characteristics, on July 23, 2022, a nationwide public health emergency was proclaimed by WHO due to the monkeypox virus. This study analyzed the gene mutation rate that is collected from the most recent NCBI monkeypox dataset. The collected data is prepared to independently identify the nucleotide and codon mutation. Additionally, depending on the size and availability of the gene dataset, the computed mutation rate is split into three categories: Canada, Germany, and the rest of the world. In this study, the genome mutation rate of the monkeypox virus is predicted using a deep learning-based Long Short-Term Memory (LSTM) model and compared with Gated Recurrent Unit (GRU) model. The LSTM model shows "Root Mean Square Error" (RMSE) values of 0.09 and 0.08 for testing and training, respectively. Using this time series analysis method, the prospective mutation rate of the 50th patient has been predicted. Note that this is a new report on the monkeypox gene mutation. It is found that the nucleotide mutation rates are decreasing, and the balance between bi-directional rates are maintained.