A Laplacian scoring algorithm for gene selection and the Gini coefficient to identify the genes whose expression varied least across a large set of samples were the state-of-the-art methods used here. These methods have not been trialed for their feasibility in cheminformatics. This was a maiden attempt to investigate a complete comparative analysis of an anthraquinone and chalcone derivatives-based virtual combinatorial library. This computational "proof-of-concept" study illustrated the combinatorial approach used to explain how the structure of the selected natural products (NPs) undergoes molecular diversity analysis. A virtual combinatorial library (1.6 M) based on 20 anthraquinones and 24 chalcones was enumerated. The resulting compounds were optimized to the near drug-likeness properties, and the physicochemical descriptors were calculated for all datasets including FDA, Non-FDA, and NPs from ZINC 15. UMAP and PCA were applied to compare and represent the chemical space coverage of each dataset. Subsequently, the Laplacian score and Gini coefficient were applied to delineate feature selection and selectivity among properties, respectively. Finally, we demonstrated the diversity between the datasets by employing Murcko's and the central scaffolds systems, calculating three fingerprint descriptors and analyzing their diversity by PCA and SOM. The optimized enumeration resulted in 1,610,268 compounds with NP-Likeness, and synthetic feasibility mean scores close to FDA, Non-FDA, and NPs datasets. The overlap between the chemical space of the 1.6 M database was more prominent than with the NPs dataset. A Laplacian score prioritized NP-likeness and hydrogen bond acceptor properties (1.0 and 0.923), respectively, while the Gini coefficient showed that all properties have selective effects on datasets (0.81-0.93). Scaffold and fingerprint diversity indicated that the descending order for the tested datasets was FDA, Non-FDA, NPs and 1.6 M. Virtual combinatorial libraries based on NPs can be considered as a source of the combinatorial compound with NP-likeness properties. Furthermore, measuring molecular diversity is supposed to be performed by different methods to allow for comparison and better judgment.
In this study, palm oil mill effluent (POME) was solubilized by batch thermo-alkaline pre-treatments. A three-factor central composite design (CCD) was applied to identify the optimum COD solubilization condition. The individual and interactive effects of three factors, temperature, NaOH concentration and reaction time, on solubilization of POME were evaluated by employing response surface methodology (RSM). The experimental results showed that temperature, NaOH concentration and reaction time all had an individual significant effect on the solubilization of POME. But these three factors were independent, or there was insignificant interaction on the response. The maximum COD solubilization of 82.63% was estimated under the optimum condition at 32.5 degrees C, 8.83g/L of NaOH and 41.23h reaction time. The confirmation experiment of the predicted optimum conditions verified that the RSM with the central composite design was useful for optimizing the solubilization of POME.
An intelligent prediction system has been developed to discriminate drug-like and non drug-like molecules pattern. The system is constructed by using the application of advanced version of standard multilayer perceptron (MLP) neural network called Hybrid Multilayer Perceptron (HMLP) neural network and trained using Modified Recursive Prediction Error (MRPE) training algorithm. In this work, a well understood and easy excess Rule of Five + Veber filter properties are selected as the topological descriptor. The main idea behind the selection of this simple descriptor is to assure that the system could be used widely, beneficial and more advantageous regardless at all user level within a drug discovery organization.
A lipase catalysed enantioselective hydrolysis process under in situ racemization of the remaining (R)-ibuprofen ester substrate with sodium hydroxide as the catalyst was developed for the production of S-ibuprofen from (R,S)-ibuprofen ester in isooctane. Detailed investigations on parameters study indicated that 0.5 M NaOH, addition of 20% (v/v) co-solvent (dimethyl sulphoxide), operating temperature of 45 degrees C, and 40 mmol/L substrate gave 86% conversion and 99.4% optical purity of S-ibuprofen in dynamic kinetic resolution. Meanwhile, in common enzymatic kinetic resolution process, only 42% conversion of the racemate and 93% enantiomeric excess of the product was obtained which are of lower values as compared to dynamic kinetic resolution. The S-ibuprofen produced during each process was evaluated and approximately 50% increment in concentration of S-acid product was produced when dynamic kinetic resolution was applied into the process.