A proof-of-concept medicinal herbs identification scheme using machine learning classifiers is proposed in the form of an automated computational package. The scheme makes use of two-dimensional correlation Fourier Transformed Infrared (FTIR) fingerprinting maps derived from the FTIR of raw herb spectra as digital input. The prototype package admits a collection of 11 machine learning classifiers to form a voting pool. A common set of oversampled dataset containing 5 different herbal classes is used to train the pool of classifiers on a one-verses-others manner. The collections of trained models, dubbed the voting classifiers, are deployed in a collective manner to cast their votes to support or against a given inference fingerprint whether it belongs to a particular class. By collecting the votes casted by all voting classifiers, a logically designed scoring system will select out the most probable guess of the identity of the inference fingerprint. The same scoring system is also capable of discriminating an inference fingerprint that does not belong to any of the classes the voting classifiers are trained for as the 'others' type. The proposed classification scheme is stress-tested to evaluate its performance and expected consistency. Our experimental runs show that, by and large, a satisfactory performance of the classification scheme of up to 90 % accuracy is achieved, providing a proof-of-concept viability that the proposed scheme is a feasible, practical, and convenient tool for herbal classification. The scheme is implemented in the form of a packaged Python code, dubbed the "Collective Voting" (CV) package, which is easily scalable, maintained and used in practice.
* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.