Introduction

nCoronavirus (2019-nCoV) breakout took place in December 2019 in Wuhan city of China. Initially, a lot of cases of unknown etiology regarding pneumonia were reported. Reported patients worked or lived near the local Huanan seafood wholesales market. Associated symptoms are acute respiratory infection and, in some patients, rapidly developing acute respiratory distress syndromes (ARDS), severe acute respiratory syndromes (SARS), Middle East Respiratory Syndrome (MERS), acute respiratory failure, and other serious complication. Mild symptoms are seen in most patients with good prognosis. A lot of casualties are reported of those patients having symptoms of severe pneumonia, pulmonary edema, acute respiratory distress syndrome, or multiple organ failure. Chinese Center for Disease Control and Prevention (CDC) identified novel coronavirus on January 7, from throat swab sample of a patient. World Health Organization (WHO) named this virus as 2019-nCoV. Currently, epidemiological and clinical attributes of 2019-nCoV are freighting causing alarming situation globally [1].

Coronavirus is classified into α (including human alphacoronavirus 229E, NL63), β (including beta-coronavirus OC43, and HKU1), γ, and δ genera. These viruses are identified in a wide range of animal species. In humans, there are six previously reported human coronaviruses that can be transmitted between humans, which are responsible for causing mild upper respiratory disorders. SARS having 10% fatality rate and MERS with 35% fatality rate are also beta-categorized coronavirus and are highly pathogenic in humans. Both viruses are of zoonotic origin [2,3,4]. SARS outbreak took place in 2003, transmitted through mammals into humans especially through bats [5]. Novel coronavirus 2019-nCoV contains Protease Mpro encoded by RNA of the virus, initially reported in 2019 in the city of China, Wuhan. From Wuhan, seafood market pneumonia virus’ genome sequence was determined (NCBI genome ID MN908947, GenBank MN908947.3) published by Wu Et al. (LOCUS MN90894, 23-JAN-2020) by multiple sequence alignments with known SARS proteases. It showed similarity with SARS crystal structure with the highest accuracy [6]. Mpro is considered a potent target for developing viral inhibitor drugs toward coronavirus [7].

Coronavirus is spreading very quickly; it would be a more reasonable and attractive strategy to develop wide-spectrum inhibitory drugs against this virus, instead of following individual strategy for drug designing. This type of drug development would provide the first line of defense against future emerging CoV-associated ailments like SARS. Development of wide-spectrum drugs requires possible conserved target sequence within whole genus coronavirus. For identifying the possible potent target, extensive research was done and Mpro (molecular weight 34 kDa) was identified as targeted protease, overall controlling RNA replication and transcription. Mpro is the main CoV protease, sharing highly conservative substrate-recognition pocket by comparing four crystal structures. Homology model also represents all three genetic clusters of genus coronavirus. CoV genome sequence mutates with high frequency [8]. For all known RNA viruses, coronaviruses have the largest genome ranges from 26 to 32 kb in length. Apart from encoding structural proteins, the major portion of the viral genome is transcribed and translated into a polypeptide that ultimately encodes for those essential proteins involving in viral reapplication and genome expression. The ~ 306 aa length protease is also encoded by polypeptide, and this polypeptide is processed finally into a functional protein. Picornavirus and Mpro shares similar cleavage-site specificity, so this Mpro is also known as 3C-like protease (3CLpro). Extensive research showed that Mpro from different coronaviruses is highly conserved in terms of 3D structure and structure, which is why Mpro is a potential target for designing anti-coronaviral drug [9].

Bioinformatics that is an interdisciplinary field of mathematics, science, and computer science provides very meaningful results regarding the analysis of exome sequencing [10]. Computational tools make sure to assess users from data storage to data retrieval, data analysis, its annotation, and ultimately provide visualization of results for the understanding of biological system thoroughly [11,12,13,14]. In silico methods use computational approaches that are cost-effective and are predictive methods for chemical compounds before carrying out a scientific laboratory experiment [15]. Computational tools have a lot of worth today [16,17,18,19,20]. Meaningful outcomes coming from these computational tools give us basic research in the biomedical regime. With time, more and more databases are being included in the study [21]. In this study, computational-based approaches are utilized to identify the potent inhibiting candidates of Mpro of 2019-nCoV. Through molecular docking and DFT-based computations, reactivity and binding of compounds are analyzed with Mpro of 2019-nCoV, while ADMET properties are computed to represent their suitability for human administration.

Material and methods

The overall flow of methodology opted in this study is shown in Fig. 1.

Fig. 1
figure 1

Flowchart of methodology

Ligands and receptor collection

A vast range of phytochemicals having alkaloid, flavonoid, terpenoid, and sterol nature was gathered through extensive study of scientific literature. A total of 2750 plant-extracted compounds were collected, with previously reported biological and pharmacological characteristics, inhabited locally in Pakistan and India (sub-continent region). PubChem database was used to retrieve 2D and 3D structures of phytochemicals. For further use, phytochemical files were converted into PDB format by using Discovery Studio [22]. Mpro (PDB ID 6LU7), PDB file of this protein, was downloaded from RCSB Protein Data Bank. For 3D visualization of protease Mpro and its preprocessing (removal of water and ligands), UCSF Chimera software was used [23].

Predicting drug-likeness and ADMET properties

ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties of selected phytochemicals were studied through PreADMET server for screening to select those lead compounds having drug-like properties against 2019-nCoV. Lipinski’s rule of five was used to estimate the drug-likeness properties of selected phytochemicals. Pharmacokinetics and pharmacodynamics characteristics of various phytochemicals are analyzed by evaluating their drug-likeness characteristics. The following set of criteria was used for compound filtering: Lipinski’s violation = 0, solubility = high, GI absorption = high or moderate, blood-brain barrier permeability = no, and toxicity = zero/nil.

Molecular docking and binding energy values evaluation

Molecular docking of protease Mpro (6LU7) was done against selected 108 phytochemicals that were extracted from various plant species using computational tools like AutoDock and AutoDock Vina. AutoDock tool combines ligand and receptor’s interaction information by providing meaningful results. To perform docking, PDBQT files for both phytochemicals and protease Mpro are made through AutoDock tool. This format is required to run Vina command, and finally, binding affinity values of selected phytochemicals against the receptor (protein) is attained, only compounds with the highest binding affinity values are selected for DFT analysis. The grid size of Mpro was 40 × 44 × 62 Ǻ with dimensions x = − 14.152, y = 13.429, and z = 67.842.

DFT analysis and study of reactivity

To check out the reactivity of selected plant extracted compounds, density-functional theory (DFT)-based computations were performed against receptor protein. It is performed for computation of quantum mechanical calculations of most reactive and active compound [24] DFT by applying Becke, 3-parameter, which uses the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO) [25]. B3LYP functional method was applied for DFT calculations, while for calculating band energy gap, (ΔE) expression of ELUMO–EHOMO was used [6]. For all DFT-based computations, the ORCA program was used for energy calculations.

Results

ADMET results

Phytochemicals were screened to analyze the ADMET properties. ADMET analysis is primarily carried out to identify the potentials of these compounds to be used as a human drug in the future during clinical experiments. The detailed ADMET analysis of the 2750 phytochemicals, in which, a total of 1061 phytochemicals did not follow the Lipinski’s rule. The remaining 1689 compounds were further analyzed using the approach BBB (blood-brain barrier) permeability. Out of 1689 compounds, 756 phytochemicals exhibited the properties of non-BBB permeability. In these compounds, a total of 650 compounds showed high GI absorption and solubility characteristics. The toxicity, as well as carcinogenicity properties of the phytochemicals, was studied. After a thorough ADMET analysis, a total of 108 compounds were screened out in the end based on their drug-likeness properties. The docking of 108 phytochemicals was then performed to study the binding properties of ligands and the target protein.

Molecular docking of phytochemicals against Mpro of 2019-nCoV

After the screening of a vast variety of phytochemicals, 108 phytochemicals were docked against the target protein named Mpro of 2019-nCoV. The binding properties were calculated using AutoDock tool and AutoDock Vina. All compounds showed the binding potential and interact with the Mpro. A cut-off value was applied, and the compounds having binding affinity ≥ − 6.5 kcal/mol values were selected. Out of 108 phytochemicals, 60 compounds were obtained after applying the cut-off value. The compounds named Robustone and SchizolaenoneB having binding affinity value of − 7.8 kcal/mol (Ki = 1.889 μM) exhibited strong potential as inhibitors of Mpro activity. This was followed by Osajin and IsosilybinA which exhibited the binding affinity value of − 7.7 kcal/mol (Ki = 2.237 μM). All selected compounds with binding affinity values ranging from − 7.8 to − 6.5 kcal/mol have shown enormous potential to be used as inhibitors against the Mpro of the recent strain of coronavirus (Table 1). Diagrams of the interaction of phytochemicals were studied through discovery studio, as shown in Fig. 2.

Table 1 Binding affinity values after applying cut off of − 6.5 kcal/mol for phytochemicals docked against Mpro of 2019-nCoV
Fig. 2
figure 2

Interactions of phytochemicals within the binding pocket of Mpro of 2019-nCoV a Robustone, b SchizolaenoneB, c Osajin, d IsosilybinA, and e SilybinA

DFT (density-functional theory) analysis

DFT analysis was performed to analyze the reactivity characteristics of the phytochemicals. The complexes were analyzed for reactivity based on the band energy gaps. Among all the analyzed complexes, the ligands showed various LUMO and HOMO energies, thus, the band energy gaps also differed. However, minimum band energy gaps were observed for Robustone (0.112 kcal/mol), SchizolaenoneB (0.112 kcal/mol), Osajin (0.113 kcal/mol), IsosilybinA (0.114 kcal/mol), SilybinA (0.115 kcal/mol), Silydianin (0.117 kcal/mol), and Isopomiferin (0.118 kcal/mol). Therefore, these results show that the aforementioned showed the highest reactivity with the Mpro of 2019-nCoV. Comprehensive results are shown in Table 2.

Table 2 Results of DFT-based band-energy-gaps computations

Discussion

Coronaviruses are the pathogenic viruses belonging to coronaviridae family. They are enveloped positive stranded RNA viruses. The name Corona has been given based on spike-like protein protruding out from the viral envelopes, thus making a corona-like shape [26]. They exhibit a wide spectrum of life-threatening symptoms in patients, predominantly affecting the respiratory and gastrointestinal part. There are six different types of the strain of coronavirus which affect the humans causing complications ranging from flu to severe acute respiratory syndrome (SARS) and MERS [27]. Recently, a novel strain of coronavirus named 2019-nCoV is causing havoc around the world. There are no vaccines, therapy, or drug that has been approved effective until now against this deadly strain of the virus. To avoid a dire consequence, there is an instant need to cope up the situation by identifying potential inhibitors against 2019-nCoV. This study was preliminary carried out to screen effective inhibitors against the Mpro of 2019-nCoV.

In the recent study, 2750 phytochemicals were collected from various literature sites, articles, and online databases based on their massive medicinal characteristics against a vast range of diseases and health complications. The compounds were screened through ADMET analysis based on their drug-likeness properties. The PreADMET server was used which is an online tool for studying the drug-likeness properties of the different compounds. A total of 108 phytochemicals were eventually obtained which further follow through docking and DFT analysis.

Molecular docking provides a paramount source for analyzing the binding properties of compounds based on the computer-aided analysis. The docking of 108 compounds was performed against the target protein Mpro of 2019-nCoV. The AutoDock tool and AutoDock Vina were used for docking purpose. All the compounds exhibited great binding properties with Mpro. A total of 60 compounds were selected after applying cut-off value of ≥ − 6.5 kcal/mol. These compounds reported in this study exhibit great potential as future drugs against Mpro of 2019-nCoV. The compound named, Robustone, is a flavonoid compound which has been previously reported as an antioxidant [28], inhibitor against the protease of dengue virus [29], exhibited great binding affinity properties against Mpro of 2019-nCoV. Lycopene has been known to be an effective antioxidant [30] as well as an anti-proliferative compound against several cancers [31]. Isopomiferin which is a derivative of plant-based compound pomiferin has been known to exhibit anti-microbial properties, anti-inflammatory agent, effective agent against diabetes and a potent antioxidant (Table 1) [32, 33].

Density functional theory is a computer-based approach with procuring immense popularity in the field of in silico analysis. The analysis of the electronic characteristics of the compounds plays an integral role in understanding their pharmacological properties. In DFT analysis, HOMO and LUMO energies of the compounds were calculated by using computer-based program ORCA. The gap energies exhibiting the molecular reactivity of the compounds were calculated. As the gap energy increases, it leads to compound with decrease reactivity and vice versa [34]. Herein, Robustone, SchizolaenoneB, Osajin, IsosilybinA, SilybinA, Silydianin, and Isopomiferin showed the highest reactivity by depicting minimal band energy gaps. These results were also correlated with the binding affinity and such correlation has already been proved and reported by Hussain et al. [35].

Conclusion

Coronaviruses are preliminary zoonotic viruses leading to SARS and pneumonia-like health complications in humans. With the growing recent unrestrained cases of the new strain of 2019-nCoV, there is an immediate need to find effective drugs against this life-threatening viral strain. This work was carried out to analyze the medicinal properties of the wide range of plant-based compounds against the target protein Mpro of 2019-nCoV. ADMET properties were studied to check the compound’s efficacy for human usage in the future, all the phytochemicals exhibited good drug-like properties. Docking was performed to analyze the binding interaction of the ligands (phytochemicals) with the target protein, the selected compounds showed magnificent binding interaction with the target protein. DFT analysis of the phytochemicals also showed satisfactory outcomes. Thus, these compounds can be analyzed through the experiment in the future clinical trials of drugs against 2019-nCoV.