Skip to content

Commit c5fd28c

Browse files
committed
Update examples and README
1 parent aa1c2fa commit c5fd28c

5 files changed

+43
-40
lines changed

README.md

+7-4
Original file line numberDiff line numberDiff line change
@@ -1,20 +1,23 @@
11
# MONSTROUS
22

3+
MOlecular traNSporT inhibitoR and substrate predictOr Utility Server (MONSTROUS) is a computational transporter profiler that predicts the potential of a chemical to interact with transporters recommended for testing in drug development by regulatory agencies. Currently, these transporters are considered to be a major player in determining the safety and efficacy of drugs. MONSTROUS utilizes either graph convolutional neural networks or similarity-based cheminformatics approaches to screen query chemicals against 12 transporters widely expressed in various tissues, including liver, brain, and kidney, and makes predictions as to their potential to be inhibitors as well as substrates.
4+
35
##### Supporting information for paper:
46
Title:
57
Authors:
68
Journal:
79
### Intro
8-
This repository contains the data and models used to make MONSTROUS's predictions and has sorted this data into 3 sections:
10+
This repository contains the data and models used to make MONSTROUS's predictions and has sorted this data into 4 sections:
911
- A GCNN folder containing the data for our GCNN transporters. This data includes csv files containing lists of compounds for each GCNN transporter as well as their GCNN models.
1012
- A Similarity Approach folder containing the data for our non-GCNN transporters. This data includes csv files containing lists of compounds for each transporter.
1113
- A python folder containing the code that will run the MONSTROUS command line tool.
14+
- An examples folder containing an example input as well as output files for each output format the tool supports.
1215

1316
### GCNN
14-
The GCNN folder contains two subfolders: compounds and models. In the compounds folder are CSV files for each transporter, containing a list of compounds that are _____ for the given transporter. These compounds are used in generating the applicability domain for that transporter. In the models folder we hold the models for each GCNN transporter. These models are used to generate the values for GCNN transporters
17+
The GCNN folder contains two subfolders: compounds and models. In the compounds folder are CSV files for each transport protein, containing a list of reference compounds that are known inhibitors or substrates for the given transporter. These compounds are used in generating the applicability domain for that transporter. In the models folder we hold the models for each GCNN transporter. These models are used to generate the values for GCNN transporters
1518

1619
### Similarity Approach
17-
The similarity approach folder contains CSV files for each transporter, containing a list of compounds that are _____ for the given transporter. These compounds are used in generating the applicability domain for that transporter, as well as generating the values for similarity approach transporters
20+
The similarity approach folder contains CSV files for each transport protein, containing a list of reference compounds that are known inhibitors or substrates for the given transporter. These compounds are used in generating the applicability domain for that transporter, as well as generating the values for similarity approach transporters
1821

1922
## MONSTROUS Command Line Tool
2023

@@ -31,7 +34,7 @@ Next, navigate to this repository's folder and enter the following command to in
3134

3235
### Running the MONSTROUS command line tool
3336

34-
Once everything is installed, you can then run the script by running `python python/monstrous_clt.py` followed by any of the following tags (and must include the `-i` , input file, tag):
37+
Once everything is installed, you can then run the script by running `python python/monstrous_clt.py` followed by any of the following tags (and must include the `-i` , input file and tag):
3538
- `-h` or `--help`: Shows a help message explaining these tags.
3639
- `-i [INPUT]` or `--input [INPUT]`: The file location of a .CSV file whose first column is 'Name' and whose second is 'SMILES' and contains the list of SMILES to be submitted.
3740
- `-o [OUTPUT]` or `--output [OUTPUT]`: The output file path
+4-4
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
Name,SMILES (original),SMILES (standardized),BCRP_inhibitor,BSEP_inhibitor,MRP1_inhibitor,OATP1B1_inhibitor,OATP1B3_inhibitor,PGP_inhibitor,MATE1_inhibitor,MATE2_inhibitor,OAT1_inhibitor,OAT3_inhibitor,OCT2_inhibitor,MRP2_inhibitor
22
Verapamil,COC1=C(OC)C=C(CCN(C)CCCC(C#N)(C(C)C)C2=CC(OC)=C(OC)C=C2)C=C1,COc1ccc(CCN(C)CCCC(C#N)(c2ccc(OC)c(OC)c2)C(C)C)cc1OC,True,False,True,False,False,True,False,False,False,False,False,False
3-
Lopinavir,CC1=C(C(=CC=C1)C)OCC(=O)NC(CC2=CC=CC=C2)C(CC(CC3=CC=CC=C3)NC(=O)C(C(C)C)N4CCCNC4=O)O,Cc1cccc(C)c1OCC(=O)NC(Cc1ccccc1)C(O)CC(Cc1ccccc1)NC(=O)C(C(C)C)N1CCCNC1=O,False,False,True,True,True,True,False,False,False,False,False,False
3+
Lopinavir,CC1=C(C(=CC=C1)C)OCC(=O)NC(CC2=CC=CC=C2)C(CC(CC3=CC=CC=C3)NC(=O)C(C(C)C)N4CCCNC4=O)O,Cc1cccc(C)c1OCC(=O)NC(Cc1ccccc1)C(O)CC(Cc1ccccc1)NC(=O)C(C(C)C)N1CCCNC1=O,False,False,True,True,True,True,True,True,False,False,True,False
44
Probenecid,CCCN(CCC)S(=O)(=O)C1=CC=C(C=C1)C(=O)O,CCCN(CCC)S(=O)(=O)c1ccc(C(=O)O)cc1,False,False,True,False,False,False,False,False,True,True,False,False
5-
Saquinavir,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(C(CC3=CC=CC=C3)NC(=O)C(CC(=O)N)NC(=O)C4=NC5=CC=CC=C5C=C4)O,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(O)C(Cc1ccccc1)NC(=O)C(CC(N)=O)NC(=O)c1ccc2ccccc2n1,False,False,True,False,False,True,False,False,False,False,False,False
5+
Saquinavir,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(C(CC3=CC=CC=C3)NC(=O)C(CC(=O)N)NC(=O)C4=NC5=CC=CC=C5C=C4)O,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(O)C(Cc1ccccc1)NC(=O)C(CC(N)=O)NC(=O)c1ccc2ccccc2n1,False,False,True,False,False,True,True,True,False,False,False,False
66
Bromosulphophthalein,[O-]S(=O)(=O)c1c(O)ccc(c1)C3(OC(=O)c2c(Br)c(Br)c(Br)c(Br)c23)c4ccc(O)c(c4)S([O-])(=O)=O,O=C1OC(c2ccc(O)c(S(=O)(=O)O)c2)(c2ccc(O)c(S(=O)(=O)O)c2)c2c(Br)c(Br)c(Br)c(Br)c21,True,False,True,True,True,False,False,False,False,False,False,False
77
Methotrexate,CN(CC1=CN=C2C(=N1)C(=NC(=N2)N)N)C3=CC=C(C=C3)C(=O)NC(CCC(=O)O)C(=O)O,CN(Cc1cnc2nc(N)nc(N)c2n1)c1ccc(C(=O)NC(CCC(=O)O)C(=O)O)cc1,False,False,True,False,False,False,False,False,False,False,False,False
88
Furosemide,C1=COC(=C1)CNC2=CC(=C(C=C2C(=O)O)S(=O)(=O)N)Cl,NS(=O)(=O)c1cc(C(=O)O)c(NCc2ccco2)cc1Cl,False,False,True,False,False,False,False,False,False,False,False,False
99
Metformin,CN(C)C(=N)N=C(N)N,CN(C)C(=N)N=C(N)N,False,False,True,False,False,False,False,False,False,False,False,False
1010
Cimetidine,CC1=C(N=CN1)CSCCNC(=NC)NC#N,CN=C(NC#N)NCCSCc1nc[nH]c1C,False,False,True,False,False,False,True,False,False,False,True,False
11-
Procainamide,CCN(CC)CCNC(=O)C1=CC=C(C=C1)N,CCN(CC)CCNC(=O)c1ccc(N)cc1,False,False,False,False,False,False,False,False,False,False,False,False
11+
Procainamide,CCN(CC)CCNC(=O)C1=CC=C(C=C1)N,CCN(CC)CCNC(=O)c1ccc(N)cc1,False,False,False,False,False,False,False,False,True,True,False,False
1212
Oestrone,CC12CCC3C(C1CCC2=O)CCC4=C3C=CC(=C4)O,CC12CCC3c4ccc(O)cc4CCC3C1CCC2=O,False,False,True,False,False,False,True,True,False,False,True,False
13-
Pravastatin,CCC(C)C(=O)OC1CC(C=C2C1C(C(C=C2)C)CCC(CC(CC(=O)O)O)O)O,CCC(C)C(=O)OC1CC(O)C=C2C=CC(C)C(CCC(O)CC(O)CC(=O)O)C21,False,False,True,True,True,False,False,False,False,True,False,False
13+
Pravastatin,CCC(C)C(=O)OC1CC(C=C2C1C(C(C=C2)C)CCC(CC(CC(=O)O)O)O)O,CCC(C)C(=O)OC1CC(O)C=C2C=CC(C)C(CCC(O)CC(O)CC(=O)O)C21,False,False,True,True,True,False,False,False,False,True,False,True
1414
Delaviridine,CC(C)NC1=C(N=CC=C1)N1CCN(CC1)C(=O)C1=CC2=C(N1)C=CC(NS(C)(=O)=O)=C2,CC(C)Nc1cccnc1N1CCN(C(=O)c2cc3cc(NS(C)(=O)=O)ccc3[nH]2)CC1,True,False,True,False,False,True,False,False,False,False,False,False
1515
Loperamide,CN(C)C(=O)C(CCN1CCC(O)(CC1)C1=CC=C(Cl)C=C1)(C1=CC=CC=C1)C1=CC=CC=C1,CN(C)C(=O)C(CCN1CCC(O)(c2ccc(Cl)cc2)CC1)(c1ccccc1)c1ccccc1,False,False,True,False,False,True,False,False,False,False,False,False
1616
Rifampicin,CC1C=CC=C(C(=O)NC2=C(C(=C3C(=C2O)C(=C(C4=C3C(=O)C(O4)(OC=CC(C(C(C(C(C(C1O)C)O)C)OC(=O)C)C)OC)C)C)O)O)C=NN5CCN(CC5)C)C,COC1C=COC2(C)Oc3c(C)c(O)c4c(O)c(c(C=NN5CCN(C)CC5)c(O)c4c3C2=O)NC(=O)C(C)=CC=CC(C)C(O)C(C)C(O)C(C)C(OC(C)=O)C1C,False,False,True,False,True,False,False,False,False,False,False,True
+8-8
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
Name,SMILES (original),SMILES (standardized),BCRP_substrate,MRP1_substrate,PGP_substrate,BSEP_substrate,MATE1_substrate,MATE2_substrate,OAT1_substrate,OAT3_substrate,OATP1B1_substrate,OATP1B3_substrate,OCT2_substrate,MRP2_substrate
22
Verapamil,COC1=C(OC)C=C(CCN(C)CCCC(C#N)(C(C)C)C2=CC(OC)=C(OC)C=C2)C=C1,COc1ccc(CCN(C)CCCC(C#N)(c2ccc(OC)c(OC)c2)C(C)C)cc1OC,False,True,True,False,False,False,False,False,False,False,False,False
3-
Lopinavir,CC1=C(C(=CC=C1)C)OCC(=O)NC(CC2=CC=CC=C2)C(CC(CC3=CC=CC=C3)NC(=O)C(C(C)C)N4CCCNC4=O)O,Cc1cccc(C)c1OCC(=O)NC(Cc1ccccc1)C(O)CC(Cc1ccccc1)NC(=O)C(C(C)C)N1CCCNC1=O,False,False,True,False,False,False,False,False,False,False,False,True
4-
Probenecid,CCCN(CCC)S(=O)(=O)C1=CC=C(C=C1)C(=O)O,CCCN(CCC)S(=O)(=O)c1ccc(C(=O)O)cc1,True,False,False,False,False,False,False,False,False,False,False,True
3+
Lopinavir,CC1=C(C(=CC=C1)C)OCC(=O)NC(CC2=CC=CC=C2)C(CC(CC3=CC=CC=C3)NC(=O)C(C(C)C)N4CCCNC4=O)O,Cc1cccc(C)c1OCC(=O)NC(Cc1ccccc1)C(O)CC(Cc1ccccc1)NC(=O)C(C(C)C)N1CCCNC1=O,False,False,True,True,False,False,False,False,False,False,False,True
4+
Probenecid,CCCN(CCC)S(=O)(=O)C1=CC=C(C=C1)C(=O)O,CCCN(CCC)S(=O)(=O)c1ccc(C(=O)O)cc1,True,False,False,False,True,True,False,True,False,False,False,True
55
Saquinavir,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(C(CC3=CC=CC=C3)NC(=O)C(CC(=O)N)NC(=O)C4=NC5=CC=CC=C5C=C4)O,CC(C)(C)NC(=O)C1CC2CCCCC2CN1CC(O)C(Cc1ccccc1)NC(=O)C(CC(N)=O)NC(=O)c1ccc2ccccc2n1,False,False,True,True,False,False,False,False,False,True,False,True
66
Bromosulphophthalein,[O-]S(=O)(=O)c1c(O)ccc(c1)C3(OC(=O)c2c(Br)c(Br)c(Br)c(Br)c23)c4ccc(O)c(c4)S([O-])(=O)=O,O=C1OC(c2ccc(O)c(S(=O)(=O)O)c2)(c2ccc(O)c(S(=O)(=O)O)c2)c2c(Br)c(Br)c(Br)c(Br)c21,True,True,False,False,False,False,False,False,True,True,False,True
77
Methotrexate,CN(CC1=CN=C2C(=N1)C(=NC(=N2)N)N)C3=CC=C(C=C3)C(=O)NC(CCC(=O)O)C(=O)O,CN(Cc1cnc2nc(N)nc(N)c2n1)c1ccc(C(=O)NC(CCC(=O)O)C(=O)O)cc1,True,False,False,False,False,False,False,True,False,False,False,True
8-
Furosemide,C1=COC(=C1)CNC2=CC(=C(C=C2C(=O)O)S(=O)(=O)N)Cl,NS(=O)(=O)c1cc(C(=O)O)c(NCc2ccco2)cc1Cl,True,False,False,False,False,False,False,False,False,False,False,False
9-
Metformin,CN(C)C(=N)N=C(N)N,CN(C)C(=N)N=C(N)N,True,True,False,False,False,False,False,False,False,False,False,False
8+
Furosemide,C1=COC(=C1)CNC2=CC(=C(C=C2C(=O)O)S(=O)(=O)N)Cl,NS(=O)(=O)c1cc(C(=O)O)c(NCc2ccco2)cc1Cl,True,False,False,False,False,False,False,True,False,False,False,False
9+
Metformin,CN(C)C(=N)N=C(N)N,CN(C)C(=N)N=C(N)N,True,True,False,False,True,True,False,False,False,False,True,False
1010
Cimetidine,CC1=C(N=CN1)CSCCNC(=NC)NC#N,CN=C(NC#N)NCCSCc1nc[nH]c1C,True,False,False,False,True,True,True,True,False,False,True,False
11-
Procainamide,CCN(CC)CCNC(=O)C1=CC=C(C=C1)N,CCN(CC)CCNC(=O)c1ccc(N)cc1,False,False,True,False,True,True,False,False,False,False,False,False
12-
Oestrone,CC12CCC3C(C1CCC2=O)CCC4=C3C=CC(=C4)O,CC12CCC3c4ccc(O)cc4CCC3C1CCC2=O,True,False,False,False,True,True,False,True,True,False,False,True
13-
Pravastatin,CCC(C)C(=O)OC1CC(C=C2C1C(C(C=C2)C)CCC(CC(CC(=O)O)O)O)O,CCC(C)C(=O)OC1CC(O)C=C2C=CC(C)C(CCC(O)CC(O)CC(=O)O)C21,True,True,False,True,False,False,False,True,False,True,False,True
11+
Procainamide,CCN(CC)CCNC(=O)C1=CC=C(C=C1)N,CCN(CC)CCNC(=O)c1ccc(N)cc1,False,False,True,False,True,True,True,True,False,False,False,True
12+
Oestrone,CC12CCC3C(C1CCC2=O)CCC4=C3C=CC(=C4)O,CC12CCC3c4ccc(O)cc4CCC3C1CCC2=O,True,False,False,True,True,True,False,True,True,False,False,True
13+
Pravastatin,CCC(C)C(=O)OC1CC(C=C2C1C(C(C=C2)C)CCC(CC(CC(=O)O)O)O)O,CCC(C)C(=O)OC1CC(O)C=C2C=CC(C)C(CCC(O)CC(O)CC(=O)O)C21,True,True,False,True,False,False,False,True,True,True,False,True
1414
Delaviridine,CC(C)NC1=C(N=CC=C1)N1CCN(CC1)C(=O)C1=CC2=C(N1)C=CC(NS(C)(=O)=O)=C2,CC(C)Nc1cccnc1N1CCN(C(=O)c2cc3cc(NS(C)(=O)=O)ccc3[nH]2)CC1,True,False,True,False,False,False,False,False,False,False,False,False
15-
Loperamide,CN(C)C(=O)C(CCN1CCC(O)(CC1)C1=CC=C(Cl)C=C1)(C1=CC=CC=C1)C1=CC=CC=C1,CN(C)C(=O)C(CCN1CCC(O)(c2ccc(Cl)cc2)CC1)(c1ccccc1)c1ccccc1,False,False,True,False,False,False,False,False,False,False,False,False
15+
Loperamide,CN(C)C(=O)C(CCN1CCC(O)(CC1)C1=CC=C(Cl)C=C1)(C1=CC=CC=C1)C1=CC=CC=C1,CN(C)C(=O)C(CCN1CCC(O)(c2ccc(Cl)cc2)CC1)(c1ccccc1)c1ccccc1,False,False,True,False,False,False,False,True,True,False,False,False
1616
Rifampicin,CC1C=CC=C(C(=O)NC2=C(C(=C3C(=C2O)C(=C(C4=C3C(=O)C(O4)(OC=CC(C(C(C(C(C(C1O)C)O)C)OC(=O)C)C)OC)C)C)O)O)C=NN5CCN(CC5)C)C,COC1C=COC2(C)Oc3c(C)c(O)c4c(O)c(c(C=NN5CCN(C)CC5)c(O)c4c3C2=O)NC(=O)C(C)=CC=CC(C)C(O)C(C)C(O)C(C)C(OC(C)=O)C1C,True,True,True,False,False,False,False,False,True,True,False,True

0 commit comments

Comments
 (0)