"everything" subset (# 10) - 5627809 entries

General Information

Selection criteria:
Created_by jji@cgl.ucsf.edu, last updated 2007-03-06
Note: Purchasable and non-purchasable
Flags: nuisance: undefined, vernalis: undefined, asymmetric atoms: undefined, asymmetric double bonds:undefined, alt nuisance: undefined

Contents: General Information   Property Distributions   Clustering and Diversity   Downloads

Property Distributions

Clustering and Diversity

We sort the ligands by molecular weight as a proxy for complexity. We then use the algorithm of Bienfait to incrementally select compounds that differ from all previous by the given Tanimoto cutoff. This is a very cheap clustering technique that scales well with set size and gives some indication of the amount of chemical redundancy in the dataset at various Tanimoto levels. N/A indicates that clustering is pending.

All5627809
90%N/A
80%N/A
70%N/A
60%N/A

Downloads

Molecules are available in four formats: isomeric SMILES, mol2, SDF and flexibase. Molecules are represented as a single pH=7 form. Additional representations (protonation variants and tautomers) are available in three incremental subsets to augment the single representative: medium pH (5.75 to 8.25), high pH 7.0-9.5 (e.g. for docking to metals), and low pH 4.5-7.0 (e.g. for docking to a positively charged binding site).
Larger files are broken up into slices to faciliate downloading. You may download individual slices or use c-shell scripts to download a single representation (pH 7.0), all Usual ligands (pH 5.75 to 8.25), ligands for metals (5.75-9.5) or All. Note that these sets are overlapping so you do not want to download both Metals and All.

We expect dockers will want to just download the "Usual" subset. Chemical informaticists who require only a single form of each molecule may want just the "Single" representation. If files appear to be missing or incomplete, please try again tomorrow as the export may still be in progress. If problems persist for 48 hours please complain to comments at docking dot org

File format Individual slicesScripts to download slices
SMILES Reference   mid pH   high pH   low pH  
mol2 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 199 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255   0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23   0 1 2 3   2 0 1 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50   Single Usual Metals All
SDF 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 199 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255   0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23   0 1 2 3   0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50   Single Usual Metals All
Flexibase  Single Usual Metals All