Multi Query 2D Similarity
Category Paths
Follow one of these paths in the Orion user interface, to find the floe.
Product-based/GraphSim
Role-based/Computational Chemist
Solution-based/Virtual-screening/DB Search/2D Similarity and SubSearch
Task-based/Virtual Screening - Ligand-Based
Description
For a set of input molecules, this floe computes the highest 2D Tanimoto similarity to any query molecule. Users can optionally filter these molecules by the computed Tanimoto. The floe can then either output all the input molecules or optionally create a hit list of top scoring molecules.
Promoted Parameters
Title in user interface (promoted name)
Inputs
Query Molecules (query_molecules): Query molecules for the 2D Tanimoto calculation.
Required
Type: data_source
Input Molecules (input_molecules): Dataset with molecules to screen. For each molecule, the 2D Tanimoto to all the query molecules will be calculated and the molecule will be assigned a 2D Tanimoto equal to the highest Tanimoto to any query.
Required
Type: data_source
Outputs
Output Dataset (output_dataset): Output dataset to write to
Required
Type: dataset_out
Default: Multi Query 2D Similarity Results
Options
Enable Hit List (enable_hit_list): If set to On, the output will be a sorted list of the top scoring molecules by 2D Tanimoto. If set to Off, all molecules will be directly outputted to the output dataset unsorted.
Required
Type: boolean
Default: True
Choices: [True, False]
Hit List Size (hit_list_size): Size of the output hit list. If ‘Enable Hit List’ is Off, this parameter will have no effect.
Required
Type: integer
Default: 100000
Fingerprint Type (fingerprint_type): The type of fingerprint to use in the Tanimoto calculation.
Required
Type: string
Default: Circular
Choices: [‘Circular’, ‘Path’, ‘Tree’]
Use Virtual Screening Fingerprint Variant (use_virtual_screening_fingerprint_variant): If On, the virtual screening variant of the selected fingerprint will be used. The virtual screening variant treats certain functional groups identically regardless of their pKa state, for example, protonated and unprotonated carboxylic acids.
Type: boolean
Default: True
Choices: [True, False]
Options: Filtering
Filter Tanimotos Higher Than (filter_tanimotos_higher_than): If this parameter is set, any molecule with a Tanimoto higher than this value will be discarded.
Type: decimal
Filter Tanimotos Lower Than (filter_tanimotos_lower_than): If this parameter is set, any molecule with a Tanimoto lower than this value will be discarded.
Type: decimal
Input Fields
Molecule Field (molecule_field): Field on the input records containing the molecule to process. If unspecified, the primary molecule field will be used.
Type: field_parameter::mol
Query Molecule Field (query_molecule_field): Field on the input query records containing the query molecules. If unspecified, the primary molecule field will be used.
Type: field_parameter::mol
Output Fields
Tanimoto Field (tanimoto_field): Field on the output records the computed Tanimoto of each molecule will be stored in.
Required
Type: field_parameter::float
Default: 2D Tanimoto
Fingerprint Type Field (fingerprint_type_field): Field on the output records with the name of the fingerprint type used to calculate the Tanimoto. Creation of this field can be suppressed by clearing the default value of this parameter.
Type: field_parameter::string
Default: Fingerprint Type
Best Query SMILES Field (best_query_smiles_field): Field on the output records containing the SMILES of the query molecule with the highest Tanimoto to the input molecule. Creation of this field can be suppressed by clearing the default value of this parameter.
Type: field_parameter::string
Default: Best Query SMILES
Best Query Title Field (best_query_title_field): Field on the output records containing the title of the query molecule with the highest Tanimoto to the input molecule. Creation of this field can be suppressed by clearing the default value of this parameter.
Type: field_parameter::string
Default: Best Query Title