OEIFlavor

The OEIFlavor namespace encodes symbolic constants used as bit-masks to indicate how to write various file formats by OEChem TK.

Note

A very important introduction (with examples) as to how to manipulate these bit-masks is given Flavored Input and Output

The OEIFlavor_Generic namespace within OEIFlavor defines control bit-masks that are common to all of the input formats. Although these generic bits are common to all file formats, they may be specified independently for each file format.

See also

The interpretation of many of these flavor values, is identical to those passed to the corresponding OEChem TK low-level file format readers.

This namespace contains the following namespaces:

OEIFlavor::CAN

The OEIFlavor_CAN namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_CAN format file by the OEParseSmiles function.

This namespace contains the following constants:

Default

OEIFlavor::CDX

The OEIFlavor_CDX namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_CDX format file by the OEReadCDXFile low-level file reader.

This namespace contains the following constants:

Default

OEIFlavor::CIF

The OEIFlavor_CIF namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_CIF format file by the OEReadCIFFile low-level file reader.

This namespace contains the following constants, that mimic the XYZ reader.

FormalCrg

ImplicitH

BondOrder

Rings

The reader stores excluded sites and sites left after filtering. The filtering of these sites is based on the flavors listed below.

OccFilterOneHalf

If occupancy is noted in the CIF file, only the sites with occupancy larger than one half is kept. If occupancy is not defined in the CIF file, the occupancies are set to 1 to prevent filtering.

RemovePBCImages

Each site read is checked to see if it is equal to a previously read site up to periodic wrapping in fractional coordinates.

NormalizeHydPos

Hydrogen positions and bond lengths are normalized if necessary.

BondHydToClosest

Enforces that hydrogen atoms are bonded to their closest heavy atom in space.

RemoveQuestionMarkInLabel

Sites with a question mark after atom labels are assigned are filtered out. These extra sites are sometimes alternative positions of existing sites.

Default

Combination of: OEIFlavor_CIF_FormalCrg, OEIFlavor_CIF_ImplicitH, OEIFlavor_CIF_BondOrder, OEIFlavor_CIF_Rings, OEIFlavor_CIF_OccFilterOneHalf, OEIFlavor_CIF_RemovePBCImages, OEIFlavor_CIF_NormalizeHydPos, OEIFlavor_CIF_BondHydToClosest, OEIFlavor_CIF_RemoveQuestionMarkInLabel flags.

OEIFlavor::CSV

This namespace contains constants for modifying the behavior of how the OEFormat_CSV file format is read.

See also

CSV File Format

Header

Attempt to interpret the first line of the .csv file as a header line containing column names. The header line will be parsed into an OECSVHeader object and stored on the oemolistream object, retrievable by oemolstreambase.GetCSVHeader.

If this flag is not set, or the first column of the first line does not contain a valid SMILES string, the OECSVHeader will be initialized with the number of columns present in the first line of the file. The columns will then be named and parsed in the following order: SMILES, TITLE, OE_CSV_COLUMN_1, OE_CSV_COLUMN_2, …, OE_CSV_COLUMN_N.

Default

Set to include OEIFlavor_CSV_Header to automatically parse column names into molecule SD data.

DEFAULT

Synonym for OEIFlavor_CSV_Default.

OEIFlavor::CXSMILES

The OEIFlavor_CXSMILES namespace encodes symbolic constants used as bit-masks that control the processing performed by the OEParseSmiles function.

This namespace contains the following constants:

Strict

Canon

DEFAULT

Default

Same as the OEIFlavor_CXSMILES_DEFAULT constant.

OEIFlavor::FASTA

The OEIFlavor_FASTA namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_FASTA format file by the OEReadFASTAFile low-level file reader.

This namespace contains the following constants:

Default

No special options are selected.

CustomResidues

Instructs the FASTA reader to interpret values surrounded in square brackets, [foo], as custom residues. The structure of these custom residues must first be registered with the OEAddCustomFASTAResidue function.

Warning

This is an experimental API that may change in the future.

EmbeddedSMILES

Instructs the FASTA reader to interpret values surrounded in square brackets, [CCC], as SMILES strings if a custom residue with the same name can not be found. The SMILES can optionally contain attachment points back to the peptide backbone. For example, [[R1]CSC[R15]] specifies to attach to the first and fifteenth residue in the peptide, effectively cyclizing the peptide.

Warning

This is an experimental API that may change in the future.

OEIFlavor::Generic

This namespace contains the following constants:

OEAroModelDaylight

This flag indicates that the Daylight aromaticity model is used to perceive aromaticity by the OEReadMolecule high-level reader.

OEAroModelOpenEye

This flag indicates that the OpenEye aromaticity model is used to perceive aromaticity by the OEReadMolecule high-level reader.

OEAroModelTripos

This flag indicates that the Tripos aromaticity model is used to perceive aromaticity by the OEReadMolecule high-level reader.

OEAroModelMMFF

This flag indicates that the MMFF aromaticity model is used to perceive aromaticity by the OEReadMolecule high-level reader.

OEAroModelMDL

This flag indicates that the MDL aromaticity model is used to perceive aromaticity by the OEReadMolecule high-level reader.

AroMask

Combination of OEIFlavor_Generic_OEAroModelDaylight, OEIFlavor_Generic_OEAroModelOpenEye, OEIFlavor_Generic_OEAroModelTripos, OEIFlavor_Generic_OEAroModelMMFF and OEIFlavor_Generic_OEAroModelMDL flags.

Rings

This flag indicates the rings are perceived by the OEReadMolecule high-level reader by calling the OEFindRingAtomsAndBonds function.

GenericMask

Combination of OEIFlavor_Generic_AroMask and OEIFlavor_Generic_Rings flags.

SpecificMask

DEFAULT

Default

Same as the OEIFlavor_Generic_DEFAULT constant.

OEIFlavor::ISM

The OEIFlavor_ISM namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_ISM format file by the OEParseSmiles function.

This namespace contains the following constants:

Default

OEIFlavor::MDL

The OEIFlavor_MDL namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_MDL format file by the OEReadMDLFile low-level file reader.

This namespace contains the following constants:

Default

FixBondMarks

This input flavor requests that the reader make an attempt to repair stereocenters that throw warnings due to issues identified by OEMDLStereoFromBondStereo. With this input flavor enabled, the warnings will continue to be issued to allow user inspection and verification of the modified bond marks. However, once the corrected structure(s) have been rewritten with any format that captures bond stereomark information, a re-read of the structure(s) should emit fewer stereocenter correction warnings. This input flavor is not enabled for the default OEFormat_MDL setting.

SuppressEmptyMolSkip

This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.

SuppressImp2ExpENHSTE

This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format molfile.

ForceImp2ExpENHSTE

This input flavor performs implicit to explicit enhanced stereogroup conversions for an input V2000 or V3000 format molfile.

OEIFlavor::MMCIF

The OEIFlavor_MMCIF namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_MMCIF format file by the OEReadCIFFile low-level file reader.

This namespace contains the following constants.

NoAltLoc

This flavor is obsolete. The MMCIF reader does not by default read alternate locations any longer

ALTLOC

This flavor is required when OEAltLocationFactory is to be used so that all the atoms with alternate location codes are retained.

DEFAULT

Same as the OEIFlavor_MMCIF_DEFAULT constant.

SpruceDefault

Combination of OEIFlavor_MMCIF_DEFAULT, OEIFlavor_MMCIF_ALTLOC, flags.

OEIFlavor::MMOD

The OEIFlavor_MMOD namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_MMOD format file by the OEReadMacroModelFile low-level file reader.

This namespace contains the following constants:

FormalCrg

This flag indicates that high-level molecular file reader calls the OEAssignFormalCharges function to assign formal charges.

Default

OEIFlavor::MOL2

The OEIFlavor_MOL2 namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecular file reader OEReadMolecule after reading OEFormat_MOL2 format file by the OEReadMol2File low-level file reader.

This namespace contains the following constants:

M2H

Forcefield

This specifies a commonly used variant of MOL2, wherein the chemical element for an atom is inferred from the atom name, but not using the second character if it is capitalized. For example:

  • CA specifies carbon (not calcium)

  • Ca specifies calcium

  • NA specifies nitrogen (not sodium)

  • Na specifies sodium

  • HA specifies hydrogen (not hahnium)

Exceptions were made for atom names beginning with FE, ZN, MG, MN, CL, and BR: these specify chemical elements Fe, Zn, Mg, Mn, Cl, and Br, respectively.

AllMask

DEFAULT

Default

OEIFlavor::MOL2H

This namespace contains the following constants:

M2H

AllMask

DEFAULT

Default

OEIFlavor::OEB

This namespace contains the following constants:

Default

OEIFlavor::OEZ

This namespace contains the following constants:

Default

OEIFlavor::PDB

This namespace contains the following constants:

TER

Controls whether the PDB file reader should separate consecutive molecules at TER records. OEIFlavor_PDB_TER is off by default.

END

Controls whether the PDB file reader should separate consecutive molecules at END records. OEIFlavor_PDB_END is on by default.

ENDM

Controls whether the PDB file reader should separate consecutive molecules at ENDM records.

OEIFlavor_PDB_ENDM is on by default.

TerMask

Combination of OEIFlavor_PDB_TER and OEIFlavor_PDB_END and OEIFlavor_PDB_ENDM flags.

Note

By default, only OEIFlavor_PDB_END and OEIFlavor_PDB_ENDM are on (and OEIFlavor_PDB_TER is off), meaning that different chains are read into the a single molecule, but different NMR models and concatenated PDB files are treated as sequential molecules.

If OEIFlavor_PDB_END, OEIFlavor_PDB_ENDM and OEIFlavor_PDB_TER are all turned off, all of the ATOM and HETATM records in a single PDB file will be read into a single OEMolBase.

ALL

It used to indicate that all of the atom records in the input file should be read into the OEMolBase. By default, the OEReadPDBFile function ignores/omits ATOM and HETATM records that represent pseudo or dummy atoms and/or alternate conformations. Without OEIFlavor_PDB_ALL, the PDB file reader ignores all atoms whose alternate location indicator is other than ' ', A or 1, all atoms with atom names beginning " Q", all atoms with residue name "DUM", and all atoms with coordinates 9999.000, 9999.000, 9999.000 (as used by XPLOR/CNS to represent dummy atoms).

ALTLOC

Combination of OEIFlavor_Generic_Default, OEIFlavor_PDB_Default, and OEIFlavor_PDB_ALL flags. This flavor is required when OEAltLocationFactory is to be used so that all the atoms with alternate location codes are retained.

DATA

Controls whether the PDB file reader preserves the PDB file’s header data as OEMolBase’s generic data. This option currently preserves the following records:

Note

Modifying or reordering the molecule may invalidate the atom serial numbers used in some of the PDB records.

CHARGE

It is used to indicate that the contents of the b-factor column in the input PDB file contains a partial charge, and should be stored in the ‘partial charge’ property of an atom, instead of the ‘b-factor’ property. This value can then be retrieved using the OEAtomBase.GetPartialCharge method.

RADIUS

It is used to indicate that the contents of the occupancy column in the input PDB file contains a radius, and should be stored in the ‘radius’ property of an atom instead of the ‘occupancy’ property. This value can then be retrieved using the OEAtomBase.GetRadius method.

DELPHI

Combinations of OEIFlavor_PDB_CHARGE and OEIFlavor_PDB_RADIUS flags.

FORMALCHARGE

Controls whether the PDB reader should read the formal charge of the atom from column 79 and 80. Note, this flavor is not turned on default because the subsequent perception steps would not take this formal charge into account when filling valences with bonds and hydrogens.

SecStruct

This controls whether protein secondary structure is perceived after a file is read in. It is on by default. If this flavor is off, secondary structure information specified in the input PDB file (through SHEET and HELIX records) is retained.

BasicMask

FormalCrg

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

ImplicitH

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

BondOrder

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

Rings

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

Connect

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

ExtraMask

Note

This flag is only relevant in the high-level file reader (OEReadMolecule) and not in OEReadPDBFile.

AllMask

SpruceDefault

Combination of OEIFlavor_PDB_DEFAULT, OEIFlavor_PDB_ALTLOC, flags.

OEIFlavor::RDF

The OEIFlavor_RDF namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_RDF format file by the OEReadMDLFile low-level file reader.

Default

FixBondMarks

This input flavor requests that the reader make an attempt to repair stereocenters that throw warnings due to issues identified by OEMDLStereoFromBondStereo. With this input flavor enabled, the warnings will continue to be issued to allow user inspection and verification of the modified bond marks. However, once the corrected structure(s) have been rewritten with any format that captures bond stereomark information, a re-read of the structure(s) should emit fewer stereocenter correction warnings. This input flavor is not enabled for the default OEFormat_RDF setting.

SuppressEmptyMolSkip

This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.

SuppressImp2ExpENHSTE

This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format RDFile.

ForceImp2ExpENHSTE

This input flavor performs implicit to explicit enhanced stereogroup conversions for an input RDFile.

OEIFlavor::SDF

The OEIFlavor_SDF namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_SDF format file by the OEReadMDLFile low-level file reader.

This namespace contains the following constants:

Default

FixBondMarks

This input flavor requests that the reader make an attempt to repair stereocenters that throw warnings due to issues identified by OEMDLStereoFromBondStereo. With this input flavor enabled, the warnings will continue to be issued to allow user inspection and verification of the modified bond marks. However, once the corrected structure(s) have been rewritten with any format that captures bond stereomark information, a re-read of the structure(s) should emit fewer stereocenter correction warnings. This input flavor is not enabled for the default OEFormat_SDF setting.

SuppressEmptyMolSkip

This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.

SuppressImp2ExpENHSTE

This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format SDFile.

ForceImp2ExpENHSTE

This input flavor performs implicit to explicit enhanced stereogroup conversions for an input V2000 or V3000 format SDFile.

OEIFlavor::SKC

The OEIFlavor_SKC namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_SKC format file by the OEReadSketchFile low-level file reader.

This namespace contains the following constants:

Default

OEIFlavor::SMI

The OEIFlavor_SMI namespace encodes symbolic constants used as bit-masks that control the processing performed by the OEParseSmiles function.

This namespace contains the following constants:

Strict

Canon

DEFAULT

Default

Same as the OEIFlavor_SMI_DEFAULT constant.

OEIFlavor::USM

The OEIFlavor_USM namespace encodes symbolic constants used as bit-masks that control the processing performed by the OEParseSmiles function.

This namespace contains constants.

Default

OEIFlavor::XYZ

The OEIFlavor_XYZ namespace encodes symbolic constants used as bit-masks that control the processing performed by the high-level molecule file reader OEReadMolecule after reading OEFormat_XYZ format file by the OEReadXYZFile low-level file reader.

This namespace contains the following constants:

FormalCrg

ImplicitH

BondOrder

Rings

Connect

ExtraMask

Default