OEIFlavor¶
The OEIFlavor
namespace encodes symbolic
constants used as bit-masks to indicate how to write various
file formats by OEChem TK.
Note
A very important introduction (with examples) as to how to manipulate these bit-masks is given Flavored Input and Output
The OEIFlavor::Generic
namespace within
OEIFlavor
defines control bit-masks that are
common to all of the input formats.
Although these generic bits are common to all file formats, they
may be specified independently for each file format.
See also
oemolistream::SetFormat
method to set the flavor of an input molecule streamoemolstreambase::GetFormat
method to retrieve the flavor of a molecule stream
The interpretation of many of these flavor values, is identical to those passed to the corresponding OEChem TK low-level file format readers.
This namespace contains the following namespaces:
OEIFlavor::CAN¶
The OEIFlavor::CAN
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::CAN
format file by
the OEParseSmiles
function.
This namespace contains the following constants:
Default
OEIFlavor::CDX¶
The OEIFlavor::CDX
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::CDX
format file by
the OEReadCDXFile
low-level file reader.
This namespace contains the following constants:
Default
OEIFlavor::CIF¶
The OEIFlavor::CIF
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::CIF
format file by the
OEReadCIFFile
low-level file reader.
This namespace contains the following constants, that mimic the XYZ reader.
FormalCrg
ImplicitH
BondOrder
Rings
The reader stores excluded sites and sites left after filtering. The filtering of these sites is based on the flavors listed below.
OccFilterOneHalf
If occupancy is noted in the CIF file, only the sites with occupancy larger than one half is kept. If occupancy is not defined in the CIF file, the occupancies are set to 1 to prevent filtering.
RemovePBCImages
Each site read is checked to see if it is equal to a previously read site up to periodic wrapping in fractional coordinates.
NormalizeHydPos
Hydrogen positions and bond lengths are normalized if necessary.
BondHydToClosest
Enforces that hydrogen atoms are bonded to their closest heavy atom in space.
RemoveQuestionMarkInLabel
Sites with a question mark after atom labels are assigned are filtered out. These extra sites are sometimes alternative positions of existing sites.
Default
Combination of:
OEIFlavor::CIF::FormalCrg
,
OEIFlavor::CIF::ImplicitH
,
OEIFlavor::CIF::BondOrder
,
OEIFlavor::CIF::Rings
,
OEIFlavor::CIF::OccFilterOneHalf
,
OEIFlavor::CIF::RemovePBCImages
,
OEIFlavor::CIF::NormalizeHydPos
,
OEIFlavor::CIF::BondHydToClosest
,
OEIFlavor::CIF::RemoveQuestionMarkInLabel
flags.
OEIFlavor::CSV¶
This namespace contains constants for modifying the behavior of
how the OEFormat::CSV
file format is read.
See also
Header
Attempt to interpret the first line of the .csv
file
as a header line containing column names. The header line
will be parsed into an OECSVHeader object and
stored on the oemolistream object,
retrievable by oemolstreambase::GetCSVHeader
.
If this flag is not set, or the first column of the first line does not contain a valid SMILES string, the OECSVHeader will be initialized with the number of columns present in the first line of the file. The columns will then be named and parsed in the following order: SMILES, TITLE, OE_CSV_COLUMN_1, OE_CSV_COLUMN_2, …, OE_CSV_COLUMN_N.
Default
Set to include OEIFlavor::CSV::Header
to
automatically parse column names into molecule SD data.
DEFAULT
Synonym for OEIFlavor::CSV::Default
.
OEIFlavor::CXSMILES¶
The OEIFlavor::CXSMILES
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the OEParseSmiles
function.
This namespace contains the following constants:
Strict
Canon
DEFAULT
Default
Same as the OEIFlavor::CXSMILES::DEFAULT
constant.
OEIFlavor::FASTA¶
The OEIFlavor::FASTA
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::FASTA
format file by
the OEReadFASTAFile
low-level file reader.
This namespace contains the following constants:
Default
No special options are selected.
CustomResidues
Instructs the FASTA reader to interpret values surrounded in
square brackets, [foo]
, as custom residues. The structure
of these custom residues must first be registered with the
OEAddCustomFASTAResidue
function.
Warning
This is an experimental API that may change in the future.
EmbeddedSMILES
Instructs the FASTA reader to interpret values surrounded in
square brackets, [CCC]
, as SMILES strings if a
custom residue with the same name can not be found. The
SMILES can optionally contain attachment points back
to the peptide backbone. For example, [[R1]CSC[R15]]
specifies to attach to the first and fifteenth residue in the
peptide, effectively cyclizing the peptide.
Warning
This is an experimental API that may change in the future.
OEIFlavor::Generic¶
This namespace contains the following constants:
OEAroModelDaylight
This flag indicates that the Daylight aromaticity model is
used to perceive aromaticity by the
OEReadMolecule
high-level reader.
OEAroModelOpenEye
This flag indicates that the OpenEye aromaticity model is
used to perceive aromaticity by the
OEReadMolecule
high-level reader.
OEAroModelTripos
This flag indicates that the Tripos aromaticity model is
used to perceive aromaticity by the
OEReadMolecule
high-level reader.
OEAroModelMMFF
This flag indicates that the MMFF aromaticity model is
used to perceive aromaticity by the
OEReadMolecule
high-level reader.
OEAroModelMDL
This flag indicates that the MDL aromaticity model is
used to perceive aromaticity by the
OEReadMolecule
high-level reader.
AroMask
Combination of
OEIFlavor::Generic::OEAroModelDaylight
,
OEIFlavor::Generic::OEAroModelOpenEye
,
OEIFlavor::Generic::OEAroModelTripos
,
OEIFlavor::Generic::OEAroModelMMFF
and
OEIFlavor::Generic::OEAroModelMDL
flags.
See also
Aromaticity Models in OEChem TK section
OEAssignAromaticFlags
function
Rings
This flag indicates the rings are perceived by the
OEReadMolecule
high-level reader by calling
the OEFindRingAtomsAndBonds
function.
GenericMask
Combination of
OEIFlavor::Generic::AroMask
and
OEIFlavor::Generic::Rings
flags.
SpecificMask
DEFAULT
Default
Same as the OEIFlavor::Generic::DEFAULT
constant.
OEIFlavor::ISM¶
The OEIFlavor::ISM
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::ISM
format file by
the OEParseSmiles
function.
This namespace contains the following constants:
Default
OEIFlavor::MDL¶
The OEIFlavor::MDL
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::MDL
format file by
the OEReadMDLFile
low-level file reader.
This namespace contains the following constants:
Default
FixBondMarks
This input flavor requests that the reader make an attempt
to repair stereocenters that throw warnings due to issues identified
by OEMDLStereoFromBondStereo
. With this input flavor
enabled, the warnings will continue to be issued to allow user inspection and verification
of the modified bond marks. However, once the corrected structure(s) have been rewritten
with any format that captures bond stereomark information, a re-read of the structure(s)
should emit fewer stereocenter correction warnings. This input flavor is not
enabled for the default OEFormat::MDL
setting.
SuppressEmptyMolSkip
This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.
SuppressImp2ExpENHSTE
This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format molfile.
ForceImp2ExpENHSTE
This input flavor performs implicit to explicit enhanced stereogroup conversions for an input V2000 or V3000 format molfile.
OEIFlavor::MMCIF¶
The OEIFlavor::MMCIF
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::MMCIF
format file by
the OEReadCIFFile
low-level file reader.
This namespace contains the following constants.
NoAltLoc
This flavor is obsolete. The MMCIF reader does not by default read alternate locations any longer
ALTLOC
This flavor is required when OEAltLocationFactory is to be used so that all the atoms with alternate location codes are retained.
DEFAULT
Same as the OEIFlavor::MMCIF::DEFAULT
constant.
SpruceDefault
Combination of
OEIFlavor::MMCIF::DEFAULT
,
OEIFlavor::MMCIF::ALTLOC
, flags.
OEIFlavor::MMOD¶
The OEIFlavor::MMOD
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::MMOD
format file by
the OEReadMacroModelFile
low-level file reader.
This namespace contains the following constants:
FormalCrg
This flag indicates that high-level molecular file reader calls
the OEAssignFormalCharges
function to
assign formal charges.
Default
OEIFlavor::MOL2¶
The OEIFlavor::MOL2
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecular file reader
OEReadMolecule
after reading
OEFormat::MOL2
format file by
the OEReadMol2File
low-level file reader.
This namespace contains the following constants:
M2H
Forcefield
This specifies a commonly used variant of MOL2, wherein the chemical element for an atom is inferred from the atom name, but not using the second character if it is capitalized. For example:
CA specifies carbon (not calcium)
Ca specifies calcium
NA specifies nitrogen (not sodium)
Na specifies sodium
HA specifies hydrogen (not hahnium)
Exceptions were made for atom names beginning with FE, ZN, MG, MN, CL, and BR: these specify chemical elements Fe, Zn, Mg, Mn, Cl, and Br, respectively.
AllMask
DEFAULT
Default
OEIFlavor::PDB¶
This namespace contains the following constants:
TER
Controls whether the PDB file reader should separate consecutive molecules at TER records.
OEIFlavor::PDB::TER
is off by default.
END
Controls whether the PDB file reader should separate consecutive molecules at END records.
OEIFlavor::PDB::END
is on by default.
ENDM
Controls whether the PDB file reader should separate consecutive molecules at ENDM records.
OEIFlavor::PDB::ENDM
is on by default.TerMask
Combination of
OEIFlavor::PDB::TER
andOEIFlavor::PDB::END
andOEIFlavor::PDB::ENDM
flags.Note
By default, only
OEIFlavor::PDB::END
andOEIFlavor::PDB::ENDM
are on (andOEIFlavor::PDB::TER
is off), meaning that different chains are read into the a single molecule, but different NMR models and concatenated PDB files are treated as sequential molecules.If
OEIFlavor::PDB::END
,OEIFlavor::PDB::ENDM
andOEIFlavor::PDB::TER
are all turned off, all of the ATOM and HETATM records in a single PDB file will be read into a single OEMolBase.ALL
It used to indicate that all of the atom records in the input file should be read into the OEMolBase. By default, the
OEReadPDBFile
function ignores/omits ATOM and HETATM records that represent pseudo or dummy atoms and/or alternate conformations. WithoutOEIFlavor::PDB::ALL
, the PDB file reader ignores all atoms whose alternate location indicator is other than' '
,A
or1
, all atoms with atom names beginning" Q"
, all atoms with residue name"DUM"
, and all atoms with coordinates 9999.000, 9999.000, 9999.000 (as used by XPLOR/CNS to represent dummy atoms).ALTLOC
Combination of
OEIFlavor::Generic::Default
,OEIFlavor::PDB::Default
, andOEIFlavor::PDB::ALL
flags. This flavor is required when OEAltLocationFactory is to be used so that all the atoms with alternate location codes are retained.DATA
Controls whether the PDB file reader preserves the PDB file’s header data as OEMolBase’s generic data. This option currently preserves the following records:
Title section: AUTHOR, CAVEAT, COMPND, EXPDTA, HEADER, JRNL, KEYWDS, OBSLTE, SOURCE, SPRSDE, REMARK, REVDAT, TITLE
Connectivity annotation section: SSBOND
Miscellaneous features section: SITE
Crystallographic and coordinate transformation section: CRYST1, MTRIX1, MTRIX2, MTRIX3, ORIGX1, ORIGX2, ORIGX3, SCALE1, SCALE2, SCALE3
Note
Modifying or reordering the molecule may invalidate the atom serial numbers used in some of the PDB records.
CHARGE
It is used to indicate that the contents of the b-factor column in the input PDB file contains a partial charge, and should be stored in the ‘partial charge’ property of an atom, instead of the ‘b-factor’ property. This value can then be retrieved using the
OEAtomBase::GetPartialCharge
method.RADIUS
It is used to indicate that the contents of the occupancy column in the input PDB file contains a radius, and should be stored in the ‘radius’ property of an atom instead of the ‘occupancy’ property. This value can then be retrieved using the
OEAtomBase::GetRadius
method.DELPHI
Combinations of
OEIFlavor::PDB::CHARGE
andOEIFlavor::PDB::RADIUS
flags.FORMALCHARGE
Controls whether the PDB reader should read the formal charge of the atom from column 79 and 80. Note, this flavor is not turned on default because the subsequent perception steps would not take this formal charge into account when filling valences with bonds and hydrogens.
SecStruct
This controls whether protein secondary structure is perceived after a file is read in. It is on by default. If this flavor is off, secondary structure information specified in the input PDB file (through SHEET and HELIX records) is retained.
BasicMask
FormalCrg
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.ImplicitH
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.BondOrder
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.Rings
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.Connect
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.ExtraMask
Note
This flag is only relevant in the high-level file reader (
OEReadMolecule
) and not inOEReadPDBFile
.AllMask
DEFAULT
Combination of
OEIFlavor::PDB::Connect
,OEIFlavor::PDB::Rings
,OEIFlavor::PDB::BondOrder
,OEIFlavor::PDB::ImplicitH
,OEIFlavor::PDB::FormalCrg
,OEIFlavor::PDB::ENDM
andOEIFlavor::PDB::END
OEIFlavor::PDB::DATA
, flags.Default
Same as the
OEIFlavor::PDB::DEFAULT
constant.SpruceDefault
Combination of
OEIFlavor::PDB::DEFAULT
,OEIFlavor::PDB::ALTLOC
, flags.
OEIFlavor::RDF¶
The OEIFlavor::RDF
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::RDF
format file by
the OEReadMDLFile
low-level file reader.
Default
FixBondMarks
This input flavor requests that the reader make an attempt
to repair stereocenters that throw warnings due to issues identified
by OEMDLStereoFromBondStereo
. With this input flavor
enabled, the warnings will continue to be issued to allow user inspection and verification
of the modified bond marks. However, once the corrected structure(s) have been rewritten
with any format that captures bond stereomark information, a re-read of the structure(s)
should emit fewer stereocenter correction warnings. This input flavor is not
enabled for the default OEFormat::RDF
setting.
SuppressEmptyMolSkip
This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.
SuppressImp2ExpENHSTE
This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format RDFile.
ForceImp2ExpENHSTE
This input flavor performs implicit to explicit enhanced stereogroup conversions for an input RDFile.
OEIFlavor::SDF¶
The OEIFlavor::SDF
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::SDF
format file by
the OEReadMDLFile
low-level file reader.
This namespace contains the following constants:
Default
FixBondMarks
This input flavor requests that the reader make an attempt
to repair stereocenters that throw warnings due to issues identified
by OEMDLStereoFromBondStereo
. With this input flavor
enabled, the warnings will continue to be issued to allow user inspection and verification
of the modified bond marks. However, once the corrected structure(s) have been rewritten
with any format that captures bond stereomark information, a re-read of the structure(s)
should emit fewer stereocenter correction warnings. This input flavor is not
enabled for the default OEFormat::SDF
setting.
SuppressEmptyMolSkip
This input flavor suppresses the default action of skipping empty molecules in the input stream. This may be important in order to recover SDData stored on empty molecule records.
SuppressImp2ExpENHSTE
This input flavor suppresses any enhanced stereogroup implicit to explicit conversions for an input V3000 format SDFile.
ForceImp2ExpENHSTE
This input flavor performs implicit to explicit enhanced stereogroup conversions for an input V2000 or V3000 format SDFile.
OEIFlavor::SKC¶
The OEIFlavor::SKC
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::SKC
format file by
the OEReadSketchFile
low-level file reader.
This namespace contains the following constants:
Default
OEIFlavor::SMI¶
The OEIFlavor::SMI
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the OEParseSmiles
function.
This namespace contains the following constants:
Strict
Canon
DEFAULT
Default
Same as the OEIFlavor::SMI::DEFAULT
constant.
OEIFlavor::USM¶
The OEIFlavor::USM
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the OEParseSmiles
function.
This namespace contains constants.
Default
OEIFlavor::XYZ¶
The OEIFlavor::XYZ
namespace encodes symbolic
constants used as bit-masks that control the processing
performed by the high-level molecule file reader
OEReadMolecule
after reading
OEFormat::XYZ
format file by
the OEReadXYZFile
low-level file reader.
This namespace contains the following constants:
FormalCrg
ImplicitH
BondOrder
Rings
Connect
ExtraMask
Default