• RE: Peculiar behaviour of remove_hydrogens

    Hi David,

    I'm afraid you have unearthed a genuine bug.  Discussions are underway here to see if it can be fixed in the forthcoming API version 1.3 release.

    In the meantime you can work around the problem by using the internal API:

    mol = Molecule.from_string(...)

    mol = Molecule(mol.identifier, _molecule=mol._molecule.create_editable_molecule())

    Sorry about this.

     

    Richard

  • RE: Querying the CSD by substructure created "from scratch"

    Thanks, Christian.

    Please carry on raising any difficulties you have, and making suggestions for ways in which we may improve the API.

    Cheers

    Richard

     

  • RE: Void space in crystal structures

    Here's the slightly modified script, testing for 3D coordinates.

    Richard

  • RE: Void space in crystal structures

    I've attached a table of spacegroup, average void space (as a percentage of the unit cell volume), number of observations from the 673,606 structures of CSD V536 with 3D coordinates.  I'll leave it to the crystallographically adept to extract any meaning there is in the table.

    Richard

  • RE: Void space in crystal structures

    I've attached a script which will do this over the whole CSD.  It's running on my desktop at the moment, but I don't expect the results until tomorrow - the void calculation is computationally fairly heavy.

    I'll let you know the results when I get them.

    Richard

  • RE: Querying the CSD by substructure created "from scratch"

    Hi Christian,

    I've looked a little further into the discrepancy between ConQuest searching and API searching.  Once the difference between ConQuest and the API's notion of error is straightened out, ConQuest finds nine extra structures:

    'BUDFEN', 'DENLOY', 'HOHKIA', 'HOHLEX', 'HOHLIB', 'IYEXUF', 'MURCIL', 'MURCOR', 'YAFRAY01'

    These structures all contain 'Unknown' bond types between a Cu and an O, and so do not get selected by the API search.  The structures will be found by the ConQuest search which, in the absence of a 3D parameter will perform a 2D search.

    There is a good case to be made that a 2D search mode would be useful in the API.  I shall raise this for consideration for the next API release.

    In the meantime, if you want these structures, use ConQuest; if you can live without them continue to use the API.

     

    Best wishes

    Richard

  • RE: Querying the CSD by substructure created "from scratch"

    Hi Christian,

    You are certainly going about things the right way.

    The API currently has different criteria for identifying error flagged structures compared to ConQuest - this is something we intend to review and fix in the next release. The error flag used by ConQuest is much more appropriate for regular use.

    The max_r_factor value should be expressed in percent, i.e.

    settings.max_r_factor = 5.0

    I'll change the documentation to reflect this.

    There do seem to be some discrepancies between the ConQuest and the API results, even when errors are ignored:  I shall look further into this.

    Cheers

    Richard