The Cambridge Structural Database (CSD) is fast approaching 1 million structures and below you can see some statistics on this highly valuable collection of curated data. Based on the current rate of growth of the CSD, the millionth structure is predicted to added by July 2019 and you can read more about the countdown and how we define a million structures in our Blog Post

These statistics are generated weekly using the CSD Python API, and give a summary of the data available through our online Access Structures service. A daily update on the Countdown to CSD 1 Million is shown on the front page of the website. Additional statistics produced as an annual snapshot of the CSD are available from the Documentation and Resources page.

The number 'CSD Structures' refers to the number of unique crystallographic data collections deposited at the CCDC. The number of CSD entries is a slightly larger value, as each CSD entry corresponds to a published reference to a dataset.

CSD Growth

Number of Entries in CSD

983,585

Unique Structures in CSD

967,398

Unique Author Names

383,724

Unique Publications

478,313

Top Journals This Year

CSD Communications

This Year
2,723
To Date
26,509

Number of Refcode Families

894,329

Number of Polymorph Families

10,627

Structures with Melting Points

168,089

Milestone Structures

Number Refcode
200,000th VAVFAZ
250,000th IBEZUK
300,000th EHUFUI
500,000th EFEMUX01
750,000th ZOYBIA
800,000th TUWMOP
900,000th PATXEQ

Top Authors This Year