Chemistry Development Kit

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Chemistry Development Kit
Cdklogo.svg
Developer(s) The CDK Project
Stable release 1.4.18 (May 11, 2013; 36 days ago (2013-05-11)) [±]
Preview release 1.5.2 (April 19, 2013; 58 days ago (2013-04-19)) [±]
Written in Java (programming language)
Operating system Cross-platform
Type Chemoinformatics/Molecular modelling/Bioinformatics
License GNU Lesser General Public License
Website http://cdk.sourceforge.net/

The Chemistry Development Kit is an open-source Java library for Chemoinformatics and Bioinformatics.[1] It is available for Windows, Unix, and Mac OS. It is distributed under the GNU LGPL.

Contents

History[edit]

The CDK was created by Christoph Steinbeck, Egon Willighagen and Dan Gezelter, the developers of Jmol and JChemPaint at the time, to provide a common code base, on 27–29 September 2000 at the University of Notre Dame. Since then more than 75 people have contributed to the project,[2] leading to a rich set of functionality, as given below.

Between 2004 and 2007 CDK News was the project's newsletter of which all articles are available from a public archive.[3] Due to lack of a steady stream of contributions, this newsletter was put on hold.

In 2012, the project became a support of the InChI Trust, to encourage continued development. The library uses JNI-InChI[4] for the generation of InChIs.[5]

Library[edit]

The CDK itself is a library, instead of a user program. However, it has been integrated into various environments to make its functionality available. CDK is currently used in several applications, among which the R (programming language),[6] CDK-Taverna (a Taverna workbench plugin),[7] Bioclipse, and Cinfony.[8] Additionally, CDK extensions exist for KNIME and for Excel, called LICSS (excel-cdk).[9]

In 2008 bits of GPL-licensed code were removed from the library. While those code bits were independent from the main CDK library, and no copylefting was involved, to reduce confusions among users, the ChemoJava project was instantiated.

Major features[edit]

Chemoinformatics[edit]

Bioinformatics[edit]

  • protein active site detection
  • cognate ligand detection[12]
  • metabolite identification[13]
  • pathway databases

General[edit]

See also[edit]

References[edit]

  1. ^ Steinbeck, C.; Han, Y.Q.; Kuhn, S.; Horlacher, O.; Luttmann, E.; Willighagen, E.L. (2003). "The Chemistry Development Kit (CDK): An open-source Java library for chemo- and bioinformatics". Journal of Chemical Information and Computer Sciences 43 (2): 493–500. doi:10.1021/ci025584y. PMID 12653513. 
  2. ^ https://github.com/cdk/cdk/blob/master/AUTHORS
  3. ^ https://sourceforge.net/projects/cdk/files/CDK%20News/
  4. ^ http://jni-inchi.sourceforge.net/
  5. ^ Spjuth, O.; Berg, A.; Adams, S.; Willighagen, E. L. (2013). "Applications of the InChI in cheminformatics with the CDK and Bioclipse". Journal of Cheminformatics 5 (1): 14. doi:10.1186/1758-2946-5-14. PMID 23497723.  edit
  6. ^ Guha, R. (2007). Journal of Statistical Software 18: 1–16. 
  7. ^ Kuhn, T.; Willighagen, E.L.; Zielesny, A.; Steinbeck, C. (2010). "CDK-Taverna: an open workflow environment for cheminformatics". BMC Bioinformatics 11: 159. doi:10.1186/1471-2105-11-159. PMC 2862046. PMID 20346188. 
  8. ^ O'Boyle, N.M.; Hutchison, G.R. (2008). Chemistry Central Journal 2. 
  9. ^ Lawson, K. R.; Lawson, J. (2012). "LICSS - a chemical spreadsheet in microsoft excel". Journal of Cheminformatics 4 (1): 3. doi:10.1186/1758-2946-4-3. PMC 3310842. PMID 22301088.  edit
  10. ^ Steinbeck, C.; Hoppe, C.; Kuhn, S.; Floris, M.; Guha, R.; Willighagen, E.L. (2006). "Recent developments of the chemistry development kit (CDK) — an open-source java library for chemo- and bioinformatics". Curr. Pharm. Des. 12 (17): 2111–20. doi:10.2174/138161206777585274. PMID 16796559. 
    Guangli M, Yiyu C (2006). "Predicting Caco-2 permeability using support vector machine and chemistry development kit". J Pharm Pharm Sci 9 (2): 210–21. PMID 16959190. 
  11. ^ Peironcely, J. E.; Rojas-Chertó, M.; Fichera, D.; Reijmers, T.; Coulier, L.; Faulon, J. L.; Hankemeier, T. (2012). "OMG: Open molecule generator". Journal of Cheminformatics 4 (1): 21. doi:10.1186/1758-2946-4-21. PMID 22985496.  edit
  12. ^ Bashton, M.; Nobeli, I.; Thornton, J. M. (2006). "Cognate Ligand Domain Mapping for Enzymes". Journal of Molecular Biology 364 (4): 836. doi:10.1016/j.jmb.2006.09.041. PMID 17034815.  edit
  13. ^ Rojas-Cherto, M.; Kasper, P. T.; Willighagen, E. L.; Vreeken, R. J.; Hankemeier, T.; Reijmers, T. H. (2011). "Elemental composition determination based on MSn". Bioinformatics 27 (17): 2376–2383. doi:10.1093/bioinformatics/btr409. PMID 21757467.  edit

External links[edit]