CATH | DHS | Gene3D | Impala | FTP |
Search
 PDB Code
 CATH Code
 General Text

Goto
SSAP Server
GRATH Server
DHS
Gene3D
Navigation
Home
Top of hierarchy
Home > Top  

CATH Protein Structure Classification

Version 3.0.0: Released May 2006

CATH Group

Dr. Lesley Greene, Dr. Frances M.G. Pearl, Dr. Ian Sillitoe, Dr. Mark Dibley, Mr. Tony Lewis, Mr. Oliver Redfern, Dr. Alison Cuff

Contributors to the CATH Version 3.0.0 Release

Dr. Rekha Nambudiry, Dr. Azara Janmohamed, Dr. Janet Moloney, Dr. Kanchan Phadwal, Dr. Corin Yeats, Ms. Sarah Addou, Mr. Tim Dallman, Mr. Adam Reid, Ms. Elisabeth Rideal, Dr. Russell L. Marsden, Dr. David Lee, Prof. Janet Thornton, Prof. Christine A. Orengo

Links

Introduction

CATH is a hierarchical classification of protein domain structures, which clusters proteins at four major levels, Class(C), Architecture(A), Topology(T) and Homologous superfamily (H).

Class, derived from secondary structure content, is assigned for more than 90% of protein structures automatically. Architecture, which describes the gross orientation of secondary structures, independent of connectivities, is currently assigned manually. The topology level clusters structures into fold groups according to their topological connections and numbers of secondary structures. The homologous superfamilies cluster proteins with highly similar structures and functions. The assignments of structures to fold groups and homologous superfamilies are made by sequence and structure comparisons.

The boundaries and assignments for each protein domain are determined using a combination of automated and manual procedures. These include computational techniques, empirical and statistical evidence, literature review and expert analysis.

> Click here for a more detailed explanation

References

Protein families and their evolution - A structural perspective. Orengo CA, Thornton JM. (2005)
Annual Review of Biochemistry. Vol 74. p. 867-900. Review.
Assessing strategies for improved superfamily recognition. Sillitoe I, Dibley M, Bray J, Addou S. and Orengo C. (2005)
Protein Science. Vol 14. No 7. p. 1800-1810.
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Pearl F, Todd A, Sillitoe I, Dibley M, Redfern O, Lewis T, Bennett C, Marsden R, Grant A, Lee D, Akpor A, Maibaum M, Harrison A, Dallman T, Reeves G, Diboun I, Addou S, Lise S, Johnston C, Sillero A, Thornton J, Orengo C. (2005)
Nucleic Acids Research. Vol. 33 Database Issue D247-D251
The CATH database: an extended protein family resource for structural and functional genomics. Pearl FM, Bennett CF, Bray JE, Harrison AP, Martin N, Shepherd A, Sillitoe I, Thornton J, Orengo CA. (2003)
Nucleic Acids Research. Vol. 31, No. 1., p. 452-455.
Recognizing the fold of a protein structure. Harrison A, Pearl F, Sillitoe I, Slidel T, Mott R, Thornton J, Orengo C. (2003)
Bioinformatics. Vol 19. No 14. p.1748-1759.
Quantifying the similarities within fold space. Harrison A, Pearl F, Mott R, Thornton J, Orengo C. (2002)
Journal of Molecular Biology. Vol 323. No 5. p.909-926.
CATH- A Hierarchic Classification of Protein Domain Structures. Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., and Thornton, J.M. (1997)
Structure. Vol 5. No 8. p.1093-1108.

Past Members/Contributors

  • Prof. Willie Taylor
  • Prof. David T. Jones
  • Dr. James E. Bray
  • Dr. Adrian J. Shepherd
  • Dr. Daniel W. A. Buchan
  • Dr. Annabel E. Todd
  • Dr. Gabrielle A. Reeves
  • Dr. Andrew Harrison
  • Mr. Adrian Akpor
  • Dr. Chris Bennett