CATH / Gene3D v4.4

151 million protein domains classified into 6,573 superfamilies

Putative CATH annotations for predicted structural domains in AlphaFold DB are available in The Encyclopedia of Domains (TED). Annotations for the 21 model organisms predicted by AlphaFold (v2) are available to download (doi:10.1038/s42003-023-04488-9). Core classification files for the latest version of CATH-Plus (v4.4) are available to download. Daily updates of our very latest classifications are also available.

3D Structure

Find out what 3D structure your protein adopts

Protein Evolution

Learn about a particular protein family and how it evolved

Protein Function

Investigate the function of your protein

Conserved Sites

Look at protein sites that are highly conserved and implicated in function

Download Data

Download data files and query CATH via webservices

Learn more

Find out how CATH is created and maintained, how to link to CATH and more


What is CATH-Gene3D?

CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains into superfamilies when there is sufficient evidence they have diverged from a common ancestor.

If you have any questions, comments or suggestions please get in touch via Twitter, ask a question in our online forum or visit our support page.

Latest Release Statistics Info

CATH-Plus 4.4.0 CATH (daily snapshot)
PDB Release 01-09-2024
Domains 500238 601493
Superfamilies 6573 6631
Annotated PDBs 150885 204239

Gene3D v21
Protein Sequences82,665,384
CATH Domain Predictions151,013,797


Citing this resource

If you find the information in this resource useful, please consider using the following citations:

CATH: increased structural coverage of functional space.
Sillitoe I, Bordin N, Dawson N, Waman VP, Ashford P, Scholes HM, Pang CSM, Woodridge L, Rauer C, Sen N, Abbasian M, Le Cornu S, Lam SD, Berka K, Varekova IH, Svobodova R, Lees J, Orengo CA.
Nucleic Acids Res. 2021 Jan
CATH 2024: CATH-AlphaFlow Doubles the Number of Structures in CATH and Reveals Nearly 200 New Folds
Waman VP, Bordin N, Alcraft R, Vickerstaff R, Rauer C, Chan Q, Sillitoe I, Yamamori H, Orengo C.
J Mol Biol.
CATH--a hierarchic classification of protein domain structures.
Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM.
Structure
Gene3D: Extensive prediction of globular domains in proteins.
Lewis TE, Sillitoe I, Dawson N, Lam SD, Clarke T, Orengo CA, Lees JG.
Nucleic Acids Res. 2018 Jan

Funding

The CATH and Gene3D resources have enjoyed generous funding from a number of research councils.

BBSRC logo MRC logo NIH logo Wellcome logo ERC logo

CATH-Gene3D is a Global Biodata Core Resource Learn more...