CATH / Gene3D v4.4
151 million protein domains classified into 6,573 superfamilies
3D Structure
Find out what 3D structure your protein adopts
Protein Evolution
Learn about a particular protein family and how it evolved
Protein Function
Investigate the function of your protein
Conserved Sites
Look at protein sites that are highly conserved and implicated in function
Download Data
Download data files and query CATH via webservices
Learn more
Find out how CATH is created and maintained, how to link to CATH and more
What is CATH-Gene3D?
CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains into superfamilies when there is sufficient evidence they have diverged from a common ancestor.
Gene3D uses the information in CATH to predict the locations of structural domains on millions of protein sequences available in public databases. This allows us to include additional annotations to the CATH-Gene3D database such as functional information and active site residues.
If you have any questions, comments or suggestions please get in touch via Twitter, ask a question in our online forum or visit our support page.
Latest Release Statistics Info
CATH-Plus 4.4.0 | CATH (daily snapshot) | |
---|---|---|
PDB Release | 01-09-2024 | |
Domains | 500238 | 536613 |
Superfamilies | 6573 | 6631 |
Annotated PDBs | 150885 | 199335 |
Gene3D v21 | |
---|---|
Protein Sequences | 82,665,384 |
CATH Domain Predictions | 151,013,797 |
Citing this resource
If you find the information in this resource useful, please consider using the following citations:
Funding
The CATH and Gene3D resources have enjoyed generous funding from a number of research councils.