NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|21536371|ref|NP_009041|]
View 

telomerase protein component 1 isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


:

Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 21536371    604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


:

Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 21536371    847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319   80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319  157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319  228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319  304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                        410       420
                 ....*....|....*....|....*...
gi 21536371 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319  375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319  124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319  204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319  284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 21536371 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319  364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


:

Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 21536371   1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


:

Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 21536371    980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 21536371      1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 21536371     91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 21536371     61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 21536371     31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200    2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200   82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 21536371 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200  161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
 
Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 21536371    604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 21536371    847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319   80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319  157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319  228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319  304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                        410       420
                 ....*....|....*....|....*...
gi 21536371 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319  375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1972-2308 1.70e-55

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 196.02  E-value: 1.70e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1972 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 2045
Cdd:cd00200   12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2046 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 2125
Cdd:cd00200   80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2126 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 2202
Cdd:cd00200  154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2203 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 2282
Cdd:cd00200  209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
                        330       340
                 ....*....|....*....|....*.
gi 21536371 2283 VTAVAWAPDGSMAVSGNQAGELILWQ 2308
Cdd:cd00200  264 VTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319  124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319  204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319  284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 21536371 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319  364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1680-1955 1.96e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.96e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:cd00200   13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1835
Cdd:cd00200   93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1836 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1915
Cdd:cd00200  171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|.
gi 21536371 1916 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1955
Cdd:cd00200  251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 21536371   1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 21536371    980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 21536371      1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 21536371     91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 21536371     61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 21536371     31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200    2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200   82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 21536371 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200  161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2050-2089 1.79e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.23  E-value: 1.79e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 21536371    2050 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PTZ00421 PTZ00421
coronin; Provisional
1962-2199 4.25e-07

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 55.28  E-value: 4.25e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  1962 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 2041
Cdd:PTZ00421   73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  2042 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 2119
Cdd:PTZ00421  120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  2120 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 2192
Cdd:PTZ00421  182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258

                  ....*..
gi 21536371  2193 CAAAMEP 2199
Cdd:PTZ00421  259 SSALFIP 265
WD40 pfam00400
WD domain, G-beta repeat;
2053-2089 6.13e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 6.13e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 21536371   2053 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1756-1787 2.06e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 40.76  E-value: 2.06e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 21536371    1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
YcjX COG3106
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ...
1149-1199 1.01e-03

Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];


Pssm-ID: 442340  Cd Length: 467  Bit Score: 44.41  E-value: 1.01e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|.
gi 21536371 1149 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 1199
Cdd:COG3106   11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
WD40 pfam00400
WD domain, G-beta repeat;
1756-1787 6.22e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 6.22e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 21536371   1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:pfam00400    8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 21536371    604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 21536371    847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319   80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319  157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319  228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319  304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                        410       420
                 ....*....|....*....|....*...
gi 21536371 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319  375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1683-2092 1.03e-60

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 215.16  E-value: 1.03e-60
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1683 AFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDD-TLFLTAFDGLLELWDLQHGCRVLQTKAHQYQ 1761
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGaRLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1762 ITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHT-YPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGA 1840
Cdd:COG2319   81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1841 PGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRG 1920
Cdd:COG2319  161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLR 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1921 HLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALK 1998
Cdd:COG2319  241 TLTGHS-GSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAF-SPdgKLLASGSDDGTVRLWDLA 318
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1999 ecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSL 2076
Cdd:COG2319  319 --TGKLLRTLTGHTGAVRSVAFSPdgKTLASGSDDGTVRLW---------DLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
                        410
                 ....*....|....*.
gi 21536371 2077 ATGGRDRSLLCWDVRT 2092
Cdd:COG2319  388 ASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
1725-2137 2.37e-60

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 214.00  E-value: 2.37e-60
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1725 LFLSDDTLFLTAFDGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHTYPKS-L 1803
Cdd:COG2319    2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAaV 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1804 NCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAH 1883
Cdd:COG2319   82 LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGH 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1884 HGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGDRVAVGYRADGIRIYKISSG 1959
Cdd:COG2319  162 SGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlrtlTGHTGAVR-----SVAFSPDGKLLASGSADGTVRLWDLATG 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1960 SQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQ 2035
Cdd:COG2319  237 KLLRTLTGHSGSVRSVAF-SPdgRLLASGSADGTVRLWDLA--TGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVR 313
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2036 LWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRDWVTGCAW 2115
Cdd:COG2319  314 LW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE---LLRTLTG-HTGAVTSVAF 380
                        410       420
                 ....*....|....*....|...
gi 21536371 2116 TKD-NLLISCSSDGSVGLWDPES 2137
Cdd:COG2319  381 SPDgRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1972-2308 1.70e-55

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 196.02  E-value: 1.70e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1972 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 2045
Cdd:cd00200   12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2046 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 2125
Cdd:cd00200   80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2126 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 2202
Cdd:cd00200  154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2203 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 2282
Cdd:cd00200  209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
                        330       340
                 ....*....|....*....|....*.
gi 21536371 2283 VTAVAWAPDGSMAVSGNQAGELILWQ 2308
Cdd:cd00200  264 VTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1975-2391 2.94e-53

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 193.59  E-value: 2.94e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1975 LAWLSPKVLVSGAEDGSLQGWALKECSLQSLWLLSRFQKPVLGLATSQELLASASEDFTVQLWPRQLLTRPHkaedfpcg 2054
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA-------- 72
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2055 tELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKTPVLIHSfpacHRDWVTGCAWTKD-NLLISCSSDGSVGLW 2133
Cdd:COG2319   73 -TLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG----HTGAVRSVAFSPDgKTLASGSADGTVRLW 147
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2134 DPESGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmepraagqPGSELL 2210
Cdd:COG2319  148 DLATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS--------PDGKLL 219
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2211 VvTVGLDGATRLWHPLLVCQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 2290
Cdd:COG2319  220 A-SGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSP 298
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2291 DGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSA-HTFFVLSADEKISEWQvkLRKGSAPGNLSLHLNRIl 2366
Cdd:COG2319  299 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGavrSVAFSPDgKTLASGSDDGTVRLWD--LATGELLRTLTGHTGAV- 375
                        410       420
                 ....*....|....*....|....*
gi 21536371 2367 qedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:COG2319  376 -------TSVAFSPDGRTLASGSAD 393
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2056-2346 5.72e-53

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 188.70  E-value: 5.72e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2056 ELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVrtpKTPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:cd00200    4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDL---ETGELLRTLKG-HTGPVRDVAASADgTYLASGSSDKTIRLWD 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2135 PESGQRLGQFLGHQSAVSAVAAVEEH--VVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAamepraagqPGSELLV 2211
Cdd:cd00200   80 LETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAF---------SPDGTFV 150
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2212 VTVGLDGATRLWHP-LLVCQtHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 2290
Cdd:cd00200  151 ASSSQDGTIKLWDLrTGKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP 229
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2291 DGSMAVSGNQAGELILWQEAKAVATAQAPGH---IGALIWS-SAHTFFVLSADEKISEWQ 2346
Cdd:cd00200  230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHtnsVTSLAWSpDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319  124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319  204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319  284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 21536371 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319  364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1799-2174 5.08e-44

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 162.89  E-value: 5.08e-44
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1799 YPKSLNCVAFHPEGQVIATGSWagsisffqvdglkvtkdlgapgasirtlafnvpggvvavgrlDSMVELWAWREGARLA 1878
Cdd:cd00200    8 HTGGVTCVAFSPDGKLLATGSG------------------------------------------DGTIKVWDLETGELLR 45
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1879 AFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGdRVAVGYRADG-IRI 1953
Cdd:cd00200   46 TLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECvrtlTGHTSYVS-----SVAFSPDG-RILSSSSRDKtIKV 119
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1954 YKISSGS-----QGAQGQALDVAVSalawlspkvlvsgaedgslqgwalkecslqslwllsrfqkpvlglaTSQELLASA 2028
Cdd:cd00200  120 WDVETGKclttlRGHTDWVNSVAFS----------------------------------------------PDGTFVASS 153
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2029 SEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRD 2108
Cdd:cd00200  154 SQDGTIKLW---------DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK---CLGTLRG-HEN 220
                        330       340       350       360       370       380
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 21536371 2109 WVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVAAVEE--HVVSVSRDGTLKVWD 2174
Cdd:cd00200  221 GVNSVAFSPDGyLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1680-1955 1.96e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.96e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:cd00200   13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1835
Cdd:cd00200   93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1836 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1915
Cdd:cd00200  171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|.
gi 21536371 1916 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1955
Cdd:cd00200  251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1719-1996 1.98e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.98e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1719 DGISACLFLSDDTLFLTAF-DGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLA--- 1794
Cdd:cd00200   10 GGVTCVAFSPDGKLLATGSgDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVrtl 89
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1795 FQHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREG 1874
Cdd:cd00200   90 TGHT--SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1875 ARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLslspaLSVALSPDGDRVAVGYRADG 1950
Cdd:cd00200  168 KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClgtlRGHENGV-----NSVAFSPDGYLLASGSEDGT 242
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*...
gi 21536371 1951 IRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWA 1996
Cdd:cd00200  243 IRVWDLRTGECVQTLSGHTNSVTSLAW-SPdgKRLASGSADGTIRIWD 289
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 21536371   1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371    900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 21536371    980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 21536371      1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 21536371     91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 21536371     61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 21536371     31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200    2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200   82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 21536371 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200  161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2050-2089 1.79e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.23  E-value: 1.79e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 21536371    2050 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PTZ00421 PTZ00421
coronin; Provisional
1962-2199 4.25e-07

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 55.28  E-value: 4.25e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  1962 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 2041
Cdd:PTZ00421   73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  2042 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 2119
Cdd:PTZ00421  120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  2120 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 2192
Cdd:PTZ00421  182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258

                  ....*..
gi 21536371  2193 CAAAMEP 2199
Cdd:PTZ00421  259 SSALFIP 265
WD40 pfam00400
WD domain, G-beta repeat;
2053-2089 6.13e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 6.13e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 21536371   2053 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2232-2265 4.25e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.38  E-value: 4.25e-06
                            10        20        30
                    ....*....|....*....|....*....|....
gi 21536371    2232 HTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 2265
Cdd:smart00320    6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2095-2134 6.55e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.00  E-value: 6.55e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 21536371    2095 TPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
1972-2147 8.48e-06

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 51.24  E-value: 8.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  1972 VSALAWLS--PKVLVSGAEDGSLQGWALKECSLQSLwlLSRFQKPVLGLATSQE---LLASASEDFTVQLWPRQlltrph 2046
Cdd:PLN00181  535 LSGICWNSyiKSQVASSNFEGVVQVWDVARSQLVTE--MKEHEKRVWSIDYSSAdptLLASGSDDGSVKLWSIN------ 606
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371  2047 kaEDFPCGTelRGHEGPVSCCSFSTDGG-SLATGGRDRSLLCWDVRTPKTPVlihsfpaC----HRDWVTGCAWTKDNLL 2121
Cdd:PLN00181  607 --QGVSIGT--IKTKANICCVQFPSESGrSLAFGSADHKVYYYDLRNPKLPL-------CtmigHSKTVSYVRFVDSSTL 675
                         170       180       190
                  ....*....|....*....|....*....|..
gi 21536371  2122 ISCSSDGSVGLWD---PESG---QRLGQFLGH 2147
Cdd:PLN00181  676 VSSSTDNTLKLWDlsmSISGineTPLHSFMGH 707
WD40 pfam00400
WD domain, G-beta repeat;
2230-2265 1.17e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 1.17e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 21536371   2230 QTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 2265
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 pfam00400
WD domain, G-beta repeat;
2098-2134 1.45e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.18  E-value: 1.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 21536371   2098 LIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:pfam00400    3 LLKTLEG-HTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1756-1787 2.06e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 40.76  E-value: 2.06e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 21536371    1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2137-2174 4.47e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 4.47e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 21536371    2137 SGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 2174
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
AAA_16 pfam13191
AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the ...
1147-1280 6.90e-04

AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily.


Pssm-ID: 433025 [Multi-domain]  Cd Length: 167  Bit Score: 42.88  E-value: 6.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1147 RLLQDTVQRLMLPHGRLSLVTGQSGQGKTAFLASLVSALqAPDGAKVASLVFFHFSGARP--DQGLALTLLRRLCT---- 1220
Cdd:pfam13191   10 EQLLDALDRVRSGRPPSVLLTGEAGTGKTTLLRELLRAL-ERDGGYFLRGKCDENLPYSPllEALTREGLLRQLLDeles 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 21536371   1221 --------YLRGQLKEPGALPSTYRSLVWELQQRLLPKSAESLHPgqtQVLIIDGADRLVDQNGQLIS 1280
Cdd:pfam13191   89 slleawraALLEALAPVPELPGDLAERLLDLLLRLLDLLARGERP---LVLVLDDLQWADEASLQLLA 153
WD40 pfam00400
WD domain, G-beta repeat;
2138-2174 8.18e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 39.25  E-value: 8.18e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 21536371   2138 GQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 2174
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAfsPDGKLLASGSDDGTVKVWD 39
YcjX COG3106
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ...
1149-1199 1.01e-03

Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];


Pssm-ID: 442340  Cd Length: 467  Bit Score: 44.41  E-value: 1.01e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|.
gi 21536371 1149 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 1199
Cdd:COG3106   11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
AAA_22 pfam13401
AAA domain;
1160-1271 1.81e-03

AAA domain;


Pssm-ID: 379165 [Multi-domain]  Cd Length: 129  Bit Score: 40.79  E-value: 1.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371   1160 HGRLSLVTGQSGQGKTAFLASLVSALQAPDgakvASLVFFHFSGarpdqglaLTLLRRLCTYLRGQLKEPGALPSTYRSL 1239
Cdd:pfam13401    4 GAGILVLTGESGTGKTTLLRRLLEQLPEVR----DSVVFVDLPS--------GTSPKDLLRALLRALGLPLSGRLSKEEL 71
                           90       100       110
                   ....*....|....*....|....*....|..
gi 21536371   1240 VWELQQRLlpksaesLHPGQTQVLIIDGADRL 1271
Cdd:pfam13401   72 LAALQQLL-------LALAVAVVLIIDEAQHL 96
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1874-1912 6.08e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 6.08e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 21536371    1874 GARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWS 1912
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1756-1787 6.22e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 6.22e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 21536371   1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:pfam00400    8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
ExeA COG3267
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ...
1160-1271 9.35e-03

Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 442498 [Multi-domain]  Cd Length: 261  Bit Score: 40.54  E-value: 9.35e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 21536371 1160 HGRLSLVTGQSGQGKTAFLASLVSALqaPDGAKVASLVFFHFSgarpdqglALTLLRRLCTYLRGQLKepgalPSTYRSL 1239
Cdd:COG3267   42 GGGFVVLTGEVGTGKTTLLRRLLERL--PDDVKVAYIPNPQLS--------PAELLRAIADELGLEPK-----GASKADL 106
                         90       100       110
                 ....*....|....*....|....*....|..
gi 21536371 1240 VWELQQRLLPKSAESLHPgqtqVLIIDGADRL 1271
Cdd:COG3267  107 LRQLQEFLLELAAAGRRV----VLIIDEAQNL 134
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH