|
Name |
Accession |
Description |
Interval |
E-value |
| DUF5920 |
pfam19334 |
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ... |
164-366 |
2.84e-138 |
|
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains. :
Pssm-ID: 466045 Cd Length: 203 Bit Score: 428.81 E-value: 2.84e-138
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 164 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 243
Cdd:pfam19334 1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 244 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 323
Cdd:pfam19334 81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 767981392 324 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 366
Cdd:pfam19334 161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1326-1745 |
8.97e-68 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 235.19 E-value: 8.97e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1326 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1405
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1406 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 1483
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1484 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 1561
Cdd:COG2319 157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1562 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 1638
Cdd:COG2319 228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1639 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 1717
Cdd:COG2319 304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
|
410 420
....*....|....*....|....*...
gi 767981392 1718 VRAAAVSETSGLMLTASEDGSVRLWQVP 1745
Cdd:COG2319 375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1157-1435 |
3.04e-47 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 175.48 E-value: 3.04e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1157 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1235
Cdd:COG2319 124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1236 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1314
Cdd:COG2319 204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1315 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1394
Cdd:COG2319 284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 767981392 1395 PRGHLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISS 1435
Cdd:COG2319 364 LLRTLTGHT-GAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| TROVE super family |
cl05344 |
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ... |
1-153 |
5.59e-34 |
|
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding. The actual alignment was detected with superfamily member pfam05731:
Pssm-ID: 461724 Cd Length: 361 Bit Score: 135.59 E-value: 5.59e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1 MAMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrne 79
Cdd:pfam05731 273 MAMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH-------------------------------- 320
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767981392 80 knrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 153
Cdd:pfam05731 321 ------------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
|
|
| NACHT |
pfam05729 |
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ... |
639-814 |
3.16e-32 |
|
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931. :
Pssm-ID: 428606 [Multi-domain] Cd Length: 166 Bit Score: 123.95 E-value: 3.16e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 639 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 718
Cdd:pfam05729 1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 719 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 787
Cdd:pfam05729 77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
|
170 180
....*....|....*....|....*..
gi 767981392 788 QgahVLALGPLEASARARLVREELALY 814
Cdd:pfam05729 143 R---YLEVRGFSESDRKQYVRKYFSDE 166
|
|
| DUF4062 |
pfam13271 |
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ... |
377-485 |
8.30e-16 |
|
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif. :
Pssm-ID: 463823 Cd Length: 78 Bit Score: 74.16 E-value: 8.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 377 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 456
Cdd:pfam13271 1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
|
90 100
....*....|....*....|....*....
gi 767981392 457 IPpsynlpdhphfhwaqqyPSGRSVTEME 485
Cdd:pfam13271 67 ID-----------------PDGISYTELE 78
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1708-1868 |
3.64e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 66.20 E-value: 3.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1708 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 1757
Cdd:cd00200 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1758 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 1820
Cdd:cd00200 82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 767981392 1821 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 1868
Cdd:cd00200 161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| DUF5920 |
pfam19334 |
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ... |
164-366 |
2.84e-138 |
|
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.
Pssm-ID: 466045 Cd Length: 203 Bit Score: 428.81 E-value: 2.84e-138
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 164 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 243
Cdd:pfam19334 1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 244 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 323
Cdd:pfam19334 81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 767981392 324 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 366
Cdd:pfam19334 161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1326-1745 |
8.97e-68 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 235.19 E-value: 8.97e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1326 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1405
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1406 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 1483
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1484 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 1561
Cdd:COG2319 157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1562 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 1638
Cdd:COG2319 228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1639 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 1717
Cdd:COG2319 304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
|
410 420
....*....|....*....|....*...
gi 767981392 1718 VRAAAVSETSGLMLTASEDGSVRLWQVP 1745
Cdd:COG2319 375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1449-1785 |
7.44e-55 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 193.71 E-value: 7.44e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1449 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 1522
Cdd:cd00200 12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1523 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 1602
Cdd:cd00200 80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1603 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 1679
Cdd:cd00200 154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1680 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 1759
Cdd:cd00200 209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
|
330 340
....*....|....*....|....*.
gi 767981392 1760 VTAVAWAPDGSMAVSGNQAGELILWQ 1785
Cdd:cd00200 264 VTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1157-1435 |
3.04e-47 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 175.48 E-value: 3.04e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1157 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1235
Cdd:COG2319 124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1236 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1314
Cdd:COG2319 204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1315 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1394
Cdd:COG2319 284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 767981392 1395 PRGHLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISS 1435
Cdd:COG2319 364 LLRTLTGHT-GAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1157-1432 |
4.58e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 4.58e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1157 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1235
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1236 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1312
Cdd:cd00200 93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1313 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1392
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 767981392 1393 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1432
Cdd:cd00200 251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
|
|
| TROVE |
pfam05731 |
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ... |
1-153 |
5.59e-34 |
|
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.
Pssm-ID: 461724 Cd Length: 361 Bit Score: 135.59 E-value: 5.59e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1 MAMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrne 79
Cdd:pfam05731 273 MAMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH-------------------------------- 320
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767981392 80 knrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 153
Cdd:pfam05731 321 ------------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
|
|
| NACHT |
pfam05729 |
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ... |
639-814 |
3.16e-32 |
|
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
Pssm-ID: 428606 [Multi-domain] Cd Length: 166 Bit Score: 123.95 E-value: 3.16e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 639 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 718
Cdd:pfam05729 1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 719 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 787
Cdd:pfam05729 77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
|
170 180
....*....|....*....|....*..
gi 767981392 788 QgahVLALGPLEASARARLVREELALY 814
Cdd:pfam05729 143 R---YLEVRGFSESDRKQYVRKYFSDE 166
|
|
| DUF4062 |
pfam13271 |
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ... |
377-485 |
8.30e-16 |
|
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.
Pssm-ID: 463823 Cd Length: 78 Bit Score: 74.16 E-value: 8.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 377 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 456
Cdd:pfam13271 1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
|
90 100
....*....|....*....|....*....
gi 767981392 457 IPpsynlpdhphfhwaqqyPSGRSVTEME 485
Cdd:pfam13271 67 ID-----------------PDGISYTELE 78
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1708-1868 |
3.64e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 66.20 E-value: 3.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1708 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 1757
Cdd:cd00200 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1758 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 1820
Cdd:cd00200 82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 767981392 1821 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 1868
Cdd:cd00200 161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1527-1566 |
1.80e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 49.23 E-value: 1.80e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 767981392 1527 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 1566
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
1439-1676 |
2.68e-07 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 55.67 E-value: 2.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1439 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 1518
Cdd:PTZ00421 73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1519 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 1596
Cdd:PTZ00421 120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1597 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 1669
Cdd:PTZ00421 182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258
|
....*..
gi 767981392 1670 CAAAMEP 1676
Cdd:PTZ00421 259 SSALFIP 265
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1530-1566 |
6.40e-07 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 47.73 E-value: 6.40e-07
10 20 30
....*....|....*....|....*....|....*..
gi 767981392 1530 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 1566
Cdd:pfam00400 3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1233-1264 |
1.95e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.76 E-value: 1.95e-04
10 20 30
....*....|....*....|....*....|..
gi 767981392 1233 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1264
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| YcjX |
COG3106 |
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ... |
626-676 |
8.35e-04 |
|
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];
Pssm-ID: 442340 Cd Length: 467 Bit Score: 44.41 E-value: 8.35e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 767981392 626 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 676
Cdd:COG3106 11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1233-1264 |
5.95e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 5.95e-03
10 20 30
....*....|....*....|....*....|..
gi 767981392 1233 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1264
Cdd:pfam00400 8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| DUF5920 |
pfam19334 |
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ... |
164-366 |
2.84e-138 |
|
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.
Pssm-ID: 466045 Cd Length: 203 Bit Score: 428.81 E-value: 2.84e-138
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 164 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 243
Cdd:pfam19334 1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 244 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 323
Cdd:pfam19334 81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 767981392 324 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 366
Cdd:pfam19334 161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1326-1745 |
8.97e-68 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 235.19 E-value: 8.97e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1326 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1405
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1406 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 1483
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1484 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 1561
Cdd:COG2319 157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1562 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 1638
Cdd:COG2319 228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1639 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 1717
Cdd:COG2319 304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
|
410 420
....*....|....*....|....*...
gi 767981392 1718 VRAAAVSETSGLMLTASEDGSVRLWQVP 1745
Cdd:COG2319 375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1160-1569 |
4.68e-59 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 210.15 E-value: 4.68e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1160 AFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDD-TLFLTAFDGLLELWDLQHGCRVLQTKAHQYQ 1238
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGaRLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1239 ITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHT-YPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGA 1317
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1318 PGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRG 1397
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLR 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1398 HLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALK 1475
Cdd:COG2319 241 TLTGHS-GSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAF-SPdgKLLASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1476 ecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSL 1553
Cdd:COG2319 319 --TGKLLRTLTGHTGAVRSVAFSPdgKTLASGSDDGTVRLW---------DLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
410
....*....|....*.
gi 767981392 1554 ATGGRDRSLLCWDVRT 1569
Cdd:COG2319 388 ASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1202-1614 |
1.38e-58 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 208.61 E-value: 1.38e-58
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1202 LFLSDDTLFLTAFDGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHTYPKS-L 1280
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAaV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1281 NCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAH 1360
Cdd:COG2319 82 LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGH 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1361 HGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGDRVAVGYRADGIRIYKISSG 1436
Cdd:COG2319 162 SGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlrtlTGHTGAVR-----SVAFSPDGKLLASGSADGTVRLWDLATG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1437 SQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQ 1512
Cdd:COG2319 237 KLLRTLTGHSGSVRSVAF-SPdgRLLASGSADGTVRLWDLA--TGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1513 LWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRDWVTGCAW 1592
Cdd:COG2319 314 LW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE---LLRTLTG-HTGAVTSVAF 380
|
410 420
....*....|....*....|...
gi 767981392 1593 TKD-NLLISCSSDGSVGLWDPES 1614
Cdd:COG2319 381 SPDgRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1449-1785 |
7.44e-55 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 193.71 E-value: 7.44e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1449 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 1522
Cdd:cd00200 12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1523 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 1602
Cdd:cd00200 80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1603 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 1679
Cdd:cd00200 154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1680 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 1759
Cdd:cd00200 209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
|
330 340
....*....|....*....|....*.
gi 767981392 1760 VTAVAWAPDGSMAVSGNQAGELILWQ 1785
Cdd:cd00200 264 VTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1533-1823 |
2.05e-52 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 186.77 E-value: 2.05e-52
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1533 ELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVrtpKTPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 1611
Cdd:cd00200 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDL---ETGELLRTLKG-HTGPVRDVAASADgTYLASGSSDKTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1612 PESGQRLGQFLGHQSAVSAVAAVEEH--VVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAamepraagqPGSELLV 1688
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAF---------SPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1689 VTVGLDGATRLWHP-LLVCQtHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 1767
Cdd:cd00200 151 ASSSQDGTIKLWDLrTGKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP 229
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1768 DGSMAVSGNQAGELILWQEAKAVATAQAPGH---IGALIWS-SAHTFFVLSADEKISEWQ 1823
Cdd:cd00200 230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHtnsVTSLAWSpDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1452-1868 |
1.12e-51 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 188.58 E-value: 1.12e-51
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1452 LAWLSPKVLVSGAEDGSLQGWALKECSLQSLWLLSRFQKPVLGLATSQELLASASEDFTVQLWPRQLLTRPHkaedfpcg 1531
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA-------- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1532 tELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKTPVLIHSfpacHRDWVTGCAWTKD-NLLISCSSDGSVGLW 1610
Cdd:COG2319 73 -TLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG----HTGAVRSVAFSPDgKTLASGSADGTVRLW 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1611 DPESGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmepraagqPGSELL 1687
Cdd:COG2319 148 DLATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS--------PDGKLL 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1688 VvTVGLDGATRLWHPLLVCQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 1767
Cdd:COG2319 220 A-SGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSP 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1768 DGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSA-HTFFVLSADEKISEWQvkLRKGSAPGNLSLHLNRIl 1843
Cdd:COG2319 299 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGavrSVAFSPDgKTLASGSDDGTVRLWD--LATGELLRTLTGHTGAV- 375
|
410 420
....*....|....*....|....*
gi 767981392 1844 qedlgvlTSLDWAPDGHFLILAKAD 1868
Cdd:COG2319 376 -------TSVAFSPDGRTLASGSAD 393
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1157-1435 |
3.04e-47 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 175.48 E-value: 3.04e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1157 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1235
Cdd:COG2319 124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1236 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1314
Cdd:COG2319 204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1315 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1394
Cdd:COG2319 284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 767981392 1395 PRGHLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISS 1435
Cdd:COG2319 364 LLRTLTGHT-GAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1276-1651 |
2.07e-43 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 160.58 E-value: 2.07e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1276 YPKSLNCVAFHPEGQVIATGSWagsisffqvdglkvtkdlgapgasirtlafnvpggvvavgrlDSMVELWAWREGARLA 1355
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSG------------------------------------------DGTIKVWDLETGELLR 45
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1356 AFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGdRVAVGYRADG-IRI 1430
Cdd:cd00200 46 TLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECvrtlTGHTSYVS-----SVAFSPDG-RILSSSSRDKtIKV 119
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1431 YKISSGS-----QGAQGQALDVAVSalawlspkvlvsgaedgslqgwalkecslqslwllsrfqkpvlglaTSQELLASA 1505
Cdd:cd00200 120 WDVETGKclttlRGHTDWVNSVAFS----------------------------------------------PDGTFVASS 153
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1506 SEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRD 1585
Cdd:cd00200 154 SQDGTIKLW---------DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK---CLGTLRG-HEN 220
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767981392 1586 WVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVAAVEE--HVVSVSRDGTLKVWD 1651
Cdd:cd00200 221 GVNSVAFSPDGyLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1157-1432 |
4.58e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 4.58e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1157 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1235
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1236 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1312
Cdd:cd00200 93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1313 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1392
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 767981392 1393 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1432
Cdd:cd00200 251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1196-1473 |
4.75e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 4.75e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1196 DGISACLFLSDDTLFLTAF-DGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLA--- 1271
Cdd:cd00200 10 GGVTCVAFSPDGKLLATGSgDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVrtl 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1272 FQHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREG 1351
Cdd:cd00200 90 TGHT--SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1352 ARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLslspaLSVALSPDGDRVAVGYRADG 1427
Cdd:cd00200 168 KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClgtlRGHENGV-----NSVAFSPDGYLLASGSEDGT 242
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 767981392 1428 IRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWA 1473
Cdd:cd00200 243 IRVWDLRTGECVQTLSGHTNSVTSLAW-SPdgKRLASGSADGTIRIWD 289
|
|
| TROVE |
pfam05731 |
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ... |
1-153 |
5.59e-34 |
|
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.
Pssm-ID: 461724 Cd Length: 361 Bit Score: 135.59 E-value: 5.59e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1 MAMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrne 79
Cdd:pfam05731 273 MAMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH-------------------------------- 320
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767981392 80 knrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 153
Cdd:pfam05731 321 ------------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
|
|
| NACHT |
pfam05729 |
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ... |
639-814 |
3.16e-32 |
|
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
Pssm-ID: 428606 [Multi-domain] Cd Length: 166 Bit Score: 123.95 E-value: 3.16e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 639 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 718
Cdd:pfam05729 1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 719 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 787
Cdd:pfam05729 77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
|
170 180
....*....|....*....|....*..
gi 767981392 788 QgahVLALGPLEASARARLVREELALY 814
Cdd:pfam05729 143 R---YLEVRGFSESDRKQYVRKYFSDE 166
|
|
| DUF4062 |
pfam13271 |
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ... |
377-485 |
8.30e-16 |
|
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.
Pssm-ID: 463823 Cd Length: 78 Bit Score: 74.16 E-value: 8.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 377 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 456
Cdd:pfam13271 1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
|
90 100
....*....|....*....|....*....
gi 767981392 457 IPpsynlpdhphfhwaqqyPSGRSVTEME 485
Cdd:pfam13271 67 ID-----------------PDGISYTELE 78
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1708-1868 |
3.64e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 66.20 E-value: 3.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1708 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 1757
Cdd:cd00200 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1758 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 1820
Cdd:cd00200 82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 767981392 1821 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 1868
Cdd:cd00200 161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1527-1566 |
1.80e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 49.23 E-value: 1.80e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 767981392 1527 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 1566
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
1439-1676 |
2.68e-07 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 55.67 E-value: 2.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1439 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 1518
Cdd:PTZ00421 73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1519 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 1596
Cdd:PTZ00421 120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1597 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 1669
Cdd:PTZ00421 182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258
|
....*..
gi 767981392 1670 CAAAMEP 1676
Cdd:PTZ00421 259 SSALFIP 265
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1530-1566 |
6.40e-07 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 47.73 E-value: 6.40e-07
10 20 30
....*....|....*....|....*....|....*..
gi 767981392 1530 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 1566
Cdd:pfam00400 3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1709-1742 |
4.31e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 4.31e-06
10 20 30
....*....|....*....|....*....|....
gi 767981392 1709 HTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 1742
Cdd:smart00320 6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
1449-1624 |
6.65e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 6.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1449 VSALAWLS--PKVLVSGAEDGSLQGWALKECSLQSLwlLSRFQKPVLGLATSQE---LLASASEDFTVQLWPRQlltrph 1523
Cdd:PLN00181 535 LSGICWNSyiKSQVASSNFEGVVQVWDVARSQLVTE--MKEHEKRVWSIDYSSAdptLLASGSDDGSVKLWSIN------ 606
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 1524 kaEDFPCGTelRGHEGPVSCCSFSTDGG-SLATGGRDRSLLCWDVRTPKTPVlihsfpaC----HRDWVTGCAWTKDNLL 1598
Cdd:PLN00181 607 --QGVSIGT--IKTKANICCVQFPSESGrSLAFGSADHKVYYYDLRNPKLPL-------CtmigHSKTVSYVRFVDSSTL 675
|
170 180 190
....*....|....*....|....*....|..
gi 767981392 1599 ISCSSDGSVGLWD---PESG---QRLGQFLGH 1624
Cdd:PLN00181 676 VSSSTDNTLKLWDlsmSISGineTPLHSFMGH 707
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1572-1611 |
6.84e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.61 E-value: 6.84e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 767981392 1572 TPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 1611
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1707-1742 |
1.21e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.87 E-value: 1.21e-05
10 20 30
....*....|....*....|....*....|....*.
gi 767981392 1707 QTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 1742
Cdd:pfam00400 3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1575-1611 |
1.50e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.79 E-value: 1.50e-04
10 20 30
....*....|....*....|....*....|....*...
gi 767981392 1575 LIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 1611
Cdd:pfam00400 3 LLKTLEG-HTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1233-1264 |
1.95e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.76 E-value: 1.95e-04
10 20 30
....*....|....*....|....*....|..
gi 767981392 1233 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1264
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1614-1651 |
4.36e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 4.36e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 767981392 1614 SGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 1651
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1615-1651 |
8.06e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 8.06e-04
10 20 30
....*....|....*....|....*....|....*....
gi 767981392 1615 GQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 1651
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAfsPDGKLLASGSDDGTVKVWD 39
|
|
| AAA_16 |
pfam13191 |
AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the ... |
624-757 |
8.13e-04 |
|
AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily.
Pssm-ID: 433025 [Multi-domain] Cd Length: 167 Bit Score: 42.11 E-value: 8.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 624 RLLQDTVQRLMLPHGRLSLVTGQSGQGKTAFLASLVSALqAPDGAKVASLVFFHFSGARP--DQGLALTLLRRLCT---- 697
Cdd:pfam13191 10 EQLLDALDRVRSGRPPSVLLTGEAGTGKTTLLRELLRAL-ERDGGYFLRGKCDENLPYSPllEALTREGLLRQLLDeles 88
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767981392 698 --------YLRGQLKEPGALPSTYRSLVWELQQRLLPKSAESLHPgqtQVLIIDGADRLVDQNGQLIS 757
Cdd:pfam13191 89 slleawraALLEALAPVPELPGDLAERLLDLLLRLLDLLARGERP---LVLVLDDLQWADEASLQLLA 153
|
|
| YcjX |
COG3106 |
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ... |
626-676 |
8.35e-04 |
|
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];
Pssm-ID: 442340 Cd Length: 467 Bit Score: 44.41 E-value: 8.35e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 767981392 626 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 676
Cdd:COG3106 11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
|
|
| AAA_22 |
pfam13401 |
AAA domain; |
637-748 |
1.75e-03 |
|
AAA domain;
Pssm-ID: 379165 [Multi-domain] Cd Length: 129 Bit Score: 40.40 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 637 HGRLSLVTGQSGQGKTAFLASLVSALQAPDgakvASLVFFHFSGarpdqglaLTLLRRLCTYLRGQLKEPGALPSTYRSL 716
Cdd:pfam13401 4 GAGILVLTGESGTGKTTLLRRLLEQLPEVR----DSVVFVDLPS--------GTSPKDLLRALLRALGLPLSGRLSKEEL 71
|
90 100 110
....*....|....*....|....*....|..
gi 767981392 717 VWELQQRLlpksaesLHPGQTQVLIIDGADRL 748
Cdd:pfam13401 72 LAALQQLL-------LALAVAVVLIIDEAQHL 96
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1351-1389 |
5.70e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 5.70e-03
10 20 30
....*....|....*....|....*....|....*....
gi 767981392 1351 GARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWS 1389
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1233-1264 |
5.95e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 5.95e-03
10 20 30
....*....|....*....|....*....|..
gi 767981392 1233 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1264
Cdd:pfam00400 8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| ExeA |
COG3267 |
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ... |
637-748 |
7.26e-03 |
|
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 442498 [Multi-domain] Cd Length: 261 Bit Score: 40.54 E-value: 7.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767981392 637 HGRLSLVTGQSGQGKTAFLASLVSALqaPDGAKVASLVFFHFSgarpdqglALTLLRRLCTYLRGQLKepgalPSTYRSL 716
Cdd:COG3267 42 GGGFVVLTGEVGTGKTTLLRRLLERL--PDDVKVAYIPNPQLS--------PAELLRAIADELGLEPK-----GASKADL 106
|
90 100 110
....*....|....*....|....*....|..
gi 767981392 717 VWELQQRLLPKSAESLHPgqtqVLIIDGADRL 748
Cdd:COG3267 107 LRQLQEFLLELAAAGRRV----VLIIDEAQNL 134
|
|
|