NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|156938257|ref|NP_612369|]
View 

capping protein, Arp2/3 and myosin-I linker protein 3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.36e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


:

Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.36e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


:

Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257    31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 156938257   111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 super family cl34950
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.99e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


The actual alignment was detected with superfamily member COG5238:

Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.99e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238    88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238   168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238   241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238   319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238   366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1122-1372 1.31e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247 2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1195 DDAGPGSwKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHSCQSPSPASQDGEEEKE-GTLFPERTLPARNAKLQ-DPA 1272
Cdd:PHA03247 2762 TTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALpPAASPAGPLPPPTSAQPtAPP 2840
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1273 LAPWPPKPVAVPRGRQPPQEPgVREEAEAGDAAPGVNKPR----LRLSSQQDQEEPEVQG-PPDPGRR----TAPLKPKR 1343
Cdd:PHA03247 2841 PPPGPPPPSLPLGGSVAPGGD-VRRRPPSRSPAAKPAAPArppvRRLARPAVSRSTESFAlPPDQPERppqpQAPPPPQP 2919
                         250       260
                  ....*....|....*....|....*....
gi 156938257 1344 TRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.36e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.36e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257    31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 156938257   111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.99e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.99e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238    88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238   168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238   241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238   319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238   366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116    28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116    99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116   177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                         250       260       270
                  ....*....|....*....|....*....|....
gi 156938257  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116   248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
PHA03247 PHA03247
large tegument protein UL36; Provisional
1122-1372 1.31e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247 2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1195 DDAGPGSwKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHSCQSPSPASQDGEEEKE-GTLFPERTLPARNAKLQ-DPA 1272
Cdd:PHA03247 2762 TTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALpPAASPAGPLPPPTSAQPtAPP 2840
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1273 LAPWPPKPVAVPRGRQPPQEPgVREEAEAGDAAPGVNKPR----LRLSSQQDQEEPEVQG-PPDPGRR----TAPLKPKR 1343
Cdd:PHA03247 2841 PPPGPPPPSLPLGGSVAPGGD-VRRRPPSRSPAAKPAAPArppvRRLARPAVSRSTESFAlPPDQPERppqpQAPPPPQP 2919
                         250       260
                  ....*....|....*....|....*....
gi 156938257 1344 TRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1238-1369 1.21e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839  359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156938257 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1369
Cdd:NF033839  437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1236-1371 5.96e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.91  E-value: 5.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839  324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1371
Cdd:NF033839  404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.36e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.36e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257   935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257    31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 156938257   111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.99e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.99e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238    88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238   168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238   241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238   319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238   366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
189-477 8.18e-14

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 75.21  E-value: 8.18e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  189 REFNLLDFSHLESRDLALMVAALAYNQWFTKLYCKDLRLGSEVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFG 268
Cdd:COG5238   154 NAVHLLGLAARLGLLAAISMAKALQNNSVETVYLGCNQIGDEGIEELAEALTQNTTVTTLWLKRNPIGDEGAEILAEALK 233
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  269 ENGScvLHALTLSHNPIEDKGFLSLSqQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPgll 348
Cdd:COG5238   234 GNKS--LTTLDLSNNQIGDEGVIALA-EALKNNTTVETLYLSGNQIGAEGAIALAKALQGNT----TLTSLDLSVNR--- 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  349 ATDE-ANALYSFLAQPNALVHLDLS----GTDCVIdLLLGALLHGccSHLTYLNLARNscshRKGREAPPAFKQFFSSAY 423
Cdd:COG5238   304 IGDEgAIALAEGLQGNKTLHTLNLAyngiGAQGAI-ALAKALQEN--TTLHSLDLSDN----QIGDEGAIALAKYLEGNT 376
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 156938257  424 TLSHVNLSATKLPLEALRALLQGLSLNShlsdLH-LDLSSCELRSAGAQALQEQL 477
Cdd:COG5238   377 TLRELNLGKNNIGKQGAEALIDALQTNR----LHtLILDGNLIGAEAQQRLEQLL 427
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116    28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116    99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116   177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                         250       260       270
                  ....*....|....*....|....*....|....
gi 156938257  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116   248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
428-628 7.57e-11

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 65.07  E-value: 7.57e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  428 VNLSATKLPLEALRALlqGLSLNSHLSDLHLDLSSCELRS--AGAQALQEQLGAVTCVGSLDLSDNGFDSDLLTLVPALG 505
Cdd:cd00116    28 LRLEGNTLGEEAAKAL--ASALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLL 105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  506 KNKSLKHLFLGKN------FNVKAKTLEEILHKLVQLIQE-----------------EDCSLQSLSVADSRLKLR-TSIL 561
Cdd:cd00116   106 RSSSLQELKLNNNglgdrgLRLLAKGLKDLPPALEKLVLGrnrlegascealakalrANRDLKELNLANNGIGDAgIRAL 185
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156938257  562 INALGSNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALES-NHTLR 628
Cdd:cd00116   186 AEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSpNISLL 253
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
334-663 1.32e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 1.32e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  334 SSLRYLDLSKNPGLlatdeanalysflAQPNALVHLDLSGTDCV-IDLLLGALlhgccSHLTYLNLARNSCShrkgrEAP 412
Cdd:COG4886    96 TNLTELDLSGNEEL-------------SNLTNLESLDLSGNQLTdLPEELANL-----TNLKELDLSNNQLT-----DLP 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  413 PAFKQFFSsaytLSHVNLSATKL-----PLEALRALlQGLSL-NSHLSDLHLDLSSCelrsagaQALQEqlgavtcvgsL 486
Cdd:COG4886   153 EPLGNLTN----LKSLDLSNNQLtdlpeELGNLTNL-KELDLsNNQITDLPEPLGNL-------TNLEE----------L 210
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  487 DLSDNGFDSdlltLVPALGKNKSLKHLFLGKNfnvKAKTLEEILhklvQLIqeedcSLQSLSVADSRLKLrtsilINALG 566
Cdd:COG4886   211 DLSGNQLTD----LPEPLANLTNLETLDLSNN---QLTDLPELG----NLT-----NLEELDLSNNQLTD-----LPPLA 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  567 SNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNHTLRFMSFPVSDISQAYRSAPE 646
Cdd:COG4886   270 NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLAL 349
                         330
                  ....*....|....*..
gi 156938257  647 RTEDVWQKIQWCLVRNN 663
Cdd:COG4886   350 LTLLLLLNLLSLLLTLL 366
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
210-493 1.58e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 54.67  E-value: 1.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  210 ALAYNQWFTKLYCKDLRLGS--EVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFGengSCVLHALTLSHNPIED 287
Cdd:cd00116    46 ALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLR---SSSLQELKLNNNGLGD 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  288 KGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPglLATDEANALYSFLAQPNALV 367
Cdd:cd00116   123 RGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANR----DLKELNLANNG--IGDAGIRALAEGLKANCNLE 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  368 HLDLSgtDCVID-----LLLGALLHGCCshLTYLNLARNSCSHRKGREAPPAFKqffSSAYTLSHVNLSATKLPLEALRA 442
Cdd:cd00116   197 VLDLN--NNGLTdegasALAETLASLKS--LEVLNLGDNNLTDAGAAALASALL---SPNISLLTLSLSCNDITDDGAKD 269
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 156938257  443 LLQGLSLNSHLsdLHLDLSSCELRSAGAQALQEQLGAVTC-VGSLDLSDNGF 493
Cdd:cd00116   270 LAEVLAEKESL--LELDLRGNKFGEEGAQLLAESLLEPGNeLESLWVKDDSF 319
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
185-501 3.50e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 53.90  E-value: 3.50e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  185 AEDNREFNLLDFS--HLESRDLALMVAALAYnQWFTKLYCKDLR------LGSEVLEQVLHtlskSGSLEELVLDNAGLK 256
Cdd:cd00116    47 LRPQPSLKELCLSlnETGRIPRGLQSLLQGL-TKGCGLQELDLSdnalgpDGCGVLESLLR----SSSLQELKLNNNGLG 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  257 TDFVQKLAGVFGENgSCVLHALTLSHNPIEDKGFLSLSQqLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANpafaSSL 336
Cdd:cd00116   122 DRGLRLLAKGLKDL-PPALEKLVLGRNRLEGASCEALAK-ALRANRDLKELNLANNGIGDAGIRALAEGLKAN----CNL 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  337 RYLDLSKNpgLLATDEANALYSFLAQPNALVHLDLSgtDCVIDLLlgallhgCCSHLtylnlarnscshrkgreappafk 416
Cdd:cd00116   196 EVLDLNNN--GLTDEGASALAETLASLKSLEVLNLG--DNNLTDA-------GAAAL----------------------- 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  417 qffssaytlshvnlsatklpLEALRALLQGLslnshlsdLHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGFDSD 496
Cdd:cd00116   242 --------------------ASALLSPNISL--------LTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEE 293

                  ....*
gi 156938257  497 LLTLV 501
Cdd:cd00116   294 GAQLL 298
PHA03247 PHA03247
large tegument protein UL36; Provisional
1122-1372 1.31e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247 2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1195 DDAGPGSwKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHSCQSPSPASQDGEEEKE-GTLFPERTLPARNAKLQ-DPA 1272
Cdd:PHA03247 2762 TTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALpPAASPAGPLPPPTSAQPtAPP 2840
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1273 LAPWPPKPVAVPRGRQPPQEPgVREEAEAGDAAPGVNKPR----LRLSSQQDQEEPEVQG-PPDPGRR----TAPLKPKR 1343
Cdd:PHA03247 2841 PPPGPPPPSLPLGGSVAPGGD-VRRRPPSRSPAAKPAAPArppvRRLARPAVSRSTESFAlPPDQPERppqpQAPPPPQP 2919
                         250       260
                  ....*....|....*....|....*....
gi 156938257 1344 TRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1121-1337 5.86e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 5.86e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1121 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1200
Cdd:PRK07764  597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1201 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1280
Cdd:PRK07764  672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 1281 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1337
Cdd:PRK07764  750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
PHA03169 PHA03169
hypothetical protein; Provisional
1122-1336 3.53e-05

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 48.04  E-value: 3.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMGTEGSEPGEGGPAPGTA--QQPRVHGVALPGLERAKGWSF----DGKREGPGPDQEGSTQAWQKRRSSD 1195
Cdd:PHA03169   34 GRRRGTAARAAKPAPPAPTTSGPQVRAVaeQGHRQTESDTETAEESRHGEKeergQGGPSGSGSESVGSPTPSPSGSAEE 113
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1196 DAGPGSWKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHScQSPSPASQ-DGEEEKEGTLFPERTLPARNAKLQDPALA 1274
Cdd:PHA03169  114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES-HNPSPNQQpSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 156938257 1275 PWPPKPVAVPRGRQPPQEPGvREEAEAGDAAPGVNKPRlRLSSQQDQEEPEVQGPPDPGRRT 1336
Cdd:PHA03169  193 PQSETPTSSPPPQSPPDEPG-EPQSPTPQQAPSPNTQQ-AVEHEDEPTEPEREGPPFPGHRS 252
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
239-576 7.50e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 46.85  E-value: 7.50e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  239 LSKSGSLEELVLDNAGLKT--DFVQKLAGvfgengscvLHALTLSHNPIEDkgflsLSQQLLCFPSgLTKLCLAKTAIS- 315
Cdd:COG4886   109 LSNLTNLESLDLSGNQLTDlpEELANLTN---------LKELDLSNNQLTD-----LPEPLGNLTN-LKSLDLSNNQLTd 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  316 -PRGLQALgqtfganpafaSSLRYLDLSKNPgllATDEANAlysfLAQPNALVHLDLSGTD-CVIDLLLGALlhgccSHL 393
Cdd:COG4886   174 lPEELGNL-----------TNLKELDLSNNQ---ITDLPEP----LGNLTNLEELDLSGNQlTDLPEPLANL-----TNL 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  394 TYLNLARNscshrKGREAPpafkqFFSSAYTLSHVNLSATKlplealralLQGLSLNSHLSDL-HLDLSSCELRSAGAQA 472
Cdd:COG4886   231 ETLDLSNN-----QLTDLP-----ELGNLTNLEELDLSNNQ---------LTDLPPLANLTNLkTLDLSNNQLTDLKLKE 291
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257  473 LQE-------QLGAVTCVGSLDLSDNGFDSDLLTLVPALGKNKSLKHLFLGKNFNVKAKTLEEILHKLVQLIQEEDCSLQ 545
Cdd:COG4886   292 LELllglnslLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGL 371
                         330       340       350
                  ....*....|....*....|....*....|.
gi 156938257  546 SLSVADSRLKLRTSILINALGSNTCLAKVDL 576
Cdd:COG4886   372 LGLLEATLLTLALLLLTLLLLLLTTTAGVLL 402
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1238-1369 1.21e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839  359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156938257 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1369
Cdd:NF033839  437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
PHA03247 PHA03247
large tegument protein UL36; Provisional
1119-1367 1.77e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1119 GFGGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKgwsfdgkreGPGPDQEGSTQAWQKRRSSDDAG 1198
Cdd:PHA03247  262 GEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLAL---------PAPPDPPPPAPAGDAEEEDDEDG 332
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1199 PGSWKPPPPPQSTKPSFSAMRRAEATW----HIAEESAPNHSCQSPSPASQDGEEEKEG-TLFPERTLPARNAKLQDPAL 1273
Cdd:PHA03247  333 AMEVVSPLPRPRQHYPLGFPKRRRPTWtppsSLEDLSAGRHHPKRASLPTRKRRSARHAaTPFARGPGGDDQTRPAAPVP 412
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1274 APWP-PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVnkprlrlssqqdqeEPEVQGPPDPGRRTAPLKPKRTRRAqscdk 1352
Cdd:PHA03247  413 ASVPtPAPTPVPASAPPPPATPLPSAEPGSDDGPAP--------------PPERQPPAPATEPAPDDPDDATRKA----- 473
                         250
                  ....*....|....*.
gi 156938257 1353 LEPDR-RRPPDPTGTS 1367
Cdd:PHA03247  474 LDALReRRPPEPPGAD 489
PRK11633 PRK11633
cell division protein DedD; Provisional
1280-1369 7.62e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 7.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1280 PVAVPRGRQPPqePGVREEAEAGDAAPGVNKP-RLRLSSQQDQEEPEVQGPPDPGRRTAPlKPKRTRRAQSCDKLEPDRR 1358
Cdd:PRK11633   58 AATQALPTQPP--EGAAEAVRAGDAAAPSLDPaTVAPPNTPVEPEPAPVEPPKPKPVEKP-KPKPKPQQKVEAPPAPKPE 134
                          90
                  ....*....|.
gi 156938257 1359 RPPDPTGTSEP 1369
Cdd:PRK11633  135 PKPVVEEKAAP 145
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1104-1372 3.00e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 3.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1104 SSPCWSPEEESSLLPGFGGGRGPSFRRK---MGTEGSEPGEGGPAP-GTAQQPRVHGVALPGLER-AKGWSFDGKREGPG 1178
Cdd:PHA03307  206 PPRRSSPISASASSPAPAPGRSAADDAGassSDSSSSESSGCGWGPeNECPLPRPAPITLPTRIWeASGWNGPSSRPGPA 285
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1179 PDQEGStqawqkRRSSDDAGPGSwkpppppqSTKPSFSAMRRAeatwhiAEESAPNHSCQSPSPASQDgeeekegtlFPE 1258
Cdd:PHA03307  286 SSSSSP------RERSPSPSPSS--------PGSGPAPSSPRA------SSSSSSSRESSSSSTSSSS---------ESS 336
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1259 RTLPARNAKLQDPALAPWPPKPVAV---PRGRQPPQEPGVREEAEAGDAAP-GVNKPRLRLSSQQDQeepevqgppdPGR 1334
Cdd:PHA03307  337 RGAAVSPGPSPSRSPSPSRPPPPADpssPRKRPRPSRAPSSPAASAGRPTRrRARAAVAGRARRRDA----------TGR 406
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 156938257 1335 RTAPLKPKRTRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03307  407 FPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPWPGSP 444
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1134-1372 4.06e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 4.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1134 TEGSEPGEGGPA--PGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPGSWKPPPPPQST 1211
Cdd:PRK07003  363 TGGGAPGGGVPArvAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDA 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1212 KPSFSAMRRAEATWHIAEESAPNHSCQSPS-------------PASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPP 1278
Cdd:PRK07003  443 ADGDAPVPAKANARASADSRCDERDAQPPAdsgsasapasdapPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAP 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1279 KPVAVPRGRQPPQEPGV-REEAEAGDAAPGVNKPR---LRLSSQQDQE---------EPEVQGPPDPGRRTAPLKPKRTR 1345
Cdd:PRK07003  523 AAAAPPAPEARPPTPAAaAPAARAGGAAAALDVLRnagMRVSSDRGARaaaaakpaaAPAAAPKPAAPRVAVQVPTPRAR 602
                         250       260
                  ....*....|....*....|....*....
gi 156938257 1346 RAQSCDKLEPDRRRPP--DPTGTSEPGTD 1372
Cdd:PRK07003  603 AATGDAPPNGAARAEQaaESRGAPPPWED 631
PHA03378 PHA03378
EBNA-3B; Provisional
1145-1369 4.55e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 4.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1145 APGTAQQP--RVHGVALPGlerakgwsfDGKREGPGPDQEGSTQaWQKRRSSDDAGPGSWKPPPPPQSTKPSFSAMRRAE 1222
Cdd:PHA03378  589 APSYAQTPwpVPHPSQTPE---------PPTTQSHIPETSAPRQ-WPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVE 658
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1223 ATWHIAEESAPNHSCQSPSPASQDgeeekegTLFPERTLPARnakLQDPALAPWPPKPVAVPRGR-QPPQEPGVREEAEA 1301
Cdd:PHA03378  659 ITPYKPTWTQIGHIPYQPSPTGAN-------TMLPIQWAPGT---MQPPPRAPTPMRPPAAPPGRaQRPAAATGRARPPA 728
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1302 GdaAPGVNKPRlrlssqqdqeepevQGPPDPGRRTAPlKPKRTRRAQSCdklePDRRRPPD-----PTGTSEP 1369
Cdd:PHA03378  729 A--APGRARPP--------------AAAPGRARPPAA-APGRARPPAAA----PGRARPPAaapgaPTPQPPP 780
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1236-1371 5.96e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.91  E-value: 5.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839  324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1371
Cdd:NF033839  404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH