NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217298817|ref|XP_047287837|]
View 

capping protein, Arp2/3 and myosin-I linker protein 3 isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.45e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


:

Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.45e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


:

Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2217298817  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 super family cl34950
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.97e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


The actual alignment was detected with superfamily member COG5238:

Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.97e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
1121-1337 6.12e-06

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 6.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1121 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1200
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1201 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1280
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1281 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1337
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.45e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.45e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2217298817  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.97e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.97e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116     28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116     99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116    177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2217298817  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116    248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1121-1337 6.12e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 6.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1121 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1200
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1201 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1280
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1281 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1337
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1238-1360 1.64e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.92  E-value: 1.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217298817 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP 1360
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKP 489
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1236-1370 9.15e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.52  E-value: 9.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839   324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217298817 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRPP-DPTAGTSEP 1370
Cdd:NF033839   404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPEvKPQPETPKP 462
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1066 6.45e-84

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 276.65  E-value: 6.45e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000  241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2217298817  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 1.97e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 1.97e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
189-477 8.26e-14

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 75.21  E-value: 8.26e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  189 REFNLLDFSHLESRDLALMVAALAYNQWFTKLYCKDLRLGSEVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFG 268
Cdd:COG5238    154 NAVHLLGLAARLGLLAAISMAKALQNNSVETVYLGCNQIGDEGIEELAEALTQNTTVTTLWLKRNPIGDEGAEILAEALK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  269 ENGScvLHALTLSHNPIEDKGFLSLSqQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPgll 348
Cdd:COG5238    234 GNKS--LTTLDLSNNQIGDEGVIALA-EALKNNTTVETLYLSGNQIGAEGAIALAKALQGNT----TLTSLDLSVNR--- 303
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  349 ATDE-ANALYSFLAQPNALVHLDLS----GTDCVIdLLLGALLHGccSHLTYLNLARNscshRKGREAPPAFKQFFSSAY 423
Cdd:COG5238    304 IGDEgAIALAEGLQGNKTLHTLNLAyngiGAQGAI-ALAKALQEN--TTLHSLDLSDN----QIGDEGAIALAKYLEGNT 376
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217298817  424 TLSHVNLSATKLPLEALRALLQGLSLNShlsdLH-LDLSSCELRSAGAQALQEQL 477
Cdd:COG5238    377 TLRELNLGKNNIGKQGAEALIDALQTNR----LHtLILDGNLIGAEAQQRLEQLL 427
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116     28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116     99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116    177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2217298817  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116    248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
428-628 7.58e-11

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 65.07  E-value: 7.58e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  428 VNLSATKLPLEALRALlqGLSLNSHLSDLHLDLSSCELRS--AGAQALQEQLGAVTCVGSLDLSDNGFDSDLLTLVPALG 505
Cdd:cd00116     28 LRLEGNTLGEEAAKAL--ASALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  506 KNKSLKHLFLGKN------FNVKAKTLEEILHKLVQLIQE-----------------EDCSLQSLSVADSRLKLR-TSIL 561
Cdd:cd00116    106 RSSSLQELKLNNNglgdrgLRLLAKGLKDLPPALEKLVLGrnrlegascealakalrANRDLKELNLANNGIGDAgIRAL 185
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217298817  562 INALGSNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALES-NHTLR 628
Cdd:cd00116    186 AEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSpNISLL 253
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
334-663 1.25e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 62.26  E-value: 1.25e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  334 SSLRYLDLSKNPGLlatdeanalysflAQPNALVHLDLSGTDCV-IDLLLGALlhgccSHLTYLNLARNSCShrkgrEAP 412
Cdd:COG4886     96 TNLTELDLSGNEEL-------------SNLTNLESLDLSGNQLTdLPEELANL-----TNLKELDLSNNQLT-----DLP 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  413 PAFKQFFSsaytLSHVNLSATKL-----PLEALRALlQGLSL-NSHLSDLHLDLSSCelrsagaQALQEqlgavtcvgsL 486
Cdd:COG4886    153 EPLGNLTN----LKSLDLSNNQLtdlpeELGNLTNL-KELDLsNNQITDLPEPLGNL-------TNLEE----------L 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  487 DLSDNGFDSdlltLVPALGKNKSLKHLFLGKNfnvKAKTLEEILhklvQLIqeedcSLQSLSVADSRLKLrtsilINALG 566
Cdd:COG4886    211 DLSGNQLTD----LPEPLANLTNLETLDLSNN---QLTDLPELG----NLT-----NLEELDLSNNQLTD-----LPPLA 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  567 SNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNHTLRFMSFPVSDISQAYRSAPE 646
Cdd:COG4886    270 NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLAL 349
                          330
                   ....*....|....*..
gi 2217298817  647 RTEDVWQKIQWCLVRNN 663
Cdd:COG4886    350 LTLLLLLNLLSLLLTLL 366
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
210-493 1.60e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 54.67  E-value: 1.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  210 ALAYNQWFTKLYCKDLRLGS--EVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFGengSCVLHALTLSHNPIED 287
Cdd:cd00116     46 ALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLR---SSSLQELKLNNNGLGD 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  288 KGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPglLATDEANALYSFLAQPNALV 367
Cdd:cd00116    123 RGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANR----DLKELNLANNG--IGDAGIRALAEGLKANCNLE 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  368 HLDLSgtDCVID-----LLLGALLHGCCshLTYLNLARNSCSHRKGREAPPAFKqffSSAYTLSHVNLSATKLPLEALRA 442
Cdd:cd00116    197 VLDLN--NNGLTdegasALAETLASLKS--LEVLNLGDNNLTDAGAAALASALL---SPNISLLTLSLSCNDITDDGAKD 269
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217298817  443 LLQGLSLNSHLsdLHLDLSSCELRSAGAQALQEQLGAVTC-VGSLDLSDNGF 493
Cdd:cd00116    270 LAEVLAEKESL--LELDLRGNKFGEEGAQLLAESLLEPGNeLESLWVKDDSF 319
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
185-501 3.50e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 53.90  E-value: 3.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  185 AEDNREFNLLDFS--HLESRDLALMVAALAYnQWFTKLYCKDLR------LGSEVLEQVLHtlskSGSLEELVLDNAGLK 256
Cdd:cd00116     47 LRPQPSLKELCLSlnETGRIPRGLQSLLQGL-TKGCGLQELDLSdnalgpDGCGVLESLLR----SSSLQELKLNNNGLG 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  257 TDFVQKLAGVFGENgSCVLHALTLSHNPIEDKGFLSLSQqLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANpafaSSL 336
Cdd:cd00116    122 DRGLRLLAKGLKDL-PPALEKLVLGRNRLEGASCEALAK-ALRANRDLKELNLANNGIGDAGIRALAEGLKAN----CNL 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  337 RYLDLSKNpgLLATDEANALYSFLAQPNALVHLDLSgtDCVIDLLlgallhgCCSHLtylnlarnscshrkgreappafk 416
Cdd:cd00116    196 EVLDLNNN--GLTDEGASALAETLASLKSLEVLNLG--DNNLTDA-------GAAAL----------------------- 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  417 qffssaytlshvnlsatklpLEALRALLQGLslnshlsdLHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGFDSD 496
Cdd:cd00116    242 --------------------ASALLSPNISL--------LTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEE 293

                   ....*
gi 2217298817  497 LLTLV 501
Cdd:cd00116    294 GAQLL 298
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1121-1337 6.12e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 6.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1121 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1200
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1201 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1280
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217298817 1281 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1337
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
PHA03169 PHA03169
hypothetical protein; Provisional
1122-1336 3.56e-05

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 48.04  E-value: 3.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1122 GGRGPSFRRKMGTEGSEPGEGGPAPGTA--QQPRVHGVALPGLERAKGWSF----DGKREGPGPDQEGSTQAWQKRRSSD 1195
Cdd:PHA03169    34 GRRRGTAARAAKPAPPAPTTSGPQVRAVaeQGHRQTESDTETAEESRHGEKeergQGGPSGSGSESVGSPTPSPSGSAEE 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1196 DAGPGSWKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHScQSPSPASQ-DGEEEKEGTLFPERTLPARNAKLQDPALA 1274
Cdd:PHA03169   114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES-HNPSPNQQpSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217298817 1275 PWPPKPVAVPRGRQPPQEPGvREEAEAGDAAPGVNKPRlRLSSQQDQEEPEVQGPPDPGRRT 1336
Cdd:PHA03169   193 PQSETPTSSPPPQSPPDEPG-EPQSPTPQQAPSPNTQQ-AVEHEDEPTEPEREGPPFPGHRS 252
PHA03247 PHA03247
large tegument protein UL36; Provisional
1122-1372 4.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247  2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1195 DDAGPGSWK------PPPPPQSTKPsfsamrrAEATWHIAEESAPNHSCQSPSPASQDGeeekegtlfPERTLPARnakl 1268
Cdd:PHA03247  2762 TTAGPPAPAppaapaAGPPRRLTRP-------AVASLSESRESLPSPWDPADPPAAVLA---------PAAALPPA---- 2821
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1269 QDPAlAPWPPKPVAVPRGRQPPQEPGVREEAEAGDAAPGVNKPRlRLSSQQDQEEPEVQGPPdPGRRTAPLKPKRTRRAQ 1348
Cdd:PHA03247  2822 ASPA-GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR-RPPSRSPAAKPAAPARP-PVRRLARPAVSRSTESF 2898
                          250       260
                   ....*....|....*....|....
gi 2217298817 1349 SCDKLEPDRRRPPDPTAGTSEPGT 1372
Cdd:PHA03247  2899 ALPPDQPERPPQPQAPPPPQPQPQ 2922
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
239-576 7.00e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 46.85  E-value: 7.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  239 LSKSGSLEELVLDNAGLKT--DFVQKLAGvfgengscvLHALTLSHNPIEDkgflsLSQQLLCFPSgLTKLCLAKTAIS- 315
Cdd:COG4886    109 LSNLTNLESLDLSGNQLTDlpEELANLTN---------LKELDLSNNQLTD-----LPEPLGNLTN-LKSLDLSNNQLTd 173
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  316 -PRGLQALgqtfganpafaSSLRYLDLSKNPgllATDEANAlysfLAQPNALVHLDLSGTD-CVIDLLLGALlhgccSHL 393
Cdd:COG4886    174 lPEELGNL-----------TNLKELDLSNNQ---ITDLPEP----LGNLTNLEELDLSGNQlTDLPEPLANL-----TNL 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  394 TYLNLARNscshrKGREAPpafkqFFSSAYTLSHVNLSATKlplealralLQGLSLNSHLSDL-HLDLSSCELRSAGAQA 472
Cdd:COG4886    231 ETLDLSNN-----QLTDLP-----ELGNLTNLEELDLSNNQ---------LTDLPPLANLTNLkTLDLSNNQLTDLKLKE 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817  473 LQE-------QLGAVTCVGSLDLSDNGFDSDLLTLVPALGKNKSLKHLFLGKNFNVKAKTLEEILHKLVQLIQEEDCSLQ 545
Cdd:COG4886    292 LELllglnslLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGL 371
                          330       340       350
                   ....*....|....*....|....*....|.
gi 2217298817  546 SLSVADSRLKLRTSILINALGSNTCLAKVDL 576
Cdd:COG4886    372 LGLLEATLLTLALLLLTLLLLLLTTTAGVLL 402
PHA03247 PHA03247
large tegument protein UL36; Provisional
1119-1366 1.42e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1119 GFGGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKgwsfdgkreGPGPDQEGSTQAWQKRRSSDDAG 1198
Cdd:PHA03247   262 GEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLAL---------PAPPDPPPPAPAGDAEEEDDEDG 332
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1199 PGSWKPPPPPQSTKPSFSAMRRAEATW----HIAEESAPNHSCQSPSPASQDGEEEKEG-TLFPERTLPARNAKLQDPAL 1273
Cdd:PHA03247   333 AMEVVSPLPRPRQHYPLGFPKRRRPTWtppsSLEDLSAGRHHPKRASLPTRKRRSARHAaTPFARGPGGDDQTRPAAPVP 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1274 APWP-PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVnkprlrlssqqdqeEPEVQGPPDPGRRTAPLKPKRTRRAqscdk 1352
Cdd:PHA03247   413 ASVPtPAPTPVPASAPPPPATPLPSAEPGSDDGPAP--------------PPERQPPAPATEPAPDDPDDATRKA----- 473
                          250
                   ....*....|....*
gi 2217298817 1353 LEPDR-RRPPDPTAG 1366
Cdd:PHA03247   474 LDALReRRPPEPPGA 488
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1238-1360 1.64e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.92  E-value: 1.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217298817 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP 1360
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKP 489
PHA03378 PHA03378
EBNA-3B; Provisional
1145-1370 2.41e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 2.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1145 APGTAQQP--RVHGVALPGlerakgwsfDGKREGPGPDQEGSTQaWQKRRSSDDAGPGSWKPPPPPQSTKPSFSAMRRAE 1222
Cdd:PHA03378   589 APSYAQTPwpVPHPSQTPE---------PPTTQSHIPETSAPRQ-WPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVE 658
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1223 ATWHIAEESAPNHSCQSPSPASQDgeeekegTLFPERTLPARnakLQDPALAPWPPKPVAVPRGR-QPPQEPGVREEAEA 1301
Cdd:PHA03378   659 ITPYKPTWTQIGHIPYQPSPTGAN-------TMLPIQWAPGT---MQPPPRAPTPMRPPAAPPGRaQRPAAATGRARPPA 728
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217298817 1302 GdaAPGVNKPRlrlssqqdqeepevQGPPDPGRRTAPlKPKRTRRAQSCdklePDRRRPPDPTAGTSEP 1370
Cdd:PHA03378   729 A--APGRARPP--------------AAAPGRARPPAA-APGRARPPAAA----PGRARPPAAAPGAPTP 776
PRK11633 PRK11633
cell division protein DedD; Provisional
1280-1363 1.91e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 41.53  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1280 PVAVPRGRQPPqePGVREEAEAGDAAPGVNKP-RLRLSSQQDQEEPEVQGPPDPGRRTAPlKPKRTRRAQSCDKLEPdrr 1358
Cdd:PRK11633    58 AATQALPTQPP--EGAAEAVRAGDAAAPSLDPaTVAPPNTPVEPEPAPVEPPKPKPVEKP-KPKPKPQQKVEAPPAP--- 131

                   ....*
gi 2217298817 1359 rPPDP 1363
Cdd:PRK11633   132 -KPEP 135
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1236-1370 9.15e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.52  E-value: 9.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839   324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217298817 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRPP-DPTAGTSEP 1370
Cdd:NF033839   404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPEvKPQPETPKP 462
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1142-1365 9.72e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.22  E-value: 9.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1142 GGPAPGTAQQPRVHGvALPGLERAKGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPGSWKPPPPPQSTKPSFSAMRRA 1221
Cdd:PRK07003   364 GGGAPGGGVPARVAG-AVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDA 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217298817 1222 EATWHIAEESAPNHSCQSPSPASQDGEeekegtlfpertlPARNAKlqdPALAPWPPKPVAVPRgrqppqEPGVREEAEA 1301
Cdd:PRK07003   443 ADGDAPVPAKANARASADSRCDERDAQ-------------PPADSG---SASAPASDAPPDAAF------EPAPRAAAPS 500
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217298817 1302 GDAAPGVNKPRlRLSSQQDQEEPEVQGPPDPgrRTAPLKPKRTRRAQSCD--------------KLEPDRRRPPDPTA 1365
Cdd:PRK07003   501 AATPAAVPDAR-APAAASREDAPAAAAPPAP--EARPPTPAAAAPAARAGgaaaaldvlrnagmRVSSDRGARAAAAA 575
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH