NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462588866|ref|XP_054202043|]
View 

target of Nesh-SH3 isoform X52 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
478-914 1.07e-16

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.07e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  781 TTTTKRTRRPHPKPKTTPHPEVpqtklapKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 860
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPV-------RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  861 DVLLPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETrGIPFIPMISPSPSQE 914
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRV-AVPRFRVPQPAPSRE 2987
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1155-1246 2.60e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1155 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1232
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 2462588866 1233 LGEGPVSNTVAFST 1246
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
778-1087 1.45e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKT----------------SPRPRIPQTQPvPKVPQRVT 841
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasddagDPPPPLPPAAP-PAAPDRSV 2568
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  842 AKPKTSPSP-EVSYTTPAPKdvllPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETRGIPfiPMISPSPSQEELQTTL 920
Cdd:PHA03247  2569 PPPRPAPRPsEPAVTSRARR----PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP--PPPSPSPAANEPDPHP 2642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  921 EETDQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQA 997
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLP 2722
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  998 PRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIE 1077
Cdd:PHA03247  2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESR 2795
                          330
                   ....*....|
gi 2462588866 1078 TDIKQPTVPA 1087
Cdd:PHA03247  2796 ESLPSPWDPA 2805
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 2.07e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


:

Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.45  E-value: 2.07e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866   124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 2462588866   200 GVK 202
Cdd:smart00060   73 RVR 75
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
317-602 4.77e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.57  E-value: 4.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  317 ALPAESKTPEVEKISArPTTVTPETvPRSTKPTTSSALDVSETTLVLSKRTPETLQTIlipqfelPLSTLAPKSLPEFPE 396
Cdd:pfam17823  129 SLPAAIAALPSEAFSA-PRAAACRA-NASAAPRAAIAAASAPHAASPAPRTAASSTTA-------ASSTTAASSAPTTAA 199
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  397 AKTPFPFEKPRGTLASS----EKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTS 472
Cdd:pfam17823  200 SSAPATLTPARGISTAAtatgHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARR 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  473 RTLEQPRATLAPSETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEP 551
Cdd:pfam17823  280 LSPAKHMPSDTMARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462588866  552 TwTTPAPgktqfislKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 602
Cdd:pfam17823  359 S-ASPVP--------VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-914 1.07e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.07e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  781 TTTTKRTRRPHPKPKTTPHPEVpqtklapKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 860
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPV-------RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  861 DVLLPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETrGIPFIPMISPSPSQE 914
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRV-AVPRFRVPQPAPSRE 2987
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1155-1246 2.60e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1155 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1232
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 2462588866 1233 LGEGPVSNTVAFST 1246
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1156-1236 4.46e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.85  E-value: 4.46e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  1156 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1233
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 2462588866  1234 GEG 1236
Cdd:smart00060   81 GEG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1087 1.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKT----------------SPRPRIPQTQPvPKVPQRVT 841
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasddagDPPPPLPPAAP-PAAPDRSV 2568
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  842 AKPKTSPSP-EVSYTTPAPKdvllPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETRGIPfiPMISPSPSQEELQTTL 920
Cdd:PHA03247  2569 PPPRPAPRPsEPAVTSRARR----PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP--PPPSPSPAANEPDPHP 2642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  921 EETDQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQA 997
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLP 2722
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  998 PRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIE 1077
Cdd:PHA03247  2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESR 2795
                          330
                   ....*....|
gi 2462588866 1078 TDIKQPTVPA 1087
Cdd:PHA03247  2796 ESLPSPWDPA 2805
fn3 pfam00041
Fibronectin type III domain;
1156-1239 1.54e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 44.71  E-value: 1.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1156 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1232
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 2462588866 1233 LGEGPVS 1239
Cdd:pfam00041   79 GGEGPPS 85
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
972-1251 9.55e-05

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 9.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  972 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1051
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1052 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1128
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1129 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1207
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462588866 1208 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1251
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1150-1294 1.30e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1150 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1227
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588866 1228 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1294
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
444-865 1.47e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  444 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 515
Cdd:pfam05109  310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  516 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 585
Cdd:pfam05109  390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  586 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 664
Cdd:pfam05109  469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  665 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 742
Cdd:pfam05109  549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  743 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 821
Cdd:pfam05109  626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 2462588866  822 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 865
Cdd:pfam05109  706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 2.07e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.45  E-value: 2.07e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866   124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 2462588866   200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 2.11e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.25  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 2462588866  200 GVK 202
Cdd:pfam00041   72 RVQ 74
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
317-602 4.77e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.57  E-value: 4.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  317 ALPAESKTPEVEKISArPTTVTPETvPRSTKPTTSSALDVSETTLVLSKRTPETLQTIlipqfelPLSTLAPKSLPEFPE 396
Cdd:pfam17823  129 SLPAAIAALPSEAFSA-PRAAACRA-NASAAPRAAIAAASAPHAASPAPRTAASSTTA-------ASSTTAASSAPTTAA 199
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  397 AKTPFPFEKPRGTLASS----EKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTS 472
Cdd:pfam17823  200 SSAPATLTPARGISTAAtatgHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARR 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  473 RTLEQPRATLAPSETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEP 551
Cdd:pfam17823  280 LSPAKHMPSDTMARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462588866  552 TwTTPAPgktqfislKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 602
Cdd:pfam17823  359 S-ASPVP--------VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 8.64e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.79  E-value: 8.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 2462588866  201 VK 202
Cdd:cd00063     74 VR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-830 1.09e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  597 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 672
Cdd:NF033839   284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  673 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 752
Cdd:NF033839   362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588866  753 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 830
Cdd:NF033839   428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
790-860 3.59e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 3.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588866  790 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 860
Cdd:NF033838   418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-914 1.07e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.53  E-value: 1.07e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  781 TTTTKRTRRPHPKPKTTPHPEVpqtklapKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 860
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPV-------RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  861 DVLLPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETrGIPFIPMISPSPSQE 914
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRV-AVPRFRVPQPAPSRE 2987
PHA03247 PHA03247
large tegument protein UL36; Provisional
635-1078 1.36e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.36e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQ--------PGSKPPKQLLPKPQTTAEPDMPPTKSV 702
Cdd:PHA03247  2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSarprapvdDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  703 SEPVPFETEAPSMTIVPttdiEPVTVRTEATVTTLA-PKTSQRTRTRRPRPKHKTTPRPETLQTkldfgPITPGTSSA-- 779
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVP----PPERPRDDPAPGRVSrPRRARRLGRAAQASSPPQRPRRRAARP-----TVGSLTSLAdp 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  780 PTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPR-------PRIPQTQPVPKVPQRVT--AKPKTSPSP 850
Cdd:PHA03247  2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpggPARPARPPTTAGPPAPAppAAPAAGPPR 2781
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  851 EVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLE---TRGIPFIPMISPSPSQEELQTtleETDQST 927
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLpppTSAQPTAPPPPPGPPPPSLPL---GGSVAP 2858
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  928 QEPFTTKIPRTTELAKTTQAPH-RFYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPSGNgvgTGVKQAPRPSGADRn 1006
Cdd:PHA03247  2859 GGDVRRRPPSRSPAAKPAAPARpPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ---PQPPPPPQPQPPPP- 2934
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462588866 1007 vsvdsTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGII-----SSGPITTPPLRSTPRPTGTPLERIET 1078
Cdd:PHA03247  2935 -----PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPASSTPPLTGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
382-899 4.11e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.04  E-value: 4.11e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247  2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247  2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPevPQTKLAPKQTPRAPP---KPKTSPRP-RIPQTQPVPKVP 837
Cdd:PHA03247  2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP--LTGHSLSRVSSWASSlalHEETDPPPvSLKQTLWPPDDT 3034
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462588866  838 QRVTAKPKTSPSPEVSYTT---PAPKDVLLP--HKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETR 899
Cdd:PHA03247  3035 EDSDADSLFDSDSERSDLEaldPLPPEPHDPfaHEPDPATPEAGARESPSSQFGPPPLSANAALSRR 3101
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1155-1246 2.60e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 2.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1155 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1232
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 2462588866 1233 LGEGPVSNTVAFST 1246
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1156-1236 4.46e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.85  E-value: 4.46e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  1156 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1233
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 2462588866  1234 GEG 1236
Cdd:smart00060   81 GEG 83
PRK10263 PRK10263
DNA translocase FtsK; Provisional
701-895 7.08e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.85  E-value: 7.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  701 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 780
Cdd:PRK10263   315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  781 TTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYttpAPK 860
Cdd:PRK10263   383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF---APQ 459
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2462588866  861 DVLLPHKPYPE-VSQSEPVLQPvtfRFEPPKTTIAP 895
Cdd:PRK10263   460 STYQTEQTYQQpAAQEPLYQQP---QPVEQQPVVEP 492
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
533-986 1.12e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 50.07  E-value: 1.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  533 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVTHTKPAP-EPQTLLPSQSTIGPETPGTKPST 606
Cdd:PTZ00449   533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEFPKDPKHPkDPEEPKKPKRPRSAQRPTRPKSP 612
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  607 TLaprktkrpgrrprprprpktTPSPEVPKSKPALEPATIQPEPLVPTtaskpseRPKTTHRPDAPQIQPGSKPPKQllP 686
Cdd:PTZ00449   613 KL--------------------PELLDIPKSPKRPESPKSPKRPPPPQ-------RPSSPERPEGPKIIKSPKPPKS--P 663
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  687 KPqttaepdmpptksvsepvPFEteaPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPrpetLQTK 766
Cdd:PTZ00449   664 KP------------------PFD---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP----FTTP 718
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  767 LDFGPITPGTSSAPTTTTKRTRRPHPKP-KTTPHPEVPQTKLapKQTPRAPPKPK-TSPRPRIPQTQPVPKVPQRVTAKP 844
Cdd:PTZ00449   719 RPLPPKLPRDEEFPFEPIGDPDAEQPDDiEFFTPPEEERTFF--HETPADTPLPDiLAEEFKEEDIHAETGEPDEAMKRP 796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  845 KtSPSpevSYTTPAPKDvllpHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPlETRGIPFipMISPSPSQEELQTTLEETD 924
Cdd:PTZ00449   797 D-SPS---EHEDKPPGD----HPSLPKKRHRLDGLALSTTDLESDAGRIAK-DASGKIV--KLKRSKSFDDLTTVEEAEE 865
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  925 QSTQEPFTTKIPRTTELAKTTQAP--HRFYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS 986
Cdd:PTZ00449   866 MGAEARKIVVDDDGTEADDEDTHPpeEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPS 929
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1087 1.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKT----------------SPRPRIPQTQPvPKVPQRVT 841
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasddagDPPPPLPPAAP-PAAPDRSV 2568
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  842 AKPKTSPSP-EVSYTTPAPKdvllPHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETRGIPfiPMISPSPSQEELQTTL 920
Cdd:PHA03247  2569 PPPRPAPRPsEPAVTSRARR----PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP--PPPSPSPAANEPDPHP 2642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  921 EETDQSTQEPFTTKIPRTTELAKTTQAPHR-FYTTVRPRTSDKPHIRPVLNRTTT--RPTRPKPSGMPSGNGVGTGVKQA 997
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRaAQASSPPQRPRRRAARPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLP 2722
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  998 PRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPpnnvTGKPGSAGiiSSGPITTPPlRSTPRPTGTPLERIE 1077
Cdd:PHA03247  2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT----AGPPAPAP--PAAPAAGPP-RRLTRPAVASLSESR 2795
                          330
                   ....*....|
gi 2462588866 1078 TDIKQPTVPA 1087
Cdd:PHA03247  2796 ESLPSPWDPA 2805
fn3 pfam00041
Fibronectin type III domain;
1156-1239 1.54e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 44.71  E-value: 1.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1156 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1232
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 2462588866 1233 LGEGPVS 1239
Cdd:pfam00041   79 GGEGPPS 85
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
790-876 6.26e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 6.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  790 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 869
Cdd:PRK14950   366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445

                   ....*..
gi 2462588866  870 PEVSQSE 876
Cdd:PRK14950   446 PPAPPKE 452
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
972-1251 9.55e-05

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 9.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  972 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1051
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1052 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1128
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1129 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1207
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462588866 1208 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1251
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
632-850 1.24e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.46  E-value: 1.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  632 PEVPKSKPALEPATIQPEPLVPTTASKPSER--PK---TTHRPDAPQIQ-PGSKPPKQLLPKPQTTAEPDMPPTKSVSEP 705
Cdd:PLN03209   328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEepPQpkaVVPRPLSPYTAyEDLKPPTSPIPTPPSSSPASSKSVDAVAKP 407
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  706 VPFETEAPSMTIVPTTDIEPVTVRTE--------ATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTklDFGPITPGTS 777
Cdd:PLN03209   408 AEPDVVPSPGSASNVPEVEPAQVEAKktrplspyARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP--DTAPATAATD 485
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  778 SA-PTTTTKRTRRPHP-----KPKTTPHPEVPQTKLAPKQTPRAPP----KPKTSPRPRIPQTQPVPK----VPQRVTAK 843
Cdd:PLN03209   486 AAaPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKvgnsAPPTALADEQHHAQPKPRplspYTMYEDLK 565

                   ....*..
gi 2462588866  844 PKTSPSP 850
Cdd:PLN03209   566 PPTSPTP 572
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
754-849 1.26e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 46.60  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  754 HKTTPRPETLQTKldfgpitpGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPkqtprAPPKPKTSPR--PRIPQTQ 831
Cdd:PRK14959   394 AATIPTPGTQGPQ--------GTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPP-----APPRSGIPPRpaPRMPEAS 460
                           90
                   ....*....|....*...
gi 2462588866  832 PVPKVPQRVTAKPKTSPS 849
Cdd:PRK14959   461 PVPGAPDSVASASDAPPT 478
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1150-1294 1.30e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866 1150 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1227
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588866 1228 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1294
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
444-865 1.47e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  444 PTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAP-------SETPF-VPQKLEIFTSPEMQPTTPAPQQTTSI 515
Cdd:pfam05109  310 PASQDMPTNTTDITYVGDNATYSVPMVTSEDANSPNVTVTAfwawpnnTETDFkCKWTLTSGTPSGCENISGAFASNRTF 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  516 PSTPKRRPRPKPPRTKPERTTSAGTITPKI--SKSPEPTWTTPAPGKTQFISLKPKIPLsPEVTH-----TKPA---PEP 585
Cdd:pfam05109  390 DITVSGLGTAPKTLIITRTATNATTTTHKVifSKAPESTTTSPTLNTTGFAAPNTTTGL-PSSTHvptnlTAPAstgPTV 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  586 QTLLPSQSTIGPETPGTKPST-TLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPK 664
Cdd:pfam05109  469 STADVTSPTPAGTTSGASPVTpSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV 548
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  665 TTHRPDAPQIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTV--RTEATVTTLAPKTS 742
Cdd:pfam05109  549 TTPTPNATSPTPAVTTPT---PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNA 625
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  743 QRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAP-PKPKT 821
Cdd:pfam05109  626 TSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPaPRPGT 705
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 2462588866  822 SPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLP 865
Cdd:pfam05109  706 TSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVP 749
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 2.07e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.45  E-value: 2.07e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866   124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 2462588866   200 GVK 202
Cdd:smart00060   73 RVR 75
fn3 pfam00041
Fibronectin type III domain;
123-202 2.11e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.25  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 2462588866  200 GVK 202
Cdd:pfam00041   72 RVQ 74
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
768-853 2.16e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.04  E-value: 2.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  768 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPriPQTQPVPKVPQRVTAKPKTS 847
Cdd:PRK12270    35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA--AAAPAAPPAAAAAAAPAAAA 112

                   ....*.
gi 2462588866  848 PSPEVS 853
Cdd:PRK12270   113 VEDEVT 118
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
565-892 2.85e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.53  E-value: 2.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  565 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 643
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  644 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPT--------KSVSEPVPFE----TE 711
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMphslqtgpSHMQHPVPPQpfplTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  712 APSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPH 791
Cdd:pfam03154  303 QSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPS 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  792 PKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQR--VTAKPKTSPSPEVSYTTPAPKDVLLPHKPY 869
Cdd:pfam03154  383 PFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          330       340
                   ....*....|....*....|...
gi 2462588866  870 PEVSQSEPVLQPVTFRFEPPKTT 892
Cdd:pfam03154  463 PQHPFVPGGPPPITPPSGPPTST 485
PHA03247 PHA03247
large tegument protein UL36; Provisional
319-713 3.64e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 3.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  319 PAESKTPEvekisARPttvtPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTILIPQFELPLSTLAPKSLPEfpeak 398
Cdd:PHA03247  2702 PPPPPTPE-----PAP----HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP----- 2767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  399 TPFPFEKPRGTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPeisdsytaTSDRILDSIPPKTSRTLEQP 478
Cdd:PHA03247  2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------PAASPAGPLPPPTSAQPTAP 2839
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  479 RATLAPSETPFVPQKLEIFTSPEMQptTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTItPKISKSPEPTWTTPAP 558
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSVAPGGDVRR--RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-PPDQPERPPQPQAPPP 2916
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  559 GKTQFISLKPKIPLSPevthtkPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSK 638
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPP------PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588866  639 PALEPATIQPEPLVPTTASKPSERPKTTHRPDApqiqpgskpPKQLLPKPQTTAEPDmPPTKSVSEPVPFETEAP 713
Cdd:PHA03247  2991 SSTPPLTGHSLSRVSSWASSLALHEETDPPPVS---------LKQTLWPPDDTEDSD-ADSLFDSDSERSDLEAL 3055
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
317-602 4.77e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.57  E-value: 4.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  317 ALPAESKTPEVEKISArPTTVTPETvPRSTKPTTSSALDVSETTLVLSKRTPETLQTIlipqfelPLSTLAPKSLPEFPE 396
Cdd:pfam17823  129 SLPAAIAALPSEAFSA-PRAAACRA-NASAAPRAAIAAASAPHAASPAPRTAASSTTA-------ASSTTAASSAPTTAA 199
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  397 AKTPFPFEKPRGTLASS----EKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTS 472
Cdd:pfam17823  200 SSAPATLTPARGISTAAtatgHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARR 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  473 RTLEQPRATLAPSETPFVPQKLEIfTSPEMQPTTPAP-QQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEP 551
Cdd:pfam17823  280 LSPAKHMPSDTMARNPAAPMGAQA-QGPIIQVSTDQPvHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462588866  552 TwTTPAPgktqfislKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGT 602
Cdd:pfam17823  359 S-ASPVP--------VLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA 400
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
639-707 5.15e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.42  E-value: 5.15e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588866  639 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 707
Cdd:PRK14950   362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
677-911 5.30e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 5.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  677 GSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtivpttdiepvtvrteatvttlAPKTSQRTRTRRPRPKHKT 756
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPA------------------------AAPAAAAAARAVAAAPARR 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  757 TPRPETLQTKLDFGPITPGTSSAPTtttkrtrrphPKPKTTPHPEVPqtklAPKQTPRAPPKPKTSPRPR---IPQTQPV 833
Cdd:PRK12323   427 SPAPEALAAARQASARGPGGAPAPA----------PAPAAAPAAAAR----PAAAGPRPVAAAAAAAPARaapAAAPAPA 492
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588866  834 PKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPvlqpvTFRFEPPKTTIAPLETRGIPFIPMISPSP 911
Cdd:PRK12323   493 DDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDD-----AFETLAPAPAAAPAPRAAAATEPVVAPRP 565
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
469-836 7.31e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.30  E-value: 7.31e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  469 PKTSRTLEQPRATLAP-----SETPFVPQKLEIFTSPE-----MQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSA 538
Cdd:PTZ00449   597 PKRPRSAQRPTRPKSPklpelLDIPKSPKRPESPKSPKrppppQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFY 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  539 GTITPKISKSPEpTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPepqtlLPSQSTIGPETPGTKPSTTLAPRktkrpgr 618
Cdd:PTZ00449   677 DDYLDAAAKSKE-TKTTVVLDESFESILKETLPETPGTPFTTPRP-----LPPKLPRDEEFPFEPIGDPDAEQ------- 743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  619 rprprPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPqTTAEPDMPP 698
Cdd:PTZ00449   744 -----PDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS--PSEHEDKP-PGDHPSLPK 815
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  699 TKSVSE-----PVPFETEAPSMTIVPTTdiEPVTVRTEATVTTLApktsqrtrtrrprpkhkttprpeTLQTKLDFGPIT 773
Cdd:PTZ00449   816 KRHRLDglalsTTDLESDAGRIAKDASG--KIVKLKRSKSFDDLT-----------------------TVEEAEEMGAEA 870
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462588866  774 PGTSSAPTTTTKRTRRPHPkPKTTPHPEVPQTKlaPKQTPRAPPKPKTSPRPRIPQTQPVPKV 836
Cdd:PTZ00449   871 RKIVVDDDGTEADDEDTHP-PEEKHKSEVRRRR--PPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 8.64e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.79  E-value: 8.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 2462588866  201 VK 202
Cdd:cd00063     74 VR 75
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
467-825 9.34e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.99  E-value: 9.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  467 IPPKTSRTLEQPRATLAP-SETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKI 545
Cdd:pfam03154  182 SPPSPPPPGTTQAATAGPtPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV 261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  546 SKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAprKTKRPGRRPRPRPR 625
Cdd:pfam03154  262 SPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI--HTPPSQSQLQSQQP 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  626 PKTTPSPEVPKSKPALEPATIQPEPLVPTTASKpsERPKTTHRPDAPQIQPGSKPPKQLlpKPQTTAEPDMPPTksvSEP 705
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSH--KHPPHLSGPSPFQMNSNLPPPPAL--KPLSSLSTHHPPS---AHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  706 VPFETEAPSMTIVPTTDIEPVTVRTEATVTTLA----PKTSQRTRTRRPRPKHKTTPRPETLQTKldfgPITPGTSSAPT 781
Cdd:pfam03154  413 PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAshppTSGLHQVPSQSPFPQHPFVPGGPPPITP----PSGPPTSTSSA 488
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  782 TTTKRTRRPHPKPKTTPHPEVPQTKLAPKQT----------PRAPPKPKTSPRP 825
Cdd:pfam03154  489 MPGIQPPSSASVSSSGPVPAAVSCPLPPVQIkeealdeaeePESPPPPPRSPSP 542
PRK11633 PRK11633
cell division protein DedD; Provisional
771-860 9.98e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.30  E-value: 9.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  771 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPqtkLAPKQTPRAPPKPKtsPRPRiPQTQPVPKVPQRVTAKPKTSPSP 850
Cdd:PRK11633    64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP---VEPEPAPVEPPKPK--PVEK-PKPKPKPQQKVEAPPAPKPEPKP 137
                           90
                   ....*....|
gi 2462588866  851 EVSyTTPAPK 860
Cdd:PRK11633   138 VVE-EKAAPT 146
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-830 1.09e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  597 PETPGTK----PSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPseRPKTTHRPDAP 672
Cdd:NF033839   284 PKEPGNKkpsaPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETP--KPEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  673 QIQPGSKPPKqllPKPQTTAEPDMPPTKSVSEPvpfETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKtsqrtrtrRPRP 752
Cdd:NF033839   362 KPEVKPQPEK---PKPEVKPQPETPKPEVKPQP---EKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ--------PEKP 427
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462588866  753 KHKTTPRPETLQTKLDFGPITPGTSSAPTTTTkrtrrphPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQT 830
Cdd:NF033839   428 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET-------PKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST 498
PRK10263 PRK10263
DNA translocase FtsK; Provisional
767-915 1.68e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.15  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  767 LDFGPITP--GTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSP-RPRIPQTQPV----PKVPQR 839
Cdd:PRK10263   736 LDDGPHEPlfTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPqQPVAPQPQYQqpqqPVAPQP 815
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462588866  840 VTAKPKTSPSPEVSYTTPAPkdvllPHKPYPEvsqsEPVLQPVTFRFEPPKTTIAPleTRGIPFIPMISPSPSQEE 915
Cdd:PRK10263   816 QYQQPQQPVAPQPQYQQPQQ-----PVAPQPQ----DTLLHPLLMRNGDSRPLHKP--TTPLPSLDLLTPPPSEVE 880
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
568-794 2.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 2.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  568 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 647
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  648 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 727
Cdd:PRK12323   439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462588866  728 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 794
Cdd:PRK12323   519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PHA03378 PHA03378
EBNA-3B; Provisional
638-848 2.78e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  638 KPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQ---PGSKPPKQ----LLPKPQTTAE-------------PDMP 697
Cdd:PHA03378   575 QPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQshiPETSAPRQwpmpLRPIPMRPLRmqpitfnvlvfptPHQP 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  698 PTKSVSEPVPFETEAPSMTIVPTT---------DIEPVTVRTEATVTT-LAPKTSQRTRTRRPRPKHKTTPRPETLQTKL 767
Cdd:PHA03378   655 PQVEITPYKPTWTQIGHIPYQPSPtgantmlpiQWAPGTMQPPPRAPTpMRPPAAPPGRAQRPAAATGRARPPAAAPGRA 734
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  768 DFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTS 847
Cdd:PHA03378   735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAA 814

                   .
gi 2462588866  848 P 848
Cdd:PHA03378   815 P 815
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
771-859 3.01e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.10  E-value: 3.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  771 PITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTklAPKQTPRAPPKPKTSPRPRiPQTQPVPKVPQRVTAKPKTSPSP 850
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPR-PVAPPVPHTPESAPKLTRAAIPV 438

                   ....*....
gi 2462588866  851 EVSYTTPAP 859
Cdd:PRK14950   439 DEKPKYTPP 447
PRK10263 PRK10263
DNA translocase FtsK; Provisional
540-774 3.25e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 3.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  540 TITPKIskSPEPTWTTPAPGKTQfislkpkiplsPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRR 619
Cdd:PRK10263   368 TGEPVI--APAPEGYPQQSQYAQ-----------PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  620 PRPRPRPKTTPSPEVPKSKPALEP-ATIQPEPLV--PTTASKPSERPKTTHRPDAPQIQPG------SKPP--------- 681
Cdd:PRK10263   435 APAPEQPVAGNAWQAEEQQSTFAPqSTYQTEQTYqqPAAQEPLYQQPQPVEQQPVVEPEPVveetkpARPPlyyfeevee 514
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  682 ------KQLLPKPQTTAEPDMPPtksvsEPVPFETEAPSMTIVPTTDIEPVTVRTEATV--TTLAPKTSQRTRTRRPRPK 753
Cdd:PRK10263   515 krarerEQLAAWYQPIPEPVKEP-----EPIKSSLKAPSVAAVPPVEAAAAVSPLASGVkkATLATGAAATVAAPVFSLA 589
                          250       260
                   ....*....|....*....|.
gi 2462588866  754 HKTTPRPetlQTKLDFGPITP 774
Cdd:PRK10263   590 NSGGPRP---QVKEGIGPQLP 607
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
790-860 3.59e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 3.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588866  790 PHPKPKTTPHPEVPqtklAPKqtpraPPKPKTSPRPRIPQTQ--------PVPKVPQRVTAKPktSPSPEVSYTTPAPK 860
Cdd:NF033838   418 EQPQPAPAPQPEKP----APK-----PEKPAEQPKAEKPADQqaeedyarRSEEEYNRLTQQQ--PPKTEKPAQPSTPK 485
PRK10905 PRK10905
cell division protein DamX; Validated
642-743 4.33e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 41.08  E-value: 4.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  642 EPATIQP---EPLVPTTASKPSERPKTTHRPDAPQIQPGSKppkqllpKPQTTAE-PDMPPTKSVSEPVPFETEAPSMTI 717
Cdd:PRK10905   126 EPATVAPvrnGNASRQTAKTQTAERPATTRPARKQAVIEPK-------KPQATAKtEPKPVAQTPKRTEPAAPVASTKAP 198
                           90       100
                   ....*....|....*....|....*.
gi 2462588866  718 VPTTDIEPVTVRTEATVTTLAPKTSQ 743
Cdd:PRK10905   199 AATSTPAPKETATTAPVQTASPAQTT 224
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
803-881 4.48e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.47  E-value: 4.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  803 PQTKLAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYT----TPAPKDVLLPHKPYPEVSQSEP 877
Cdd:PRK14954   376 NDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPASAPTpeqqPPVARSAPLPPSPQASAPRNVA 455

                   ....
gi 2462588866  878 VLQP 881
Cdd:PRK14954   456 SGKP 459
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
656-1070 4.65e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 4.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  656 ASKPSERPKTTHRPDAPQIQPGSkpPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPS--MTIVPTTDIEPVTVRTEAT 733
Cdd:PHA03307    17 GGEFFPRPPATPGDAADDLLSGS--QGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPgpGTEAPANESRSTPTWSLST 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  734 VTTLAPKTSQRTRTRRPRPKhKTTPRPETlqtkldfgPITPGTSSAPttttkrtrrPHPKPKTTPHPEVPQTKLAPKQTP 813
Cdd:PHA03307    95 LAPASPAREGSPTPPGPSSP-DPPPPTPP--------PASPPPSPAP---------DLSEMLRPVGSPGPPPAASPPAAG 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  814 RAPPKPKTSPRPRIPQTQPVPKVPQrvTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVtfrfePPKTTI 893
Cdd:PHA03307   157 ASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA-----PGRSAA 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  894 APLETRGIPfipmISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELAKTtqaphrfyttvrPRTSDKPHIRPvlnRTT 973
Cdd:PHA03307   230 DDAGASSSD----SSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWN------------GPSSRPGPASS---SSS 290
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  974 TRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGIIS 1053
Cdd:PHA03307   291 PRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPR 370
                          410
                   ....*....|....*..
gi 2462588866 1054 SGPITTPPLRSTPRPTG 1070
Cdd:PHA03307   371 PSRAPSSPAASAGRPTR 387
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
631-825 6.80e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 6.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  631 SPEVPKSKPALEPATIQPEPLVPTTASKPsERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVS-EPVPFE 709
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDgWPAKAG 674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  710 TEAPSMTIVPTTDIEPVTVRTEATvttlapktsqrtrtRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRR 789
Cdd:PRK07764   675 GAAPAAPPPAPAPAAPAAPAGAAP--------------AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2462588866  790 PHPKPKTTPHPEVPQTKLAPKQTPRAPPKPKTSPRP 825
Cdd:PRK07764   741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
790-921 7.06e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 7.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  790 PHPKPKTTPHPEVPQtklAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK--DVLLPHK 867
Cdd:PRK14951   373 AAPAEKKTPARPEAA---APAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAapAAVALAP 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462588866  868 PYPEVSQSEPVLQPVTFRFEPPKTTIApletrgipfiPMISPSPSQEELQTTLE 921
Cdd:PRK14951   450 APPAQAAPETVAIPVRVAPEPAVASAA----------PAPAAAPAAARLTPTEE 493
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
444-832 9.19e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.67  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  444 PTTSDEPEISDSYTATSDriLDSIPPKTSRTLEQPratLAPSETPF---VPQKLEIFTSPEMQPTTPAPQQTTSIPSTPK 520
Cdd:pfam05109  455 PTNLTAPASTGPTVSTAD--VTSPTPAGTTSGASP---VTPSPSPRdngTESKAPDMTSPTSAVTTPTPNATSPTPAVTT 529
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  521 RRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTqFISLKPKIPLSPEVTHTKPAPEPQTLLPS-QSTIGPET 599
Cdd:pfam05109  530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPTSAVTTPTPNATSPTVGETSpQANTTNHT 608
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  600 PGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKP---SERPktTHRPDAPQIQP 676
Cdd:pfam05109  609 LGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHP--TGGENITQVTP 686
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  677 GSKPPKQL-----LPKPQTTAEPDMPPTKSVS-EPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRP 750
Cdd:pfam05109  687 ASTSTHHVstsspAPRPGTTSQASGPGNSSTStKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588866  751 RPKH--KTTPRPETlqtklDFGpitpGTSSAPTTTTKRtrrphpkpkTTPHPEVPQTKLAPKQTPRAPP-KPKTSPRPRI 827
Cdd:pfam05109  767 TTGHgaRTSTEPTT-----DYG----GDSTTPRTRYNA---------TTYLPPSTSSKLRPRWTFTSPPvTTAQATVPVP 828

                   ....*
gi 2462588866  828 PQTQP 832
Cdd:pfam05109  829 PTSQP 833
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH