|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03307 super family |
cl33723 |
transcriptional regulator ICP4; Provisional |
517-949 |
1.27e-12 |
|
transcriptional regulator ICP4; Provisional The actual alignment was detected with superfamily member PHA03307:
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 73.28 E-value: 1.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307 31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307 110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307 168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307 318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 148922288 912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307 393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1537-1628 |
4.90e-12 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 63.67 E-value: 4.90e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063 3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| PHA03307 super family |
cl33723 |
transcriptional regulator ICP4; Provisional |
809-1238 |
2.63e-09 |
|
transcriptional regulator ICP4; Provisional The actual alignment was detected with superfamily member PHA03307:
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 62.50 E-value: 2.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307 158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307 232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307 300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307 346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307 411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
209-302 |
2.80e-09 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 55.97 E-value: 2.80e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063 2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 287 QGERDGKWSTSVFQRT 302
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
3-68 |
4.30e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins. :
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 46.45 E-value: 4.30e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288 3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060 17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
|
|
| PHA03378 super family |
cl33729 |
EBNA-3B; Provisional |
331-643 |
2.37e-03 |
|
EBNA-3B; Provisional The actual alignment was detected with superfamily member PHA03378:
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.75 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378 424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378 504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378 583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378 663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288 578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378 741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
106-192 |
4.66e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 38.25 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
|
90
....*....|
gi 148922288 183 MNSQGRSQPV 192
Cdd:cd00063 77 VNGGGESPPS 86
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
517-949 |
1.27e-12 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 73.28 E-value: 1.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307 31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307 110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307 168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307 318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 148922288 912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307 393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1537-1628 |
4.90e-12 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 63.67 E-value: 4.90e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063 3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
809-1238 |
2.63e-09 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 62.50 E-value: 2.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307 158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307 232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307 300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307 346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307 411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
209-302 |
2.80e-09 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 55.97 E-value: 2.80e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063 2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 287 QGERDGKWSTSVFQRT 302
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1537-1618 |
6.82e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 6.82e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEGchSFVIVDWDKAtPGDVVTGYLV-YSASYEDfIRNKWSTQASSV--THLPIENLKPNTRYYFKVQAQN 1613
Cdd:smart00060 3 PPSNLRVTDVTS--TSVTLSWEPP-PDDGITGYIVgYRVEYRE-EGSEWKEVNVTPssTSYTLTGLKPGTEYEFRVRAVN 78
|
....*
gi 148922288 1614 PHGYG 1618
Cdd:smart00060 79 GAGEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1537-1621 |
5.44e-07 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.95 E-value: 5.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDKATPGD-VVTGYLVYSASYEDFIRNKWSTQASSVTHLPIENLKPNTRYYFKVQAQNPH 1615
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPPDGNgPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79
|
....*.
gi 148922288 1616 GYGPIS 1621
Cdd:pfam00041 80 GEGPPS 85
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
209-295 |
1.23e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGELARWDYKQIAN--RRVLIENLIPDTVYEFAVRIS 286
Cdd:pfam00041 1 SAPSNLTVTDVTSTSLTVSW-TPPPDGNGPIT---GYEVEYRPKNSGEPWNEITVPGttTSVTLTGLKPGTEYEVRVQAV 76
|
....*....
gi 148922288 287 QGERDGKWS 295
Cdd:pfam00041 77 NGGGEGPPS 85
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
3-68 |
4.30e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 46.45 E-value: 4.30e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288 3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060 17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1531-1633 |
1.61e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 49.62 E-value: 1.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1531 DLPPQHAPRNITVVAVEGchSFVIVDWDKATPGDVvTGYLVYSASYEDFIRNKWSTQASSVTHLpIENLKPNTRYYFKVQ 1610
Cdd:COG3401 323 DLTPPAAPSGLTATAVGS--SSITLSWTASSDADV-TGYNVYRSTSGGGTYTKIAETVTTTSYT-DTGLTPGTTYYYKVT 398
|
90 100
....*....|....*....|....
gi 148922288 1611 AQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401 399 AVDAAGnESAPSEEVSATTASAAS 422
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
211-284 |
1.73e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 44.91 E-value: 1.73e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148922288 211 PDDISVRVMSSQSVLVSWVDPVLEKQKKvvasrqYTVRYREKGELARWDYKQI----ANRRVLIENLIPDTVYEFAVR 284
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSWEPPPDDGITG------YIVGYRVEYREEGSEWKEVnvtpSSTSYTLTGLKPGTEYEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
3-75 |
2.23e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 44.79 E-value: 2.23e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288 3 LKVTWDPPKDATSrPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRPVYRAE 75
Cdd:cd00063 17 VTLSWTPPEDDGG-PITGYVVEYREkgSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVT 90
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
3-70 |
5.37e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.48 E-value: 5.37e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 3 LKVTWDPPKDAtSRPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRP 70
Cdd:pfam00041 16 LTVSWTPPPDG-NGPITGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGPP 84
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
331-643 |
2.37e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.75 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378 424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378 504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378 583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378 663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288 578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378 741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
106-192 |
4.66e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 38.25 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
|
90
....*....|
gi 148922288 183 MNSQGRSQPV 192
Cdd:cd00063 77 VNGGGESPPS 86
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
511-972 |
9.19e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 40.83 E-value: 9.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 511 TQGTSHRPSLPASLNDNDLVDSDEDERAVGSL---HPKGAFAQPRPALSPSRQSPSSvlrdrssvhpGAKPASPARRTPH 587
Cdd:pfam03546 13 TQAKAGKPEEDSESSSEEESDSEEETPAAKTPlqaKPSGKTPQVRAASAPAKESPRK----------GAPPVPPGKTGPA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 588 SGAA-----EEDSSASAPPSRlSPPHGGSSRLLPTQPHLSSPLSKGGK-DGEDAPATNSNAPSRSTMSSSVSSHLSSRTQ 661
Cdd:pfam03546 83 AAQAqagkpEEDSESSSEESD-SDGETPAAATLTTSPAQVKPLGKNSQvRPASTVGKGPSGKGANPAPPGKAGSAAPLVQ 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 662 VSEGAEASDG---ESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSS------ 732
Cdd:pfam03546 162 VGKKEEDSESsseESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSeessds 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 733 ---SPQSTVPSRAHPRVPS-HSDSHPKLSSGIHGDEEDEKPL---------------PATVVNDHVPSSSRQPisrgwED 793
Cdd:pfam03546 242 eeeAPAAATPAQAKPALKTpQTKASPRKGTPITPTSAKVPPVrvgtpapwkagtvtsPACASSPAVARGAQRP-----EE 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 794 LRRSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPK-AQPGSTDRHASPARPPAARSQ 872
Cdd:pfam03546 317 DSSSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAvAQVKAEAQEDSESSEEESDSE 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 873 QHPSVPRRMTP-GRAPEQQ--PPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGR-PRPTSQGRSHSSSDPYTASSRG- 947
Cdd:pfam03546 397 EAAATPAQVKAsGKTPQAKanPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKpPARTPQNSAISVRGQASVPAVGk 476
|
490 500
....*....|....*....|....*
gi 148922288 948 MLPTALQNQDEDAQGSYDDDSTEVE 972
Cdd:pfam03546 477 AVATAAQAQKGPVGGPQEEDSESSE 501
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
517-949 |
1.27e-12 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 73.28 E-value: 1.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307 31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307 110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307 168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307 318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 148922288 912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307 393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1537-1628 |
4.90e-12 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 63.67 E-value: 4.90e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063 3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
362-1015 |
2.05e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 66.50 E-value: 2.05e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 362 PSSPSPRAPASSQHPSVPAS---PQGRNAKDLLLDLKNKILANGGAPRKPQLRAKKAEELdlqsTEITGEEELGSRED-- 436
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAapdPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML----TWIRGLEELASDDAgd 2550
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 437 -----SPMSPSDTQDQKrtlRPPSRHGHSVVAPGRTAvRARMPALPRRegvdkpgfslATQPR-PGAPPSASASPAHHAS 510
Cdd:PHA03247 2551 pppplPPAAPPAAPDRS---VPPPRPAPRPSEPAVTS-RARRPDAPPQ----------SARPRaPVDDRGDPRGPAPPSP 2616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 511 TQGTSHRPSLPASLNdndlvDSDEDERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRS-SVHPGAKPASPARRTPHSG 589
Cdd:PHA03247 2617 LPPDTHAPDPPPPSP-----SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGrAAQASSPPQRPRRRAARPT 2691
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 590 AAEEDSSASAPPsrlsPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSvsshlssrTQVSEGAEAS 669
Cdd:PHA03247 2692 VGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA--------TPGGPARPAR 2759
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 670 DGESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTV--PSRAHPRVP 747
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppPTSAQPTAP 2839
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 748 SHSDSHPKLSSGIHGDEEDEKPL----PATVVNDHVPSSSRQPISR-GWEDLRRSPQrgaSLHRKEPIPENPKSTGADTH 822
Cdd:PHA03247 2840 PPPPGPPPPSLPLGGSVAPGGDVrrrpPSRSPAAKPAAPARPPVRRlARPAVSRSTE---SFALPPDQPERPPQPQAPPP 2916
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 823 PQgkysslaskaqdVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATsqhhp 902
Cdd:PHA03247 2917 PQ------------PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV----- 2979
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 903 gPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRS---HSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAp 979
Cdd:PHA03247 2980 -PQPAPSREAPASSTPPLTGHSLSRVSSWASSlalHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDP- 3057
|
650 660 670
....*....|....*....|....*....|....*.
gi 148922288 980 ahaarAKEAAASLPKHQQVESPTGAGAGGDHRSQRG 1015
Cdd:PHA03247 3058 -----LPPEPHDPFAHEPDPATPEAGARESPSSQFG 3088
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
696-1200 |
6.99e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.57 E-value: 6.99e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 696 RPASGHFhllrhkPFAANG-RSPsrfsiGRGPRLQPSSSPQSTVPSRA-HPRVPSHSDSHPKLSSGIHGDEE---DEKPL 770
Cdd:PHA03247 2482 RPAEARF------PFAAGAaPDP-----GGGGPPDPDAPPAPSRLAPAiLPDEPVGEPVHPRMLTWIRGLEElasDDAGD 2550
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 771 PATVVndhvPSSSRQPisrgwedlrrSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSlaskaqdvqqstdadTEGHSP 850
Cdd:PHA03247 2551 PPPPL----PPAAPPA----------APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQS---------------ARPRAP 2601
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 851 KAQPGSTDRHASPARPPAArsqqhPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAgrsPSQPRLSLTQagrpRPTS 930
Cdd:PHA03247 2602 VDDRGDPRGPAPPSPLPPD-----THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD---PAPGRVSRPR----RARR 2669
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 931 QGRSHSSSDPytasSRGMLPTALQnqdeDAQGSYDDDSTEVEAQDVRAPAHAarakeaaaslPKHQQVESPTGAGAGGDH 1010
Cdd:PHA03247 2670 LGRAAQASSP----PQRPRRRAAR----PTVGSLTSLADPPPPPPTPEPAPH----------ALVSATPLPPGPAAARQA 2731
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1011 RSQRGHA----ASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEeeedagffkggkedlLSSSVP 1086
Cdd:PHA03247 2732 SPALPAApappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS---------------LSESRE 2796
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1087 KWPSSSTPRggkdadgslakeerEPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRPPPRSAATVSPVAGTHPWPRY 1166
Cdd:PHA03247 2797 SLPSPWDPA--------------DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
|
490 500 510
....*....|....*....|....*....|....
gi 148922288 1167 TTRAPPGHFSTTPMLSLRQRMMHARfRNPLSRQP 1200
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRST 2895
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
809-1238 |
2.63e-09 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 62.50 E-value: 2.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307 158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307 232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307 300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307 346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307 411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
209-302 |
2.80e-09 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 55.97 E-value: 2.80e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063 2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
|
90
....*....|....*.
gi 148922288 287 QGERDGKWSTSVFQRT 302
Cdd:cd00063 78 NGGGESPPSESVTVTT 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
684-1170 |
3.95e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 58.80 E-value: 3.95e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 684 RQAEATAQTLRARPASghfhllrhKPFAANGRSPsrfsigRGPRLQPSSSPQST-VPSRAHPRVPSHSDSHPKLSSGIHG 762
Cdd:PHA03247 2576 RPSEPAVTSRARRPDA--------PPQSARPRAP------VDDRGDPRGPAPPSpLPPDTHAPDPPPPSPSPAANEPDPH 2641
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 763 DEedeKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEP----IPENPKSTGADTHPQGKY----------- 827
Cdd:PHA03247 2642 PP---PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraarPTVGSLTSLADPPPPPPTpepaphalvsa 2718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 828 SSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPaarSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSR 907
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA---RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 908 DAGRSPSQPrlsltqagrPRPTSQGRSHSSSDPYTASSRGMLPtalqnqdedaqgsydddsteveaqdvraPAHAARAKE 987
Cdd:PHA03247 2796 ESLPSPWDP---------ADPPAAVLAPAAALPPAASPAGPLP----------------------------PPTSAQPTA 2838
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 988 AAASLPKHQQVESPTGAGAGGDHRSQRGHAASParPSRPGGPqSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEEE 1067
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP--AAKPAAP-ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP 2915
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1068 EDAGffkggKEDLLSSSVPKWPSSSTPRggkdADGSLAKEErEPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRPP 1147
Cdd:PHA03247 2916 PPQP-----QPQPPPPPQPQPPPPPPPR----PQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
|
490 500
....*....|....*....|...
gi 148922288 1148 PRSAATVSPVAGTHPWPRYTTRA 1170
Cdd:PHA03247 2986 REAPASSTPPLTGHSLSRVSSWA 3008
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1537-1618 |
6.82e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.46 E-value: 6.82e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEGchSFVIVDWDKAtPGDVVTGYLV-YSASYEDfIRNKWSTQASSV--THLPIENLKPNTRYYFKVQAQN 1613
Cdd:smart00060 3 PPSNLRVTDVTS--TSVTLSWEPP-PDDGITGYIVgYRVEYRE-EGSEWKEVNVTPssTSYTLTGLKPGTEYEFRVRAVN 78
|
....*
gi 148922288 1614 PHGYG 1618
Cdd:smart00060 79 GAGEG 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
357-929 |
2.21e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.49 E-value: 2.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 357 PEKPEPSSPSP-RAPASSQHPSVPASPQ-GRNAKDLLLDlknkilANGGAPRKPQLRakkaeeldlqsTEITGEEELGSR 434
Cdd:PHA03247 2484 AEARFPFAAGAaPDPGGGGPPDPDAPPApSRLAPAILPD------EPVGEPVHPRML-----------TWIRGLEELASD 2546
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 435 ED-------SPMSPSDTQDQKRtlrPPSRHGHSVVAPGRTAvRARMPALPRREgvdkpgfslaTQPR-PGAPPSASASPA 506
Cdd:PHA03247 2547 DAgdpppplPPAAPPAAPDRSV---PPPRPAPRPSEPAVTS-RARRPDAPPQS----------ARPRaPVDDRGDPRGPA 2612
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 507 HHASTQGTSHRPSLPASLNDndlvdSDEDERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRS-SVHPGAKPASPARRT 585
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPS-----PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGrAAQASSPPQRPRRRA 2687
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 586 PHSGAAEEDSSASAPPsrlsPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMsssvsshlssrTQVSEG 665
Cdd:PHA03247 2688 ARPTVGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-----------GPATPG 2752
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 666 AEASdgeshgdgdredggrqaeataqtlRARPASghfhllrhkPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHPR 745
Cdd:PHA03247 2753 GPAR------------------------PARPPT---------TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLP 2799
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 746 VPSHSDSHPKLSSgihgdeedekPLPATVVNDHVPSSSRQPISrgwedlrrSPQRGASLHRKEPIPEnPKSTGADTHPQG 825
Cdd:PHA03247 2800 SPWDPADPPAAVL----------APAAALPPAASPAGPLPPPT--------SAQPTAPPPPPGPPPP-SLPLGGSVAPGG 2860
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 826 KYSSLASKAQDVQQSTdadTEGHSPkaqpgsTDRHASPARPPAARSQ-QHPSVPRRMTPGRAPEQQPPPPVATSQHHPGP 904
Cdd:PHA03247 2861 DVRRRPPSRSPAAKPA---APARPP------VRRLARPAVSRSTESFaLPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
570 580
....*....|....*....|....*
gi 148922288 905 QSRDAGRSPSQPRLSLTQAGRPRPT 929
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPS 2956
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1537-1621 |
5.44e-07 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.95 E-value: 5.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDKATPGD-VVTGYLVYSASYEDFIRNKWSTQASSVTHLPIENLKPNTRYYFKVQAQNPH 1615
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPPDGNgPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79
|
....*.
gi 148922288 1616 GYGPIS 1621
Cdd:pfam00041 80 GEGPPS 85
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
209-295 |
1.23e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGELARWDYKQIAN--RRVLIENLIPDTVYEFAVRIS 286
Cdd:pfam00041 1 SAPSNLTVTDVTSTSLTVSW-TPPPDGNGPIT---GYEVEYRPKNSGEPWNEITVPGttTSVTLTGLKPGTEYEVRVQAV 76
|
....*....
gi 148922288 287 QGERDGKWS 295
Cdd:pfam00041 77 NGGGEGPPS 85
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
355-734 |
2.54e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.87 E-value: 2.54e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 355 GKPEKPEPSSPSPRAPassqhPSVPASPQGRNAKDLLLDLKnkilANGGAPRKPQLRAKKAEELDLQSTEITGEeelgSR 434
Cdd:PHA03307 104 GSPTPPGPSSPDPPPP-----TPPPASPPPSPAPDLSEMLR----PVGSPGPPPAASPPAAGASPAAVASDAAS----SR 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 435 EDSPMSPSdTQDQKRTLRPPSRHGHSVVAPGRTAVRARMPALPRREGVDKPGFSLATQPRPGAPPSASASPAHHASTQGT 514
Cdd:PHA03307 171 QAALPLSS-PEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGW 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 515 SHRPSLPASLNDNDLVDSDEDERAVGSLHPkgafaqPRPALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAEED 594
Cdd:PHA03307 250 GPENECPLPRPAPITLPTRIWEASGWNGPS------SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 595 SSASAPPSRLSPPHGGSSRllPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRTQVSEGAEASDGesh 674
Cdd:PHA03307 324 SSSSSTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRA--- 398
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 675 gdgdredggRQAEATAQTLRARPASGhfhllrhkPFAANGRSPSRFSigRGPRLQPSSSP 734
Cdd:PHA03307 399 ---------RRRDATGRFPAGRPRPS--------PLDAGAASGAFYA--RYPLLTPSGEP 439
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
3-68 |
4.30e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 46.45 E-value: 4.30e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288 3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060 17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1531-1633 |
1.61e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 49.62 E-value: 1.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1531 DLPPQHAPRNITVVAVEGchSFVIVDWDKATPGDVvTGYLVYSASYEDFIRNKWSTQASSVTHLpIENLKPNTRYYFKVQ 1610
Cdd:COG3401 323 DLTPPAAPSGLTATAVGS--SSITLSWTASSDADV-TGYNVYRSTSGGGTYTKIAETVTTTSYT-DTGLTPGTTYYYKVT 398
|
90 100
....*....|....*....|....
gi 148922288 1611 AQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401 399 AVDAAGnESAPSEEVSATTASAAS 422
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
211-284 |
1.73e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 44.91 E-value: 1.73e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148922288 211 PDDISVRVMSSQSVLVSWVDPVLEKQKKvvasrqYTVRYREKGELARWDYKQI----ANRRVLIENLIPDTVYEFAVR 284
Cdd:smart00060 4 PSNLRVTDVTSTSVTLSWEPPPDDGITG------YIVGYRVEYREEGSEWKEVnvtpSSTSYTLTGLKPGTEYEFRVR 75
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
852-1051 |
2.16e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 2.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 852 AQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQ 931
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 932 GrshSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAAslPKHQQVESPTGAGAGGDHR 1011
Cdd:PRK07764 666 G---DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP--PQAAQGASAPSPAADDPVP 740
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 148922288 1012 SQRGHAASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRP 1051
Cdd:PRK07764 741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
3-75 |
2.23e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 44.79 E-value: 2.23e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288 3 LKVTWDPPKDATSrPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRPVYRAE 75
Cdd:cd00063 17 VTLSWTPPEDDGG-PITGYVVEYREkgSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVT 90
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
852-1207 |
4.35e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 48.44 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 852 AQPGSTDRHASPARPPAARSQQHPSVPrrmtpgrAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQ 931
Cdd:PRK07764 409 APAPAAAAPAAAAAPAPAAAPQPAPAP-------APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 932 GRSHSSSDPYTASSRGMLPTALQNQDEDAQ--GSYDDDSTEVEaQDVRAPAHAARAKEAAASLPKHQQV---ESPTGAGA 1006
Cdd:PRK07764 482 PAPPAAPAPAAAPAAPAAPAAPAGADDAATlrERWPEILAAVP-KRSRKTWAILLPEATVLGVRGDTLVlgfSTGGLARR 560
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1007 ------------------GGDHR--SQRGHAASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEE 1066
Cdd:PRK07764 561 faspgnaevlvtalaeelGGDWQveAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAS 640
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1067 EEDAGFFKGGKEdlLSSSVPKWPSSSTPRGGKDADGSLAKEEREPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRP 1146
Cdd:PRK07764 641 AAPAPGVAAPEH--HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1147 --PPRSAATVSPVAGT-------HPWPRYTTRAPPGHFSTTPMLSLRQRMMHARFRNPLSRQPARPSYRQ 1207
Cdd:PRK07764 719 aqPPQAAQGASAPSPAaddpvplPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
809-1019 |
2.04e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 46.13 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 809 PIPENPKSTGADTHPQGkYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PRK07764 601 PAPASSGPPEEAARPAA-PAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 889 QQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSRGMLPTALQNQDEDAQGSYDDds 968
Cdd:PRK07764 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ-- 757
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 148922288 969 teveAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGAGGDHRSQRGHAAS 1019
Cdd:PRK07764 758 ----PPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEV 804
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
798-979 |
2.65e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 45.75 E-value: 2.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 798 PQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTdADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSV 877
Cdd:PRK07764 627 PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP-AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 878 PRRMTPGRAPEQQP-PPPVATSQHHPGPQSRDAGRSPSQPRLsltQAGRPRPTSQGRSHSSSDPYTASSRGMLPTALqnq 956
Cdd:PRK07764 706 AATPPAGQADDPAAqPPQAAQGASAPSPAADDPVPLPPEPDD---PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP--- 779
|
170 180
....*....|....*....|...
gi 148922288 957 dEDAQGSYDDDSTEVEAQDVRAP 979
Cdd:PRK07764 780 -SEEEEMAEDDAPSMDDEDRRDA 801
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
3-70 |
5.37e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.48 E-value: 5.37e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 3 LKVTWDPPKDAtSRPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRP 70
Cdd:pfam00041 16 LTVSWTPPPDG-NGPITGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGPP 84
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1533-1633 |
8.60e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 44.22 E-value: 8.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1533 PPQhAPRNITVVAVEgcHSFVIVDWDKATPGDVvTGYLVYSASYEDfirNKWS----TQASSVTHlpiENLKPNTRYYFK 1608
Cdd:COG3401 232 PPS-APTGLTATADT--PGSVTLSWDPVTESDA-TGYRVYRSNSGD---GPFTkvatVTTTSYTD---TGLTNGTTYYYR 301
|
90 100
....*....|....*....|....*.
gi 148922288 1609 VQAQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401 302 VTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
536-912 |
9.04e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 9.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 536 ERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRSSVHPGAKPAsPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLL 615
Cdd:PRK07764 391 AGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPA-PAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 616 PTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRT---QVSEGAE------------------------- 667
Cdd:PRK07764 470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRErwpEILAAVPkrsrktwaillpeatvlgvrgdtlv 549
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 668 -----ASDGESHGDGDREDGGRQA--EATAQTLRARPASGHfhllrHKPFAANGRSPSRFSIGRGPRL-QPSSSPQSTVP 739
Cdd:PRK07764 550 lgfstGGLARRFASPGNAEVLVTAlaEELGGDWQVEAVVGP-----APGAAGGEGPPAPASSGPPEEAaRPAAPAAPAAP 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 740 SRAHPR-VPSHSDSHPKLSSGIHGDEED-EKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKST 817
Cdd:PRK07764 625 AAPAPAgAAAAPAEASAAPAPGVAAPEHhPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPA 704
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 818 GADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE--QQPPPPV 895
Cdd:PRK07764 705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPpsPPSEEEE 784
|
410
....*....|....*..
gi 148922288 896 ATSQHHPGPQSRDAGRS 912
Cdd:PRK07764 785 MAEDDAPSMDDEDRRDA 801
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
853-1056 |
9.59e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 44.10 E-value: 9.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 853 QPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPP--PPVATSQHHPGPQSRDAGRSPSQPRLSLTQAgrpRPTS 930
Cdd:PRK12323 364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPaaPAAAPAAAAAARAVAAAPARRSPAPEALAAA---RQAS 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 931 QGRSHSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESP---TGAGAG 1007
Cdd:PRK12323 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpDAAPAG 520
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 148922288 1008 GDHRSQRGHAASPARPSRPGGPQSR--ARVPSRAAPGKSEPPSKRPLSSKS 1056
Cdd:PRK12323 521 WVAESIPDPATADPDDAFETLAPAPaaAPAPRAAAATEPVVAPRPPRASAS 571
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
331-643 |
2.37e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.75 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378 424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378 504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378 583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378 663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288 578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378 741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
723-1158 |
2.49e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 2.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 723 GRGPRLQPSSSPQSTVPSRAHPRVPSHSDS---HPKLSSGIHGdEEDEKPLPAtvvNDHVPSS----SRQPisrgwedlr 795
Cdd:PTZ00449 513 GPEASGLPPKAPGDKEGEEGEHEDSKESDEpkeGGKPGETKEG-EVGKKPGPA---KEHKPSKiptlSKKP--------- 579
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 796 RSPQRGASLHRKEPiPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKaQPGSTDRHASPARPPAARSQQHP 875
Cdd:PTZ00449 580 EFPKDPKHPKDPEE-PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPK-RPPPPQRPSSPERPEGPKIIKSP 657
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 876 SVPRRMTPGRAPE----------QQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASS 945
Cdd:PTZ00449 658 KPPKSPKPPFDPKfkekfyddylDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIG 737
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 946 RgmlPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAkeaaaSLPKHQQVESPTGAGAGGDHRSQRGHAASPARP-S 1024
Cdd:PTZ00449 738 D---PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILA-----EEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPgD 809
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1025 RPGGPQSRARVPSRA--------APGK-SEPPSKRPLSSKSQQSVsaedeeeedagffkggkEDLLSSSVPKWPSSSTPR 1095
Cdd:PTZ00449 810 HPSLPKKRHRLDGLAlsttdlesDAGRiAKDASGKIVKLKRSKSF-----------------DDLTTVEEAEEMGAEARK 872
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288 1096 GGKDADGSLAK-EEREPaialaPRGGSLAPVKRPLPPPPGSSPRASHVPSRP-PPRSAATVSPVA 1158
Cdd:PTZ00449 873 IVVDDDGTEADdEDTHP-----PEEKHKSEVRRRRPPKKPSKPKKPSKPKKPkKPDSAFIPSIIA 932
|
|
| Pur_ac_phosph_N |
pfam16656 |
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple ... |
1553-1628 |
2.50e-03 |
|
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple acid phosphatase proteins.
Pssm-ID: 465220 [Multi-domain] Cd Length: 93 Bit Score: 38.93 E-value: 2.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1553 VIVDWdkATPGDVVTGYLVYSASYEDfirNKWSTQASSVT------------HLPIENLKPNTRYYFKVQAQNphgyGPI 1620
Cdd:pfam16656 15 MTVSW--VTPSAVTSPVVQYGTSSSA---LTSTATATSSTyttgdggtgyihRATLTGLEPGTTYYYRVGDDN----GGW 85
|
....*...
gi 148922288 1621 SPSVSFVT 1628
Cdd:pfam16656 86 SEVYSFTT 93
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
734-904 |
2.60e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 2.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 734 PQSTVPSRAHPRVPSHSDShpklSSGIHGDEEDEKPlpatvvndhvPSSSRQPISRGWEDLRRSPQRgASLHRKEPIP-- 811
Cdd:PTZ00449 511 PEGPEASGLPPKAPGDKEG----EEGEHEDSKESDE----------PKEGGKPGETKEGEVGKKPGP-AKEHKPSKIPtl 575
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 812 -ENPKSTGADTHPQGKYSSLASKAQdvqQSTDADTEGHSPKaQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQ 890
Cdd:PTZ00449 576 sKKPEFPKDPKHPKDPEEPKKPKRP---RSAQRPTRPKSPK-LPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGP 651
|
170
....*....|....
gi 148922288 891 PPPPVATSQHHPGP 904
Cdd:PTZ00449 652 KIIKSPKPPKSPKP 665
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
991-1236 |
4.32e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 4.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 991 SLPKHQQVESPTGAGAGGdhRSQRGHA-ASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLssksqqsvsaedeeeed 1069
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTS--RARRPDApPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP----------------- 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1070 agffkggkedllsssvpkwPSSSTPRGGKDADGSLAKEEREPAIALAPRGGSLAPVKRPLPPPPGSspRASHVPSRPPPR 1149
Cdd:PHA03247 2628 -------------------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA--QASSPPQRPRRR 2686
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1150 SA-ATVSPV---AGTHPWPRYTTRAPPGHFSTTPMLSLRQRMMHARFRNPLSRQParpsyrqgyngRPNVEGKVLPGSNG 1225
Cdd:PHA03247 2687 AArPTVGSLtslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-----------PAVPAGPATPGGPA 2755
|
250
....*....|.
gi 148922288 1226 KPNGQRIINGP 1236
Cdd:PHA03247 2756 RPARPPTTAGP 2766
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
106-192 |
4.66e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 38.25 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
|
90
....*....|
gi 148922288 183 MNSQGRSQPV 192
Cdd:cd00063 77 VNGGGESPPS 86
|
|
| COG3979 |
COG3979 |
Chitodextrinase [Carbohydrate transport and metabolism]; |
1537-1650 |
5.24e-03 |
|
Chitodextrinase [Carbohydrate transport and metabolism];
Pssm-ID: 443178 [Multi-domain] Cd Length: 369 Bit Score: 41.30 E-value: 5.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEGchSFVIVDWDKATPGDVVTGYLVYSASyedfirNKWSTQASSVTHLpIENLKPNTRYYFKVQAQNPHG 1616
Cdd:COG3979 5 APTGLTASNVTS--SSVSLSWDASTDNVGVTGYDVYRGG------DQVATVTGLTAWT-VTGLTPGTEYTFTVGACDAAG 75
|
90 100 110
....*....|....*....|....*....|....
gi 148922288 1617 YGPISPSVSFVTESDNPLLVVRPPGGEPIWIPFA 1650
Cdd:COG3979 76 NVSAASGTSTAMFGGSSTTLGSAEGVADTSGNLA 109
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
847-1039 |
5.39e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 41.76 E-value: 5.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 847 GHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSrdagrspSQPRLSLTQAGRP 926
Cdd:PRK07003 370 GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA-------PAPPATADRGDDA 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 927 RPTSQGRSHSSSDPytassrgmLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGA 1006
Cdd:PRK07003 443 ADGDAPVPAKANAR--------ASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA 514
|
170 180 190
....*....|....*....|....*....|...
gi 148922288 1007 GGDHRSQRGHAASPARPSRPGGPQSrARVPSRA 1039
Cdd:PRK07003 515 AASREDAPAAAAPPAPEARPPTPAA-AAPAARA 546
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
461-697 |
6.56e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.51 E-value: 6.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 461 VVAPGRTAVRARMPALPRREGVDKPGFSLATQPRPGappsasaspAHHASTQGTSHRPSLPASLNDNDLVDSDEDERAVG 540
Cdd:PRK07764 587 VVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPA---------APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 541 SLHPKGAFAQPRPALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPH 620
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 621 LSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRTQVSEGAEA--SDGESHGDGDREDGGRQA-EATAQTLRARP 697
Cdd:PRK07764 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMaeDDAPSMDDEDRRDAEEVAmELLEEELGAKK 817
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
861-1042 |
7.05e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.01 E-value: 7.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 861 ASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGrSHSSSDP 940
Cdd:PRK12323 397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAG-PRPVAAA 475
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 941 YTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGAggdhrSQRGHAASP 1020
Cdd:PRK12323 476 AAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETL-----APAPAAAPA 550
|
170 180
....*....|....*....|..
gi 148922288 1021 ARPSRPGGPQSRARVPSRAAPG 1042
Cdd:PRK12323 551 PRAAAATEPVVAPRPPRASASG 572
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
679-1058 |
7.92e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.21 E-value: 7.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 679 REDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRfsigrGPRLQPSSSPQSTVPSRAHPRVPSHSDSHPKLSS 758
Cdd:PHA03378 427 EEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTG-----PLSVQAPLEPWQPLPHPQVTPVILHQPPAQGVQA 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 759 giHGD-----EEDEKPLPATVVNDHVPSSSRQPisrgwedlrRSPQRGASLHRKE-PIPENPKSTGADTHPQGKYSSLAS 832
Cdd:PHA03378 502 --HGSmldllEKDDEDMEQRVMATLLPPSPPQP---------RAGRRAPCVYTEDlDIESDEPASTEPVHDQLLPAPGLG 570
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 833 KAQdvQQSTDADTEGHSPKAQPGSTDR-----HASP-ARPPAARSQqhpsVPRRMTPGRAPEQQPPPPVATSQHHPGPQS 906
Cdd:PHA03378 571 PLQ--IQPLTSPTTSQLASSAPSYAQTpwpvpHPSQtPEPPTTQSH----IPETSAPRQWPMPLRPIPMRPLRMQPITFN 644
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 907 RDAGRSPSQPrlSLTQAGRPRPTSQGRSHSSSDPYTASSRGMLPtaLQNQDEDAQgsydddsteveaQDVRAPAHAARAK 986
Cdd:PHA03378 645 VLVFPTPHQP--PQVEITPYKPTWTQIGHIPYQPSPTGANTMLP--IQWAPGTMQ------------PPPRAPTPMRPPA 708
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148922288 987 EAAASLPKHQQVESPTGAGAGGDHRSQRGhAASPARPSRPGGPQSRARVPSrAAPGKSEPPSKRPLSSKSQQ 1058
Cdd:PHA03378 709 APPGRAQRPAAATGRARPPAAAPGRARPP-AAAPGRARPPAAAPGRARPPA-AAPGRARPPAAAPGAPTPQP 778
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
511-972 |
9.19e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 40.83 E-value: 9.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 511 TQGTSHRPSLPASLNDNDLVDSDEDERAVGSL---HPKGAFAQPRPALSPSRQSPSSvlrdrssvhpGAKPASPARRTPH 587
Cdd:pfam03546 13 TQAKAGKPEEDSESSSEEESDSEEETPAAKTPlqaKPSGKTPQVRAASAPAKESPRK----------GAPPVPPGKTGPA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 588 SGAA-----EEDSSASAPPSRlSPPHGGSSRLLPTQPHLSSPLSKGGK-DGEDAPATNSNAPSRSTMSSSVSSHLSSRTQ 661
Cdd:pfam03546 83 AAQAqagkpEEDSESSSEESD-SDGETPAAATLTTSPAQVKPLGKNSQvRPASTVGKGPSGKGANPAPPGKAGSAAPLVQ 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 662 VSEGAEASDG---ESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSS------ 732
Cdd:pfam03546 162 VGKKEEDSESsseESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSeessds 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 733 ---SPQSTVPSRAHPRVPS-HSDSHPKLSSGIHGDEEDEKPL---------------PATVVNDHVPSSSRQPisrgwED 793
Cdd:pfam03546 242 eeeAPAAATPAQAKPALKTpQTKASPRKGTPITPTSAKVPPVrvgtpapwkagtvtsPACASSPAVARGAQRP-----EE 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 794 LRRSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPK-AQPGSTDRHASPARPPAARSQ 872
Cdd:pfam03546 317 DSSSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAvAQVKAEAQEDSESSEEESDSE 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 873 QHPSVPRRMTP-GRAPEQQ--PPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGR-PRPTSQGRSHSSSDPYTASSRG- 947
Cdd:pfam03546 397 EAAATPAQVKAsGKTPQAKanPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKpPARTPQNSAISVRGQASVPAVGk 476
|
490 500
....*....|....*....|....*
gi 148922288 948 MLPTALQNQDEDAQGSYDDDSTEVE 972
Cdd:pfam03546 477 AVATAAQAQKGPVGGPQEEDSESSE 501
|
|
|