NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|148922288|gb|AAI46784|]
View 

FNDC1 protein [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
517-949 1.27e-12

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 73.28  E-value: 1.27e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307   31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307  110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307  168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307  242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307  318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 148922288  912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307  393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1537-1628 4.90e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 63.67  E-value: 4.90e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063     3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
809-1238 2.63e-09

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.50  E-value: 2.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307   78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307  158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307  232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307  300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307  346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307  411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
209-302 2.80e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 55.97  E-value: 2.80e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063     2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288  287 QGERDGKWSTSVFQRT 302
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
3-68 4.30e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


:

Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 46.45  E-value: 4.30e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288      3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060   17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
PHA03378 super family cl33729
EBNA-3B; Provisional
331-643 2.37e-03

EBNA-3B; Provisional


The actual alignment was detected with superfamily member PHA03378:

Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378  424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378  504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378  583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378  663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288  578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378  741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
106-192 4.66e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.25  E-value: 4.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063     1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
                          90
                  ....*....|
gi 148922288  183 MNSQGRSQPV 192
Cdd:cd00063    77 VNGGGESPPS 86
 
Name Accession Description Interval E-value
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
517-949 1.27e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 73.28  E-value: 1.27e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307   31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307  110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307  168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307  242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307  318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 148922288  912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307  393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1537-1628 4.90e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 63.67  E-value: 4.90e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063     3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
809-1238 2.63e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.50  E-value: 2.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307   78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307  158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307  232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307  300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307  346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307  411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
209-302 2.80e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 55.97  E-value: 2.80e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063     2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288  287 QGERDGKWSTSVFQRT 302
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1537-1618 6.82e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.46  E-value: 6.82e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   1537 APRNITVVAVEGchSFVIVDWDKAtPGDVVTGYLV-YSASYEDfIRNKWSTQASSV--THLPIENLKPNTRYYFKVQAQN 1613
Cdd:smart00060    3 PPSNLRVTDVTS--TSVTLSWEPP-PDDGITGYIVgYRVEYRE-EGSEWKEVNVTPssTSYTLTGLKPGTEYEFRVRAVN 78

                    ....*
gi 148922288   1614 PHGYG 1618
Cdd:smart00060   79 GAGEG 83
fn3 pfam00041
Fibronectin type III domain;
1537-1621 5.44e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.95  E-value: 5.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  1537 APRNITVVAVEgcHSFVIVDWDKATPGD-VVTGYLVYSASYEDFIRNKWSTQASSVTHLPIENLKPNTRYYFKVQAQNPH 1615
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPPDGNgPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79

                   ....*.
gi 148922288  1616 GYGPIS 1621
Cdd:pfam00041   80 GEGPPS 85
fn3 pfam00041
Fibronectin type III domain;
209-295 1.23e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGELARWDYKQIAN--RRVLIENLIPDTVYEFAVRIS 286
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSW-TPPPDGNGPIT---GYEVEYRPKNSGEPWNEITVPGttTSVTLTGLKPGTEYEVRVQAV 76

                   ....*....
gi 148922288   287 QGERDGKWS 295
Cdd:pfam00041   77 NGGGEGPPS 85
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
3-68 4.30e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 46.45  E-value: 4.30e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288      3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060   17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1531-1633 1.61e-05

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 49.62  E-value: 1.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1531 DLPPQHAPRNITVVAVEGchSFVIVDWDKATPGDVvTGYLVYSASYEDFIRNKWSTQASSVTHLpIENLKPNTRYYFKVQ 1610
Cdd:COG3401   323 DLTPPAAPSGLTATAVGS--SSITLSWTASSDADV-TGYNVYRSTSGGGTYTKIAETVTTTSYT-DTGLTPGTTYYYKVT 398
                          90       100
                  ....*....|....*....|....
gi 148922288 1611 AQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401   399 AVDAAGnESAPSEEVSATTASAAS 422
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
211-284 1.73e-05

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 44.91  E-value: 1.73e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148922288    211 PDDISVRVMSSQSVLVSWVDPVLEKQKKvvasrqYTVRYREKGELARWDYKQI----ANRRVLIENLIPDTVYEFAVR 284
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSWEPPPDDGITG------YIVGYRVEYREEGSEWKEVnvtpSSTSYTLTGLKPGTEYEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
3-75 2.23e-05

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 44.79  E-value: 2.23e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288    3 LKVTWDPPKDATSrPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRPVYRAE 75
Cdd:cd00063    17 VTLSWTPPEDDGG-PITGYVVEYREkgSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVT 90
fn3 pfam00041
Fibronectin type III domain;
3-70 5.37e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.37e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288     3 LKVTWDPPKDAtSRPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRP 70
Cdd:pfam00041   16 LTVSWTPPPDG-NGPITGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGPP 84
PHA03378 PHA03378
EBNA-3B; Provisional
331-643 2.37e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378  424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378  504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378  583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378  663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288  578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378  741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
106-192 4.66e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.25  E-value: 4.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063     1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
                          90
                  ....*....|
gi 148922288  183 MNSQGRSQPV 192
Cdd:cd00063    77 VNGGGESPPS 86
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
511-972 9.19e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   511 TQGTSHRPSLPASLNDNDLVDSDEDERAVGSL---HPKGAFAQPRPALSPSRQSPSSvlrdrssvhpGAKPASPARRTPH 587
Cdd:pfam03546   13 TQAKAGKPEEDSESSSEEESDSEEETPAAKTPlqaKPSGKTPQVRAASAPAKESPRK----------GAPPVPPGKTGPA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   588 SGAA-----EEDSSASAPPSRlSPPHGGSSRLLPTQPHLSSPLSKGGK-DGEDAPATNSNAPSRSTMSSSVSSHLSSRTQ 661
Cdd:pfam03546   83 AAQAqagkpEEDSESSSEESD-SDGETPAAATLTTSPAQVKPLGKNSQvRPASTVGKGPSGKGANPAPPGKAGSAAPLVQ 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   662 VSEGAEASDG---ESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSS------ 732
Cdd:pfam03546  162 VGKKEEDSESsseESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSeessds 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   733 ---SPQSTVPSRAHPRVPS-HSDSHPKLSSGIHGDEEDEKPL---------------PATVVNDHVPSSSRQPisrgwED 793
Cdd:pfam03546  242 eeeAPAAATPAQAKPALKTpQTKASPRKGTPITPTSAKVPPVrvgtpapwkagtvtsPACASSPAVARGAQRP-----EE 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   794 LRRSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPK-AQPGSTDRHASPARPPAARSQ 872
Cdd:pfam03546  317 DSSSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAvAQVKAEAQEDSESSEEESDSE 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   873 QHPSVPRRMTP-GRAPEQQ--PPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGR-PRPTSQGRSHSSSDPYTASSRG- 947
Cdd:pfam03546  397 EAAATPAQVKAsGKTPQAKanPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKpPARTPQNSAISVRGQASVPAVGk 476
                          490       500
                   ....*....|....*....|....*
gi 148922288   948 MLPTALQNQDEDAQGSYDDDSTEVE 972
Cdd:pfam03546  477 AVATAAQAQKGPVGGPQEEDSESSE 501
 
Name Accession Description Interval E-value
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
517-949 1.27e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 73.28  E-value: 1.27e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  517 RPSLPASLNDNDLVDSDEDERAVGSLHPKGAFAQPR----PALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAe 592
Cdd:PHA03307   31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEpptgPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP- 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  593 eDSSASAPPSRLSPPHGGSSrllPTQPHLSSPLSKGGKDGEDAPATNSNAPsrstmsssvsshlssrtqVSEGAEASDGE 672
Cdd:PHA03307  110 -GPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVGSPGPPPAASPPAAG------------------ASPAAVASDAA 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  673 SHGDGdredgGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHpRVPSHSDS 752
Cdd:PHA03307  168 SSRQA-----ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG-ASSSDSSS 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  753 HPKLSSGIHGDEEDEKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKSTGADTHPqgkySSLAS 832
Cdd:PHA03307  242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSS 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  833 KAQDVQQSTDADTEGHSPKAQPGStdrhaSPARPPA-ARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGR 911
Cdd:PHA03307  318 SSSSRESSSSSTSSSSESSRGAAV-----SPGPSPSrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 148922288  912 SPSQPRL-----SLTQAGRPRPTSQGRSHSSSDPYTASSRGML 949
Cdd:PHA03307  393 AVAGRARrrdatGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1537-1628 4.90e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 63.67  E-value: 4.90e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEgcHSFVIVDWDK-ATPGDVVTGYLVYsasYEDFIRNKW---STQASSVTHLPIENLKPNTRYYFKVQAQ 1612
Cdd:cd00063     3 PPTNLRVTDVT--STSVTLSWTPpEDDGGPITGYVVE---YREKGSGDWkevEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288 1613 NPHGYGPISPSVSFVT 1628
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
362-1015 2.05e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 2.05e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  362 PSSPSPRAPASSQHPSVPAS---PQGRNAKDLLLDLKNKILANGGAPRKPQLRAKKAEELdlqsTEITGEEELGSRED-- 436
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAapdPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML----TWIRGLEELASDDAgd 2550
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  437 -----SPMSPSDTQDQKrtlRPPSRHGHSVVAPGRTAvRARMPALPRRegvdkpgfslATQPR-PGAPPSASASPAHHAS 510
Cdd:PHA03247 2551 pppplPPAAPPAAPDRS---VPPPRPAPRPSEPAVTS-RARRPDAPPQ----------SARPRaPVDDRGDPRGPAPPSP 2616
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  511 TQGTSHRPSLPASLNdndlvDSDEDERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRS-SVHPGAKPASPARRTPHSG 589
Cdd:PHA03247 2617 LPPDTHAPDPPPPSP-----SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGrAAQASSPPQRPRRRAARPT 2691
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  590 AAEEDSSASAPPsrlsPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSvsshlssrTQVSEGAEAS 669
Cdd:PHA03247 2692 VGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA--------TPGGPARPAR 2759
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  670 DGESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSSSPQSTV--PSRAHPRVP 747
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppPTSAQPTAP 2839
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  748 SHSDSHPKLSSGIHGDEEDEKPL----PATVVNDHVPSSSRQPISR-GWEDLRRSPQrgaSLHRKEPIPENPKSTGADTH 822
Cdd:PHA03247 2840 PPPPGPPPPSLPLGGSVAPGGDVrrrpPSRSPAAKPAAPARPPVRRlARPAVSRSTE---SFALPPDQPERPPQPQAPPP 2916
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  823 PQgkysslaskaqdVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATsqhhp 902
Cdd:PHA03247 2917 PQ------------PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV----- 2979
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  903 gPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRS---HSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAp 979
Cdd:PHA03247 2980 -PQPAPSREAPASSTPPLTGHSLSRVSSWASSlalHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDP- 3057
                         650       660       670
                  ....*....|....*....|....*....|....*.
gi 148922288  980 ahaarAKEAAASLPKHQQVESPTGAGAGGDHRSQRG 1015
Cdd:PHA03247 3058 -----LPPEPHDPFAHEPDPATPEAGARESPSSQFG 3088
PHA03247 PHA03247
large tegument protein UL36; Provisional
696-1200 6.99e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 6.99e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  696 RPASGHFhllrhkPFAANG-RSPsrfsiGRGPRLQPSSSPQSTVPSRA-HPRVPSHSDSHPKLSSGIHGDEE---DEKPL 770
Cdd:PHA03247 2482 RPAEARF------PFAAGAaPDP-----GGGGPPDPDAPPAPSRLAPAiLPDEPVGEPVHPRMLTWIRGLEElasDDAGD 2550
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  771 PATVVndhvPSSSRQPisrgwedlrrSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSlaskaqdvqqstdadTEGHSP 850
Cdd:PHA03247 2551 PPPPL----PPAAPPA----------APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQS---------------ARPRAP 2601
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  851 KAQPGSTDRHASPARPPAArsqqhPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAgrsPSQPRLSLTQagrpRPTS 930
Cdd:PHA03247 2602 VDDRGDPRGPAPPSPLPPD-----THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD---PAPGRVSRPR----RARR 2669
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  931 QGRSHSSSDPytasSRGMLPTALQnqdeDAQGSYDDDSTEVEAQDVRAPAHAarakeaaaslPKHQQVESPTGAGAGGDH 1010
Cdd:PHA03247 2670 LGRAAQASSP----PQRPRRRAAR----PTVGSLTSLADPPPPPPTPEPAPH----------ALVSATPLPPGPAAARQA 2731
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1011 RSQRGHA----ASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEeeedagffkggkedlLSSSVP 1086
Cdd:PHA03247 2732 SPALPAApappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS---------------LSESRE 2796
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1087 KWPSSSTPRggkdadgslakeerEPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRPPPRSAATVSPVAGTHPWPRY 1166
Cdd:PHA03247 2797 SLPSPWDPA--------------DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
                         490       500       510
                  ....*....|....*....|....*....|....
gi 148922288 1167 TTRAPPGHFSTTPMLSLRQRMMHARfRNPLSRQP 1200
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRST 2895
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
809-1238 2.63e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.50  E-value: 2.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  809 PIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PHA03307   78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  889 qqPPPPVATSQHHPG---------PQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSrgmlPTALQNQDED 959
Cdd:PHA03307  158 --SPAAVASDAASSRqaalplsspEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA----PAPGRSAADD 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  960 AQGSYDDDSTEVEAQDVRAPAHAARakeaaasLPKHQQVESPT--GAGAGGDHRSQRghaASPARPSrpGGPQSRARVPS 1037
Cdd:PHA03307  232 AGASSSDSSSSESSGCGWGPENECP-------LPRPAPITLPTriWEASGWNGPSSR---PGPASSS--SSPRERSPSPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1038 RAAPGKSEPPSKRPLSSKSQQSvsaedeeeedagffkggkedllsssvpkwPSSSTPRGGKDADGSlakeeREPAIALAP 1117
Cdd:PHA03307  300 PSSPGSGPAPSSPRASSSSSSS-----------------------------RESSSSSTSSSSESS-----RGAAVSPGP 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1118 RGGSLAPVKRPLPPPPGSSPRashvpSRPPPRSAATVSPVAGTHPWPRYTTRAPPGhfsttpmlslRQRMMHARFRNPLS 1197
Cdd:PHA03307  346 SPSRSPSPSRPPPPADPSSPR-----KRPRPSRAPSSPAASAGRPTRRRARAAVAG----------RARRRDATGRFPAG 410
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 148922288 1198 RQPARPSYRQGYNGRPNVE-------GKVLPGSNGKPNGqRIINGPQG 1238
Cdd:PHA03307  411 RPRPSPLDAGAASGAFYARyplltpsGEPWPGSPPPPPG-RVRYGGLG 457
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
209-302 2.80e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 55.97  E-value: 2.80e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGE--LARWDYKQIANRRVLIENLIPDTVYEFAVRIS 286
Cdd:cd00063     2 SPPTNLRVTDVTSTSVTLSW-TPPEDDGGPIT---GYVVEYREKGSgdWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77
                          90
                  ....*....|....*.
gi 148922288  287 QGERDGKWSTSVFQRT 302
Cdd:cd00063    78 NGGGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
684-1170 3.95e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 3.95e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  684 RQAEATAQTLRARPASghfhllrhKPFAANGRSPsrfsigRGPRLQPSSSPQST-VPSRAHPRVPSHSDSHPKLSSGIHG 762
Cdd:PHA03247 2576 RPSEPAVTSRARRPDA--------PPQSARPRAP------VDDRGDPRGPAPPSpLPPDTHAPDPPPPSPSPAANEPDPH 2641
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  763 DEedeKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEP----IPENPKSTGADTHPQGKY----------- 827
Cdd:PHA03247 2642 PP---PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraarPTVGSLTSLADPPPPPPTpepaphalvsa 2718
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  828 SSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPaarSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSR 907
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA---RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  908 DAGRSPSQPrlsltqagrPRPTSQGRSHSSSDPYTASSRGMLPtalqnqdedaqgsydddsteveaqdvraPAHAARAKE 987
Cdd:PHA03247 2796 ESLPSPWDP---------ADPPAAVLAPAAALPPAASPAGPLP----------------------------PPTSAQPTA 2838
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  988 AAASLPKHQQVESPTGAGAGGDHRSQRGHAASParPSRPGGPqSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEEE 1067
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP--AAKPAAP-ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP 2915
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1068 EDAGffkggKEDLLSSSVPKWPSSSTPRggkdADGSLAKEErEPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRPP 1147
Cdd:PHA03247 2916 PPQP-----QPQPPPPPQPQPPPPPPPR----PQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         490       500
                  ....*....|....*....|...
gi 148922288 1148 PRSAATVSPVAGTHPWPRYTTRA 1170
Cdd:PHA03247 2986 REAPASSTPPLTGHSLSRVSSWA 3008
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1537-1618 6.82e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.46  E-value: 6.82e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   1537 APRNITVVAVEGchSFVIVDWDKAtPGDVVTGYLV-YSASYEDfIRNKWSTQASSV--THLPIENLKPNTRYYFKVQAQN 1613
Cdd:smart00060    3 PPSNLRVTDVTS--TSVTLSWEPP-PDDGITGYIVgYRVEYRE-EGSEWKEVNVTPssTSYTLTGLKPGTEYEFRVRAVN 78

                    ....*
gi 148922288   1614 PHGYG 1618
Cdd:smart00060   79 GAGEG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
357-929 2.21e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.49  E-value: 2.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  357 PEKPEPSSPSP-RAPASSQHPSVPASPQ-GRNAKDLLLDlknkilANGGAPRKPQLRakkaeeldlqsTEITGEEELGSR 434
Cdd:PHA03247 2484 AEARFPFAAGAaPDPGGGGPPDPDAPPApSRLAPAILPD------EPVGEPVHPRML-----------TWIRGLEELASD 2546
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  435 ED-------SPMSPSDTQDQKRtlrPPSRHGHSVVAPGRTAvRARMPALPRREgvdkpgfslaTQPR-PGAPPSASASPA 506
Cdd:PHA03247 2547 DAgdpppplPPAAPPAAPDRSV---PPPRPAPRPSEPAVTS-RARRPDAPPQS----------ARPRaPVDDRGDPRGPA 2612
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  507 HHASTQGTSHRPSLPASLNDndlvdSDEDERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRS-SVHPGAKPASPARRT 585
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPS-----PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGrAAQASSPPQRPRRRA 2687
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  586 PHSGAAEEDSSASAPPsrlsPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMsssvsshlssrTQVSEG 665
Cdd:PHA03247 2688 ARPTVGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-----------GPATPG 2752
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  666 AEASdgeshgdgdredggrqaeataqtlRARPASghfhllrhkPFAANGRSPSRFSIGRGPRLQPSSSPQSTVPSRAHPR 745
Cdd:PHA03247 2753 GPAR------------------------PARPPT---------TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLP 2799
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  746 VPSHSDSHPKLSSgihgdeedekPLPATVVNDHVPSSSRQPISrgwedlrrSPQRGASLHRKEPIPEnPKSTGADTHPQG 825
Cdd:PHA03247 2800 SPWDPADPPAAVL----------APAAALPPAASPAGPLPPPT--------SAQPTAPPPPPGPPPP-SLPLGGSVAPGG 2860
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  826 KYSSLASKAQDVQQSTdadTEGHSPkaqpgsTDRHASPARPPAARSQ-QHPSVPRRMTPGRAPEQQPPPPVATSQHHPGP 904
Cdd:PHA03247 2861 DVRRRPPSRSPAAKPA---APARPP------VRRLARPAVSRSTESFaLPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
                         570       580
                  ....*....|....*....|....*
gi 148922288  905 QSRDAGRSPSQPRLSLTQAGRPRPT 929
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPS 2956
fn3 pfam00041
Fibronectin type III domain;
1537-1621 5.44e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.95  E-value: 5.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  1537 APRNITVVAVEgcHSFVIVDWDKATPGD-VVTGYLVYSASYEDFIRNKWSTQASSVTHLPIENLKPNTRYYFKVQAQNPH 1615
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPPDGNgPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79

                   ....*.
gi 148922288  1616 GYGPIS 1621
Cdd:pfam00041   80 GEGPPS 85
fn3 pfam00041
Fibronectin type III domain;
209-295 1.23e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   209 DVPDDISVRVMSSQSVLVSWvDPVLEKQKKVVasrQYTVRYREKGELARWDYKQIAN--RRVLIENLIPDTVYEFAVRIS 286
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSW-TPPPDGNGPIT---GYEVEYRPKNSGEPWNEITVPGttTSVTLTGLKPGTEYEVRVQAV 76

                   ....*....
gi 148922288   287 QGERDGKWS 295
Cdd:pfam00041   77 NGGGEGPPS 85
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
355-734 2.54e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 2.54e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  355 GKPEKPEPSSPSPRAPassqhPSVPASPQGRNAKDLLLDLKnkilANGGAPRKPQLRAKKAEELDLQSTEITGEeelgSR 434
Cdd:PHA03307  104 GSPTPPGPSSPDPPPP-----TPPPASPPPSPAPDLSEMLR----PVGSPGPPPAASPPAAGASPAAVASDAAS----SR 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  435 EDSPMSPSdTQDQKRTLRPPSRHGHSVVAPGRTAVRARMPALPRREGVDKPGFSLATQPRPGAPPSASASPAHHASTQGT 514
Cdd:PHA03307  171 QAALPLSS-PEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGW 249
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  515 SHRPSLPASLNDNDLVDSDEDERAVGSLHPkgafaqPRPALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAEED 594
Cdd:PHA03307  250 GPENECPLPRPAPITLPTRIWEASGWNGPS------SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  595 SSASAPPSRLSPPHGGSSRllPTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRTQVSEGAEASDGesh 674
Cdd:PHA03307  324 SSSSSTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRA--- 398
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  675 gdgdredggRQAEATAQTLRARPASGhfhllrhkPFAANGRSPSRFSigRGPRLQPSSSP 734
Cdd:PHA03307  399 ---------RRRDATGRFPAGRPRPS--------PLDAGAASGAFYA--RYPLLTPSGEP 439
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
3-68 4.30e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 46.45  E-value: 4.30e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 148922288      3 LKVTWDPPKDATSR-PVEHYNIAYGKSLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVS 68
Cdd:smart00060   17 VTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1531-1633 1.61e-05

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 49.62  E-value: 1.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1531 DLPPQHAPRNITVVAVEGchSFVIVDWDKATPGDVvTGYLVYSASYEDFIRNKWSTQASSVTHLpIENLKPNTRYYFKVQ 1610
Cdd:COG3401   323 DLTPPAAPSGLTATAVGS--SSITLSWTASSDADV-TGYNVYRSTSGGGTYTKIAETVTTTSYT-DTGLTPGTTYYYKVT 398
                          90       100
                  ....*....|....*....|....
gi 148922288 1611 AQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401   399 AVDAAGnESAPSEEVSATTASAAS 422
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
211-284 1.73e-05

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 44.91  E-value: 1.73e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148922288    211 PDDISVRVMSSQSVLVSWVDPVLEKQKKvvasrqYTVRYREKGELARWDYKQI----ANRRVLIENLIPDTVYEFAVR 284
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSWEPPPDDGITG------YIVGYRVEYREEGSEWKEVnvtpSSTSYTLTGLKPGTEYEFRVR 75
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
852-1051 2.16e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 2.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  852 AQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQ 931
Cdd:PRK07764  586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  932 GrshSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAAslPKHQQVESPTGAGAGGDHR 1011
Cdd:PRK07764  666 G---DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP--PQAAQGASAPSPAADDPVP 740
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 148922288 1012 SQRGHAASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRP 1051
Cdd:PRK07764  741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
3-75 2.23e-05

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 44.79  E-value: 2.23e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288    3 LKVTWDPPKDATSrPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRPVYRAE 75
Cdd:cd00063    17 VTLSWTPPEDDGG-PITGYVVEYREkgSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVT 90
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
852-1207 4.35e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 4.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  852 AQPGSTDRHASPARPPAARSQQHPSVPrrmtpgrAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQ 931
Cdd:PRK07764  409 APAPAAAAPAAAAAPAPAAAPQPAPAP-------APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  932 GRSHSSSDPYTASSRGMLPTALQNQDEDAQ--GSYDDDSTEVEaQDVRAPAHAARAKEAAASLPKHQQV---ESPTGAGA 1006
Cdd:PRK07764  482 PAPPAAPAPAAAPAAPAAPAAPAGADDAATlrERWPEILAAVP-KRSRKTWAILLPEATVLGVRGDTLVlgfSTGGLARR 560
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1007 ------------------GGDHR--SQRGHAASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLSSKSQQSVSAEDEE 1066
Cdd:PRK07764  561 faspgnaevlvtalaeelGGDWQveAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAS 640
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1067 EEDAGFFKGGKEdlLSSSVPKWPSSSTPRGGKDADGSLAKEEREPAIALAPRGGSLAPVKRPLPPPPGSSPRASHVPSRP 1146
Cdd:PRK07764  641 AAPAPGVAAPEH--HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1147 --PPRSAATVSPVAGT-------HPWPRYTTRAPPGHFSTTPMLSLRQRMMHARFRNPLSRQPARPSYRQ 1207
Cdd:PRK07764  719 aqPPQAAQGASAPSPAaddpvplPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
809-1019 2.04e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 2.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  809 PIPENPKSTGADTHPQGkYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE 888
Cdd:PRK07764  601 PAPASSGPPEEAARPAA-PAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  889 QQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASSRGMLPTALQNQDEDAQGSYDDds 968
Cdd:PRK07764  680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ-- 757
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 148922288  969 teveAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGAGGDHRSQRGHAAS 1019
Cdd:PRK07764  758 ----PPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEV 804
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
798-979 2.65e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 2.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  798 PQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTdADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSV 877
Cdd:PRK07764  627 PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP-AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  878 PRRMTPGRAPEQQP-PPPVATSQHHPGPQSRDAGRSPSQPRLsltQAGRPRPTSQGRSHSSSDPYTASSRGMLPTALqnq 956
Cdd:PRK07764  706 AATPPAGQADDPAAqPPQAAQGASAPSPAADDPVPLPPEPDD---PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP--- 779
                         170       180
                  ....*....|....*....|...
gi 148922288  957 dEDAQGSYDDDSTEVEAQDVRAP 979
Cdd:PRK07764  780 -SEEEEMAEDDAPSMDDEDRRDA 801
fn3 pfam00041
Fibronectin type III domain;
3-70 5.37e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.37e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288     3 LKVTWDPPKDAtSRPVEHYNIAYGK--SLKSLKYIKVNAETYSFLIEDVEPGVVYFVLLTAENHSGVSRP 70
Cdd:pfam00041   16 LTVSWTPPPDG-NGPITGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGPP 84
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1533-1633 8.60e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 8.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1533 PPQhAPRNITVVAVEgcHSFVIVDWDKATPGDVvTGYLVYSASYEDfirNKWS----TQASSVTHlpiENLKPNTRYYFK 1608
Cdd:COG3401   232 PPS-APTGLTATADT--PGSVTLSWDPVTESDA-TGYRVYRSNSGD---GPFTkvatVTTTSYTD---TGLTNGTTYYYR 301
                          90       100
                  ....*....|....*....|....*.
gi 148922288 1609 VQAQNPHG-YGPISPSVSFVTESDNP 1633
Cdd:COG3401   302 VTAVDAAGnESAPSNVVSVTTDLTPP 327
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
536-912 9.04e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 9.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  536 ERAVGSLHPKGAFAQPRPALSPSRQSPSSVLRDRSSVHPGAKPAsPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLL 615
Cdd:PRK07764  391 AGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPA-PAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  616 PTQPHLSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRT---QVSEGAE------------------------- 667
Cdd:PRK07764  470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRErwpEILAAVPkrsrktwaillpeatvlgvrgdtlv 549
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  668 -----ASDGESHGDGDREDGGRQA--EATAQTLRARPASGHfhllrHKPFAANGRSPSRFSIGRGPRL-QPSSSPQSTVP 739
Cdd:PRK07764  550 lgfstGGLARRFASPGNAEVLVTAlaEELGGDWQVEAVVGP-----APGAAGGEGPPAPASSGPPEEAaRPAAPAAPAAP 624
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  740 SRAHPR-VPSHSDSHPKLSSGIHGDEED-EKPLPATVVNDHVPSSSRQPISRGWEDLRRSPQRGASLHRKEPIPENPKST 817
Cdd:PRK07764  625 AAPAPAgAAAAPAEASAAPAPGVAAPEHhPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPA 704
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  818 GADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPE--QQPPPPV 895
Cdd:PRK07764  705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPpsPPSEEEE 784
                         410
                  ....*....|....*..
gi 148922288  896 ATSQHHPGPQSRDAGRS 912
Cdd:PRK07764  785 MAEDDAPSMDDEDRRDA 801
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
853-1056 9.59e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.10  E-value: 9.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  853 QPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPP--PPVATSQHHPGPQSRDAGRSPSQPRLSLTQAgrpRPTS 930
Cdd:PRK12323  364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPaaPAAAPAAAAAARAVAAAPARRSPAPEALAAA---RQAS 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  931 QGRSHSSSDPYTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESP---TGAGAG 1007
Cdd:PRK12323  441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpDAAPAG 520
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 148922288 1008 GDHRSQRGHAASPARPSRPGGPQSR--ARVPSRAAPGKSEPPSKRPLSSKS 1056
Cdd:PRK12323  521 WVAESIPDPATADPDDAFETLAPAPaaAPAPRAAAATEPVVAPRPPRASAS 571
PHA03378 PHA03378
EBNA-3B; Provisional
331-643 2.37e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  331 DALPETEGKVKASKADVQQNTEDNGKP-----EKPEPSSPSPRAPASSQHPSVPASP----------------QGRNAKD 389
Cdd:PHA03378  424 KAIEEEHRKKKAARTEQPRATPHSQAPtvvlhRPPTQPLEGPTGPLSVQAPLEPWQPlphpqvtpvilhqppaQGVQAHG 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  390 LLLDL--------KNKILANGGAPRKPQLRAKKA------EELDLQSTEITGEEE----------LGSREDSPMSpSDTQ 445
Cdd:PHA03378  504 SMLDLlekddedmEQRVMATLLPPSPPQPRAGRRapcvytEDLDIESDEPASTEPvhdqllpapgLGPLQIQPLT-SPTT 582
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  446 DQKRTLRP-----PSRHGHSVVAPGRTAVRARMPA--------LPRREGVDKP------GFSLATQPRPGAPPSASASPA 506
Cdd:PHA03378  583 SQLASSAPsyaqtPWPVPHPSQTPEPPTTQSHIPEtsaprqwpMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPY 662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  507 HHASTQgTSHRPSLPASLNDNDLV-------DSDEDERAVGSLHPKGAFAQP--RPALSPSRQSPSSVLRDRSSVHPGAk 577
Cdd:PHA03378  663 KPTWTQ-IGHIPYQPSPTGANTMLpiqwapgTMQPPPRAPTPMRPPAAPPGRaqRPAAATGRARPPAAAPGRARPPAAA- 740
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148922288  578 pasPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPHLSSPLSKGGKDGEDAPATNSNAP 643
Cdd:PHA03378  741 ---PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
723-1158 2.49e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 2.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  723 GRGPRLQPSSSPQSTVPSRAHPRVPSHSDS---HPKLSSGIHGdEEDEKPLPAtvvNDHVPSS----SRQPisrgwedlr 795
Cdd:PTZ00449  513 GPEASGLPPKAPGDKEGEEGEHEDSKESDEpkeGGKPGETKEG-EVGKKPGPA---KEHKPSKiptlSKKP--------- 579
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  796 RSPQRGASLHRKEPiPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPKaQPGSTDRHASPARPPAARSQQHP 875
Cdd:PTZ00449  580 EFPKDPKHPKDPEE-PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPK-RPPPPQRPSSPERPEGPKIIKSP 657
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  876 SVPRRMTPGRAPE----------QQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGRSHSSSDPYTASS 945
Cdd:PTZ00449  658 KPPKSPKPPFDPKfkekfyddylDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIG 737
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  946 RgmlPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAkeaaaSLPKHQQVESPTGAGAGGDHRSQRGHAASPARP-S 1024
Cdd:PTZ00449  738 D---PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILA-----EEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPgD 809
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1025 RPGGPQSRARVPSRA--------APGK-SEPPSKRPLSSKSQQSVsaedeeeedagffkggkEDLLSSSVPKWPSSSTPR 1095
Cdd:PTZ00449  810 HPSLPKKRHRLDGLAlsttdlesDAGRiAKDASGKIVKLKRSKSF-----------------DDLTTVEEAEEMGAEARK 872
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 148922288 1096 GGKDADGSLAK-EEREPaialaPRGGSLAPVKRPLPPPPGSSPRASHVPSRP-PPRSAATVSPVA 1158
Cdd:PTZ00449  873 IVVDDDGTEADdEDTHP-----PEEKHKSEVRRRRPPKKPSKPKKPSKPKKPkKPDSAFIPSIIA 932
Pur_ac_phosph_N pfam16656
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple ...
1553-1628 2.50e-03

Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple acid phosphatase proteins.


Pssm-ID: 465220 [Multi-domain]  Cd Length: 93  Bit Score: 38.93  E-value: 2.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  1553 VIVDWdkATPGDVVTGYLVYSASYEDfirNKWSTQASSVT------------HLPIENLKPNTRYYFKVQAQNphgyGPI 1620
Cdd:pfam16656   15 MTVSW--VTPSAVTSPVVQYGTSSSA---LTSTATATSSTyttgdggtgyihRATLTGLEPGTTYYYRVGDDN----GGW 85

                   ....*...
gi 148922288  1621 SPSVSFVT 1628
Cdd:pfam16656   86 SEVYSFTT 93
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
734-904 2.60e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 2.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  734 PQSTVPSRAHPRVPSHSDShpklSSGIHGDEEDEKPlpatvvndhvPSSSRQPISRGWEDLRRSPQRgASLHRKEPIP-- 811
Cdd:PTZ00449  511 PEGPEASGLPPKAPGDKEG----EEGEHEDSKESDE----------PKEGGKPGETKEGEVGKKPGP-AKEHKPSKIPtl 575
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  812 -ENPKSTGADTHPQGKYSSLASKAQdvqQSTDADTEGHSPKaQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQ 890
Cdd:PTZ00449  576 sKKPEFPKDPKHPKDPEEPKKPKRP---RSAQRPTRPKSPK-LPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGP 651
                         170
                  ....*....|....
gi 148922288  891 PPPPVATSQHHPGP 904
Cdd:PTZ00449  652 KIIKSPKPPKSPKP 665
PHA03247 PHA03247
large tegument protein UL36; Provisional
991-1236 4.32e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  991 SLPKHQQVESPTGAGAGGdhRSQRGHA-ASPARPSRPGGPQSRARVPSRAAPGKSEPPSKRPLssksqqsvsaedeeeed 1069
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTS--RARRPDApPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP----------------- 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1070 agffkggkedllsssvpkwPSSSTPRGGKDADGSLAKEEREPAIALAPRGGSLAPVKRPLPPPPGSspRASHVPSRPPPR 1149
Cdd:PHA03247 2628 -------------------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA--QASSPPQRPRRR 2686
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1150 SA-ATVSPV---AGTHPWPRYTTRAPPGHFSTTPMLSLRQRMMHARFRNPLSRQParpsyrqgyngRPNVEGKVLPGSNG 1225
Cdd:PHA03247 2687 AArPTVGSLtslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-----------PAVPAGPATPGGPA 2755
                         250
                  ....*....|.
gi 148922288 1226 KPNGQRIINGP 1236
Cdd:PHA03247 2756 RPARPPTTAGP 2766
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
106-192 4.66e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.25  E-value: 4.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  106 PNKPLRVRVRS-SDDRLSVAWKAPRLSGAksprRSRGFLLGYGESGRK--MNYVPLTRDERTHEIKKLASESVYVVSLQS 182
Cdd:cd00063     1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGG----PITGYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
                          90
                  ....*....|
gi 148922288  183 MNSQGRSQPV 192
Cdd:cd00063    77 VNGGGESPPS 86
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1537-1650 5.24e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 41.30  E-value: 5.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288 1537 APRNITVVAVEGchSFVIVDWDKATPGDVVTGYLVYSASyedfirNKWSTQASSVTHLpIENLKPNTRYYFKVQAQNPHG 1616
Cdd:COG3979     5 APTGLTASNVTS--SSVSLSWDASTDNVGVTGYDVYRGG------DQVATVTGLTAWT-VTGLTPGTEYTFTVGACDAAG 75
                          90       100       110
                  ....*....|....*....|....*....|....
gi 148922288 1617 YGPISPSVSFVTESDNPLLVVRPPGGEPIWIPFA 1650
Cdd:COG3979    76 NVSAASGTSTAMFGGSSTTLGSAEGVADTSGNLA 109
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
847-1039 5.39e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 5.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  847 GHSPKAQPGSTDRHASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSrdagrspSQPRLSLTQAGRP 926
Cdd:PRK07003  370 GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA-------PAPPATADRGDDA 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  927 RPTSQGRSHSSSDPytassrgmLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGA 1006
Cdd:PRK07003  443 ADGDAPVPAKANAR--------ASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA 514
                         170       180       190
                  ....*....|....*....|....*....|...
gi 148922288 1007 GGDHRSQRGHAASPARPSRPGGPQSrARVPSRA 1039
Cdd:PRK07003  515 AASREDAPAAAAPPAPEARPPTPAA-AAPAARA 546
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
461-697 6.56e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 6.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  461 VVAPGRTAVRARMPALPRREGVDKPGFSLATQPRPGappsasaspAHHASTQGTSHRPSLPASLNDNDLVDSDEDERAVG 540
Cdd:PRK07764  587 VVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPA---------APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  541 SLHPKGAFAQPRPALSPSRQSPSSVLRDRSSVHPGAKPASPARRTPHSGAAEEDSSASAPPSRLSPPHGGSSRLLPTQPH 620
Cdd:PRK07764  658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  621 LSSPLSKGGKDGEDAPATNSNAPSRSTMSSSVSSHLSSRTQVSEGAEA--SDGESHGDGDREDGGRQA-EATAQTLRARP 697
Cdd:PRK07764  738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMaeDDAPSMDDEDRRDAEEVAmELLEEELGAKK 817
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
861-1042 7.05e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 7.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  861 ASPARPPAARSQQHPSVPRRMTPGRAPEQQPPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGRPRPTSQGrSHSSSDP 940
Cdd:PRK12323  397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAG-PRPVAAA 475
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  941 YTASSRGMLPTALQNQDEDAQGSYDDDSTEVEAQDVRAPAHAARAKEAAASLPKHQQVESPTGAGAggdhrSQRGHAASP 1020
Cdd:PRK12323  476 AAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETL-----APAPAAAPA 550
                         170       180
                  ....*....|....*....|..
gi 148922288 1021 ARPSRPGGPQSRARVPSRAAPG 1042
Cdd:PRK12323  551 PRAAAATEPVVAPRPPRASASG 572
PHA03378 PHA03378
EBNA-3B; Provisional
679-1058 7.92e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.21  E-value: 7.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  679 REDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRfsigrGPRLQPSSSPQSTVPSRAHPRVPSHSDSHPKLSS 758
Cdd:PHA03378  427 EEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTG-----PLSVQAPLEPWQPLPHPQVTPVILHQPPAQGVQA 501
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  759 giHGD-----EEDEKPLPATVVNDHVPSSSRQPisrgwedlrRSPQRGASLHRKE-PIPENPKSTGADTHPQGKYSSLAS 832
Cdd:PHA03378  502 --HGSmldllEKDDEDMEQRVMATLLPPSPPQP---------RAGRRAPCVYTEDlDIESDEPASTEPVHDQLLPAPGLG 570
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  833 KAQdvQQSTDADTEGHSPKAQPGSTDR-----HASP-ARPPAARSQqhpsVPRRMTPGRAPEQQPPPPVATSQHHPGPQS 906
Cdd:PHA03378  571 PLQ--IQPLTSPTTSQLASSAPSYAQTpwpvpHPSQtPEPPTTQSH----IPETSAPRQWPMPLRPIPMRPLRMQPITFN 644
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288  907 RDAGRSPSQPrlSLTQAGRPRPTSQGRSHSSSDPYTASSRGMLPtaLQNQDEDAQgsydddsteveaQDVRAPAHAARAK 986
Cdd:PHA03378  645 VLVFPTPHQP--PQVEITPYKPTWTQIGHIPYQPSPTGANTMLP--IQWAPGTMQ------------PPPRAPTPMRPPA 708
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148922288  987 EAAASLPKHQQVESPTGAGAGGDHRSQRGhAASPARPSRPGGPQSRARVPSrAAPGKSEPPSKRPLSSKSQQ 1058
Cdd:PHA03378  709 APPGRAQRPAAATGRARPPAAAPGRARPP-AAAPGRARPPAAAPGRARPPA-AAPGRARPPAAAPGAPTPQP 778
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
511-972 9.19e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   511 TQGTSHRPSLPASLNDNDLVDSDEDERAVGSL---HPKGAFAQPRPALSPSRQSPSSvlrdrssvhpGAKPASPARRTPH 587
Cdd:pfam03546   13 TQAKAGKPEEDSESSSEEESDSEEETPAAKTPlqaKPSGKTPQVRAASAPAKESPRK----------GAPPVPPGKTGPA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   588 SGAA-----EEDSSASAPPSRlSPPHGGSSRLLPTQPHLSSPLSKGGK-DGEDAPATNSNAPSRSTMSSSVSSHLSSRTQ 661
Cdd:pfam03546   83 AAQAqagkpEEDSESSSEESD-SDGETPAAATLTTSPAQVKPLGKNSQvRPASTVGKGPSGKGANPAPPGKAGSAAPLVQ 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   662 VSEGAEASDG---ESHGDGDREDGGRQAEATAQTLRARPASGHFHLLRHKPFAANGRSPSRFSIGRGPRLQPSS------ 732
Cdd:pfam03546  162 VGKKEEDSESsseESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSeessds 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   733 ---SPQSTVPSRAHPRVPS-HSDSHPKLSSGIHGDEEDEKPL---------------PATVVNDHVPSSSRQPisrgwED 793
Cdd:pfam03546  242 eeeAPAAATPAQAKPALKTpQTKASPRKGTPITPTSAKVPPVrvgtpapwkagtvtsPACASSPAVARGAQRP-----EE 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   794 LRRSPQRGASLHRKEPIPENPKSTGADTHPQGKYSSLASKAQDVQQSTDADTEGHSPK-AQPGSTDRHASPARPPAARSQ 872
Cdd:pfam03546  317 DSSSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAvAQVKAEAQEDSESSEEESDSE 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148922288   873 QHPSVPRRMTP-GRAPEQQ--PPPPVATSQHHPGPQSRDAGRSPSQPRLSLTQAGR-PRPTSQGRSHSSSDPYTASSRG- 947
Cdd:pfam03546  397 EAAATPAQVKAsGKTPQAKanPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKpPARTPQNSAISVRGQASVPAVGk 476
                          490       500
                   ....*....|....*....|....*
gi 148922288   948 MLPTALQNQDEDAQGSYDDDSTEVE 972
Cdd:pfam03546  477 AVATAAQAQKGPVGGPQEEDSESSE 501
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH