NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1344544774|ref|NP_001347810|]
View 

transcription elongation regulator 1 isoform 3 [Mus musculus]

Protein Classification

WW domain-containing protein( domain architecture ID 13629023)

WW domain-containing protein; the WW domain mediates protein-protein interaction via proline-rich motifs, such as PPxY

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRP40 super family cl34905
Splicing factor [RNA processing and modification];
402-999 1.51e-25

Splicing factor [RNA processing and modification];


The actual alignment was detected with superfamily member COG5104:

Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 112.87  E-value: 1.51e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  402 ASPATLAGATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQEL--KEKEKLDEkikepikeaseeplpmeteeedpkeep 479
Cdd:COG5104      3 AALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPKELlkGSEEDLDV--------------------------- 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  480 vkeikeepkeeemteeekaaqkakpvattpipgTPWCVVWTGDERVFFYNPTTRLSMWDRPDDligRADVDKIIQEpphK 559
Cdd:COG5104     56 ---------------------------------DPWKECRTADGKVYYYNSITRESRWKIPPE---RKKVEPIAEQ---K 96
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  560 KGLEDMKKLRHPAPTMLSIQKWQFSmsaiKEEQELMEEMNEDEPIKAKKRKrddnkdidsekeaameaeikaareraivP 639
Cdd:COG5104     97 HDERSMIGGNGNDMAITDHETSEPK----YLLGRLMSQYGITSTKDAVYRL----------------------------T 144
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  640 LEARMKQFKDMLLERGVSAFSTWEKELHKIVfDPRYLLL--NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMM 717
Cdd:COG5104    145 KEEAEKEFITMLKENQVDSTWPIFRAIEELR-DPRYWMVdtDPLWRKDLFKKYFENQEKDQREEEENKQRKYINEFCKML 223
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  718 E-EAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWS 796
Cdd:COG5104    224 AgNSHIKYYTDWFTFKSIFSKHPYYSSVVNEKTKRQTFQKYKDKLGCYEKYVGKHMGGTALGRLEEVLRSLGSETFIIWL 303
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  797 KVKDKVESDPRYKAvdSSSM----REDLFKQYIeKIAKNLdsekekelerqarieaslrerEREVQKARSEQTKEIDReR 872
Cdd:COG5104    304 LNHYVFDSVVRYLK--NKEMkpldRKDILFSFI-RYVRRL---------------------EKELLSAIEERKAAAAQ-N 358
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  873 EQHKREeaiqNFKALLSDMVRSSDVS----WSDTRRTLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDE 948
Cdd:COG5104    359 ARHHRD----EFRTLLRKLYSEGKIYyrmkWKNAYPLIKDDPRFLNLLGRTGSSPLDLFFDFIVDLENMYGFARRSYERE 434
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774  949 TSaITLTSTW--KEVKKIIKEDPRciKFSSSDRKKQREFEE---YIRDKYITAKAD 999
Cdd:COG5104    435 TR-TGQISPTdrRAVDEIFEAIAE--KKEEGEIKFDKVDKEdisLIVDGLIKQRNE 487
WW pfam00397
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ...
137-162 6.02e-08

WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.


:

Pssm-ID: 459800 [Multi-domain]  Cd Length: 30  Bit Score: 49.43  E-value: 6.02e-08
                           10        20
                   ....*....|....*....|....*.
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKP 162
Cdd:pfam00397    5 WEERWDPDGRVYYYNHETGETQWEKP 30
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
258-346 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  258 VGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTtatsvAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTaTPVQTVPQP 337
Cdd:pfam05109  513 VTTPTPNATSPTPAVTTPTPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKT-SPTSAVTTP 586

                   ....*....
gi 1344544774  338 HPQTLPPAV 346
Cdd:pfam05109  587 TPNATSPTV 595
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
996-1055 2.38e-04

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


:

Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 39.75  E-value: 2.38e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  996 AKADFRTLLKETKfITYRSkkliqesdqHLKDVEKILQNDKRYLVLDcVPEERRKLIVAY 1055
Cdd:pfam01846    2 AREAFKELLKEHK-ITPYS---------TWSEIKKKIENDPRYKALL-DGSEREELFEDY 50
 
Name Accession Description Interval E-value
PRP40 COG5104
Splicing factor [RNA processing and modification];
402-999 1.51e-25

Splicing factor [RNA processing and modification];


Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 112.87  E-value: 1.51e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  402 ASPATLAGATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQEL--KEKEKLDEkikepikeaseeplpmeteeedpkeep 479
Cdd:COG5104      3 AALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPKELlkGSEEDLDV--------------------------- 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  480 vkeikeepkeeemteeekaaqkakpvattpipgTPWCVVWTGDERVFFYNPTTRLSMWDRPDDligRADVDKIIQEpphK 559
Cdd:COG5104     56 ---------------------------------DPWKECRTADGKVYYYNSITRESRWKIPPE---RKKVEPIAEQ---K 96
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  560 KGLEDMKKLRHPAPTMLSIQKWQFSmsaiKEEQELMEEMNEDEPIKAKKRKrddnkdidsekeaameaeikaareraivP 639
Cdd:COG5104     97 HDERSMIGGNGNDMAITDHETSEPK----YLLGRLMSQYGITSTKDAVYRL----------------------------T 144
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  640 LEARMKQFKDMLLERGVSAFSTWEKELHKIVfDPRYLLL--NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMM 717
Cdd:COG5104    145 KEEAEKEFITMLKENQVDSTWPIFRAIEELR-DPRYWMVdtDPLWRKDLFKKYFENQEKDQREEEENKQRKYINEFCKML 223
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  718 E-EAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWS 796
Cdd:COG5104    224 AgNSHIKYYTDWFTFKSIFSKHPYYSSVVNEKTKRQTFQKYKDKLGCYEKYVGKHMGGTALGRLEEVLRSLGSETFIIWL 303
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  797 KVKDKVESDPRYKAvdSSSM----REDLFKQYIeKIAKNLdsekekelerqarieaslrerEREVQKARSEQTKEIDReR 872
Cdd:COG5104    304 LNHYVFDSVVRYLK--NKEMkpldRKDILFSFI-RYVRRL---------------------EKELLSAIEERKAAAAQ-N 358
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  873 EQHKREeaiqNFKALLSDMVRSSDVS----WSDTRRTLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDE 948
Cdd:COG5104    359 ARHHRD----EFRTLLRKLYSEGKIYyrmkWKNAYPLIKDDPRFLNLLGRTGSSPLDLFFDFIVDLENMYGFARRSYERE 434
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774  949 TSaITLTSTW--KEVKKIIKEDPRciKFSSSDRKKQREFEE---YIRDKYITAKAD 999
Cdd:COG5104    435 TR-TGQISPTdrRAVDEIFEAIAE--KKEEGEIKFDKVDKEdisLIVDGLIKQRNE 487
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
775-824 1.55e-13

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 65.94  E-value: 1.55e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1344544774  775 KIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQY 824
Cdd:pfam01846    1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
936-991 1.23e-09

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 54.89  E-value: 1.23e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774   936 KKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCiKFSSSDRKKQREFEEYIRD 991
Cdd:smart00441    1 EEAKEAFKELLKEHEVITPDTTWSEARKKLKNDPRY-KALLSESEREQLFEDHIEE 55
WW cd00201
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ...
414-443 1.04e-08

Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.


Pssm-ID: 238122 [Multi-domain]  Cd Length: 31  Bit Score: 51.76  E-value: 1.04e-08
                           10        20        30
                   ....*....|....*....|....*....|
gi 1344544774  414 SEWTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:cd00201      2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
WW pfam00397
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ...
137-162 6.02e-08

WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.


Pssm-ID: 459800 [Multi-domain]  Cd Length: 30  Bit Score: 49.43  E-value: 6.02e-08
                           10        20
                   ....*....|....*....|....*.
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKP 162
Cdd:pfam00397    5 WEERWDPDGRVYYYNHETGETQWEKP 30
WW smart00456
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ...
132-164 6.59e-08

Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.


Pssm-ID: 197736 [Multi-domain]  Cd Length: 33  Bit Score: 49.52  E-value: 6.59e-08
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1344544774   132 PTEEIWVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:smart00456    1 PLPPGWEERKDPDGRPYYYNHETKETQWEKPRE 33
WW cd00201
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ...
137-164 1.70e-07

Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.


Pssm-ID: 238122 [Multi-domain]  Cd Length: 31  Bit Score: 48.29  E-value: 1.70e-07
                           10        20
                   ....*....|....*....|....*...
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:cd00201      4 WEERWDPDGRVYYYNHNTKETQWEDPRE 31
PTZ00121 PTZ00121
MAEBL; Provisional
590-1031 2.12e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.53  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  590 EEQELMEEMNEDEPIKAKKRKRDDNKDIDSEKEAAMEAEIKAARERAIVPLEARMKQFKDMLLErgvsafstwekelhki 669
Cdd:PTZ00121  1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE---------------- 1409
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  670 vfdpryllLNPKERKQVFDQYVKTRAEEERR--EKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIE 745
Cdd:PTZ00121  1410 --------LKKAAAAKKKADEAKKKAEEKKKadEAKKKAEEAKkaDEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  746 KMKDREAlfNEFVAAARKKEKEDSKTRGEKIKSDffELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYI 825
Cdd:PTZ00121  1482 AKKADEA--KKKAEEAKKKADEAKKAAEAKKKAD--EAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  826 EKIAKNLDSEKEKELERQARIEASLREREREVQKARSEQTKEIDREREQHKREEAiqnfKALLSDMVRSSDVSWSDTRRt 905
Cdd:PTZ00121  1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA----KKAEEAKIKAEELKKAEEEK- 1632
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  906 lRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDETSAitltstwKEVKKIIKEDPRCIKFSSSDRKKQREF 985
Cdd:PTZ00121  1633 -KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA-------EEAKKAEEDEKKAAEALKKEAEEAKKA 1704
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1344544774  986 EEyIRDKYITAKADFRTLLKETKFITYRSKKLIQESDQHLKDVEKI 1031
Cdd:PTZ00121  1705 EE-LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEA 1749
PRP40 COG5104
Splicing factor [RNA processing and modification];
124-173 1.56e-05

Splicing factor [RNA processing and modification];


Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 48.92  E-value: 1.56e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1344544774  124 APGAPALPPTEEIWVENKTPDGKVYYYNARTRESAWTKPDgvKVIQQSEL 173
Cdd:COG5104      4 ALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPK--ELLKGSEE 51
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
258-346 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  258 VGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTtatsvAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTaTPVQTVPQP 337
Cdd:pfam05109  513 VTTPTPNATSPTPAVTTPTPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKT-SPTSAVTTP 586

                   ....*....
gi 1344544774  338 HPQTLPPAV 346
Cdd:pfam05109  587 TPNATSPTV 595
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
996-1055 2.38e-04

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 39.75  E-value: 2.38e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  996 AKADFRTLLKETKfITYRSkkliqesdqHLKDVEKILQNDKRYLVLDcVPEERRKLIVAY 1055
Cdd:pfam01846    2 AREAFKELLKEHK-ITPYS---------TWSEIKKKIENDPRYKALL-DGSEREELFEDY 50
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
994-1058 2.59e-04

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 39.86  E-value: 2.59e-04
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1344544774   994 ITAKADFRTLLKETKFITYrskkliqesDQHLKDVEKILQNDKRYLVLDcVPEERRKLIVAYVDD 1058
Cdd:smart00441    1 EEAKEAFKELLKEHEVITP---------DTTWSEARKKLKNDPRYKALL-SESEREQLFEDHIEE 55
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
299-463 5.22e-04

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 43.91  E-value: 5.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  299 STPTTQDQTPSSAVSVATPTVSVSapaPTATPVQTVPQPHPqTLPPAVPHSVPQPAAAIPAFppvmVPPFRVPLPGMPIP 378
Cdd:TIGR01645  322 AVLGPRAQSPATPSSSLPTDIGNK---AVVSSAKKEAEEVP-PLPQAAPAVVKPGPMEIPTP----VPPPGLAIPSLVAP 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  379 LPGVLPGMAPPIV------PMIHPQVAIAASP--ATLAGATAVSEwtEYKTADGKTYYYNNRTLESTWEKPQElKEKEKL 450
Cdd:TIGR01645  394 PGLVAPTEINPSFlasprkKMKREKLPVTFGAldDTLAWKEPSKE--DQTSEDGKMLAIMGEAAAALALEPKK-KKKEKE 470
                          170
                   ....*....|...
gi 1344544774  451 DEKIKEPIKEASE 463
Cdd:TIGR01645  471 GEELQPKLVMNSE 483
PHA03247 PHA03247
large tegument protein UL36; Provisional
260-424 8.21e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 8.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHP 339
Cdd:PHA03247  2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  340 QTLPPAVPHSVPQPAAAIpafppvmvPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQVAIAASPATLAGATAVSEWTEY 419
Cdd:PHA03247  2783 LTRPAVASLSESRESLPS--------PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854

                   ....*
gi 1344544774  420 KTADG 424
Cdd:PHA03247  2855 SVAPG 2859
 
Name Accession Description Interval E-value
PRP40 COG5104
Splicing factor [RNA processing and modification];
402-999 1.51e-25

Splicing factor [RNA processing and modification];


Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 112.87  E-value: 1.51e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  402 ASPATLAGATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQEL--KEKEKLDEkikepikeaseeplpmeteeedpkeep 479
Cdd:COG5104      3 AALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPKELlkGSEEDLDV--------------------------- 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  480 vkeikeepkeeemteeekaaqkakpvattpipgTPWCVVWTGDERVFFYNPTTRLSMWDRPDDligRADVDKIIQEpphK 559
Cdd:COG5104     56 ---------------------------------DPWKECRTADGKVYYYNSITRESRWKIPPE---RKKVEPIAEQ---K 96
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  560 KGLEDMKKLRHPAPTMLSIQKWQFSmsaiKEEQELMEEMNEDEPIKAKKRKrddnkdidsekeaameaeikaareraivP 639
Cdd:COG5104     97 HDERSMIGGNGNDMAITDHETSEPK----YLLGRLMSQYGITSTKDAVYRL----------------------------T 144
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  640 LEARMKQFKDMLLERGVSAFSTWEKELHKIVfDPRYLLL--NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMM 717
Cdd:COG5104    145 KEEAEKEFITMLKENQVDSTWPIFRAIEELR-DPRYWMVdtDPLWRKDLFKKYFENQEKDQREEEENKQRKYINEFCKML 223
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  718 E-EAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWS 796
Cdd:COG5104    224 AgNSHIKYYTDWFTFKSIFSKHPYYSSVVNEKTKRQTFQKYKDKLGCYEKYVGKHMGGTALGRLEEVLRSLGSETFIIWL 303
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  797 KVKDKVESDPRYKAvdSSSM----REDLFKQYIeKIAKNLdsekekelerqarieaslrerEREVQKARSEQTKEIDReR 872
Cdd:COG5104    304 LNHYVFDSVVRYLK--NKEMkpldRKDILFSFI-RYVRRL---------------------EKELLSAIEERKAAAAQ-N 358
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  873 EQHKREeaiqNFKALLSDMVRSSDVS----WSDTRRTLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDE 948
Cdd:COG5104    359 ARHHRD----EFRTLLRKLYSEGKIYyrmkWKNAYPLIKDDPRFLNLLGRTGSSPLDLFFDFIVDLENMYGFARRSYERE 434
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774  949 TSaITLTSTW--KEVKKIIKEDPRciKFSSSDRKKQREFEE---YIRDKYITAKAD 999
Cdd:COG5104    435 TR-TGQISPTdrRAVDEIFEAIAE--KKEEGEIKFDKVDKEdisLIVDGLIKQRNE 487
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
775-824 1.55e-13

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 65.94  E-value: 1.55e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1344544774  775 KIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQY 824
Cdd:pfam01846    1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
708-757 1.60e-11

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 60.16  E-value: 1.60e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1344544774  708 QAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEF 757
Cdd:pfam01846    1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
643-690 1.10e-09

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 54.77  E-value: 1.10e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1344544774  643 RMKQFKDMLLERGVSAFSTWEKELHKIVFDPRYL-LLNPKERKQVFDQY 690
Cdd:pfam01846    2 AREAFKELLKEHKITPYSTWSEIKKKIENDPRYKaLLDGSEREELFEDY 50
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
936-991 1.23e-09

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 54.89  E-value: 1.23e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774   936 KKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCiKFSSSDRKKQREFEEYIRD 991
Cdd:smart00441    1 EEAKEAFKELLKEHEVITPDTTWSEARKKLKNDPRY-KALLSESEREQLFEDHIEE 55
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
774-827 2.61e-09

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 54.12  E-value: 2.61e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 1344544774   774 EKIKSDFFELLSNHHLD-SQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEK 827
Cdd:smart00441    1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKALLSESEREQLFEDHIEE 55
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
879-930 3.69e-09

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 53.23  E-value: 3.69e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1344544774  879 EAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWEsgSLLEREEKEKLFNEH 930
Cdd:pfam01846    1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYK--ALLDGSEREELFEDY 50
WW cd00201
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ...
414-443 1.04e-08

Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.


Pssm-ID: 238122 [Multi-domain]  Cd Length: 31  Bit Score: 51.76  E-value: 1.04e-08
                           10        20        30
                   ....*....|....*....|....*....|
gi 1344544774  414 SEWTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:cd00201      2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
WW pfam00397
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ...
414-441 3.55e-08

WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.


Pssm-ID: 459800 [Multi-domain]  Cd Length: 30  Bit Score: 50.20  E-value: 3.55e-08
                           10        20
                   ....*....|....*....|....*...
gi 1344544774  414 SEWTEYKTADGKTYYYNNRTLESTWEKP 441
Cdd:pfam00397    3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
WW pfam00397
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ...
137-162 6.02e-08

WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.


Pssm-ID: 459800 [Multi-domain]  Cd Length: 30  Bit Score: 49.43  E-value: 6.02e-08
                           10        20
                   ....*....|....*....|....*.
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKP 162
Cdd:pfam00397    5 WEERWDPDGRVYYYNHETGETQWEKP 30
WW smart00456
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ...
132-164 6.59e-08

Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.


Pssm-ID: 197736 [Multi-domain]  Cd Length: 33  Bit Score: 49.52  E-value: 6.59e-08
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1344544774   132 PTEEIWVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:smart00456    1 PLPPGWEERKDPDGRPYYYNHETKETQWEKPRE 33
WW smart00456
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ...
416-443 7.20e-08

Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.


Pssm-ID: 197736 [Multi-domain]  Cd Length: 33  Bit Score: 49.14  E-value: 7.20e-08
                            10        20
                    ....*....|....*....|....*...
gi 1344544774   416 WTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:smart00456    6 WEERKDPDGRPYYYNHETKETQWEKPRE 33
WW cd00201
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ...
137-164 1.70e-07

Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.


Pssm-ID: 238122 [Multi-domain]  Cd Length: 31  Bit Score: 48.29  E-value: 1.70e-07
                           10        20
                   ....*....|....*....|....*...
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:cd00201      4 WEERWDPDGRVYYYNHNTKETQWEDPRE 31
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
707-760 2.04e-07

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 48.72  E-value: 2.04e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 1344544774   707 MQAKEDFKKMMEEAKFN-PRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAA 760
Cdd:smart00441    1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKALLSESEREQLFEDHIEE 55
PTZ00121 PTZ00121
MAEBL; Provisional
590-1031 2.12e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.53  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  590 EEQELMEEMNEDEPIKAKKRKRDDNKDIDSEKEAAMEAEIKAARERAIVPLEARMKQFKDMLLErgvsafstwekelhki 669
Cdd:PTZ00121  1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE---------------- 1409
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  670 vfdpryllLNPKERKQVFDQYVKTRAEEERR--EKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIE 745
Cdd:PTZ00121  1410 --------LKKAAAAKKKADEAKKKAEEKKKadEAKKKAEEAKkaDEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  746 KMKDREAlfNEFVAAARKKEKEDSKTRGEKIKSDffELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYI 825
Cdd:PTZ00121  1482 AKKADEA--KKKAEEAKKKADEAKKAAEAKKKAD--EAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  826 EKIAKNLDSEKEKELERQARIEASLREREREVQKARSEQTKEIDREREQHKREEAiqnfKALLSDMVRSSDVSWSDTRRt 905
Cdd:PTZ00121  1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA----KKAEEAKIKAEELKKAEEEK- 1632
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  906 lRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDETSAitltstwKEVKKIIKEDPRCIKFSSSDRKKQREF 985
Cdd:PTZ00121  1633 -KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA-------EEAKKAEEDEKKAAEALKKEAEEAKKA 1704
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1344544774  986 EEyIRDKYITAKADFRTLLKETKFITYRSKKLIQESDQHLKDVEKI 1031
Cdd:PTZ00121  1705 EE-LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEA 1749
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
937-988 3.61e-07

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 47.84  E-value: 3.61e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1344544774  937 KKREHFRQLLDETSaITLTSTWKEVKKIIKEDPRCIKFSSSDRKKQrEFEEY 988
Cdd:pfam01846    1 KAREAFKELLKEHK-ITPYSTWSEIKKKIENDPRYKALLDGSEREE-LFEDY 50
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
878-933 3.90e-06

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 44.87  E-value: 3.90e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1344544774   878 EEAIQNFKALLSDMVRS-SDVSWSDTRRTLRKDHRWESgsLLEREEKEKLFNEHIEA 933
Cdd:smart00441    1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKA--LLSESEREQLFEDHIEE 55
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
641-692 1.01e-05

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 43.72  E-value: 1.01e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 1344544774   641 EARMKQFKDMLLERGVS-AFSTWEKELHKIVFDPRY-LLLNPKERKQVFDQYVK 692
Cdd:smart00441    1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYkALLSESEREQLFEDHIE 54
WW smart00456
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ...
514-542 1.14e-05

Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.


Pssm-ID: 197736 [Multi-domain]  Cd Length: 33  Bit Score: 42.97  E-value: 1.14e-05
                            10        20
                    ....*....|....*....|....*....
gi 1344544774   514 PWCVVWTGDERVFFYNPTTRLSMWDRPDD 542
Cdd:smart00456    5 GWEERKDPDGRPYYYNHETKETQWEKPRE 33
PRP40 COG5104
Splicing factor [RNA processing and modification];
124-173 1.56e-05

Splicing factor [RNA processing and modification];


Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 48.92  E-value: 1.56e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1344544774  124 APGAPALPPTEEIWVENKTPDGKVYYYNARTRESAWTKPDgvKVIQQSEL 173
Cdd:COG5104      4 ALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPK--ELLKGSEE 51
WW cd00201
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ...
513-542 1.97e-05

Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.


Pssm-ID: 238122 [Multi-domain]  Cd Length: 31  Bit Score: 42.13  E-value: 1.97e-05
                           10        20        30
                   ....*....|....*....|....*....|
gi 1344544774  513 TPWCVVWTGDERVFFYNPTTRLSMWDRPDD 542
Cdd:cd00201      2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
682-932 2.38e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 48.58  E-value: 2.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  682 ERKQVfDQYVKTRAEEERREKKNKIMQAKEdfKKMMEEAKFNPRATFSEFAAKHAKDSRFkAIEKMKDREALFNEfvaaa 761
Cdd:pfam17380  286 ERQQQ-EKFEKMEQERLRQEKEEKAREVER--RRKLEEAEKARQAEMDRQAAIYAEQERM-AMERERELERIRQE----- 356
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  762 rKKEKEDSKTRGEKIKSDFFEL--LSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMRE-DLFKQYIEKIaknldsEKEK 838
Cdd:pfam17380  357 -ERKRELERIRQEEIAMEISRMreLERLQMERQQKNERVRQELEAARKVKILEEERQRKiQQQKVEMEQI------RAEQ 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  839 ELERQARIEASLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWESGSLL 918
Cdd:pfam17380  430 EEARQREVRRLEEERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMI 509
                          250
                   ....*....|....
gi 1344544774  919 EREEKEKLFNEHIE 932
Cdd:pfam17380  510 EEERKRKLLEKEME 523
PTZ00121 PTZ00121
MAEBL; Provisional
418-1010 7.29e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 47.06  E-value: 7.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  418 EYKTADGKTYYYNNRTLESTWEKPQELKEKEKLDEKIKEPIKEASEepLPMETEEEDPKEEPVKEIKEEPKEEEMTEEEK 497
Cdd:PTZ00121  1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE--LKKAAAAKKKADEAKKKAEEKKKADEAKKKAE 1441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  498 AAQKAKPVATtpipgtpwcvvwTGDERVFFYNPTTRLSMWDRPDDLIGRADVDKIIQEPphKKGLEDMKKLRHPAptmls 577
Cdd:PTZ00121  1442 EAKKADEAKK------------KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA--KKKAEEAKKKADEA----- 1502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  578 iQKWQFSMSAIKEEQELMEEMNEDEPIKAKKRKRDDNKDIDSEKEAAMEA----EIKAARERAIVPLEARMKQFKDMLLE 653
Cdd:PTZ00121  1503 -KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELkkaeELKKAEEKKKAEEAKKAEEDKNMALR 1581
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  654 RGVSAFSTWEKELhkivfdprylllnpKERKQVFDQYVKTRAEEERREKKNKImqAKEDFKKMMEEAKfnpratFSEFAA 733
Cdd:PTZ00121  1582 KAEEAKKAEEARI--------------EEVMKLYEEEKKMKAEEAKKAEEAKI--KAEELKKAEEEKK------KVEQLK 1639
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  734 KHAKDSRFKAIEKMKDREAlfNEFVAAARKKEKEDSKTRGEKIKSDffellsnhhlDSQSRWSKVKDKVESDPRYKAvds 813
Cdd:PTZ00121  1640 KKEAEEKKKAEELKKAEEE--NKIKAAEEAKKAEEDKKKAEEAKKA----------EEDEKKAAEALKKEAEEAKKA--- 1704
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  814 ssmrEDLFKQYIEKIAKNLDSEKEKElERQARIEASLREREREVQKARSEQTKEIDREREQH-------KREEAIQNFKA 886
Cdd:PTZ00121  1705 ----EELKKKEAEEKKKAEELKKAEE-ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHlkkeeekKAEEIRKEKEA 1779
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  887 LLSDMVRSSDVSWSDTRRTLRKDHRWESGSLLEREEKEKLFnehiealTKKKREHFRQLLDETsAITLTSTWKEVKKIIK 966
Cdd:PTZ00121  1780 VIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLV-------INDSKEMEDSAIKEV-ADSKNMQLEEADAFEK 1851
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....
gi 1344544774  967 EDPRCIKFSSSDRKKQREFEeyiRDKYItaKADFRTLLKETKFI 1010
Cdd:PTZ00121  1852 HKFNKNNENGEDGNKEADFN---KEKDL--KEDDEEEIEEADEI 1890
PTZ00121 PTZ00121
MAEBL; Provisional
590-963 9.97e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 46.67  E-value: 9.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  590 EEQELMEEM---NEDEPIKAKKRKRDDNKDidseKEAAMEAEIKAARERAIVPLEARMKQFKdmlleRGVSAFSTWEKEl 666
Cdd:PTZ00121  1209 EEERKAEEArkaEDAKKAEAVKKAEEAKKD----AEEAKKAEEERNNEEIRKFEEARMAHFA-----RRQAAIKAEEAR- 1278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  667 hkivfdpRYLLLNPKERKQVFDQYVKTRAEEERREKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAI 744
Cdd:PTZ00121  1279 -------KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKkaDEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  745 EKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRwskvKDKVESDPRYKAVDSSSMREDLfKQY 824
Cdd:PTZ00121  1352 AEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE----EDKKKADELKKAAAAKKKADEA-KKK 1426
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  825 IEKIAKNLDSEKEKELERQARieaSLREREREVQKARSEQTKEIDREREQHKREEAIQNFKAllSDMVRSSDVSWSDTRR 904
Cdd:PTZ00121  1427 AEEKKKADEAKKKAEEAKKAD---EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADE 1501
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1344544774  905 TLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDETSAITLTSTwKEVKK 963
Cdd:PTZ00121  1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKA-EELKK 1559
PRP40 COG5104
Splicing factor [RNA processing and modification];
137-172 9.99e-05

Splicing factor [RNA processing and modification];


Pssm-ID: 227435 [Multi-domain]  Cd Length: 590  Bit Score: 46.23  E-value: 9.99e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1344544774  137 WVENKTPDGKVYYYNARTRESAWTKPDGVKVIQQSE 172
Cdd:COG5104     58 WKECRTADGKVYYYNSITRESRWKIPPERKKVEPIA 93
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
258-346 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  258 VGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTtatsvAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTaTPVQTVPQP 337
Cdd:pfam05109  513 VTTPTPNATSPTPAVTTPTPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKT-SPTSAVTTP 586

                   ....*....
gi 1344544774  338 HPQTLPPAV 346
Cdd:pfam05109  587 TPNATSPTV 595
WW pfam00397
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ...
513-540 2.01e-04

WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.


Pssm-ID: 459800 [Multi-domain]  Cd Length: 30  Bit Score: 39.41  E-value: 2.01e-04
                           10        20
                   ....*....|....*....|....*...
gi 1344544774  513 TPWCVVWTGDERVFFYNPTTRLSMWDRP 540
Cdd:pfam00397    3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
FF pfam01846
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ...
996-1055 2.38e-04

FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.


Pssm-ID: 426471 [Multi-domain]  Cd Length: 50  Bit Score: 39.75  E-value: 2.38e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  996 AKADFRTLLKETKfITYRSkkliqesdqHLKDVEKILQNDKRYLVLDcVPEERRKLIVAY 1055
Cdd:pfam01846    2 AREAFKELLKEHK-ITPYS---------TWSEIKKKIENDPRYKALL-DGSEREELFEDY 50
FF smart00441
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ...
994-1058 2.59e-04

Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.


Pssm-ID: 128718 [Multi-domain]  Cd Length: 55  Bit Score: 39.86  E-value: 2.59e-04
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1344544774   994 ITAKADFRTLLKETKFITYrskkliqesDQHLKDVEKILQNDKRYLVLDcVPEERRKLIVAYVDD 1058
Cdd:smart00441    1 EEAKEAFKELLKEHEVITP---------DTTWSEARKKLKNDPRYKALL-SESEREQLFEDHIEE 55
PHA02682 PHA02682
ORF080 virion core protein; Provisional
294-423 2.84e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 44.08  E-value: 2.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  294 VAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHPQTLP-PAVPHSVPQpaaaipafppvmvPPFRVPL 372
Cdd:PHA02682    82 LAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPaPARPAPACP-------------PSTRQCP 148
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1344544774  373 PGMPIPLPGVLPGMAPPIV-PMIHPQVAIAASPATLAGATAVSEWTEYKTAD 423
Cdd:PHA02682   149 PAPPLPTPKPAPAAKPIFLhNQLPPPDYPAASCPTIETAPAASPVLEPRIPD 200
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
299-463 5.22e-04

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 43.91  E-value: 5.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  299 STPTTQDQTPSSAVSVATPTVSVSapaPTATPVQTVPQPHPqTLPPAVPHSVPQPAAAIPAFppvmVPPFRVPLPGMPIP 378
Cdd:TIGR01645  322 AVLGPRAQSPATPSSSLPTDIGNK---AVVSSAKKEAEEVP-PLPQAAPAVVKPGPMEIPTP----VPPPGLAIPSLVAP 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  379 LPGVLPGMAPPIV------PMIHPQVAIAASP--ATLAGATAVSEwtEYKTADGKTYYYNNRTLESTWEKPQElKEKEKL 450
Cdd:TIGR01645  394 PGLVAPTEINPSFlasprkKMKREKLPVTFGAldDTLAWKEPSKE--DQTSEDGKMLAIMGEAAAALALEPKK-KKKEKE 470
                          170
                   ....*....|...
gi 1344544774  451 DEKIKEPIKEASE 463
Cdd:TIGR01645  471 GEELQPKLVMNSE 483
PTZ00121 PTZ00121
MAEBL; Provisional
678-1050 9.25e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 9.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  678 LNPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMMEEAKfnpratFSEFAAKHAKDSRfKAIEKMKDREALFNEf 757
Cdd:PTZ00121  1072 LKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEAR------KAEEAKKKAEDAR-KAEEARKAEDARKAE- 1143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  758 vaAARKKEkEDSKTRGEKIKSDFFELLSNHHLDSQSRWSKVKDKVE---SDPRYKAVDSSSMREDLFKQYIEKIAKNLDS 834
Cdd:PTZ00121  1144 --EARKAE-DAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEvrkAEELRKAEDARKAEAARKAEEERKAEEARKA 1220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  835 EKEKELERQARIEaSLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsdtrrTLRK-DHRWE 913
Cdd:PTZ00121  1221 EDAKKAEAVKKAE-EAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKAD--------ELKKaEEKKK 1291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  914 SGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDET--SAITLTSTWKEVKKIIKEDPRCIKFSSSDRKKQREFEEYIRD 991
Cdd:PTZ00121  1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAkkKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1344544774  992 KYITAKADFRTLLK--ETKFITYRSKKLIQESDQHLKDVEKILQNDKRYLVLDCVPEERRK 1050
Cdd:PTZ00121  1372 KKEEAKKKADAAKKkaEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
260-345 1.04e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.02  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVS-VSAPAPTATPVQTVPQPH 338
Cdd:pfam17823  165 ASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGtALAAVGNSSPAAGTVTAA 244

                   ....*..
gi 1344544774  339 PQTLPPA 345
Cdd:pfam17823  245 VGTVTPA 251
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
697-1051 1.75e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.36  E-value: 1.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  697 EERREKKNKIMQAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSRF-----KAIEKMKDREALFNEFVAAARKKEKEDSKT 771
Cdd:PRK03918   175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELreeleKLEKEVKELEELKEEIEELEKELESLEGSK 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  772 RGEKIKsdffelLSNhhldSQSRWSKVKDKVEsDPRYKAVDSSSMREDLfKQYIEkiaknLDSEKEKELERQARIE---A 848
Cdd:PRK03918   255 RKLEEK------IRE----LEERIEELKKEIE-ELEEKVKELKELKEKA-EEYIK-----LSEFYEEYLDELREIEkrlS 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  849 SLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsDTRRTLRKDHRWESGslLEREEKEKLFN 928
Cdd:PRK03918   318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE----EAKAKKEELERLKKR--LTGLTPEKLEK 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  929 EhIEALTKKKREHFRQLLdetsaiTLTSTWKEVKKIIKEDPRCIKFSSSDRKK----QREFEEYIRDKYITA-KADFRTL 1003
Cdd:PRK03918   392 E-LEELEKAKEEIEEEIS------KITARIGELKKEIKELKKAIEELKKAKGKcpvcGRELTEEHRKELLEEyTAELKRI 464
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1344544774 1004 LKETKFITYRSKKLIQEsdqhLKDVEKILQNDKRYLVLDCVPEERRKL 1051
Cdd:PRK03918   465 EKELKEIEEKERKLRKE----LRELEKVLKKESELIKLKELAEQLKEL 508
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
298-414 2.12e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 2.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  298 VSTPTTQDQTPSSAVSVATPTVSvSAPAPTatpvqtvPQPHPQTLPPAVPHSVPQPAAAIPAFPPVMVP-PFRVPLPGMP 376
Cdd:pfam17823  296 AAPMGAQAQGPIIQVSTDQPVHN-TAGEPT-------PSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKePSASPVPVLH 367
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1344544774  377 ---IP---------LPGVLP---GMAPPIVPMIHPQVAIAASPATL-AGATAVS 414
Cdd:pfam17823  368 tsmIPeveatspttQPSPLLptqGAAGPGILLAPEQVATEATAGTAsAGPTPRS 421
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
679-1071 2.82e-03

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 41.88  E-value: 2.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  679 NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDfkkmmeeakfnpratfsefaakhakdsrfKAIEKMKDREALFNEFV 758
Cdd:pfam02463  151 KPERRLEIEEEAAGSRLKRKKKEALKKLIEETEN-----------------------------LAELIIDLEELKLQELK 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  759 AAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKavDSSSMREDLFKQYIEKIAKNLDSEKEK 838
Cdd:pfam02463  202 LKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEE--IESSKQEIEKEEEKLAQVLKENKEEEK 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  839 ELERQARIEASLREREREVQKAR--SEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsdtRRTLRKDHRWESGS 916
Cdd:pfam02463  280 EKKLQEEELKLLAKEEEELKSELlkLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEE------LEKELKELEIKREA 353
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  917 LLEREE----KEKLFNEHIEALTKKKREHFRQLLDETSAITLTSTWKEVKKIIkedprcikfsSSDRKKQREFEEYIRDK 992
Cdd:pfam02463  354 EEEEEEelekLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKE----------AQLLLELARQLEDLLKE 423
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1344544774  993 YITAKADFrtLLKETKFITYRSKKLIQESDqHLKDVEKILQNDKRYLVLDCVPEERRKLIVAYVDDLDRRGPPPPPTAS 1071
Cdd:pfam02463  424 EKKEELEI--LEEEEESIELKQGKLTEEKE-ELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERS 499
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
694-976 3.08e-03

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 41.48  E-value: 3.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  694 RAEEERREKKNKIMQAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSrfKAIEKMKDREALFNEFVAAARKKEKEDSKTRG 773
Cdd:COG5185    257 KLVEQNTDLRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYT--KSIDIKKATESLEEQLAAAEAEQELEESKRET 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  774 EKIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSekekelerqarIEASLRER 853
Cdd:COG5185    335 ETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVELSKSSEELDSFKDTIESTKESLDE-----------IPQNQRGY 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  854 EREVQKARSEQTKEIDREREQHKR---------EEAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWESGSLLEREEKE 924
Cdd:COG5185    404 AQEILATLEDTLKAADRQIEELQRqieqatssnEEVSKLLNELISELNKVMREADEESQSRLEEAYDEINRSVRSKKEDL 483
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774  925 --------------KLFNEHIEALTKKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCIKFSS 976
Cdd:COG5185    484 neeltqiesrvstlKATLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILALENLIPASE 549
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
260-397 3.73e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.29  E-value: 3.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAP------------APT 327
Cdd:pfam03154  176 AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQrlpsphpplqpmTQP 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  328 ATPVQTVPQPHP---------------QTLPPAVPHSV------------------------------PQPAAAIPAFPP 362
Cdd:pfam03154  256 PPPSQVSPQPLPqpslhgqmppmphslQTGPSHMQHPVppqpfpltpqssqsqvppgpspaapgqsqqRIHTPPSQSQLQ 335
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1344544774  363 VMVPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQ 397
Cdd:pfam03154  336 SQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQ 370
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
817-948 6.04e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 40.67  E-value: 6.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  817 REDLFKQYIEKIAKNLDSEKEKELERQARIEAsLREREREVQKARSEQ--------TKEIDR-EREQHKREEAIQNFKAL 887
Cdd:COG4913    289 RLELLEAELEELRAELARLEAELERLEARLDA-LREELDELEAQIRGNggdrleqlEREIERlERELEERERRRARLEAL 367
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1344544774  888 LSDMvrssDVSWSDTRRTLRKDHRwESGSLLER--EEKEKLFNEHIEALTKKK--REHFRQLLDE 948
Cdd:COG4913    368 LAAL----GLPLPASAEEFAALRA-EAAALLEAleEELEALEEALAEAEAALRdlRRELRELEAE 427
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
589-1031 7.74e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 40.43  E-value: 7.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  589 KEEQELMEEMNEDEPIKAKKRKRDDNKDIDSEKEAAMEAEIKAARERaIVPLEARMKQFKDmlLERGVSAFSTWEKELHK 668
Cdd:PRK03918   228 KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE-IEELEEKVKELKE--LKEKAEEYIKLSEFYEE 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  669 IVFDPRYLllnpKERKQVFDQYVKTRAE--EERREKKNKIMQAKEDFKKMMEE-AKFNPRA-TFSEFAAKHAKDSRFKAI 744
Cdd:PRK03918   305 YLDELREI----EKRLSRLEEEINGIEEriKELEEKEERLEELKKKLKELEKRlEELEERHeLYEEAKAKKEELERLKKR 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  745 EKMKDREALFNEFVAAARKKEK--EDSKTRGEKIKSdfFELLSNHHLDSQSRWSKVKDKVesdPRYKAVDSSSMREDLFK 822
Cdd:PRK03918   381 LTGLTPEKLEKELEELEKAKEEieEEISKITARIGE--LKKEIKELKKAIEELKKAKGKC---PVCGRELTEEHRKELLE 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  823 QYIEKIAKnldseKEKELERQARIEASLREREREVQKARS------------EQTKEIDREREQHKREEAIQNFKALLSD 890
Cdd:PRK03918   456 EYTAELKR-----IEKELKEIEEKERKLRKELRELEKVLKkeseliklkelaEQLKELEEKLKKYNLEELEKKAEEYEKL 530
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  891 MVRSsdvswsdtrRTLRKDHRwesgSLLEREEKEKLFNEHIEALTKKKREHFRQLldetsaitltstwKEVKKIIKEdpr 970
Cdd:PRK03918   531 KEKL---------IKLKGEIK----SLKKELEKLEELKKKLAELEKKLDELEEEL-------------AELLKELEE--- 581
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1344544774  971 cIKFSSSD--RKKQREFEEYIRdKYIT---AKADFRTLLKETKFITYRSKKLIQESDQHLKDVEKI 1031
Cdd:PRK03918   582 -LGFESVEelEERLKELEPFYN-EYLElkdAEKELEREEKELKKLEEELDKAFEELAETEKRLEEL 645
PHA03247 PHA03247
large tegument protein UL36; Provisional
260-424 8.21e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 8.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHP 339
Cdd:PHA03247  2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  340 QTLPPAVPHSVPQPAAAIpafppvmvPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQVAIAASPATLAGATAVSEWTEY 419
Cdd:PHA03247  2783 LTRPAVASLSESRESLPS--------PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854

                   ....*
gi 1344544774  420 KTADG 424
Cdd:PHA03247  2855 SVAPG 2859
PLN02316 PLN02316
synthase/transferase
814-899 9.63e-03

synthase/transferase


Pssm-ID: 215180 [Multi-domain]  Cd Length: 1036  Bit Score: 40.24  E-value: 9.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1344544774  814 SSMREDLFKQYiekiaknLDSEKEKELERQARIEASlREREREVQKARSEQTKEIDREREQHKREEAIQNFKA--LLSDM 891
Cdd:PLN02316   239 GGMDEHSFEDF-------LLEEKRRELEKLAKEEAE-RERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLqnLLKKA 310

                   ....*...
gi 1344544774  892 VRSSDVSW 899
Cdd:PLN02316   311 SRSADNVW 318
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH