NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2065208831|ref|NP_001382389|]
View 

teneurin-2 isoform 5 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 651.27  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLIHRESDEFPRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  169 RPIPPTSSPSLlPSAQLPSshnPPP--VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQS 246
Cdd:pfam06484  161 PPIPPSSSSSS-PVEQHSP---PPPslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNHS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  247 TLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTS 323
Cdd:pfam06484  237 RLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTG 316
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2065208831  324 SGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  317 TGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1240-1570 1.34e-48

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 177.34  E-value: 1.34e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 1313
Cdd:cd14953     25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1314 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1391
Cdd:cd14953    104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1392 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1469
Cdd:cd14953    170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1549
Cdd:cd14953    237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                          330       340
                   ....*....|....*....|.
gi 2065208831 1550 AVAPDGTIYIADLGNIRIRAV 1570
Cdd:cd14953    303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2690-2767 4.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 4.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2065208831 2690 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2767
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1531-2467 3.80e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 134.88  E-value: 3.80e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1531 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1610
Cdd:COG3209    107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1611 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1689
Cdd:COG3209    187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1690 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1769
Cdd:COG3209    267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1770 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1849
Cdd:COG3209    347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1850 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1929
Cdd:COG3209    427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1930 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 2001
Cdd:COG3209    507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2002 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 2081
Cdd:COG3209    587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2082 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 2161
Cdd:COG3209    667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2162 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 2235
Cdd:COG3209    744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2236 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 2315
Cdd:COG3209    824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2316 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 2395
Cdd:COG3209    893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                          890       900       910       920       930       940       950
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2065208831 2396 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2467
Cdd:COG3209    953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
843-873 3.60e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 51.36  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2065208831  843 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 873
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 super family cl44670
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
575-734 2.14e-07

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


The actual alignment was detected with superfamily member pfam19232:

Pssm-ID: 437064  Cd Length: 265  Bit Score: 55.01  E-value: 2.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  575 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 725
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 2065208831  726 ------------DPN---WMGPDC 734
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 super family cl49352
MXAN_6577-like cysteine-rich domain;
739-820 4.36e-04

MXAN_6577-like cysteine-rich domain;


The actual alignment was detected with superfamily member NF041328:

Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  739 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 808
Cdd:NF041328    45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                           90
                   ....*....|..
gi 2065208831  809 GCPDLCNGNGRC 820
Cdd:NF041328   115 ACVDLATDPLHC 126
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 651.27  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLIHRESDEFPRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  169 RPIPPTSSPSLlPSAQLPSshnPPP--VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQS 246
Cdd:pfam06484  161 PPIPPSSSSSS-PVEQHSP---PPPslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNHS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  247 TLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTS 323
Cdd:pfam06484  237 RLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTG 316
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2065208831  324 SGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  317 TGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1240-1570 1.34e-48

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 177.34  E-value: 1.34e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 1313
Cdd:cd14953     25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1314 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1391
Cdd:cd14953    104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1392 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1469
Cdd:cd14953    170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1549
Cdd:cd14953    237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                          330       340
                   ....*....|....*....|.
gi 2065208831 1550 AVAPDGTIYIADLGNIRIRAV 1570
Cdd:cd14953    303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2690-2767 4.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 4.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2065208831 2690 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2767
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1531-2467 3.80e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 134.88  E-value: 3.80e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1531 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1610
Cdd:COG3209    107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1611 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1689
Cdd:COG3209    187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1690 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1769
Cdd:COG3209    267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1770 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1849
Cdd:COG3209    347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1850 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1929
Cdd:COG3209    427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1930 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 2001
Cdd:COG3209    507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2002 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 2081
Cdd:COG3209    587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2082 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 2161
Cdd:COG3209    667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2162 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 2235
Cdd:COG3209    744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2236 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 2315
Cdd:COG3209    824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2316 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 2395
Cdd:COG3209    893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                          890       900       910       920       930       940       950
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2065208831 2396 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2467
Cdd:COG3209    953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1240-1570 1.29e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 70.82  E-value: 1.29e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 1315
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1316 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1392
Cdd:COG4257     86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1393 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1469
Cdd:COG4257    139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1549
Cdd:COG4257    184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                          330       340
                   ....*....|....*....|.
gi 2065208831 1550 AVAPDGTIYIADLGNIRIRAV 1570
Cdd:COG4257    237 AVDGDGRVWFAESGANRIVRF 257
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2391-2467 2.46e-09

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 55.97  E-value: 2.46e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2391 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2466
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   .
gi 2065208831 2467 S 2467
Cdd:TIGR03696   70 N 70
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
843-873 3.60e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 51.36  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2065208831  843 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 873
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
575-734 2.14e-07

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 55.01  E-value: 2.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  575 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 725
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 2065208831  726 ------------DPN---WMGPDC 734
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
644-790 2.81e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 52.07  E-value: 2.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  644 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328    13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  714 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 787
Cdd:NF041328    75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139

                   ...
gi 2065208831  788 REG 790
Cdd:NF041328   140 RGG 142
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1291-1568 1.67e-06

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 54.09  E-value: 1.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1291 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1365
Cdd:PLN02919   573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1366 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1438
Cdd:PLN02919   635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1439 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1513
Cdd:PLN02919   700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2065208831 1514 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1568
Cdd:PLN02919   778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1682-1718 6.44e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.20  E-value: 6.44e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2065208831 1682 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1718
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
739-820 4.36e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  739 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 808
Cdd:NF041328    45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                           90
                   ....*....|..
gi 2065208831  809 GCPDLCNGNGRC 820
Cdd:NF041328   115 ACVDLATDPLHC 126
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
756-798 1.65e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 38.37  E-value: 1.65e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 2065208831  756 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 798
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
168-360 2.03e-03

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 43.58  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  168 GRPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDE---EFSPNSYLLRACSgpqqasssgppnHHS 244
Cdd:COG5099    202 FNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVEnniILNSSSSINELTS------------IYG 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  245 QSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNrRSQIHAPAPAPNDLATTPESVQLQDSwvLNSNVPLETRHFLFkTSS 324
Cdd:COG5099    270 SVPSIRNLRGLNSALVSFLNVSSSSLAFSALNG-KEVSPTGSPSTRSFARVLPKSSPNNL--LTEILTTGVNPPQS-LPS 345
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2065208831  325 GSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 360
Cdd:COG5099    346 LLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSL 381
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
671-700 2.58e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.58e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2065208831  671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054      4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
583-755 7.81e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 39.36  E-value: 7.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  583 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 656
Cdd:NF041328    29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  657 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 734
Cdd:NF041328    89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
                          170       180
                   ....*....|....*....|.
gi 2065208831  735 sveVCSVDCGTHGVCIGGACR 755
Cdd:NF041328   128 ---ACGVACDPGESCRGGACT 145
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 651.27  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLIHRESDEFPRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831   90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  169 RPIPPTSSPSLlPSAQLPSshnPPP--VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQS 246
Cdd:pfam06484  161 PPIPPSSSSSS-PVEQHSP---PPPslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNHS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  247 TLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTS 323
Cdd:pfam06484  237 RLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTG 316
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2065208831  324 SGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  317 TGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1240-1570 1.34e-48

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 177.34  E-value: 1.34e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 1313
Cdd:cd14953     25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1314 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1391
Cdd:cd14953    104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1392 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1469
Cdd:cd14953    170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1549
Cdd:cd14953    237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                          330       340
                   ....*....|....*....|.
gi 2065208831 1550 AVAPDGTIYIADLGNIRIRAV 1570
Cdd:cd14953    303 AVDAAGNLYVADTGNNRIRKI 323
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1291-1571 9.22e-41

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 154.61  E-value: 9.22e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1291 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGEqclpfdEARCGDGGKAidATLMSPRGIAVDKNGLMY 1370
Cdd:cd14953     28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1371 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1446
Cdd:cd14953     92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1447 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1526
Cdd:cd14953    165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2065208831 1527 ncncYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1571
Cdd:cd14953    229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2690-2767 4.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 4.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2065208831 2690 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2767
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1328-1571 3.46e-32

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 129.57  E-value: 3.46e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1328 VVAGTGeqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1400
Cdd:cd14953      3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1401 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1478
Cdd:cd14953     72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1479 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdcdckndvncNCYSGDDAyATDAILNSPSSLAVAPDGTIY 1558
Cdd:cd14953    136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGG----------AGYAGDGP-ATAAQFNNPTGVAVDAAGNLY 201
                          250
                   ....*....|...
gi 2065208831 1559 IADLGNIRIRAVS 1571
Cdd:cd14953    202 VADRGNHRIRKIT 214
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1531-2467 3.80e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 134.88  E-value: 3.80e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1531 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1610
Cdd:COG3209    107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1611 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1689
Cdd:COG3209    187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1690 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1769
Cdd:COG3209    267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1770 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1849
Cdd:COG3209    347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1850 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1929
Cdd:COG3209    427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1930 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 2001
Cdd:COG3209    507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2002 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 2081
Cdd:COG3209    587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2082 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 2161
Cdd:COG3209    667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2162 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 2235
Cdd:COG3209    744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2236 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 2315
Cdd:COG3209    824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2316 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 2395
Cdd:COG3209    893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                          890       900       910       920       930       940       950
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2065208831 2396 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2467
Cdd:COG3209    953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1210-1380 2.00e-18

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 89.13  E-value: 2.00e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1210 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 1282
Cdd:cd14953    163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1283 ---NNPahkYYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeQCLPfdearcGDGGKAIDATLMSPR 1359
Cdd:cd14953    239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
                          170       180
                   ....*....|....*....|...
gi 2065208831 1360 GIAVDKNGLMYFVDAT--MIRKV 1380
Cdd:cd14953    301 GVAVDAAGNLYVADTGnnRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1291-1589 4.85e-18

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 86.60  E-value: 4.85e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1291 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMY 1370
Cdd:cd05819     13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1371 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1446
Cdd:cd05819     70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1447 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1526
Cdd:cd05819    135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2065208831 1527 ncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1589
Cdd:cd05819    187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1235-1568 1.46e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 85.45  E-value: 1.46e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1235 NKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYR 1312
Cdd:cd05819      5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1313 VkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 1390
Cdd:cd05819     81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1391 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1468
Cdd:cd05819    139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1469 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncncysgdDAYATDAILNSPSS 1548
Cdd:cd05819    194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
                          330       340
                   ....*....|....*....|
gi 2065208831 1549 LAVAPDGTIYIADLGNIRIR 1568
Cdd:cd05819    248 LAVDSDGNLYVADTGNNRIQ 267
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1229-1440 3.77e-16

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 81.21  E-value: 3.77e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1229 NGLAEGNkLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSIL---ELRNKEFkhsNNPahkYYLAVDPvSGSLYVS 1303
Cdd:cd05819     94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQF---NGP---TGVAVDS-DGNIYVA 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1304 DTNSRRIYRVKSlsgtkdlagNSEVVAGTGEQCLPfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDATM--IRKVD 1381
Cdd:cd05819    166 DTGNHRIQVFDP---------DGNFLTTFGSTGTG-------------PGQFNYPTGIAVDSDGNIYVADSGNnrVQVFD 223
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2065208831 1382 QNGIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1440
Cdd:cd05819    224 PDGAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1236-1501 5.49e-15

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 77.74  E-value: 5.49e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1236 KLLAPVALAVGIDGSLYVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRI 1310
Cdd:cd05819     53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1311 YRVkslsgtkDLAGNSEVVAGTGEQClpfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 1388
Cdd:cd05819    126 QKF-------DPDGEFLTTFGSGGSG---------------PGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1389 LLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1466
Cdd:cd05819    184 TFGSTGTG--------------PGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 2065208831 1467 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1501
Cdd:cd05819    235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1352-1583 7.34e-15

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 77.36  E-value: 7.34e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1352 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1429
Cdd:cd05819      4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1430 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1507
Cdd:cd05819     69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2065208831 1508 GEICLLAGAASDCDCKndvncncysgddayatdaiLNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1583
Cdd:cd05819    132 GEFLTTFGSGGSGPGQ-------------------FNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1240-1570 1.29e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 70.82  E-value: 1.29e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 1315
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1316 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1392
Cdd:COG4257     86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1393 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1469
Cdd:COG4257    139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1549
Cdd:COG4257    184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                          330       340
                   ....*....|....*....|.
gi 2065208831 1550 AVAPDGTIYIADLGNIRIRAV 1570
Cdd:COG4257    237 AVDGDGRVWFAESGANRIVRF 257
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1235-1510 6.48e-10

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 62.73  E-value: 6.48e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1235 NKLLAPVALAVGIDGSLYVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIYR 1312
Cdd:COG4257     56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1313 VkslsgtkDLAGNsEVVAGTgeqcLPFDEARcgdggkaidatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 1389
Cdd:COG4257    128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1390 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1467
Cdd:COG4257    182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2065208831 1468 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1510
Cdd:COG4257    224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1289-1600 2.41e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.80  E-value: 2.41e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1289 YYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAgnsevvagTGEqclpFDEARCGDGGkaidatlmSPRGIAVDKNGL 1368
Cdd:COG4257     20 RDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS--------GPHGIAVDPDGN 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1369 MYFVDAT--MIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRIT-E 1442
Cdd:COG4257     72 LWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtdQGGNRIGRLDpA 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1443 NHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGEIcllagaasdcd 1521
Cdd:COG4257    132 TGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGTL----------- 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1522 ckndvncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPGEQELY--VFNAD 1599
Cdd:COG4257    179 -------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGGGARPYgvAVDGD 241

                   .
gi 2065208831 1600 G 1600
Cdd:COG4257    242 G 242
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2391-2467 2.46e-09

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 55.97  E-value: 2.46e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 2391 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2466
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   .
gi 2065208831 2467 S 2467
Cdd:TIGR03696   70 N 70
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
843-873 3.60e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 51.36  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2065208831  843 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 873
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1291-1567 4.63e-08

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 56.83  E-value: 4.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1291 LAVDPvSGSLYVSDTNSRRIYRvkslsgtkdLAgnsevvAGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 1369
Cdd:cd14952     15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1370 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1447
Cdd:cd14952     66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1448 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVN 1527
Cdd:cd14952    122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTTQTVLPFTGLNSPSGVAVDTA 188
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2065208831 1528 CNCYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1567
Cdd:cd14952    189 GNVYVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
575-734 2.14e-07

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 55.01  E-value: 2.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  575 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 725
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 2065208831  726 ------------DPN---WMGPDC 734
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
644-790 2.81e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 52.07  E-value: 2.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  644 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328    13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  714 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 787
Cdd:NF041328    75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139

                   ...
gi 2065208831  788 REG 790
Cdd:NF041328   140 RGG 142
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1291-1568 1.67e-06

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 54.09  E-value: 1.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1291 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1365
Cdd:PLN02919   573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1366 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1438
Cdd:PLN02919   635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1439 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1513
Cdd:PLN02919   700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2065208831 1514 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1568
Cdd:PLN02919   778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1240-1568 1.85e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 52.27  E-value: 1.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIyRVK 1314
Cdd:cd14957     20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1315 SLSGTKDLAgnsevVAGTGEQCLPFDEarcgdggkaidatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 1392
Cdd:cd14957     92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1393 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1469
Cdd:cd14957    151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1470 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncncySGDDAYA------TDAIL 1543
Cdd:cd14957    201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
                          330       340
                   ....*....|....*....|....*
gi 2065208831 1544 NSPSSLAVAPDGTIYIADLGNIRIR 1568
Cdd:cd14957    253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1355-1634 2.97e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 51.50  E-value: 2.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1355 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1432
Cdd:cd14957     17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1433 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1509
Cdd:cd14957     82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1510 icllagaasdcdckndvncNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1589
Cdd:cd14957    144 -------------------FSYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 2065208831 1590 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYSTDndvtelIDNNGN 1634
Cdd:cd14957    184 -----VFTSSGTFQYTFgSSGSGPGQFSDPYGIA------VDSDGN 218
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1237-1431 4.93e-06

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 50.67  E-value: 4.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1237 LLAPVALAVGIDGSLYVGDFNYIR--RIFPSRNVTSILElrnkeFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYRVK 1314
Cdd:cd14952     51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLP-----FTGLNDPTG---VAVDA-AGNVYVADTGNNRVLKLA 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1315 S------------LSGTKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKN 1366
Cdd:cd14952    122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2065208831 1367 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1431
Cdd:cd14952    189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1350-1570 1.10e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 49.63  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1350 AIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTllgsndltavrplscdssmdVAQVRLEWPTDLAVNPmD 1426
Cdd:COG4257     11 PVPAPGSGPRDVAVDPDGAVWFTDQGggRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-D 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1427 NSLYVLE--NNVILRIT-ENHQVSIIAGrpmhcqvPGIDYSLSKLAIHSAlesasaiaishtGVLYITETDEKKINRLRq 1503
Cdd:COG4257     70 GNLWFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNPHGIAFDPD------------GNLWFTDQGGNRIGRLD- 129
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2065208831 1504 vTTNGEIcllagaasdcdckndvncncySGDDAYATDAilnSPSSLAVAPDGTIYIADLGNIRIRAV 1570
Cdd:COG4257    130 -PATGEV---------------------TEFPLPTGGA---GPYGIAVDPDGNLWVTDFGANAIGRI 171
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
170-349 1.75e-05

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 49.43  E-value: 1.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  170 PIPPTSSPSLLPSaqlPSSHNPPPVScQMPLLDSNTSHQIMDTNPDEEF----SPNSYLLRACSGPQQ---ASSSGPPNH 242
Cdd:pfam15279  115 PLISVASSSKLLA---PKPHEPPSLP-PPPLPPKKGRRHRPGLHPPLGRppgsPPMSMTPRGLLGKPQqhpPPSPLPAFM 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  243 HSQSTLRPPLPPPhnhtlsHHHSSANSlnrnSLTNRRSQIHAPAPAP-NDLATTPEsvqlqdswvlnsnvPLEtRHFLFK 321
Cdd:pfam15279  191 EPSSMPPPFLRPP------PSIPQPNS----PLSNPMLPGIGPPPKPpRNLGPPSN--------------PMH-RPPFSP 245
                          170       180
                   ....*....|....*....|....*...
gi 2065208831  322 TSSGSTPLFSSSSPGYPLTSGTVYTPPP 349
Cdd:pfam15279  246 HHPPPPPTPPGPPPGLPPPPPRGFTPPF 273
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1532-1573 1.94e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 49.45  E-value: 1.94e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 2065208831 1532 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1573
Cdd:cd14953     11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
NHL_like_4 cd14955
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1292-1567 3.38e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271325 [Multi-domain]  Cd Length: 279  Bit Score: 48.34  E-value: 3.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1292 AVDPvSGSLYVSDTNSRRIYRVKSlSGTkdlagnseVVAGTGeqclpfdeaRCGDGgkaiDATLMSPRGIAVDKNGLMYF 1371
Cdd:cd14955     69 AVDS-DGNVYVADTGNHRIQKFDS-TGT--------FLTKWG---------SSGSG----DGQFNSPSGIAVDSAGNVYV 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1372 VDAT--MIRKVDQNGIISTLLGSNDltavrplSCDSSMDvaqvrleWPTDLAVnpmDNS--LYVLEnnvilriTENHQV- 1446
Cdd:cd14955    126 TDSGnnRIQKFDSSGTFITKWGSFG-------SGDGQFN-------SPTGIAV---DSAgnVYVAD-------TGNNRIq 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1447 ------SIIAGRpmhcQVPGIDyslsklaiHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdc 1520
Cdd:cd14955    182 kftstgTFLTKW----GSEGSG--------DGQFNAPYGIAVDSAGNVYVADTGN---NRIQKFDSSGTFITKWGSEG-- 244
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 2065208831 1521 dckndvncncySGDDAYatdailNSPSSLAVAPDGTIYIADLGNIRI 1567
Cdd:cd14955    245 -----------SGDGQF------NSPSGIAVDSAGNVYVADSGNNRI 274
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1682-1718 6.44e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.20  E-value: 6.44e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2065208831 1682 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1718
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
667-785 2.19e-04

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 44.40  E-value: 2.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  667 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCelarvqCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTH 746
Cdd:pfam01500    4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC------CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQT 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2065208831  747 GVCIGGACRCEEGWTGAA----CDQRVCHPRCIEHGTCKDGKC 785
Cdd:pfam01500   76 SSCQTGCGGIGYGQEGSSgavsSRTRWCRPDCRVEGTCLPPCC 118
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1243-1403 2.72e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 45.27  E-value: 2.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1243 LAVGIDGSLYVGDFNY------IRRIFPSRNVTSILElrnkEFKHSNNpahkyyLAVDPVSGSLYVSDTNSRRIYRVkSL 1316
Cdd:COG3386     98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF-DL 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1317 SGTKDLaGNSEVVAgtgeqclpfdEARCGDGGkaidatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 1394
Cdd:COG3386    167 DADGTL-GNRRVFA----------DLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223

                   ....*....
gi 2065208831 1395 LTAVRPLSC 1403
Cdd:COG3386    224 LPERRPTNV 232
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
645-667 3.82e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 39.64  E-value: 3.82e-04
                           10        20
                   ....*....|....*....|....*
gi 2065208831  645 CGGHGSCID--GNCVCSAGYKGEHC 667
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
580-602 3.93e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 39.64  E-value: 3.93e-04
                           10        20
                   ....*....|....*....|....*
gi 2065208831  580 CHGNGECVS--GVCHCFPGFLGADC 602
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
739-820 4.36e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  739 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 808
Cdd:NF041328    45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                           90
                   ....*....|..
gi 2065208831  809 GCPDLCNGNGRC 820
Cdd:NF041328   115 ACVDLATDPLHC 126
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
710-734 7.66e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 7.66e-04
                           10        20
                   ....*....|....*....|....*
gi 2065208831  710 CSGHGTYLPDTGLCSCDPNWMGPDC 734
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1230-1394 8.95e-04

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 43.82  E-value: 8.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1230 GLAEGnKLLAPVALAVGIDGSLYVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNNPAHkyyLAVDpvS 1297
Cdd:cd14963     49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1298 GSLYVSDTNSRRIYrvkslsgtkdlagnseVVAGTGEQCLPFDEARCGDGgkaidaTLMSPRGIAVDKNGLMYFVDATMI 1377
Cdd:cd14963    112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
                          170       180
                   ....*....|....*....|
gi 2065208831 1378 R-KV-DQNG-IISTLLGSND 1394
Cdd:cd14963    170 RiQVfDKNGkFIKELNGSPD 189
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1240-1373 9.38e-04

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 43.74  E-value: 9.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1240 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNNPahkYYLAVDPvSGSLYVSDTNSRRIYRVKSLS 1317
Cdd:cd14952     96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVLP-----FTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKLAAGS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1318 GTK----------------DLAG--------NSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 1369
Cdd:cd14952    167 TTQtvlpftglnspsgvavDTAGnvyvtdhgNNRVLklaAGSTTPTvLPFTG-------------LNGPLGVAVDAAGNV 233

                   ....
gi 2065208831 1370 YFVD 1373
Cdd:cd14952    234 YVAD 237
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
756-798 1.65e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 38.37  E-value: 1.65e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 2065208831  756 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 798
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
168-360 2.03e-03

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 43.58  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  168 GRPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDE---EFSPNSYLLRACSgpqqasssgppnHHS 244
Cdd:COG5099    202 FNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVEnniILNSSSSINELTS------------IYG 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  245 QSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNrRSQIHAPAPAPNDLATTPESVQLQDSwvLNSNVPLETRHFLFkTSS 324
Cdd:COG5099    270 SVPSIRNLRGLNSALVSFLNVSSSSLAFSALNG-KEVSPTGSPSTRSFARVLPKSSPNNL--LTEILTTGVNPPQS-LPS 345
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2065208831  325 GSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 360
Cdd:COG5099    346 LLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSL 381
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
671-700 2.58e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.58e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2065208831  671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054      4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1682-1724 3.53e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 37.57  E-value: 3.53e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2065208831 1682 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1724
Cdd:TIGR01643    1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
638-668 4.44e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.85  E-value: 4.44e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2065208831  638 NQCIDPS-CGGHGSCIDG----NCVCSAGYKGEHCE 668
Cdd:cd00054      3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1230-1373 4.89e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 41.51  E-value: 4.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1230 GLAEGNkLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHS--NNPAHkyyLAVDPvSGSLYVSDTNS 1307
Cdd:cd14963    141 GSEPGE-LSYPNGIAVDEDGNIYVADSGNGRIQVFDKNGKFIKELNGSPDGKSgfVNPRG---IAVDP-DGNLYVVDNLS 215
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2065208831 1308 RRIYrVKSLSGTKDLagnseVVAGTGEqclpfdearcgdggkaIDATLMSPRGIAVDKNGLMYFVD 1373
Cdd:cd14963    216 HRVY-VFDEQGKELF-----TFGGRGK----------------DDGQFNLPNGLFIDDDGRLYVTD 259
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
583-755 7.81e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 39.36  E-value: 7.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  583 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 656
Cdd:NF041328    29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831  657 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 734
Cdd:NF041328    89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
                          170       180
                   ....*....|....*....|.
gi 2065208831  735 sveVCSVDCGTHGVCIGGACR 755
Cdd:NF041328   128 ---ACGVACDPGESCRGGACT 145
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1485-1570 8.33e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 41.02  E-value: 8.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2065208831 1485 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncncySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1559
Cdd:cd14951    206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
                           90
                   ....*....|.
gi 2065208831 1560 ADLGNIRIRAV 1570
Cdd:cd14951    264 ADTNNHRIRRL 274
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH