NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530381130|ref|XP_005266009|]
View 

teneurin-2 isoform X23 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
844-1174 5.08e-48

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 175.41  E-value: 5.08e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 917
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  918 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 995
Cdd:cd14953   104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  996 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1073
Cdd:cd14953   170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1153
Cdd:cd14953   237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                         330       340
                  ....*....|....*....|.
gi 530381130 1154 AVAPDGTIYIADLGNIRIRAV 1174
Cdd:cd14953   303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2294-2371 3.60e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 3.60e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530381130  2294 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2371
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1135-2071 1.04e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 132.96  E-value: 1.04e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1135 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1214
Cdd:COG3209   107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1215 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1293
Cdd:COG3209   187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1294 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1373
Cdd:COG3209   267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1374 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1453
Cdd:COG3209   347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1454 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1533
Cdd:COG3209   427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1534 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 1605
Cdd:COG3209   507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1606 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 1685
Cdd:COG3209   587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1686 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 1765
Cdd:COG3209   667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1766 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 1839
Cdd:COG3209   744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1840 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 1919
Cdd:COG3209   824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1920 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 1999
Cdd:COG3209   893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530381130 2000 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2071
Cdd:COG3209   953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
447-477 3.31e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 3.31e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 530381130  447 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 477
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 super family cl44670
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
179-338 5.36e-08

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


The actual alignment was detected with superfamily member pfam19232:

Pssm-ID: 437064  Cd Length: 265  Bit Score: 56.55  E-value: 5.36e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   179 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 228
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   229 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 277
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   278 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 329
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 530381130   330 ------------DPN---WMGPDC 338
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 super family cl49352
MXAN_6577-like cysteine-rich domain;
343-424 4.50e-04

MXAN_6577-like cysteine-rich domain;


The actual alignment was detected with superfamily member NF041328:

Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  343 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 412
Cdd:NF041328   45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                          90
                  ....*....|..
gi 530381130  413 GCPDLCNGNGRC 424
Cdd:NF041328  115 ACVDLATDPLHC 126
 
Name Accession Description Interval E-value
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
844-1174 5.08e-48

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 175.41  E-value: 5.08e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 917
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  918 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 995
Cdd:cd14953   104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  996 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1073
Cdd:cd14953   170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1153
Cdd:cd14953   237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                         330       340
                  ....*....|....*....|.
gi 530381130 1154 AVAPDGTIYIADLGNIRIRAV 1174
Cdd:cd14953   303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2294-2371 3.60e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 3.60e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530381130  2294 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2371
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1135-2071 1.04e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 132.96  E-value: 1.04e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1135 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1214
Cdd:COG3209   107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1215 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1293
Cdd:COG3209   187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1294 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1373
Cdd:COG3209   267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1374 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1453
Cdd:COG3209   347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1454 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1533
Cdd:COG3209   427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1534 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 1605
Cdd:COG3209   507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1606 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 1685
Cdd:COG3209   587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1686 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 1765
Cdd:COG3209   667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1766 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 1839
Cdd:COG3209   744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1840 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 1919
Cdd:COG3209   824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1920 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 1999
Cdd:COG3209   893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530381130 2000 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2071
Cdd:COG3209   953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
844-1174 1.52e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 70.43  E-value: 1.52e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 919
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  920 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 996
Cdd:COG4257    86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  997 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1073
Cdd:COG4257   139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1153
Cdd:COG4257   184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                         330       340
                  ....*....|....*....|.
gi 530381130 1154 AVAPDGTIYIADLGNIRIRAV 1174
Cdd:COG4257   237 AVDGDGRVWFAESGANRIVRF 257
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1995-2071 2.35e-09

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 55.97  E-value: 2.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  1995 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2070
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   .
gi 530381130  2071 S 2071
Cdd:TIGR03696   70 N 70
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
447-477 3.31e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 3.31e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 530381130  447 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 477
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
179-338 5.36e-08

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 56.55  E-value: 5.36e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   179 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 228
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   229 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 277
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   278 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 329
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 530381130   330 ------------DPN---WMGPDC 338
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
248-394 3.17e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 51.68  E-value: 3.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  248 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 317
Cdd:NF041328   13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  318 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 391
Cdd:NF041328   75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139

                  ...
gi 530381130  392 REG 394
Cdd:NF041328  140 RGG 142
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
895-1172 1.41e-06

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 54.09  E-value: 1.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  895 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 969
Cdd:PLN02919  573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  970 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1042
Cdd:PLN02919  635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1043 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1117
Cdd:PLN02919  700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 530381130 1118 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1172
Cdd:PLN02919  778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1286-1322 6.46e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 41.82  E-value: 6.46e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530381130  1286 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1322
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
343-424 4.50e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  343 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 412
Cdd:NF041328   45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                          90
                  ....*....|..
gi 530381130  413 GCPDLCNGNGRC 424
Cdd:NF041328  115 ACVDLATDPLHC 126
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
360-402 1.41e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 38.37  E-value: 1.41e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 530381130   360 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 402
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
275-304 2.40e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.40e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 530381130  275 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 304
Cdd:cd00054     4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
187-359 8.37e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 38.97  E-value: 8.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  187 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 260
Cdd:NF041328   29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  261 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 338
Cdd:NF041328   89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
                         170       180
                  ....*....|....*....|.
gi 530381130  339 sveVCSVDCGTHGVCIGGACR 359
Cdd:NF041328  128 ---ACGVACDPGESCRGGACT 145
 
Name Accession Description Interval E-value
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
844-1174 5.08e-48

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 175.41  E-value: 5.08e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 917
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  918 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 995
Cdd:cd14953   104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  996 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1073
Cdd:cd14953   170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1153
Cdd:cd14953   237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                         330       340
                  ....*....|....*....|.
gi 530381130 1154 AVAPDGTIYIADLGNIRIRAV 1174
Cdd:cd14953   303 AVDAAGNLYVADTGNNRIRKI 323
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
895-1175 2.96e-40

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 153.07  E-value: 2.96e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  895 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGEqclpfdEARCGDGGKAidATLMSPRGIAVDKNGLMY 974
Cdd:cd14953    28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  975 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1050
Cdd:cd14953    92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1051 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1130
Cdd:cd14953   165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 530381130 1131 ncncYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1175
Cdd:cd14953   229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2294-2371 3.60e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.05  E-value: 3.60e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530381130  2294 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2371
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
932-1175 9.02e-32

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 128.03  E-value: 9.02e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  932 VVAGTGeqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1004
Cdd:cd14953     3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1005 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1082
Cdd:cd14953    72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1083 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdcdckndvncNCYSGDDAyATDAILNSPSSLAVAPDGTIY 1162
Cdd:cd14953   136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGG----------AGYAGDGP-ATAAQFNNPTGVAVDAAGNLY 201
                         250
                  ....*....|...
gi 530381130 1163 IADLGNIRIRAVS 1175
Cdd:cd14953   202 VADRGNHRIRKIT 214
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1135-2071 1.04e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 132.96  E-value: 1.04e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1135 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1214
Cdd:COG3209   107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1215 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1293
Cdd:COG3209   187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1294 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1373
Cdd:COG3209   267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1374 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1453
Cdd:COG3209   347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1454 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1533
Cdd:COG3209   427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1534 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 1605
Cdd:COG3209   507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1606 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 1685
Cdd:COG3209   587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1686 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 1765
Cdd:COG3209   667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1766 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 1839
Cdd:COG3209   744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1840 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 1919
Cdd:COG3209   824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1920 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 1999
Cdd:COG3209   893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530381130 2000 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2071
Cdd:COG3209   953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
814-984 3.82e-18

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 87.97  E-value: 3.82e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  814 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 886
Cdd:cd14953   163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  887 ---NNPahkYYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeQCLPfdearcGDGGKAIDATLMSPR 963
Cdd:cd14953   239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
                         170       180
                  ....*....|....*....|...
gi 530381130  964 GIAVDKNGLMYFVDAT--MIRKV 984
Cdd:cd14953   301 GVAVDAAGNLYVADTGnnRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
895-1193 1.41e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 85.06  E-value: 1.41e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  895 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMY 974
Cdd:cd05819    13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  975 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1050
Cdd:cd05819    70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1051 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1130
Cdd:cd05819   135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530381130 1131 ncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1193
Cdd:cd05819   187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
839-1172 4.75e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 83.52  E-value: 4.75e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  839 NKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYR 916
Cdd:cd05819     5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  917 VkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 994
Cdd:cd05819    81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  995 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1072
Cdd:cd05819   139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1073 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncncysgdDAYATDAILNSPSS 1152
Cdd:cd05819   194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
                         330       340
                  ....*....|....*....|
gi 530381130 1153 LAVAPDGTIYIADLGNIRIR 1172
Cdd:cd05819   248 LAVDSDGNLYVADTGNNRIQ 267
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
833-1044 1.02e-15

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 79.67  E-value: 1.02e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  833 NGLAEGNkLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPahkYYLAVDPvSGSLYVSDTN 910
Cdd:cd05819    94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFGSGGSGPGQFNGP---TGVAVDS-DGNIYVADTG 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  911 SRRIYRVKSlsgtkdlagNSEVVAGTGEQCLPfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNG 988
Cdd:cd05819   169 NHRIQVFDP---------DGNFLTTFGSTGTG-------------PGQFNYPTGIAVDSDGNIYVADSGNnrVQVFDPDG 226
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 530381130  989 IISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1044
Cdd:cd05819   227 AGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
840-1105 1.39e-14

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 76.20  E-value: 1.39e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  840 KLLAPVALAVGIDGSLYVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRI 914
Cdd:cd05819    53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  915 YRVkslsgtkDLAGNSEVVAGTGEQClpfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 992
Cdd:cd05819   126 QKF-------DPDGEFLTTFGSGGSG---------------PGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  993 LLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1070
Cdd:cd05819   184 TFGST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 530381130 1071 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1105
Cdd:cd05819   235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
956-1187 1.54e-14

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 76.20  E-value: 1.54e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  956 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1033
Cdd:cd05819     4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1034 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1111
Cdd:cd05819    69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530381130 1112 GEICLLAGAASDCDCKndvncncysgddayatdaiLNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1187
Cdd:cd05819   132 GEFLTTFGSGGSGPGQ-------------------FNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
844-1174 1.52e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 70.43  E-value: 1.52e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 919
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  920 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 996
Cdd:COG4257    86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  997 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1073
Cdd:COG4257   139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1153
Cdd:COG4257   184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                         330       340
                  ....*....|....*....|.
gi 530381130 1154 AVAPDGTIYIADLGNIRIRAV 1174
Cdd:COG4257   237 AVDGDGRVWFAESGANRIVRF 257
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
839-1114 7.30e-10

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 62.34  E-value: 7.30e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  839 NKLLAPVALAVGIDGSLYVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIYR 916
Cdd:COG4257    56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  917 VkslsgtkDLAGNsEVVAGTgeqcLPFDEARcgdggkaidatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 993
Cdd:COG4257   128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  994 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1071
Cdd:COG4257   182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 530381130 1072 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1114
Cdd:COG4257   224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1995-2071 2.35e-09

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 55.97  E-value: 2.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  1995 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2070
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   .
gi 530381130  2071 S 2071
Cdd:TIGR03696   70 N 70
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
893-1204 2.98e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.42  E-value: 2.98e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  893 YYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAgnsevvagTGEqclpFDEARCGDGGkaidatlmSPRGIAVDKNGL 972
Cdd:COG4257    20 RDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS--------GPHGIAVDPDGN 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  973 MYFVDAT--MIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRIT-E 1046
Cdd:COG4257    72 LWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtdQGGNRIGRLDpA 131
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1047 NHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGEIcllagaasdcd 1125
Cdd:COG4257   132 TGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGTL----------- 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1126 ckndvncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPGEQELY--VFNAD 1203
Cdd:COG4257   179 -------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGGGARPYgvAVDGD 241

                  .
gi 530381130 1204 G 1204
Cdd:COG4257   242 G 242
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
447-477 3.31e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 3.31e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 530381130  447 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 477
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
895-1171 4.59e-08

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 56.45  E-value: 4.59e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  895 LAVDPvSGSLYVSDTNSRRIYRvkslsgtkdLAgnsevvAGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 973
Cdd:cd14952    15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  974 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1051
Cdd:cd14952    66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1052 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVN 1131
Cdd:cd14952   122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTTQTVLPFTGLNSPSGVAVDTA 188
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 530381130 1132 CNCYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1171
Cdd:cd14952   189 GNVYVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
179-338 5.36e-08

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 56.55  E-value: 5.36e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   179 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 228
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   229 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 277
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   278 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 329
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 530381130   330 ------------DPN---WMGPDC 338
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
248-394 3.17e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 51.68  E-value: 3.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  248 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 317
Cdd:NF041328   13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  318 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 391
Cdd:NF041328   75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139

                  ...
gi 530381130  392 REG 394
Cdd:NF041328  140 RGG 142
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
895-1172 1.41e-06

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 54.09  E-value: 1.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  895 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 969
Cdd:PLN02919  573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  970 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1042
Cdd:PLN02919  635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1043 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1117
Cdd:PLN02919  700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 530381130 1118 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1172
Cdd:PLN02919  778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
844-977 2.76e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 51.50  E-value: 2.76e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGD-FNYIRRIFPSRNVTSIlelrnkEFKH-SNNPAHKYYL---AVDPvSGSLYVSDTNSRRIyRVK 918
Cdd:cd14957   114 PYGIAVDSNGNIYVADtGNHRIQVFTSSGTFSY------SIGSgGTGPGQFNGPqgiAVDS-DGNIYVADTGNHRI-QVF 185
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 530381130  919 SLSGTKDLAgnsevVAGTGEqclpfdearcGDGGkaidatLMSPRGIAVDKNGLMYFVD 977
Cdd:cd14957   186 TSSGTFQYT-----FGSSGS----------GPGQ------FSDPYGIAVDSDGNIYVAD 223
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
844-1172 3.59e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 51.11  E-value: 3.59e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIyRVK 918
Cdd:cd14957    20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  919 SLSGTKDLAgnsevVAGTGEQCLPFDEarcgdggkaidatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 996
Cdd:cd14957    92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  997 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1073
Cdd:cd14957   151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1074 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncncySGDDAYA------TDAIL 1147
Cdd:cd14957   201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
                         330       340
                  ....*....|....*....|....*
gi 530381130 1148 NSPSSLAVAPDGTIYIADLGNIRIR 1172
Cdd:cd14957   253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
959-1238 4.49e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 50.73  E-value: 4.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  959 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1036
Cdd:cd14957    17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1037 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1113
Cdd:cd14957    82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1114 icllagaasdcdckndvncNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1193
Cdd:cd14957   144 -------------------FSYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 530381130 1194 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYSTDndvtelIDNNGN 1238
Cdd:cd14957   184 -----VFTSSGTFQYTFgSSGSGPGQFSDPYGIA------VDSDGN 218
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
841-1035 5.03e-06

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 50.28  E-value: 5.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  841 LLAPVALAVGIDGSLYVGDFNYIR--RIFPSRNVTSILElrnkeFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYRVK 918
Cdd:cd14952    51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLP-----FTGLNDPTG---VAVDA-AGNVYVADTGNNRVLKLA 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  919 S------------LSGTKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKN 970
Cdd:cd14952   122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530381130  971 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1035
Cdd:cd14952   189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
954-1174 1.20e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 49.25  E-value: 1.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  954 AIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTllgsndltavrplscdssmdVAQVRLEWPTDLAVNPmD 1030
Cdd:COG4257    11 PVPAPGSGPRDVAVDPDGAVWFTDQGggRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-D 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1031 NSLYVLE--NNVILRIT-ENHQVSIIAGrpmhcqvPGIDYSLSKLAIHSAlesasaiaishtGVLYITETDEKKINRLRq 1107
Cdd:COG4257    70 GNLWFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNPHGIAFDPD------------GNLWFTDQGGNRIGRLD- 129
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530381130 1108 vTTNGEIcllagaasdcdckndvncncySGDDAYATDAilnSPSSLAVAPDGTIYIADLGNIRIRAV 1174
Cdd:COG4257   130 -PATGEV---------------------TEFPLPTGGA---GPYGIAVDPDGNLWVTDFGANAIGRI 171
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1136-1177 2.10e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 49.07  E-value: 2.10e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 530381130 1136 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1177
Cdd:cd14953    11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
NHL_like_4 cd14955
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
896-1171 6.31e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271325 [Multi-domain]  Cd Length: 279  Bit Score: 47.19  E-value: 6.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  896 AVDPvSGSLYVSDTNSRRIYRVKSlSGTkdlagnseVVAGTGeqclpfdeaRCGDGgkaiDATLMSPRGIAVDKNGLMYF 975
Cdd:cd14955    69 AVDS-DGNVYVADTGNHRIQKFDS-TGT--------FLTKWG---------SSGSG----DGQFNSPSGIAVDSAGNVYV 125
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  976 VDAT--MIRKVDQNGIISTLLGSNDltavrplSCDSSMDvaqvrleWPTDLAVnpmDNS--LYVLEnnvilriTENHQV- 1050
Cdd:cd14955   126 TDSGnnRIQKFDSSGTFITKWGSFG-------SGDGQFN-------SPTGIAV---DSAgnVYVAD-------TGNNRIq 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1051 ------SIIAGRpmhcQVPGIDyslsklaiHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdc 1124
Cdd:cd14955   182 kftstgTFLTKW----GSEGSG--------DGQFNAPYGIAVDSAGNVYVADTGN---NRIQKFDSSGTFITKWGSEG-- 244
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 530381130 1125 dckndvncncySGDDAYatdailNSPSSLAVAPDGTIYIADLGNIRI 1171
Cdd:cd14955   245 -----------SGDGQF------NSPSGIAVDSAGNVYVADSGNNRI 274
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1286-1322 6.46e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 41.82  E-value: 6.46e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530381130  1286 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1322
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
271-389 7.58e-05

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 45.55  E-value: 7.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130   271 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCelarvqCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTH 350
Cdd:pfam01500    4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC------CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQT 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 530381130   351 GVCIGGACRCEEGWTGAA----CDQRVCHPRCIEHGTCKDGKC 389
Cdd:pfam01500   76 SSCQTGCGGIGYGQEGSSgavsSRTRWCRPDCRVEGTCLPPCC 118
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
847-1007 3.31e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 44.88  E-value: 3.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  847 LAVGIDGSLYVGDFNYIR------RIFPSRNVTSILElrnkEFKHSNNpahkyyLAVDPVSGSLYVSDTNSRRIYRVkSL 920
Cdd:COG3386    98 GVVDPDGRLYFTDMGEYLptgalyRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF-DL 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  921 SGTKDLaGNSEVVAgtgeqclpfdEARCGDGGkaidatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 998
Cdd:COG3386   167 DADGTL-GNRRVFA----------DLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223

                  ....*....
gi 530381130  999 LTAVRPLSC 1007
Cdd:COG3386   224 LPERRPTNV 232
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
249-271 3.83e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 39.64  E-value: 3.83e-04
                           10        20
                   ....*....|....*....|....*
gi 530381130   249 CGGHGSCID--GNCVCSAGYKGEHC 271
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
184-206 4.06e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 39.64  E-value: 4.06e-04
                           10        20
                   ....*....|....*....|....*
gi 530381130   184 CHGNGECVS--GVCHCFPGFLGADC 206
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
343-424 4.50e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 4.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  343 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 412
Cdd:NF041328   45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
                          90
                  ....*....|..
gi 530381130  413 GCPDLCNGNGRC 424
Cdd:NF041328  115 ACVDLATDPLHC 126
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
314-338 7.69e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 7.69e-04
                           10        20
                   ....*....|....*....|....*
gi 530381130   314 CSGHGTYLPDTGLCSCDPNWMGPDC 338
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
844-977 9.05e-04

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 43.35  E-value: 9.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  844 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNNPahkYYLAVDPvSGSLYVSDTNSRRIYRVKSLS 921
Cdd:cd14952    96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVLP-----FTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKLAAGS 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  922 GTK----------------DLAG--------NSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 973
Cdd:cd14952   167 TTQtvlpftglnspsgvavDTAGnvyvtdhgNNRVLklaAGSTTPTvLPFTG-------------LNGPLGVAVDAAGNV 233

                  ....
gi 530381130  974 YFVD 977
Cdd:cd14952   234 YVAD 237
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
360-402 1.41e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 38.37  E-value: 1.41e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 530381130   360 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 402
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
834-998 1.56e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 42.66  E-value: 1.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  834 GLAEGnKLLAPVALAVGIDGSLYVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNNPAHkyyLAVDpvS 901
Cdd:cd14963    49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  902 GSLYVSDTNSRRIYrvkslsgtkdlagnseVVAGTGEQCLPFDEARCGDGgkaidaTLMSPRGIAVDKNGLMYFVDATMI 981
Cdd:cd14963   112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
                         170       180
                  ....*....|....*....|
gi 530381130  982 R-KV-DQNG-IISTLLGSND 998
Cdd:cd14963   170 RiQVfDKNGkFIKELNGSPD 189
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
275-304 2.40e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.40e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 530381130  275 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 304
Cdd:cd00054     4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1286-1328 3.37e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 37.18  E-value: 3.37e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 530381130  1286 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1328
Cdd:TIGR01643    1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
242-272 4.16e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.85  E-value: 4.16e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 530381130  242 NQCIDPS-CGGHGSCIDG----NCVCSAGYKGEHCE 272
Cdd:cd00054     3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1089-1174 7.08e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 41.02  E-value: 7.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130 1089 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncncySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1163
Cdd:cd14951   206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
                          90
                  ....*....|.
gi 530381130 1164 ADLGNIRIRAV 1174
Cdd:cd14951   264 ADTNNHRIRRL 274
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
834-977 7.48e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 40.74  E-value: 7.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  834 GLAEGNkLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHS--NNPAHkyyLAVDPvSGSLYVSDTNS 911
Cdd:cd14963   141 GSEPGE-LSYPNGIAVDEDGNIYVADSGNGRIQVFDKNGKFIKELNGSPDGKSgfVNPRG---IAVDP-DGNLYVVDNLS 215
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530381130  912 RRIYrVKSLSGTKDLagnseVVAGTGEqclpfdearcgdggkaIDATLMSPRGIAVDKNGLMYFVD 977
Cdd:cd14963   216 HRVY-VFDEQGKELF-----TFGGRGK----------------DDGQFNLPNGLFIDDDGRLYVTD 259
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
187-359 8.37e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 38.97  E-value: 8.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  187 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 260
Cdd:NF041328   29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530381130  261 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 338
Cdd:NF041328   89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
                         170       180
                  ....*....|....*....|.
gi 530381130  339 sveVCSVDCGTHGVCIGGACR 359
Cdd:NF041328  128 ---ACGVACDPGESCRGGACT 145
I-EGF_1 pfam18372
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ...
215-232 8.69e-03

Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end.


Pssm-ID: 465729  Cd Length: 29  Bit Score: 35.93  E-value: 8.69e-03
                           10
                   ....*....|....*...
gi 530381130   215 CSGNGQYSKGTCQCYSGW 232
Cdd:pfam18372   12 CSGNGTFVCGVCVCNPGY 29
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH