|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
4-212 |
3.56e-112 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 361.99 E-value: 3.56e-112
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 4 QNGRPIPPTSSPSLlPSAQLPSshnPPP--VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHH 81
Cdd:pfam06484 158 ENGPPIPPSSSSSS-PVEQHSP---PPPslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQ 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 82 SQSTLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLF 158
Cdd:pfam06484 234 NHSRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLF 313
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 2217356498 159 KTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 212
Cdd:pfam06484 314 KTGTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1069-1392 |
9.49e-48 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 174.64 E-value: 9.49e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSIL---------------ELrNNPahkYYLAVDPvSGSLYVSDTNSRR 1131
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAgtgtagfadgggaaaQF-NTP---SGVAVDA-AGNLYVADTGNHR 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1132 IYRVkslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIS 1209
Cdd:cd14953 100 IRKI-------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVT 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1210 TLLGsndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSL 1287
Cdd:cd14953 166 TVAG----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSG 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1288 SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNS 1367
Cdd:cd14953 233 DGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNN 298
|
330 340
....*....|....*....|....*
gi 2217356498 1368 PSSLAVAPDGTIYIADLGNIRIRAV 1392
Cdd:cd14953 299 PTGVAVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2512-2589 |
3.93e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 3.93e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217356498 2512 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2589
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1353-2289 |
1.40e-31 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 136.04 E-value: 1.40e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1353 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1432
Cdd:COG3209 107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1433 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1511
Cdd:COG3209 187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1512 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1591
Cdd:COG3209 267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1592 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1671
Cdd:COG3209 347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1672 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1751
Cdd:COG3209 427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1752 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 1823
Cdd:COG3209 507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1824 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 1903
Cdd:COG3209 587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1904 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 1983
Cdd:COG3209 667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1984 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 2057
Cdd:COG3209 744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2058 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 2137
Cdd:COG3209 824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2138 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 2217
Cdd:COG3209 893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217356498 2218 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2289
Cdd:COG3209 953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1069-1392 |
1.27e-13 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 73.52 E-value: 1.27e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDF--NYIRRIFP-SRNVTSILELRNNPAHKyyLAVDPvSGSLYVSDTNSRRIYRVkslsGTKDla 1145
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFTEYPLGGGSGPHG--IAVDP-DGNLWFTDNGNNRIGRI----DPKT-- 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1146 GNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgsndltavrP 1222
Cdd:COG4257 90 GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF----------P 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1223 LSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslsklaiHSALESA 1299
Cdd:COG4257 140 LPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL-------------------PTPGAGP 190
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1300 SAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSLAVAPDGTI 1379
Cdd:COG4257 191 RGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGVAVDGDGRV 244
|
330
....*....|...
gi 2217356498 1380 YIADLGNIRIRAV 1392
Cdd:COG4257 245 WFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2213-2289 |
2.47e-09 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 55.97 E-value: 2.47e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2213 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2288
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
.
gi 2217356498 2289 S 2289
Cdd:TIGR03696 70 N 70
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
672-702 |
2.54e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.75 E-value: 2.54e-08
10 20 30
....*....|....*....|....*....|.
gi 2217356498 672 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 702
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
413-572 |
1.82e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 55.01 E-value: 1.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 413 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 462
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 463 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 511
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 512 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 563
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 2217356498 564 ------------DPN---WMGPDC 572
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
482-628 |
1.95e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 52.45 E-value: 1.95e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 482 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 551
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 552 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 625
Cdd:NF041328 75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 2217356498 626 REG 628
Cdd:NF041328 140 RGG 142
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1113-1390 |
1.55e-06 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 54.09 E-value: 1.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1113 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1187
Cdd:PLN02919 573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1188 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1260
Cdd:PLN02919 635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1261 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1335
Cdd:PLN02919 700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2217356498 1336 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1390
Cdd:PLN02919 778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1504-1540 |
4.81e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.59 E-value: 4.81e-05
10 20 30
....*....|....*....|....*....|....*..
gi 2217356498 1504 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1540
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
594-636 |
1.40e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.76 E-value: 1.40e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 2217356498 594 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 636
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
6-198 |
1.86e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 43.58 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 6 GRPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDE---EFSPNSYLLRACSgpqqasssgppnHHS 82
Cdd:COG5099 202 FNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVEnniILNSSSSINELTS------------IYG 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 83 QSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNrRSQIHAPAPAPNDLATTPESVQLQDSwvLNSNVPLETRHFLFkTSS 162
Cdd:COG5099 270 SVPSIRNLRGLNSALVSFLNVSSSSLAFSALNG-KEVSPTGSPSTRSFARVLPKSSPNNL--LTEILTTGVNPPQS-LPS 345
|
170 180 190
....*....|....*....|....*....|....*.
gi 2217356498 163 GSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 198
Cdd:COG5099 346 LLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSL 381
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
509-538 |
2.15e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 38.00 E-value: 2.15e-03
10 20 30
....*....|....*....|....*....|....*
gi 2217356498 509 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 538
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
577-649 |
3.07e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.51 E-value: 3.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 577 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 644
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 2217356498 645 GNGRC 649
Cdd:NF041328 122 DPLHC 126
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
636-669 |
3.41e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.41e-03
10 20 30
....*....|....*....|....*....|....*.
gi 2217356498 636 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 669
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
421-593 |
5.64e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 39.74 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 421 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 494
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 495 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 572
Cdd:NF041328 89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
|
170 180
....*....|....*....|.
gi 2217356498 573 sveVCSVDCGTHGVCIGGACR 593
Cdd:NF041328 128 ---ACGVACDPGESCRGGACT 145
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
4-212 |
3.56e-112 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 361.99 E-value: 3.56e-112
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 4 QNGRPIPPTSSPSLlPSAQLPSshnPPP--VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHH 81
Cdd:pfam06484 158 ENGPPIPPSSSSSS-PVEQHSP---PPPslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQ 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 82 SQSTLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLF 158
Cdd:pfam06484 234 NHSRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLF 313
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 2217356498 159 KTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 212
Cdd:pfam06484 314 KTGTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1069-1392 |
9.49e-48 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 174.64 E-value: 9.49e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSIL---------------ELrNNPahkYYLAVDPvSGSLYVSDTNSRR 1131
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAgtgtagfadgggaaaQF-NTP---SGVAVDA-AGNLYVADTGNHR 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1132 IYRVkslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIS 1209
Cdd:cd14953 100 IRKI-------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVT 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1210 TLLGsndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSL 1287
Cdd:cd14953 166 TVAG----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSG 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1288 SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNS 1367
Cdd:cd14953 233 DGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNN 298
|
330 340
....*....|....*....|....*
gi 2217356498 1368 PSSLAVAPDGTIYIADLGNIRIRAV 1392
Cdd:cd14953 299 PTGVAVDAAGNLYVADTGNNRIRKI 323
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1113-1393 |
9.43e-41 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 154.61 E-value: 9.43e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1113 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGEqclpfdEARCGDGGKAidATLMSPRGIAVDKNGLMY 1192
Cdd:cd14953 28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1193 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1268
Cdd:cd14953 92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1269 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1348
Cdd:cd14953 165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2217356498 1349 ncncYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1393
Cdd:cd14953 229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2512-2589 |
3.93e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 3.93e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217356498 2512 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2589
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1150-1393 |
3.22e-32 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 129.57 E-value: 3.22e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1150 VVAGTGeqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1222
Cdd:cd14953 3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1223 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1300
Cdd:cd14953 72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1301 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdcdckndvncNCYSGDDAyATDAILNSPSSLAVAPDGTIY 1380
Cdd:cd14953 136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGG----------AGYAGDGP-ATAAQFNNPTGVAVDAAGNLY 201
|
250
....*....|...
gi 2217356498 1381 IADLGNIRIRAVS 1393
Cdd:cd14953 202 VADRGNHRIRKIT 214
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1353-2289 |
1.40e-31 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 136.04 E-value: 1.40e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1353 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNADGIHQYTVSLVT 1432
Cdd:COG3209 107 GLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGT 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1433 GEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMT-YDGNTGLL 1511
Cdd:COG3209 187 GAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATtLGGTTGAG 266
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1512 ATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNSYQ 1591
Cdd:COG3209 267 TGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTT 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1592 LCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSI 1671
Cdd:COG3209 347 TTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGAL 426
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1672 DYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADG 1751
Cdd:COG3209 427 TAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL 506
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1752 KVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYS 1823
Cdd:COG3209 507 GGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTG 586
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1824 DDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSE 1903
Cdd:COG3209 587 GTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTG 666
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1904 EGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGRIK 1983
Cdd:COG3209 667 TGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGT 743
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1984 EVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH-----LL 2057
Cdd:COG3209 744 LTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitvGS 823
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2058 NPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKTNL 2137
Cdd:COG3209 824 GGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RTDG 892
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2138 GHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTAYG 2217
Cdd:COG3209 893 GTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFG 952
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217356498 2218 EIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2289
Cdd:COG3209 953 NLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1039-1202 |
1.24e-18 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 89.51 E-value: 1.24e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1039 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELR------------ 1104
Cdd:cd14953 163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGtagfsgdggata 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1105 ---NNPahkYYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeQCLPfdearcGDGGKAIDATLMSPR 1181
Cdd:cd14953 239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
|
170 180
....*....|....*....|...
gi 2217356498 1182 GIAVDKNGLMYFVDAT--MIRKV 1202
Cdd:cd14953 301 GVAVDAAGNLYVADTGnnRIRKI 323
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1113-1411 |
5.16e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 86.60 E-value: 5.16e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1113 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMY 1192
Cdd:cd05819 13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1193 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1268
Cdd:cd05819 70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1269 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1348
Cdd:cd05819 135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217356498 1349 ncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1411
Cdd:cd05819 187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1064-1390 |
7.10e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 86.22 E-value: 7.10e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1064 NKLLAPVALAVGIDGSLYVGDFNYIR-RIFPSRN--VTSILELRNNPAHKYY---LAVDPvSGSLYVSDTNSRRIYRVks 1137
Cdd:cd05819 5 GELNNPQGIAVDSSGNIYVADTGNNRiQVFDPDGnfITSFGSFGSGDGQFNEpagVAVDS-DGNLYVADTGNHRIQKF-- 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1138 lsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLGSN 1215
Cdd:cd05819 82 -----DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTFGSG 141
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1216 dltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslsklaih 1293
Cdd:cd05819 142 ------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG----------- 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1294 sALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncncysgdDAYATDAILNSPSSLAV 1373
Cdd:cd05819 194 -QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSGLAV 250
|
330
....*....|....*..
gi 2217356498 1374 APDGTIYIADLGNIRIR 1390
Cdd:cd05819 251 DSDGNLYVADTGNNRIQ 267
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1058-1262 |
8.97e-16 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 80.06 E-value: 8.97e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1058 NGLAEGNkLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSIL--------ELrNNPahkYYLAVDPvSGSLYVSDT 1127
Cdd:cd05819 94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsggsgpgQF-NGP---TGVAVDS-DGNIYVADT 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1128 NSRRIYRVKSlsgtkdlagNSEVVAGTGEQCLPfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDATM--IRKVDQN 1205
Cdd:cd05819 168 GNHRIQVFDP---------DGNFLTTFGSTGTG-------------PGQFNYPTGIAVDSDGNIYVADSGNnrVQVFDPD 225
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 2217356498 1206 GIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1262
Cdd:cd05819 226 GAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1174-1405 |
7.58e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 77.36 E-value: 7.58e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1174 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1251
Cdd:cd05819 4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1252 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1329
Cdd:cd05819 69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217356498 1330 GEICLLAGAASDCDCKndvncncysgddayatdaiLNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1405
Cdd:cd05819 132 GEFLTTFGSGGSGPGQ-------------------FNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1065-1323 |
4.19e-14 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 75.05 E-value: 4.19e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1065 KLLAPVALAVGIDGSLYVGDFNYIR-RIFPS--RNVTSILEL------RNNPahkYYLAVDPvSGSLYVSDTNSRRIYRV 1135
Cdd:cd05819 53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgNFLASFGGSgdgdgeFNGP---RGIAVDS-SGNIYVADTGNHRIQKF 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1136 kslsgtkDLAGNSEVVAGTGEQClpfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1213
Cdd:cd05819 129 -------DPDGEFLTTFGSGGSG---------------PGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLTTFG 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1214 SNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslSKLA 1291
Cdd:cd05819 187 ST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------NFLG 237
|
250 260 270
....*....|....*....|....*....|..
gi 2217356498 1292 IHSALESASAIAISHTGVLYITETDEKKINRL 1323
Cdd:cd05819 238 SDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1069-1392 |
1.27e-13 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 73.52 E-value: 1.27e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDF--NYIRRIFP-SRNVTSILELRNNPAHKyyLAVDPvSGSLYVSDTNSRRIYRVkslsGTKDla 1145
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFTEYPLGGGSGPHG--IAVDP-DGNLWFTDNGNNRIGRI----DPKT-- 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1146 GNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgsndltavrP 1222
Cdd:COG4257 90 GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF----------P 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1223 LSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslsklaiHSALESA 1299
Cdd:COG4257 140 LPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL-------------------PTPGAGP 190
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1300 SAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSLAVAPDGTI 1379
Cdd:COG4257 191 RGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGVAVDGDGRV 244
|
330
....*....|...
gi 2217356498 1380 YIADLGNIRIRAV 1392
Cdd:COG4257 245 WFAESGANRIVRF 257
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1064-1332 |
2.99e-11 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 66.58 E-value: 2.99e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1064 NKLLAPVALAVGIDGSLYVGD--FNYIRRIFPSRNVTSILELRNNPAHKYYLAVDPvSGSLYVSDTNSRRIYRVkslsgt 1141
Cdd:COG4257 56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGGSNPHGIAFDP-DGNLWFTDQGGNRIGRL------ 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1142 kDLAGNsEVVAGTgeqcLPFDEARcgdggkaidatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTLLGSNDLT 1218
Cdd:COG4257 129 -DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEYALPTPGA 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1219 AvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSKLAIHSAL 1296
Cdd:COG4257 189 G-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTEYPLPGGG 230
|
250 260 270
....*....|....*....|....*....|....*.
gi 2217356498 1297 ESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1332
Cdd:COG4257 231 ARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2213-2289 |
2.47e-09 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 55.97 E-value: 2.47e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 2213 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2288
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
.
gi 2217356498 2289 S 2289
Cdd:TIGR03696 70 N 70
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1111-1422 |
2.51e-09 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 60.80 E-value: 2.51e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1111 YYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAgnsevvagTGEqclpFDEARCGDGGkaidatlmSPRGIAVDKNGL 1190
Cdd:COG4257 20 RDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS--------GPHGIAVDPDGN 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1191 MYFVD--ATMIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRIT-E 1264
Cdd:COG4257 72 LWFTDngNNRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtdQGGNRIGRLDpA 131
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1265 NHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGEIcllagaasdcd 1343
Cdd:COG4257 132 TGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGTL----------- 178
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1344 ckndvncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPGEQELY--VFNAD 1421
Cdd:COG4257 179 -------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGGGARPYgvAVDGD 241
|
.
gi 2217356498 1422 G 1422
Cdd:COG4257 242 G 242
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
672-702 |
2.54e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.75 E-value: 2.54e-08
10 20 30
....*....|....*....|....*....|.
gi 2217356498 672 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 702
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1113-1389 |
5.95e-08 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 56.06 E-value: 5.95e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1113 LAVDPvSGSLYVSDTNSRRIYRvkslsgtkdLAgnsevvAGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 1191
Cdd:cd14952 15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1192 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1269
Cdd:cd14952 66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1270 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVN 1349
Cdd:cd14952 122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTTQTVLPFTGLNSPSGVAVDTA 188
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2217356498 1350 CNCYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1389
Cdd:cd14952 189 GNVYVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
413-572 |
1.82e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 55.01 E-value: 1.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 413 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 462
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 463 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 511
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 512 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 563
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 2217356498 564 ------------DPN---WMGPDC 572
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
482-628 |
1.95e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 52.45 E-value: 1.95e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 482 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 551
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 552 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 625
Cdd:NF041328 75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 2217356498 626 REG 628
Cdd:NF041328 140 RGG 142
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1069-1195 |
2.88e-07 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 54.58 E-value: 2.88e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGD-FNYIRRIFPSRNVT--SILELRNNPAHKYYL---AVDPvSGSLYVSDTNSRRIyRVKSLSGTK 1142
Cdd:cd14957 114 PYGIAVDSNGNIYVADtGNHRIQVFTSSGTFsySIGSGGTGPGQFNGPqgiAVDS-DGNIYVADTGNHRI-QVFTSSGTF 191
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 2217356498 1143 DLAgnsevVAGTGEqclpfdearcGDGGkaidatLMSPRGIAVDKNGLMYFVD 1195
Cdd:cd14957 192 QYT-----FGSSGS----------GPGQ------FSDPYGIAVDSDGNIYVAD 223
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1066-1253 |
1.11e-06 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 52.21 E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1066 LLAPVALAVGIDGSLYVGDFNYIR--RIFPSRNVTSILELR--NNPAHkyyLAVDPvSGSLYVSDTNSRRIYRVKS---- 1137
Cdd:cd14952 51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLPFTglNDPTG---VAVDA-AGNVYVADTGNNRVLKLAAgsnt 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1138 --------LSGTKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLMYF 1193
Cdd:cd14952 127 qtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTAGNVYV 193
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1194 VDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1253
Cdd:cd14952 194 TDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1113-1390 |
1.55e-06 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 54.09 E-value: 1.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1113 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1187
Cdd:PLN02919 573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1188 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1260
Cdd:PLN02919 635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1261 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1335
Cdd:PLN02919 700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2217356498 1336 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1390
Cdd:PLN02919 778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1177-1456 |
2.82e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 51.50 E-value: 2.82e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1177 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1254
Cdd:cd14957 17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1255 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1331
Cdd:cd14957 82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1332 icllagaasdcdckndvncNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1411
Cdd:cd14957 144 -------------------FSYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 2217356498 1412 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYSTDndvtelIDNNGN 1456
Cdd:cd14957 184 -----VFTSSGTFQYTFgSSGSGPGQFSDPYGIA------VDSDGN 218
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1172-1392 |
1.16e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 49.63 E-value: 1.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1172 AIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTllgsndltavrplscdssmdVAQVRLEWPTDLAVNPmD 1248
Cdd:COG4257 11 PVPAPGSGPRDVAVDPDGAVWFTDQGggRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-D 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1249 NSLYVLE--NNVILRIT-ENHQVSIIAGrpmhcqvPGIDYSLSKLAIHSAlesasaiaishtGVLYITETDEKKINRLRq 1325
Cdd:COG4257 70 GNLWFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNPHGIAFDPD------------GNLWFTDQGGNRIGRLD- 129
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217356498 1326 vTTNGEIcllagaasdcdckndvncncySGDDAYATDAilnSPSSLAVAPDGTIYIADLGNIRIRAV 1392
Cdd:COG4257 130 -PATGEV---------------------TEFPLPTGGA---GPYGIAVDPDGNLWVTDFGANAIGRI 171
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1069-1390 |
1.36e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 49.57 E-value: 1.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDFNYIR-RIF-PSRNVTSILELRNNPAHK----YYLAVDPvSGSLYVSDTNSRRIyRVKSLSGTK 1142
Cdd:cd14957 20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIGSGGTGSGQfnspYGIAVDS-NGNIYVADTDNNRI-QVFNSSGVY 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1143 DLAgnsevVAGTGEQCLPFDEarcgdggkaidatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSndltav 1220
Cdd:cd14957 98 QYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS------ 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1221 rplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLAIHSALE 1297
Cdd:cd14957 151 ------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGSGPGQFS 206
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1298 SASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncncySGDDAYA------TDAILNSPSSL 1371
Cdd:cd14957 207 DPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQFNYPYGI 258
|
330
....*....|....*....
gi 2217356498 1372 AVAPDGTIYIADLGNIRIR 1390
Cdd:cd14957 259 AVDNDGKIYVADSNNNRIQ 277
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
8-187 |
1.63e-05 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 49.43 E-value: 1.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 8 PIPPTSSPSLLPSaqlPSSHNPPPVScQMPLLDSNTSHQIMDTNPDEEF----SPNSYLLRACSGPQQ---ASSSGPPNH 80
Cdd:pfam15279 115 PLISVASSSKLLA---PKPHEPPSLP-PPPLPPKKGRRHRPGLHPPLGRppgsPPMSMTPRGLLGKPQqhpPPSPLPAFM 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 81 HSQSTLRPPLPPPhnhtlsHHHSSANSlnrnSLTNRRSQIHAPAPAP-NDLATTPEsvqlqdswvlnsnvPLEtRHFLFK 159
Cdd:pfam15279 191 EPSSMPPPFLRPP------PSIPQPNS----PLSNPMLPGIGPPPKPpRNLGPPSN--------------PMH-RPPFSP 245
|
170 180
....*....|....*....|....*...
gi 2217356498 160 TSSGSTPLFSSSSPGYPLTSGTVYTPPP 187
Cdd:pfam15279 246 HHPPPPPTPPGPPPGLPPPPPRGFTPPF 273
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1354-1395 |
1.84e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 49.45 E-value: 1.84e-05
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2217356498 1354 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1395
Cdd:cd14953 11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
|
|
| NHL_like_4 |
cd14955 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1114-1389 |
3.48e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271325 [Multi-domain] Cd Length: 279 Bit Score: 48.34 E-value: 3.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1114 AVDPvSGSLYVSDTNSRRIYRVKSlSGTkdlagnseVVAGTGeqclpfdeaRCGDGgkaiDATLMSPRGIAVDKNGLMYF 1193
Cdd:cd14955 69 AVDS-DGNVYVADTGNHRIQKFDS-TGT--------FLTKWG---------SSGSG----DGQFNSPSGIAVDSAGNVYV 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1194 VDAT--MIRKVDQNGIISTLLGSNDltavrplSCDSSMDvaqvrleWPTDLAVnpmDNS--LYVLEnnvilriTENHQV- 1268
Cdd:cd14955 126 TDSGnnRIQKFDSSGTFITKWGSFG-------SGDGQFN-------SPTGIAV---DSAgnVYVAD-------TGNNRIq 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1269 ------SIIAGRpmhcQVPGIDyslsklaiHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdc 1342
Cdd:cd14955 182 kftstgTFLTKW----GSEGSG--------DGQFNAPYGIAVDSAGNVYVADTGN---NRIQKFDSSGTFITKWGSEG-- 244
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 2217356498 1343 dckndvncncySGDDAYatdailNSPSSLAVAPDGTIYIADLGNIRI 1389
Cdd:cd14955 245 -----------SGDGQF------NSPSGIAVDSAGNVYVADSGNNRI 274
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1504-1540 |
4.81e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.59 E-value: 4.81e-05
10 20 30
....*....|....*....|....*....|....*..
gi 2217356498 1504 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1540
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| Keratin_B2 |
pfam01500 |
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ... |
505-623 |
1.67e-04 |
|
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
Pssm-ID: 366678 [Multi-domain] Cd Length: 161 Bit Score: 44.40 E-value: 1.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 505 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCelarvqCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTH 584
Cdd:pfam01500 4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC------CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQT 75
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 2217356498 585 GVCIGGACRCEEGWTGAA----CDQRVCHPRCIEHGTCKDGKC 623
Cdd:pfam01500 76 SSCQTGCGGIGYGQEGSSgavsSRTRWCRPDCRVEGTCLPPCC 118
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
483-505 |
3.08e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 40.02 E-value: 3.08e-04
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
418-440 |
3.21e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 40.02 E-value: 3.21e-04
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1059-1216 |
3.64e-04 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 44.97 E-value: 3.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1059 GLAEGnKLLAPVALAVGIDGSLYVGDFnYIRRI------------FPSRNVTSILElrnNPAHkyyLAVDpvSGSLYVSD 1126
Cdd:cd14963 49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEKKDRVKLI---SPAG---LAID--DGKLYVSD 118
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1127 TNSRRIYrvkslsgtkdlagnseVVAGTGEQCLPFDEARCGDGgkaidaTLMSPRGIAVDKNGLMYFVDATMIR-KV-DQ 1204
Cdd:cd14963 119 VKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNGRiQVfDK 176
|
170
....*....|...
gi 2217356498 1205 NG-IISTLLGSND 1216
Cdd:cd14963 177 NGkFIKELNGSPD 189
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
548-572 |
6.50e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 38.87 E-value: 6.50e-04
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
594-636 |
1.40e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.76 E-value: 1.40e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 2217356498 594 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 636
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1059-1195 |
1.42e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 43.05 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1059 GLAEGNkLLAPVALAVGIDGSLYVGDFN------------YIRRIFPSRNVTSILEL-RNnpahkyyLAVDPvSGSLYVS 1125
Cdd:cd14963 141 GSEPGE-LSYPNGIAVDEDGNIYVADSGngriqvfdkngkFIKELNGSPDGKSGFVNpRG-------IAVDP-DGNLYVV 211
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1126 DTNSRRIYrVKSLSGTKDLagnseVVAGTGEqclpfdearcgdggkaIDATLMSPRGIAVDKNGLMYFVD 1195
Cdd:cd14963 212 DNLSHRVY-VFDEQGKELF-----TFGGRGK----------------DDGQFNLPNGLFIDDDGRLYVTD 259
|
|
| YvrE |
COG3386 |
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ... |
1072-1225 |
1.81e-03 |
|
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway
Pssm-ID: 442613 [Multi-domain] Cd Length: 266 Bit Score: 42.57 E-value: 1.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1072 LAVGIDGSLYVGDFNY------IRRIFPSRNVTSILElrnnpahKYY----LAVDPVSGSLYVSDTNSRRIYRVkSLSGT 1141
Cdd:COG3386 98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD-------GLTfpngIAFSPDGRTLYVADTGAGRIYRF-DLDAD 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1142 KDLaGNSEVVAgtgeqclpfdEARCGDGGkaidatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSNDLTA 1219
Cdd:COG3386 170 GTL-GNRRVFA----------DLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIELPE 226
|
....*.
gi 2217356498 1220 VRPLSC 1225
Cdd:COG3386 227 RRPTNV 232
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
6-198 |
1.86e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 43.58 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 6 GRPIPPTSSPSLLPSAQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDE---EFSPNSYLLRACSgpqqasssgppnHHS 82
Cdd:COG5099 202 FNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVEnniILNSSSSINELTS------------IYG 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 83 QSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNrRSQIHAPAPAPNDLATTPESVQLQDSwvLNSNVPLETRHFLFkTSS 162
Cdd:COG5099 270 SVPSIRNLRGLNSALVSFLNVSSSSLAFSALNG-KEVSPTGSPSTRSFARVLPKSSPNNL--LTEILTTGVNPPQS-LPS 345
|
170 180 190
....*....|....*....|....*....|....*.
gi 2217356498 163 GSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 198
Cdd:COG5099 346 LLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSL 381
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
509-538 |
2.15e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 38.00 E-value: 2.15e-03
10 20 30
....*....|....*....|....*....|....*
gi 2217356498 509 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 538
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1504-1546 |
2.61e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.57 E-value: 2.61e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2217356498 1504 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1546
Cdd:TIGR01643 1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1069-1195 |
2.94e-03 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 41.81 E-value: 2.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1069 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELR--NNPahkYYLAVDPvSGSLYVSDTNSRRIYRVKSLSGTK-- 1142
Cdd:cd14952 96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVLPFTglSNP---DGVAVDG-AGNVYVTDTGNNRVLKLAAGSTTQtv 171
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217356498 1143 --------------DLAG--------NSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLMYFVD 1195
Cdd:cd14952 172 lpftglnspsgvavDTAGnvyvtdhgNNRVLklaAGSTTPTvLPFTG-------------LNGPLGVAVDAAGNVYVAD 237
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
577-649 |
3.07e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.51 E-value: 3.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 577 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 644
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 2217356498 645 GNGRC 649
Cdd:NF041328 122 DPLHC 126
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1102-1275 |
3.25e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 42.18 E-value: 3.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1102 ELRNNPA-HKYY------LAVDPvSGSLYVSDTNS---RRIyrvkSLSGTKDLAGNSEVVAGTGeqcLpFDearCGD-GG 1170
Cdd:cd14951 121 GNRNGPYpHEAWfaqpsgLSLAG-WGELFVADSESsaiRAV----SLKDGGVKTLVGGTRVGTG---L-FD---FGDrDG 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1171 KAIDATLMSPRGIAVDKNGLMYFVDA--TMIRKVD-QNGIISTLLGSNdltavrplscDSSMDVAQVRLEWPTDLAVNPm 1247
Cdd:cd14951 189 PGAEALLQHPLGVAALPDGSVYVADTynHKIKRVDpATGEVSTLAGTG----------KAGYKDLEAQFSEPSGLVVDG- 257
|
170 180 190
....*....|....*....|....*....|...
gi 2217356498 1248 DNSLYVLE--NNVILRI---TENHQVSIIAGRP 1275
Cdd:cd14951 258 DGRLYVADtnNHRIRRLdlpTEALEVLTLAHRT 290
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
636-669 |
3.41e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.41e-03
10 20 30
....*....|....*....|....*....|....*.
gi 2217356498 636 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 669
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
476-506 |
3.80e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.80e-03
10 20 30
....*....|....*....|....*....|....*.
gi 2217356498 476 NQCIDPS-CGGHGSCIDG----NCVCSAGYKGEHCE 506
Cdd:cd00054 3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
421-593 |
5.64e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 39.74 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 421 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 494
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 495 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 572
Cdd:NF041328 89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
|
170 180
....*....|....*....|.
gi 2217356498 573 sveVCSVDCGTHGVCIGGACR 593
Cdd:NF041328 128 ---ACGVACDPGESCRGGACT 145
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1307-1392 |
7.77e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 41.02 E-value: 7.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356498 1307 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncncySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1381
Cdd:cd14951 206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
|
90
....*....|.
gi 2217356498 1382 ADLGNIRIRAV 1392
Cdd:cd14951 264 ADTNNHRIRRL 274
|
|
| I-EGF_1 |
pfam18372 |
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ... |
449-466 |
9.39e-03 |
|
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end.
Pssm-ID: 465729 Cd Length: 29 Bit Score: 35.93 E-value: 9.39e-03
|
|