NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2258096898|ref|WP_252190561|]
View 

RHS repeat-associated core domain-containing protein [Pseudoalteromonas sp. SiA1]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RHS_core super family cl49306
RHS element core protein;
421-1252 4.66e-58

RHS element core protein;


The actual alignment was detected with superfamily member NF041261:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 220.65  E-value: 4.66e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  421 TYTYQFEYNldagETTCIDSRGHKEhfVHNAQG-----KLVKHTDPNGNVWHYGYNTKGQkiTEIKPDSSEIKYSYTPY- 494
Cdd:NF041261   385 SYRYQYEQD----RITITDSLNRRE--VLHTEGegglkRVVKKEHADGSVTRSGYDAAGR--LTAQTDAAGRRTEYSLNv 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  495 --GQLESITQPDGSVTKFAYNQLGQRVLTTLPDGQTITRKYSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKD--GQV 570
Cdd:NF041261   457 vsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDatGST 536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  571 TKFVWNEQGELLA-KHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQLAQTiafdeKDPEHKQQQhFSYDDAGRL 649
Cdd:NF041261   537 KQMTWSRYGQLLAfTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSV-----KDAQGRETR-YEYNAAGDL 610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  650 ISSKNSKGDTTEQHFEGLSQPHCVIQpDGSALHLTYDKERNLTAIERSDGHVYRISYDANENPTQVTGFDGTLQQYKYDA 729
Cdd:NF041261   611 TAVITPDGNRSETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDL 689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  730 CNRLtsvTQSDKRhvkierdklGRVIAQHaslatdthivnnanyysYNLQGKIT-RAHNAQSTLKQQFDKGGRLVRAEQV 808
Cdd:NF041261   690 TGKL---TQSEDE---------GLVTLWH-----------------YDESDRIThRTVNGEPAEQWQYDEHGWLTDISHL 740
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  809 HNQQQAhTLKYSYDDYGR----RQSLTLPDSSKL------NYSYNKfgqlSGIHLQQANSTAVELAALSYDSqGNIQTQK 878
Cdd:NF041261   741 SEGHRV-AVHYGYDDKGRltgeRQTVENPETGELlwqhetGHAYNE----QGLANRVTPDSLPPVEWLTYGS-GYLAGMK 814
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  879 FGNDITLTQQFDVFNRLTQQQLTHPAQALFDTCTYNYDSVNQLIARKEQGVSShNINFEYNSLGQLIQqnlVSSENtKTT 958
Cdd:NF041261   815 LGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPAGQLQSQHLNSLVY-DRDYTWNDNGDLVR---ISGPR-QTR 889
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  959 QYQWDSFGNPVS-QSTVQNNDL-----TEQQTTQLNDDSDASDASNKV--ESNQSESTHNVI-HSEYNDSSVATlasdls 1029
Cdd:NF041261   890 EYGYSATGRLTGvHTTAANLDIripyaTDPAGNRLPDPELHPDSTLTAwpDNRIAEDAHYVYrYDEYGRLTEKT------ 963
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1030 DDVITGT--TDADRLTHfgdsdFHYDE-----FGNQIRETGKGIKTRreynafnqlscfnnngtltqYDYDPLGRRIAKH 1102
Cdd:NF041261   964 DRIPEGVirTDDERTHH-----YHYDSqhrlvFYTRIQHGEPLVESR--------------------YLYDPLGRRMAKR 1018
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1103 TEHGKID---------------YIWDNDQLI----------GEYQHGEYTWYI--------------------------- 1130
Cdd:NF041261  1019 VWRRERDltgwmslsrkpevtwYGWDGDRLTtvqtdttriqTVYQPGSFTPLIrvetengerakaqrrslaetlqqegse 1098
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1131 -------------------------------------------NLPNQFHPlALIKKGEVYYYHLDQLNTPRFVTNNKTE 1167
Cdd:NF041261  1099 nghgvvfpaelvrmldrleeeiradrvseesrawlaqcgltveQMARQVEP-EYTPARKLHLYHCDHRGLPLALISEEGN 1177
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1168 VVWENQADVYGYEESEANtTNSFTQPIRFQGQYLDEESGLHYNRYRYYSPKQQRFINQDPIGLVGGINHYQYAPNPVNWV 1247
Cdd:NF041261  1178 TAWQGEYDEWGNLLNEEN-PHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFI 1256

                   ....*
gi 2258096898 1248 DPFGL 1252
Cdd:NF041261  1257 DPLGL 1261
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
161-246 7.61e-14

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


:

Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 67.94  E-value: 7.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  161 DPVSMLSGEEILPLIDFTLAGSKQLIWRRLYRSSHADVcSVMGNGWRHDFMVQLTEHylpppkvgpkqkGTYWLEYQDEH 240
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE------------GDGGVVYIDAD 68

                   ....*.
gi 2258096898  241 GAKHRF 246
Cdd:pfam20148   69 GREVTF 74
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
361-396 1.26e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


:

Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.66  E-value: 1.26e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2258096898  361 YDENKNLVRATNQQGETEHYGYNAANLLTKRTRASG 396
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
421-1252 4.66e-58

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 220.65  E-value: 4.66e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  421 TYTYQFEYNldagETTCIDSRGHKEhfVHNAQG-----KLVKHTDPNGNVWHYGYNTKGQkiTEIKPDSSEIKYSYTPY- 494
Cdd:NF041261   385 SYRYQYEQD----RITITDSLNRRE--VLHTEGegglkRVVKKEHADGSVTRSGYDAAGR--LTAQTDAAGRRTEYSLNv 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  495 --GQLESITQPDGSVTKFAYNQLGQRVLTTLPDGQTITRKYSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKD--GQV 570
Cdd:NF041261   457 vsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDatGST 536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  571 TKFVWNEQGELLA-KHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQLAQTiafdeKDPEHKQQQhFSYDDAGRL 649
Cdd:NF041261   537 KQMTWSRYGQLLAfTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSV-----KDAQGRETR-YEYNAAGDL 610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  650 ISSKNSKGDTTEQHFEGLSQPHCVIQpDGSALHLTYDKERNLTAIERSDGHVYRISYDANENPTQVTGFDGTLQQYKYDA 729
Cdd:NF041261   611 TAVITPDGNRSETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDL 689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  730 CNRLtsvTQSDKRhvkierdklGRVIAQHaslatdthivnnanyysYNLQGKIT-RAHNAQSTLKQQFDKGGRLVRAEQV 808
Cdd:NF041261   690 TGKL---TQSEDE---------GLVTLWH-----------------YDESDRIThRTVNGEPAEQWQYDEHGWLTDISHL 740
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  809 HNQQQAhTLKYSYDDYGR----RQSLTLPDSSKL------NYSYNKfgqlSGIHLQQANSTAVELAALSYDSqGNIQTQK 878
Cdd:NF041261   741 SEGHRV-AVHYGYDDKGRltgeRQTVENPETGELlwqhetGHAYNE----QGLANRVTPDSLPPVEWLTYGS-GYLAGMK 814
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  879 FGNDITLTQQFDVFNRLTQQQLTHPAQALFDTCTYNYDSVNQLIARKEQGVSShNINFEYNSLGQLIQqnlVSSENtKTT 958
Cdd:NF041261   815 LGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPAGQLQSQHLNSLVY-DRDYTWNDNGDLVR---ISGPR-QTR 889
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  959 QYQWDSFGNPVS-QSTVQNNDL-----TEQQTTQLNDDSDASDASNKV--ESNQSESTHNVI-HSEYNDSSVATlasdls 1029
Cdd:NF041261   890 EYGYSATGRLTGvHTTAANLDIripyaTDPAGNRLPDPELHPDSTLTAwpDNRIAEDAHYVYrYDEYGRLTEKT------ 963
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1030 DDVITGT--TDADRLTHfgdsdFHYDE-----FGNQIRETGKGIKTRreynafnqlscfnnngtltqYDYDPLGRRIAKH 1102
Cdd:NF041261   964 DRIPEGVirTDDERTHH-----YHYDSqhrlvFYTRIQHGEPLVESR--------------------YLYDPLGRRMAKR 1018
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1103 TEHGKID---------------YIWDNDQLI----------GEYQHGEYTWYI--------------------------- 1130
Cdd:NF041261  1019 VWRRERDltgwmslsrkpevtwYGWDGDRLTtvqtdttriqTVYQPGSFTPLIrvetengerakaqrrslaetlqqegse 1098
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1131 -------------------------------------------NLPNQFHPlALIKKGEVYYYHLDQLNTPRFVTNNKTE 1167
Cdd:NF041261  1099 nghgvvfpaelvrmldrleeeiradrvseesrawlaqcgltveQMARQVEP-EYTPARKLHLYHCDHRGLPLALISEEGN 1177
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1168 VVWENQADVYGYEESEANtTNSFTQPIRFQGQYLDEESGLHYNRYRYYSPKQQRFINQDPIGLVGGINHYQYAPNPVNWV 1247
Cdd:NF041261  1178 TAWQGEYDEWGNLLNEEN-PHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFI 1256

                   ....*
gi 2258096898 1248 DPFGL 1252
Cdd:NF041261  1257 DPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
417-1254 4.77e-41

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 165.31  E-value: 4.77e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  417 GDNNTYTYQFEYNLDAGETTCIDSRGHKEHFVHNAQGKLVKHTDPNGNVWHYGYNTKGQKITEIKPDSSEIKYSYTPYGQ 496
Cdd:COG3209    290 ATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTS 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  497 LESITQPDGSVTKFAYNQLGQRVLTTLPDGQTITRKYSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKDGQVTKFVWN 576
Cdd:COG3209    370 VGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDAT 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  577 EQGELLAKHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQLAQTIAFDEKDPEHKQQQHFSYDDAGRLISSKNSK 656
Cdd:COG3209    450 TTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTL 529
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  657 GDTTEQHFEGLSQPHCVIQPDGSALHLTYDKERNLTAIERSDGHVYRISYDANENPTQVTGFDGTLQQYKYDACNRLTSV 736
Cdd:COG3209    530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  737 TQSDKRHVKIERDKLGRVIAQHASLATDTHIVNNANYYSYNLQGKITRAHNAQSTLKQQFDKGGRLVRAEQVHNQQQAHT 816
Cdd:COG3209    610 TSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  817 LKYSYDDYGRRQSLTLPDSSKLNYSYNKFGQLSGIHLQQANSTAVELAALSYDSQGNIQTQKFGNDITLTQQFDVFNRLT 896
Cdd:COG3209    690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  897 QQQLTHPAQALFDTCTYNYDSVNQLIARKEQgvSSHNINFEYNSLGQLIQQNLVSSENTKTTQ---YQWDSFGNPVSQST 973
Cdd:COG3209    770 SETTPGGVTQGTYTTRYTYDALGRLTSVTYP--DGETVTYTYDALGRLTSVITVGSGGGTDLQdrtYTYDAAGNITSITD 847
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  974 VQNNDLTEQQTTqlnddsdasdasnkvesnqsesthnvihseYNDssvatlasdlsddvitgttdADRLT----HFGDSD 1049
Cdd:COG3209    848 ALRAGTLTQTYT------------------------------YDA--------------------LGRLTsatdPGTTES 877
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1050 FHYDEFGNQIRETGKGIkTRREYNAFNQLSCF-NNNGTLTQYDYDPLGrriakhtehgkidyiwdndqligeyqhgeytw 1128
Cdd:COG3209    878 YTYDANGNLTSRTDGGT-TTYTYDALGRLVSVtKPDGTTTTYTYDALG-------------------------------- 924
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1129 yinlpnqfhplalikkgevyyyHLDQLNTPRFVTNNKTEVVWENQADVYGyeESEANTTNSFTQPIRFQGQYLDEESGLH 1208
Cdd:COG3209    925 ----------------------HTDHLGSVRALTDASGQVVWRYDYDPFG--NLLAETSGAAANPLRFTGQEYDAETGLY 980
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*..
gi 2258096898 1209 YNRYRYYSPKQQRFINQDPIGLVGGINHYQYA-PNPVNWVDPFGLSC 1254
Cdd:COG3209    981 YNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAA 1027
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1175-1252 7.51e-34

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 124.92  E-value: 7.51e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2258096898 1175 DVYGYEESEantTNSFTQPIRFQGQYLDEESGLHYNRYRYYSPKQQRFINQDPIGLVGGINHYQYAP-NPVNWVDPFGL 1252
Cdd:TIGR03696    2 DPYGEVLSE---SGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
161-246 7.61e-14

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 67.94  E-value: 7.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  161 DPVSMLSGEEILPLIDFTLAGSKQLIWRRLYRSSHADVcSVMGNGWRHDFMVQLTEHylpppkvgpkqkGTYWLEYQDEH 240
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE------------GDGGVVYIDAD 68

                   ....*.
gi 2258096898  241 GAKHRF 246
Cdd:pfam20148   69 GREVTF 74
RHS_core NF041261
RHS element core protein;
550-1099 1.43e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 63.10  E-value: 1.43e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  550 VLYSYDKFGQLTQHINKDG-QVTKFVWNEQ--GELLAKHHNDE-LIRYSYDSLGRVNATINNAGLLTQYKYnEHGQLAQT 625
Cdd:NF041261   320 VRYTYTEAGELLAVYDRSNtQVRAFTYDAQhpGRMVAHRYAGRpEMCYRYDDTGRVTEQLNPAGLSYRYQY-EQDRITIT 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  626 IAFDEKDPEHKQqqhfsyDDAG--RLISSKNSkgdtteqhfeglsqphcviqpDGSALHLTYDKERNLTAieRSDGHVYR 703
Cdd:NF041261   399 DSLNRREVLHTE------GEGGlkRVVKKEHA---------------------DGSVTRSGYDAAGRLTA--QTDAAGRR 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  704 ISYDAN---ENPTQVTGFDGTLQQYKYDACNRLTSVTQSDKRHVKIERDKLGRVIAQHASlatdthivnnanyysynlQG 780
Cdd:NF041261   450 TEYSLNvvsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSR------------------SG 511
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  781 KITRahnaqstlkqqfdkggrlvraeqvhnqqqahtlkYSYDDYGRRQSLTLPDS--SKLNYSYNKFGQLsgihlqqans 858
Cdd:NF041261   512 ETTR----------------------------------YRYDDPHSELPATTTDAtgSTKQMTWSRYGQL---------- 547
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  859 tavelaaLSYDSQGNIQTQkfgnditltqqfdvfnrltqqqlthpaqalfdtctYNYDSVNQLIA-RKEQGVSSHNinfE 937
Cdd:NF041261   548 -------LAFTDCSGYQTR-----------------------------------YEYDRFGQMTAvHREEGISTYR---R 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  938 YNSLGQLIQqnlVSSENTKTTQYQWDSFGnpvsqstvqnnDLTEqqttqlnddsdasdasnkvesnqsesthnvihseyn 1017
Cdd:NF041261   583 YDNRGQLTS---VKDAQGRETRYEYNAAG-----------DLTA------------------------------------ 612
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1018 dssvatlasdlsddVITgtTDADRlthfgdSDFHYDEFGNQIRETGKGIKTRREYNAFNQLSCFNN-NGTLTQYDYDPLG 1096
Cdd:NF041261   613 --------------VIT--PDGNR------SETQYDAWGKAVSTTQGGLTRSMEYDAAGRITTLTNeNGSHSTFLYDALD 670

                   ...
gi 2258096898 1097 RRI 1099
Cdd:NF041261   671 RLV 673
RHS_core NF041261
RHS element core protein;
296-762 1.71e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 62.71  E-value: 1.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  296 VINEKGQSIGFYYDAKERLHRVEVNKARGCILKYNPQGllaNISAYRTGDNN----------KPVLLTP--LLAQYDYDE 363
Cdd:NF041261   571 VHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAG---DLTAVITPDGNrsetqydawgKAVSTTQggLTRSMEYDA 647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  364 NKNLVRATNQQGETEHYGYNAANLLTKRTRASGFSHHFEWD-----SYSSSAKCIKQWG-DNNTYTYQFEYNLDAGETTC 437
Cdd:NF041261   648 AGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDltgklTQSEDEGLVTLWHyDESDRITHRTVNGEPAEQWQ 727
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  438 IDSRGHKEHFVHNAQGKLVkhtdpngnVWHYGYNTKG----QKITEIKPDSSEI------KYSYTPYGqLESITQPDgSV 507
Cdd:NF041261   728 YDEHGWLTDISHLSEGHRV--------AVHYGYDDKGrltgERQTVENPETGELlwqhetGHAYNEQG-LANRVTPD-SL 797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  508 TKFAYNQLGQRVLTTLPDGQTITRKYSvAGLLQSET---FG-DGRTVLY----SYDKFGQL-TQHINKDGQVTKFVWNEQ 578
Cdd:NF041261   798 PPVEWLTYGSGYLAGMKLGGTPLVEYT-RDRLHRETvrsFGgAGSNAAYelttAYTPAGQLqSQHLNSLVYDRDYTWNDN 876
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  579 GELLAKHHNDELIRYSYDSLGRVNA--------------TINNAG------------LLTQ-------------YKYNEH 619
Cdd:NF041261   877 GDLVRISGPRQTREYGYSATGRLTGvhttaanldiripyATDPAGnrlpdpelhpdsTLTAwpdnriaedahyvYRYDEY 956
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  620 GQLAQTIafdEKDPE------HKQQQHFSYDDAGRLISSknskgdTTEQHFEGLSQPHCVIQPDGSAL-HLTYDKERNLT 692
Cdd:NF041261   957 GRLTEKT---DRIPEgvirtdDERTHHYHYDSQHRLVFY------TRIQHGEPLVESRYLYDPLGRRMaKRVWRRERDLT 1027
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  693 AiersdghvyrisYDANENPTQVT--GFDGtlqqykydacNRLTSVtQSDKRHV-------------KIERDKLGRVIAQ 757
Cdd:NF041261  1028 G------------WMSLSRKPEVTwyGWDG----------DRLTTV-QTDTTRIqtvyqpgsftpliRVETENGERAKAQ 1084

                   ....*
gi 2258096898  758 HASLA 762
Cdd:NF041261  1085 RRSLA 1089
RHS pfam03527
RHS protein;
1147-1178 1.13e-04

RHS protein;


Pssm-ID: 427349 [Multi-domain]  Cd Length: 38  Bit Score: 40.75  E-value: 1.13e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2258096898 1147 VYYYHLDQLNTPRFVTNNKTEVVWENQADVYG 1178
Cdd:pfam03527    1 IYYYHTDHLGTPEELTDEAGEIVWSAEYDAWG 32
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
361-396 1.26e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.66  E-value: 1.26e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2258096898  361 YDENKNLVRATNQQGETEHYGYNAANLLTKRTRASG 396
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
Bacuni_01323_like cd12871
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ...
552-713 3.36e-03

Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.


Pssm-ID: 214015 [Multi-domain]  Cd Length: 231  Bit Score: 40.87  E-value: 3.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  552 YSYDKFGQLTQHINKDGQVTKFvwnEQGELLAKHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQlAQTIAFDek 631
Cdd:cd12871     21 FEYDADGRLTSITTTQEGEAEE---ITYTTTITYEPNVITVTDDGGKTVSTYTLNEKGYVTSCTETEYGK-GQLRTYT-- 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  632 dpehkqqqhFSYDDAGRLISSKNSKGDTTEQHfeglsqphcviqpdgsalHLTYDKERNLTAIERSDGHV--YRISYDAN 709
Cdd:cd12871     95 ---------FTYNADGQLTKIVESIGTEYSTI------------------TITWNNGDIVSISTKSNTEEneSKITYTSD 147

                   ....
gi 2258096898  710 ENPT 713
Cdd:cd12871    148 KVYN 151
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
421-1252 4.66e-58

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 220.65  E-value: 4.66e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  421 TYTYQFEYNldagETTCIDSRGHKEhfVHNAQG-----KLVKHTDPNGNVWHYGYNTKGQkiTEIKPDSSEIKYSYTPY- 494
Cdd:NF041261   385 SYRYQYEQD----RITITDSLNRRE--VLHTEGegglkRVVKKEHADGSVTRSGYDAAGR--LTAQTDAAGRRTEYSLNv 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  495 --GQLESITQPDGSVTKFAYNQLGQRVLTTLPDGQTITRKYSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKD--GQV 570
Cdd:NF041261   457 vsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDatGST 536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  571 TKFVWNEQGELLA-KHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQLAQTiafdeKDPEHKQQQhFSYDDAGRL 649
Cdd:NF041261   537 KQMTWSRYGQLLAfTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSV-----KDAQGRETR-YEYNAAGDL 610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  650 ISSKNSKGDTTEQHFEGLSQPHCVIQpDGSALHLTYDKERNLTAIERSDGHVYRISYDANENPTQVTGFDGTLQQYKYDA 729
Cdd:NF041261   611 TAVITPDGNRSETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDL 689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  730 CNRLtsvTQSDKRhvkierdklGRVIAQHaslatdthivnnanyysYNLQGKIT-RAHNAQSTLKQQFDKGGRLVRAEQV 808
Cdd:NF041261   690 TGKL---TQSEDE---------GLVTLWH-----------------YDESDRIThRTVNGEPAEQWQYDEHGWLTDISHL 740
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  809 HNQQQAhTLKYSYDDYGR----RQSLTLPDSSKL------NYSYNKfgqlSGIHLQQANSTAVELAALSYDSqGNIQTQK 878
Cdd:NF041261   741 SEGHRV-AVHYGYDDKGRltgeRQTVENPETGELlwqhetGHAYNE----QGLANRVTPDSLPPVEWLTYGS-GYLAGMK 814
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  879 FGNDITLTQQFDVFNRLTQQQLTHPAQALFDTCTYNYDSVNQLIARKEQGVSShNINFEYNSLGQLIQqnlVSSENtKTT 958
Cdd:NF041261   815 LGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPAGQLQSQHLNSLVY-DRDYTWNDNGDLVR---ISGPR-QTR 889
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  959 QYQWDSFGNPVS-QSTVQNNDL-----TEQQTTQLNDDSDASDASNKV--ESNQSESTHNVI-HSEYNDSSVATlasdls 1029
Cdd:NF041261   890 EYGYSATGRLTGvHTTAANLDIripyaTDPAGNRLPDPELHPDSTLTAwpDNRIAEDAHYVYrYDEYGRLTEKT------ 963
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1030 DDVITGT--TDADRLTHfgdsdFHYDE-----FGNQIRETGKGIKTRreynafnqlscfnnngtltqYDYDPLGRRIAKH 1102
Cdd:NF041261   964 DRIPEGVirTDDERTHH-----YHYDSqhrlvFYTRIQHGEPLVESR--------------------YLYDPLGRRMAKR 1018
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1103 TEHGKID---------------YIWDNDQLI----------GEYQHGEYTWYI--------------------------- 1130
Cdd:NF041261  1019 VWRRERDltgwmslsrkpevtwYGWDGDRLTtvqtdttriqTVYQPGSFTPLIrvetengerakaqrrslaetlqqegse 1098
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1131 -------------------------------------------NLPNQFHPlALIKKGEVYYYHLDQLNTPRFVTNNKTE 1167
Cdd:NF041261  1099 nghgvvfpaelvrmldrleeeiradrvseesrawlaqcgltveQMARQVEP-EYTPARKLHLYHCDHRGLPLALISEEGN 1177
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1168 VVWENQADVYGYEESEANtTNSFTQPIRFQGQYLDEESGLHYNRYRYYSPKQQRFINQDPIGLVGGINHYQYAPNPVNWV 1247
Cdd:NF041261  1178 TAWQGEYDEWGNLLNEEN-PHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFI 1256

                   ....*
gi 2258096898 1248 DPFGL 1252
Cdd:NF041261  1257 DPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
417-1254 4.77e-41

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 165.31  E-value: 4.77e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  417 GDNNTYTYQFEYNLDAGETTCIDSRGHKEHFVHNAQGKLVKHTDPNGNVWHYGYNTKGQKITEIKPDSSEIKYSYTPYGQ 496
Cdd:COG3209    290 ATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTS 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  497 LESITQPDGSVTKFAYNQLGQRVLTTLPDGQTITRKYSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKDGQVTKFVWN 576
Cdd:COG3209    370 VGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDAT 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  577 EQGELLAKHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQLAQTIAFDEKDPEHKQQQHFSYDDAGRLISSKNSK 656
Cdd:COG3209    450 TTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTL 529
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  657 GDTTEQHFEGLSQPHCVIQPDGSALHLTYDKERNLTAIERSDGHVYRISYDANENPTQVTGFDGTLQQYKYDACNRLTSV 736
Cdd:COG3209    530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  737 TQSDKRHVKIERDKLGRVIAQHASLATDTHIVNNANYYSYNLQGKITRAHNAQSTLKQQFDKGGRLVRAEQVHNQQQAHT 816
Cdd:COG3209    610 TSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  817 LKYSYDDYGRRQSLTLPDSSKLNYSYNKFGQLSGIHLQQANSTAVELAALSYDSQGNIQTQKFGNDITLTQQFDVFNRLT 896
Cdd:COG3209    690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  897 QQQLTHPAQALFDTCTYNYDSVNQLIARKEQgvSSHNINFEYNSLGQLIQQNLVSSENTKTTQ---YQWDSFGNPVSQST 973
Cdd:COG3209    770 SETTPGGVTQGTYTTRYTYDALGRLTSVTYP--DGETVTYTYDALGRLTSVITVGSGGGTDLQdrtYTYDAAGNITSITD 847
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  974 VQNNDLTEQQTTqlnddsdasdasnkvesnqsesthnvihseYNDssvatlasdlsddvitgttdADRLT----HFGDSD 1049
Cdd:COG3209    848 ALRAGTLTQTYT------------------------------YDA--------------------LGRLTsatdPGTTES 877
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1050 FHYDEFGNQIRETGKGIkTRREYNAFNQLSCF-NNNGTLTQYDYDPLGrriakhtehgkidyiwdndqligeyqhgeytw 1128
Cdd:COG3209    878 YTYDANGNLTSRTDGGT-TTYTYDALGRLVSVtKPDGTTTTYTYDALG-------------------------------- 924
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1129 yinlpnqfhplalikkgevyyyHLDQLNTPRFVTNNKTEVVWENQADVYGyeESEANTTNSFTQPIRFQGQYLDEESGLH 1208
Cdd:COG3209    925 ----------------------HTDHLGSVRALTDASGQVVWRYDYDPFG--NLLAETSGAAANPLRFTGQEYDAETGLY 980
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*..
gi 2258096898 1209 YNRYRYYSPKQQRFINQDPIGLVGGINHYQYA-PNPVNWVDPFGLSC 1254
Cdd:COG3209    981 YNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAA 1027
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1175-1252 7.51e-34

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 124.92  E-value: 7.51e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2258096898 1175 DVYGYEESEantTNSFTQPIRFQGQYLDEESGLHYNRYRYYSPKQQRFINQDPIGLVGGINHYQYAP-NPVNWVDPFGL 1252
Cdd:TIGR03696    2 DPYGEVLSE---SGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
161-246 7.61e-14

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 67.94  E-value: 7.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  161 DPVSMLSGEEILPLIDFTLAGSKQLIWRRLYRSSHADVcSVMGNGWRHDFMVQLTEHylpppkvgpkqkGTYWLEYQDEH 240
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE------------GDGGVVYIDAD 68

                   ....*.
gi 2258096898  241 GAKHRF 246
Cdd:pfam20148   69 GREVTF 74
RHS_core NF041261
RHS element core protein;
550-1099 1.43e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 63.10  E-value: 1.43e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  550 VLYSYDKFGQLTQHINKDG-QVTKFVWNEQ--GELLAKHHNDE-LIRYSYDSLGRVNATINNAGLLTQYKYnEHGQLAQT 625
Cdd:NF041261   320 VRYTYTEAGELLAVYDRSNtQVRAFTYDAQhpGRMVAHRYAGRpEMCYRYDDTGRVTEQLNPAGLSYRYQY-EQDRITIT 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  626 IAFDEKDPEHKQqqhfsyDDAG--RLISSKNSkgdtteqhfeglsqphcviqpDGSALHLTYDKERNLTAieRSDGHVYR 703
Cdd:NF041261   399 DSLNRREVLHTE------GEGGlkRVVKKEHA---------------------DGSVTRSGYDAAGRLTA--QTDAAGRR 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  704 ISYDAN---ENPTQVTGFDGTLQQYKYDACNRLTSVTQSDKRHVKIERDKLGRVIAQHASlatdthivnnanyysynlQG 780
Cdd:NF041261   450 TEYSLNvvsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSR------------------SG 511
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  781 KITRahnaqstlkqqfdkggrlvraeqvhnqqqahtlkYSYDDYGRRQSLTLPDS--SKLNYSYNKFGQLsgihlqqans 858
Cdd:NF041261   512 ETTR----------------------------------YRYDDPHSELPATTTDAtgSTKQMTWSRYGQL---------- 547
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  859 tavelaaLSYDSQGNIQTQkfgnditltqqfdvfnrltqqqlthpaqalfdtctYNYDSVNQLIA-RKEQGVSSHNinfE 937
Cdd:NF041261   548 -------LAFTDCSGYQTR-----------------------------------YEYDRFGQMTAvHREEGISTYR---R 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  938 YNSLGQLIQqnlVSSENTKTTQYQWDSFGnpvsqstvqnnDLTEqqttqlnddsdasdasnkvesnqsesthnvihseyn 1017
Cdd:NF041261   583 YDNRGQLTS---VKDAQGRETRYEYNAAG-----------DLTA------------------------------------ 612
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898 1018 dssvatlasdlsddVITgtTDADRlthfgdSDFHYDEFGNQIRETGKGIKTRREYNAFNQLSCFNN-NGTLTQYDYDPLG 1096
Cdd:NF041261   613 --------------VIT--PDGNR------SETQYDAWGKAVSTTQGGLTRSMEYDAAGRITTLTNeNGSHSTFLYDALD 670

                   ...
gi 2258096898 1097 RRI 1099
Cdd:NF041261   671 RLV 673
RHS_core NF041261
RHS element core protein;
296-762 1.71e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 62.71  E-value: 1.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  296 VINEKGQSIGFYYDAKERLHRVEVNKARGCILKYNPQGllaNISAYRTGDNN----------KPVLLTP--LLAQYDYDE 363
Cdd:NF041261   571 VHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAG---DLTAVITPDGNrsetqydawgKAVSTTQggLTRSMEYDA 647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  364 NKNLVRATNQQGETEHYGYNAANLLTKRTRASGFSHHFEWD-----SYSSSAKCIKQWG-DNNTYTYQFEYNLDAGETTC 437
Cdd:NF041261   648 AGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDltgklTQSEDEGLVTLWHyDESDRITHRTVNGEPAEQWQ 727
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  438 IDSRGHKEHFVHNAQGKLVkhtdpngnVWHYGYNTKG----QKITEIKPDSSEI------KYSYTPYGqLESITQPDgSV 507
Cdd:NF041261   728 YDEHGWLTDISHLSEGHRV--------AVHYGYDDKGrltgERQTVENPETGELlwqhetGHAYNEQG-LANRVTPD-SL 797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  508 TKFAYNQLGQRVLTTLPDGQTITRKYSvAGLLQSET---FG-DGRTVLY----SYDKFGQL-TQHINKDGQVTKFVWNEQ 578
Cdd:NF041261   798 PPVEWLTYGSGYLAGMKLGGTPLVEYT-RDRLHRETvrsFGgAGSNAAYelttAYTPAGQLqSQHLNSLVYDRDYTWNDN 876
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  579 GELLAKHHNDELIRYSYDSLGRVNA--------------TINNAG------------LLTQ-------------YKYNEH 619
Cdd:NF041261   877 GDLVRISGPRQTREYGYSATGRLTGvhttaanldiripyATDPAGnrlpdpelhpdsTLTAwpdnriaedahyvYRYDEY 956
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  620 GQLAQTIafdEKDPE------HKQQQHFSYDDAGRLISSknskgdTTEQHFEGLSQPHCVIQPDGSAL-HLTYDKERNLT 692
Cdd:NF041261   957 GRLTEKT---DRIPEgvirtdDERTHHYHYDSQHRLVFY------TRIQHGEPLVESRYLYDPLGRRMaKRVWRRERDLT 1027
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  693 AiersdghvyrisYDANENPTQVT--GFDGtlqqykydacNRLTSVtQSDKRHV-------------KIERDKLGRVIAQ 757
Cdd:NF041261  1028 G------------WMSLSRKPEVTwyGWDG----------DRLTTV-QTDTTRIqtvyqpgsftpliRVETENGERAKAQ 1084

                   ....*
gi 2258096898  758 HASLA 762
Cdd:NF041261  1085 RRSLA 1089
RHS pfam03527
RHS protein;
1147-1178 1.13e-04

RHS protein;


Pssm-ID: 427349 [Multi-domain]  Cd Length: 38  Bit Score: 40.75  E-value: 1.13e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2258096898 1147 VYYYHLDQLNTPRFVTNNKTEVVWENQADVYG 1178
Cdd:pfam03527    1 IYYYHTDHLGTPEELTDEAGEIVWSAEYDAWG 32
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
361-396 1.26e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.66  E-value: 1.26e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2258096898  361 YDENKNLVRATNQQGETEHYGYNAANLLTKRTRASG 396
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
554-593 1.89e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 40.27  E-value: 1.89e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2258096898  554 YDKFGQLTQHINKDGQVTKFVWNEQGELLAKHHNDELIRY 593
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTR 40
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
533-569 2.37e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.89  E-value: 2.37e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2258096898  533 YSVAGLLQSETFGDGRTVLYSYDKFGQLTQHINKDGQ 569
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
449-483 4.36e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.12  E-value: 4.36e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2258096898  449 HNAQGKLVKHTDPNGNVWHYGYNTKGQKITEIKPD 483
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
491-527 6.53e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 38.35  E-value: 6.53e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2258096898  491 YTPYGQLESITQPDGSVTKFAYNQLGQRVLTTLPDGQ 527
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
706-742 9.21e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.96  E-value: 9.21e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2258096898  706 YDANENPTQVTGFDGTLQQYKYDACNRLTSVTQSDKR 742
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Bacuni_01323_like cd12871
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ...
552-713 3.36e-03

Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.


Pssm-ID: 214015 [Multi-domain]  Cd Length: 231  Bit Score: 40.87  E-value: 3.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  552 YSYDKFGQLTQHINKDGQVTKFvwnEQGELLAKHHNDELIRYSYDSLGRVNATINNAGLLTQYKYNEHGQlAQTIAFDek 631
Cdd:cd12871     21 FEYDADGRLTSITTTQEGEAEE---ITYTTTITYEPNVITVTDDGGKTVSTYTLNEKGYVTSCTETEYGK-GQLRTYT-- 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2258096898  632 dpehkqqqhFSYDDAGRLISSKNSKGDTTEQHfeglsqphcviqpdgsalHLTYDKERNLTAIERSDGHV--YRISYDAN 709
Cdd:cd12871     95 ---------FTYNADGQLTKIVESIGTEYSTI------------------TITWNNGDIVSISTKSNTEEneSKITYTSD 147

                   ....
gi 2258096898  710 ENPT 713
Cdd:cd12871    148 KVYN 151
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
491-528 4.24e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.41  E-value: 4.24e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2258096898  491 YTPYGQLESITQPDGSVTKFAYNQLGQRVLTTLPDGQT 528
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGS 38
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
449-511 9.15e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 35.26  E-value: 9.15e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2258096898  449 HNAQGKLVKHTDPNGNVWhygyntkgqkiteikpdsseiKYSYTPYGQLESITQPDGSVTKFA 511
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTT---------------------RYTYDAAGRLVEITDADGGSTRYE 42
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH