View
Concise Results
Standard Results
Full Results
nipped-B-like protein isoform X3 [Rattus norvegicus]
Protein Classification
List of domain hits
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1146-2356
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
:Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1439.79
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1146 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1225
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1226 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1305
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1306 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD vdgepm Y IQMVTAL V LQL I 1381
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1382 Q CV V H LP SS EK DPNSEEDSNKKV D Q ----- DVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1454
Cdd:cd23958 230 Q SS V K LP NL EK ESSRDKSLEEDS D E llede ESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1455 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1534
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1535 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K SQKD E ESSDGAHHA keiettgqimhra E S RK R FL R S I i 1614
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K AAEE E DTILKLELS ------------- E L RK K FL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1615 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1694
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1695 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1774
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1775 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1848
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1849 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 1927
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1928 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2007
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2008 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2081
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2082 AF I Q HP S L MFEQ EV KN L YNS IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2156
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2157 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2236
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2237 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKTEVTM 2310
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPSDLDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 1958753999 2311 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2356
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
PspC_subgroup_2 super family
cl41463
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-719
2.82e-14
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
The actual alignment was detected with superfamily member NF033839 :Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 78.66
E-value: 2.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 482 P ENH PE TPKN K SD PE LS K S E M K qne SR L SES KP NENQLG E SKSN E S K letktetqteel K Q S E NKTT E S K QSESA vve PK 561
Cdd:NF033839 301 P SPQ PE KKEV K PE PE TP K P E V K --- PQ L EKP KP EVKPQP E KPKP E V K ------------ P Q L E TPKP E V K PQPEK --- PK 362
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 562 QNENRLCDT - KP NDNK Q NN T RSENT K AR PE T PK QKAESR PE T PK QKSEGR PE T PK QKGDGR PE T PK QKSEGR PE T PK QKG 640
Cdd:NF033839 363 PEVKPQPEK p KP EVKP Q PE T PKPEV K PQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEV 442
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 641 EGR PE T PK hr H E NRKDSGK P ST E K KP DVS K H K QDI K SDSSRL K SERA eal K QRP D GRSE S LRRD - HDS KQ K S DDRGES E R 719
Cdd:NF033839 443 KPQ PE K PK -- P E VKPQPET P KP E V KP QPE K P K PEV K PQPEKP K PDNS --- K PQA D DKKP S TPNN l SKD KQ P S NQASTN E K 517
PTZ00121 super family
cl31754
MAEBL; Provisional
448-999
2.21e-10
MAEBL; Provisional
The actual alignment was detected with superfamily member PTZ00121 :Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 66.70
E-value: 2.21e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 448 Q D SDNI KK P EE TK qc N DAP I SVLQ E DSVGSLKSIPENHPETPKN K S D p EL S K S E -- M K QN E SRLS E S K pnenqlge S K SN 525
Cdd:PTZ00121 1237 K D AEEA KK A EE ER -- N NEE I RKFE E ARMAHFARRQAAIKAEEAR K A D - EL K K A E ek K K AD E AKKA E E K -------- K K AD 1305
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 526 E S K LETKTETQTE E L K QSENKTTESKQSESAVV E PKQNENRLCDTKPNDNKQNNTRS E NTKARP E TP K QK A ESRPETP K Q 605
Cdd:PTZ00121 1306 E A K KKAEEAKKAD E A K KKAEEAKKKADAAKKKA E EAKKAAEAAKAEAEAAADEAEAA E EKAEAA E KK K EE A KKKADAA K K 1385
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 606 K S E grpet P K Q K G D grpe TP K Q K S E grpetpkqkgegrp E TP K HRH E NR K dsg KPSTE KK P D VS K H K QDI K SDSSRL K SE 685
Cdd:PTZ00121 1386 K A E ----- E K K K A D ---- EA K K K A E -------------- E DK K KAD E LK K --- AAAAK KK A D EA K K K AEE K KKADEA K KK 1439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 686 RA EA L K - QRPDGRS E SLRRDHDS K Q K SDDRGESERHRGDQSRVRRPETLRSSSR n E HST K S D GS K TEKLER K HRH E SGDS 764
Cdd:PTZ00121 1440 AE EA K K a DEAKKKA E EAKKAEEA K K K AEEAKKADEAKKKAEEAKKADEAKKKAE - E AKK K A D EA K KAAEAK K KAD E AKKA 1518
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 765 RDRPSGEQKSRPDSPR vkqg DTNKSRPGFKSPNSKDD K RT E GNRSKVDSN KA HTDN KAE FPSYLLGGRSSAL K N fv IPKI 844
Cdd:PTZ00121 1519 EEAKKADEAKKAEEAK ---- KADEAKKAEEKKKADEL K KA E ELKKAEEKK KA EEAK KAE EDKNMALRKAEEA K K -- AEEA 1592
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 845 KRDKDGNITQ E T KK md MK G E QKD K V E KMGL - V E D L N K GA --- K P V VV L Q K LSLDDVQ K L -- I K DR EE KSRSSLKSLKN K P 918
Cdd:PTZ00121 1593 RIEEVMKLYE E E KK -- MK A E EAK K A E EAKI k A E E L K K AE eek K K V EQ L K K KEAEEKK K A ee L K KA EE ENKIKAAEEAK K A 1670
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 919 SKSN K GS id QSVL K ELPP E LL A EIESTMPLC E RV K MNKR K RSTVN EK P K YA E ISSD E DNDSDE A f E SSR K RHKK D DD KA W 998
Cdd:PTZ00121 1671 EEDK K KA -- EEAK K AEED E KK A AEALKKEAE E AK K AEEL K KKEAE EK K K AE E LKKA E EENKIK A - E EAK K EAEE D KK KA E 1747
.
gi 1958753999 999 E 999
Cdd:PTZ00121 1748 E 1748
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1146-2356
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1439.79
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1146 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1225
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1226 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1305
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1306 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD vdgepm Y IQMVTAL V LQL I 1381
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1382 Q CV V H LP SS EK DPNSEEDSNKKV D Q ----- DVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1454
Cdd:cd23958 230 Q SS V K LP NL EK ESSRDKSLEEDS D E llede ESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1455 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1534
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1535 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K SQKD E ESSDGAHHA keiettgqimhra E S RK R FL R S I i 1614
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K AAEE E DTILKLELS ------------- E L RK K FL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1615 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1694
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1695 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1774
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1775 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1848
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1849 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 1927
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1928 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2007
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2008 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2081
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2082 AF I Q HP S L MFEQ EV KN L YNS IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2156
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2157 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2236
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2237 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKTEVTM 2310
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPSDLDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 1958753999 2311 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2356
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
Nipped-B_C
pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2158-2339
1.27e-69
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.
Pssm-ID: 463722
Cd Length: 180
Bit Score: 232.04
E-value: 1.27e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2158 S S SIM Q L YLK QV LE AFFHTQSS VR HF AL N V I AL T L N QGL I HP VQ C V P Y LIA MG T D P E P AM R NK A DQQLV E IDK K YAGFIH 2237
Cdd:pfam12830 1 C S ALV Q R YLK HI LE ICLSSDDQ VR LL AL E V L AL I L R QGL V HP KE C I P T LIA LE T S P N P YI R KL A FELHK E LHE K HESLLE 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2238 MKAVA G MKMSYQV Q QAINTC lkdpvrgf RQD E SSSALC S H LYS MI R G N RQH R RA FL I SL LN LF D D ------ TAKTEVTM L 2311
Cdd:pfam12830 81 SRYME G IRLAFEY Q RRVLSG -------- ATL E PPTSFL S L LYS LL R S N KKS R KK FL K SL VK LF F D ldlsse SSPSDLDF L 152
170 180
....*....|....*....|....*...
gi 1958753999 2312 LYI A D NLA CF PYQTQ E E P LF IM HHID IT 2339
Cdd:pfam12830 153 RFL A E NLA FL PYQTQ D E V LF LI HHID RI 180
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-719
2.82e-14
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 78.66
E-value: 2.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 482 P ENH PE TPKN K SD PE LS K S E M K qne SR L SES KP NENQLG E SKSN E S K letktetqteel K Q S E NKTT E S K QSESA vve PK 561
Cdd:NF033839 301 P SPQ PE KKEV K PE PE TP K P E V K --- PQ L EKP KP EVKPQP E KPKP E V K ------------ P Q L E TPKP E V K PQPEK --- PK 362
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 562 QNENRLCDT - KP NDNK Q NN T RSENT K AR PE T PK QKAESR PE T PK QKSEGR PE T PK QKGDGR PE T PK QKSEGR PE T PK QKG 640
Cdd:NF033839 363 PEVKPQPEK p KP EVKP Q PE T PKPEV K PQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEV 442
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 641 EGR PE T PK hr H E NRKDSGK P ST E K KP DVS K H K QDI K SDSSRL K SERA eal K QRP D GRSE S LRRD - HDS KQ K S DDRGES E R 719
Cdd:NF033839 443 KPQ PE K PK -- P E VKPQPET P KP E V KP QPE K P K PEV K PQPEKP K PDNS --- K PQA D DKKP S TPNN l SKD KQ P S NQASTN E K 517
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
454-822
8.29e-14
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 77.12
E-value: 8.29e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 454 KK PE ET K QCND AP ISVLQEDSV G SLK S I P ENHP E tpknksd P E LS K SEMKQNE S RLSESKPNENQ lge S K SNESKLETKT 533
Cdd:NF033839 165 EN PE HQ K PTTP AP DTKPSPQPE G KKP S V P DINQ E ------- K E KA K LAVATYM S KILDDIQKHHL --- Q K EKHRQIVALI 234
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 534 ETQT E EL KQ -- SE NKTTES K QSESAV V EPKQNENRLCD TK PNDNKQNN T RS E NTKAR P ET PK QKAESR P ETP K QKSEGR P 611
Cdd:NF033839 235 KELD E LK KQ al SE IDNVNT K VEIENT V HKIFADMDAVV TK FKKGLTQD T PK E PGNKK P SA PK PGMQPS P QPE K KEVKPE P 314
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 612 ETPK QKGDGRP E T PK QKSEGR PE T PK QKGEGRP ETPK hr H E NRKDSG KP ST E K KP DVS K H K QDI K SDSSRL K S E - RAEAL 690
Cdd:NF033839 315 ETPK PEVKPQL E K PK PEVKPQ PE K PK PEVKPQL ETPK -- P E VKPQPE KP KP E V KP QPE K P K PEV K PQPETP K P E v KPQPE 392
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 691 K QR P DGRSESLRRDHDS K --- Q K SDDRGESERHRGDQSRVRR PE TLRSSSRNEHSTKSDGS K TEK le RKHRH E SGDSRDR 767
Cdd:NF033839 393 K PK P EVKPQPEKPKPEV K pqp E K PKPEVKPQPEKPKPEVKPQ PE KPKPEVKPQPEKPKPEV K PQP -- ETPKP E VKPQPEK 470
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1958753999 768 P SG E Q K SR P DS P R vkq G D TN K SRPGF K S P NSKD dkrt EGNRS K VD SN K A H T DN KA 822
Cdd:NF033839 471 P KP E V K PQ P EK P K --- P D NS K PQADD K K P STPN ---- NLSKD K QP SN Q A S T NE KA 518
PTZ00121
PTZ00121
MAEBL; Provisional
448-999
2.21e-10
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 66.70
E-value: 2.21e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 448 Q D SDNI KK P EE TK qc N DAP I SVLQ E DSVGSLKSIPENHPETPKN K S D p EL S K S E -- M K QN E SRLS E S K pnenqlge S K SN 525
Cdd:PTZ00121 1237 K D AEEA KK A EE ER -- N NEE I RKFE E ARMAHFARRQAAIKAEEAR K A D - EL K K A E ek K K AD E AKKA E E K -------- K K AD 1305
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 526 E S K LETKTETQTE E L K QSENKTTESKQSESAVV E PKQNENRLCDTKPNDNKQNNTRS E NTKARP E TP K QK A ESRPETP K Q 605
Cdd:PTZ00121 1306 E A K KKAEEAKKAD E A K KKAEEAKKKADAAKKKA E EAKKAAEAAKAEAEAAADEAEAA E EKAEAA E KK K EE A KKKADAA K K 1385
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 606 K S E grpet P K Q K G D grpe TP K Q K S E grpetpkqkgegrp E TP K HRH E NR K dsg KPSTE KK P D VS K H K QDI K SDSSRL K SE 685
Cdd:PTZ00121 1386 K A E ----- E K K K A D ---- EA K K K A E -------------- E DK K KAD E LK K --- AAAAK KK A D EA K K K AEE K KKADEA K KK 1439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 686 RA EA L K - QRPDGRS E SLRRDHDS K Q K SDDRGESERHRGDQSRVRRPETLRSSSR n E HST K S D GS K TEKLER K HRH E SGDS 764
Cdd:PTZ00121 1440 AE EA K K a DEAKKKA E EAKKAEEA K K K AEEAKKADEAKKKAEEAKKADEAKKKAE - E AKK K A D EA K KAAEAK K KAD E AKKA 1518
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 765 RDRPSGEQKSRPDSPR vkqg DTNKSRPGFKSPNSKDD K RT E GNRSKVDSN KA HTDN KAE FPSYLLGGRSSAL K N fv IPKI 844
Cdd:PTZ00121 1519 EEAKKADEAKKAEEAK ---- KADEAKKAEEKKKADEL K KA E ELKKAEEKK KA EEAK KAE EDKNMALRKAEEA K K -- AEEA 1592
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 845 KRDKDGNITQ E T KK md MK G E QKD K V E KMGL - V E D L N K GA --- K P V VV L Q K LSLDDVQ K L -- I K DR EE KSRSSLKSLKN K P 918
Cdd:PTZ00121 1593 RIEEVMKLYE E E KK -- MK A E EAK K A E EAKI k A E E L K K AE eek K K V EQ L K K KEAEEKK K A ee L K KA EE ENKIKAAEEAK K A 1670
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 919 SKSN K GS id QSVL K ELPP E LL A EIESTMPLC E RV K MNKR K RSTVN EK P K YA E ISSD E DNDSDE A f E SSR K RHKK D DD KA W 998
Cdd:PTZ00121 1671 EEDK K KA -- EEAK K AEED E KK A AEALKKEAE E AK K AEEL K KKEAE EK K K AE E LKKA E EENKIK A - E EAK K EAEE D KK KA E 1747
.
gi 1958753999 999 E 999
Cdd:PTZ00121 1748 E 1748
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-680
1.94e-08
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 59.78
E-value: 1.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 482 PENH PE T PK NKSD P E L SK s EMKQNESRLSES KP NENQLG E SKSN E S K LETK T ETQTEELKQSEN K TTESK Q S E SAVV E P K 561
Cdd:NF033839 332 VKPQ PE K PK PEVK P Q L ET - PKPEVKPQPEKP KP EVKPQP E KPKP E V K PQPE T PKPEVKPQPEKP K PEVKP Q P E KPKP E V K 410
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 562 QNENR lcd T KP NDNK Q NNTRSENT K AR PE T PK QKAESR PE T P ----- K Q ksegr PETPK QKGDGR PE T PK QKSEGR PE T P 636
Cdd:NF033839 411 PQPEK --- P KP EVKP Q PEKPKPEV K PQ PE K PK PEVKPQ PE K P kpevk P Q ----- PETPK PEVKPQ PE K PK PEVKPQ PE K P 482
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1958753999 637 K ------ Q KGEGR P E TP KHRHENRKD S GKP ST EK K P d VS K H K QDIK S DS S 680
Cdd:NF033839 483 K pdnskp Q ADDKK P S TP NNLSKDKQP S NQA ST NE K A - TN K P K KSLP S TG S 531
PRK12678
PRK12678
transcription termination factor Rho; Provisional
591-808
2.64e-07
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 56.45
E-value: 2.64e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 591 TPKQK A ESRPETPKQKSEG R PETPKQKGD -- GRPETPKQKSEGRPETPKQKGEGRPETPKHRHEN R KDSGKPSTEKKPDV 668
Cdd:PRK12678 63 AAAAA A TPAAPAAAARRAA R AAAAARQAE qp AAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA R ERRERGEAARRGAA 142
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 669 S K HKQDIKSDSSRLKSER AE A l KQRPDGRSESL R R D HDSK Q KSDD RGE SE R HRGDQSRVRRPE tl R SSS R NEHSTKSDGS 748
Cdd:PRK12678 143 R K AGEGGEQPATEARADA AE R - TEEEERDERRR R G D REDR Q AEAE RGE RG R REERGRDGDDRD -- R RDR R EQGDRREERG 219
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 749 KTEKLE R KH R HESG D S RD RPSGEQKSRPDSPRVKQ G DTNKS R P G FKS p NSK D DKRTE G NR 808
Cdd:PRK12678 220 RRDGGD R RG R RRRR D R RD ARGDDNREDRGDRDGDD G EGRGG R R G RRF - RDR D RRGRR G GD 278
ftsN
TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
481-666
3.45e-07
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain]
Cd Length: 298
Bit Score: 54.70
E-value: 3.45e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 481 IPENHPE tpkn KSD PE LSKSEMKQNESRLSESK P N -- E NQLGESKSN E SKLETKTETQTEELKQSENKTTESKQSESAVV 558
Cdd:TIGR02223 47 LLTESKQ ---- ANE PE TLQPKNQTENGETAADL P P kp E ERWSYIEEL E AREVLINDPEEPSNGGGVEESAQLTAEQRQLL 122
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 559 E PK Q NEN R lcdtkp NDN K QNN T RSENTKARP E TP KQ K AE SR P ETPKQKSEGRPETPKQ K GDGRPET -- P KQK SEGRPETP 636
Cdd:TIGR02223 123 E QM Q ADM R ------ AAE K VLA T APSEQTVAV E AR KQ T AE KK P QKARTAEAQKTPVETE K IASKVKE ak Q KQK ALPKQTAE 196
170 180 190
....*....|....*....|....*....|
gi 1958753999 637 K Q KGEGRP ET P kh RHENRK D SG KP STEK K P 666
Cdd:TIGR02223 197 T Q SNSKPI ET A -- PKADKA D KT KP KPKE K A 224
SF-CC1
TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
691-808
4.77e-04
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain]
Cd Length: 494
Bit Score: 45.30
E-value: 4.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 691 KQ R PDG R SESLRRDH D SKQKS D - D R ges ER H R g D Q SR V R RPE tl RS SS R NE H stk S D GSKTEKL ER KH R HESGDS R D RP S 769
Cdd:TIGR01622 3 RD R ERE R LRDSSSAG D RDRRR D k G R --- ER S R - D R SR D R ERS -- RS RR R DR H --- R D RDYYRGR ER RS R SRRPNR R Y RP R 73
90 100 110
....*....|....*....|....*....|....*....
gi 1958753999 770 GEQKS R P DS P R VKQG D TNKS R PGFKSPNSKDDKR TE GN R 808
Cdd:TIGR01622 74 EKRRR R G DS Y R RRRD D RRSR R EKPRARDGTPEPL TE DE R 112
Caldesmon
pfam02029
Caldesmon;
369-749
5.98e-04
Caldesmon;
Pssm-ID: 460421 [Multi-domain]
Cd Length: 495
Bit Score: 45.24
E-value: 5.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 369 IE R E SAIE RER FSKEVQDK dkplkk R K Q DSYPQEA G GA T GGNR P ASQETGSTGNGSR P ALMVSI D LHQ A G radsqasltq 448
Cdd:pfam02029 1 IE D E EEAA RER RRRAREER ------ R R Q KEEEEPS G QV T ESVE P NEHNSYEEDSELK P SGQGGL D EEE A F ---------- 64
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 449 d S D NIK K P EE TK Q CNDAPISVL Q EDSVGSLKSIP E NHP E TPK N KSDP E L S KS E MK - QNE SRL SES K PN E NQLG E SKSN E S 527
Cdd:pfam02029 65 - L D RTA K R EE RR Q KRLQEALER Q KEFDPTIADEK E SVA E RKE N NEEE E N S SW E KE e KRD SRL GRY K EE E TEIR E KEYQ E N 143
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 528 K LE T KTETQT EE LKQS E N K TT E SK qsesav VE P KQ N ENRLCDTKPNDN K QNNTRS E NTK --- ARPET P KQ K AESRP E tp K 604
Cdd:pfam02029 144 K WS T EVRQAE EE GEEE E D K SE E AE ------ EV P TE N FAKEEVKDEKIK K EKKVKY E SKV fld QKRGH P EV K SQNGE E -- E 215
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 605 QKSEGRPETPK Q K G DGRPETPKQKS E GRP E TPKQKG E G R petpkhrhen R KDSG K P S T E KKP dv SKH KQ - DIKSDSSR LK 683
Cdd:pfam02029 216 VTKLKVTTKRR Q G G LSQSQEREEEA E VFL E AEQKLE E L R ---------- R RRQE K E S E E FEK -- LRQ KQ q EAELELEE LK 283
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958753999 684 SE R aeal KQ R PDGRS E SLR R DHDSKQKSDD R G E S E RH R - GDQSRV RR P E TLRSSSRNEHSTK S D G S K 749
Cdd:pfam02029 284 KK R ---- EE R RKLLE E EEQ R RKQEEAERKL R E E E E KR R m KEEIER RR A E AAEKRQKLPEDSS S E G K K 346
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
454-819
4.45e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 42.31
E-value: 4.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 454 KKPEE TK QCN DA PISVLQE D SVGSL K SIP E NHPETPKNKSDPELS K S E MKQ N --------- E SRLS ES ---- K PN E NQ L G 520
Cdd:NF033838 114 ELTSK TK KEL DA AFEQFKK D TLEPG K KVA E ATKKVEEAEKKAKDQ K E E DRR N yptntyktl E LEIA ES dvev K KA E LE L V 193
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 521 ESKSN E SK letktet QT E EL KQ SEN K T t ESK QS E SAVV E ---- PKQNENRLCDTKPNDNKQNNTRSENTKARPET PK QK A 596
Cdd:NF033838 194 KEEAK E PR ------- DE E KI KQ AKA K V - ESK KA E ATRL E kikt DREKAEEEAKRRADAKLKEAVEKNVATSEQDK PK RR A 265
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 597 E ---- SR P E TP KQ K SEGRPETPKQK G DGRPET P KQ K S E GR - P E TP K QKG E GRPETPKHRH E N R KDS g KPS T E K KPDVSKH 671
Cdd:NF033838 266 K rgvl GE P A TP DK K ENDAKSSDSSV G EETLPS P SL K P E KK v A E AE K KVE E AKKKAKDQKE E D R RNY - PTN T Y K TLELEIA 344
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 672 KQ D I K SDSSR L KSERA EA LKQ R PDGRSESLRRDHD SK QKSDD R G E serhrgdqsrvrrpetlrsssrneh ST K S D GS K T E 751
Cdd:NF033838 345 ES D V K VKEAE L ELVKE EA KEP R NEEKIKQAKAKVE SK KAEAT R L E ------------------------- KI K T D RK K A E 399
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958753999 752 KLERKHRH E SGDSRDR P S g EQ KSRPDS P RVK qgdtnks R P GF K SPNSKDDKRT E gnrs K VDSNK A HT D 819
Cdd:NF033838 400 EEAKRKAA E EDKVKEK P A - EQ PQPAPA P QPE ------- K P AP K PEKPAEQPKA E ---- K PADQQ A EE D 455
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
364-648
7.99e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 41.54
E-value: 7.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 364 AE I E RIE R ES A IER E RFS K E V -- QDK DKP l K K R KQDSYPQ E AGGATGGNRP A SQETG S T G NGSR P A lmvsidlhqagra D 441
Cdd:NF033838 233 AE E E AKR R AD A KLK E AVE K N V at SEQ DKP - K R R AKRGVLG E PATPDKKEND A KSSDS S V G EETL P S ------------- P 298
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 442 S QASLTQDSDNI KK P EE T - K QCN D AP isvl Q ED S vgslksip E N H P ETPKNKSDP E LSK S EM K QN E SR L SES K PNENQ lg 520
Cdd:NF033838 299 S LKPEKKVAEAE KK V EE A k K KAK D QK ---- E ED R -------- R N Y P TNTYKTLEL E IAE S DV K VK E AE L ELV K EEAKE -- 364
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 521 es KS NE S K L etktetqteel KQ SEN K T t ESK QS E SAVV E PKQNENR lcd TKPNDN K QNNTRSENT K AR P - E T P KQKAESR 599
Cdd:NF033838 365 -- PR NE E K I ----------- KQ AKA K V - ESK KA E ATRL E KIKTDRK --- KAEEEA K RKAAEEDKV K EK P a E Q P QPAPAPQ 427
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 600 PE T P KQ K S E GRP E T PK - Q K GDGR ---------- P E TPKQKSEGR P et PK QKGEGR P E TPK 648
Cdd:NF033838 428 PE K P AP K P E KPA E Q PK a E K PADQ qaeedyarrs E E EYNRLTQQQ P -- PK TEKPAQ P S TPK 485
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1146-2356
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1439.79
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1146 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1225
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1226 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1305
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1306 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD vdgepm Y IQMVTAL V LQL I 1381
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1382 Q CV V H LP SS EK DPNSEEDSNKKV D Q ----- DVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1454
Cdd:cd23958 230 Q SS V K LP NL EK ESSRDKSLEEDS D E llede ESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1455 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1534
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1535 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K SQKD E ESSDGAHHA keiettgqimhra E S RK R FL R S I i 1614
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K AAEE E DTILKLELS ------------- E L RK K FL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1615 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1694
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1695 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1774
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1775 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1848
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1849 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 1927
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1928 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2007
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2008 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2081
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2082 AF I Q HP S L MFEQ EV KN L YNS IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2156
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2157 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2236
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2237 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKTEVTM 2310
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPSDLDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 1958753999 2311 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2356
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
Nipped-B_C
pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2158-2339
1.27e-69
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.
Pssm-ID: 463722
Cd Length: 180
Bit Score: 232.04
E-value: 1.27e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2158 S S SIM Q L YLK QV LE AFFHTQSS VR HF AL N V I AL T L N QGL I HP VQ C V P Y LIA MG T D P E P AM R NK A DQQLV E IDK K YAGFIH 2237
Cdd:pfam12830 1 C S ALV Q R YLK HI LE ICLSSDDQ VR LL AL E V L AL I L R QGL V HP KE C I P T LIA LE T S P N P YI R KL A FELHK E LHE K HESLLE 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 2238 MKAVA G MKMSYQV Q QAINTC lkdpvrgf RQD E SSSALC S H LYS MI R G N RQH R RA FL I SL LN LF D D ------ TAKTEVTM L 2311
Cdd:pfam12830 81 SRYME G IRLAFEY Q RRVLSG -------- ATL E PPTSFL S L LYS LL R S N KKS R KK FL K SL VK LF F D ldlsse SSPSDLDF L 152
170 180
....*....|....*....|....*...
gi 1958753999 2312 LYI A D NLA CF PYQTQ E E P LF IM HHID IT 2339
Cdd:pfam12830 153 RFL A E NLA FL PYQTQ D E V LF LI HHID RI 180
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-719
2.82e-14
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 78.66
E-value: 2.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 482 P ENH PE TPKN K SD PE LS K S E M K qne SR L SES KP NENQLG E SKSN E S K letktetqteel K Q S E NKTT E S K QSESA vve PK 561
Cdd:NF033839 301 P SPQ PE KKEV K PE PE TP K P E V K --- PQ L EKP KP EVKPQP E KPKP E V K ------------ P Q L E TPKP E V K PQPEK --- PK 362
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 562 QNENRLCDT - KP NDNK Q NN T RSENT K AR PE T PK QKAESR PE T PK QKSEGR PE T PK QKGDGR PE T PK QKSEGR PE T PK QKG 640
Cdd:NF033839 363 PEVKPQPEK p KP EVKP Q PE T PKPEV K PQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEV 442
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 641 EGR PE T PK hr H E NRKDSGK P ST E K KP DVS K H K QDI K SDSSRL K SERA eal K QRP D GRSE S LRRD - HDS KQ K S DDRGES E R 719
Cdd:NF033839 443 KPQ PE K PK -- P E VKPQPET P KP E V KP QPE K P K PEV K PQPEKP K PDNS --- K PQA D DKKP S TPNN l SKD KQ P S NQASTN E K 517
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
454-822
8.29e-14
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 77.12
E-value: 8.29e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 454 KK PE ET K QCND AP ISVLQEDSV G SLK S I P ENHP E tpknksd P E LS K SEMKQNE S RLSESKPNENQ lge S K SNESKLETKT 533
Cdd:NF033839 165 EN PE HQ K PTTP AP DTKPSPQPE G KKP S V P DINQ E ------- K E KA K LAVATYM S KILDDIQKHHL --- Q K EKHRQIVALI 234
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 534 ETQT E EL KQ -- SE NKTTES K QSESAV V EPKQNENRLCD TK PNDNKQNN T RS E NTKAR P ET PK QKAESR P ETP K QKSEGR P 611
Cdd:NF033839 235 KELD E LK KQ al SE IDNVNT K VEIENT V HKIFADMDAVV TK FKKGLTQD T PK E PGNKK P SA PK PGMQPS P QPE K KEVKPE P 314
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 612 ETPK QKGDGRP E T PK QKSEGR PE T PK QKGEGRP ETPK hr H E NRKDSG KP ST E K KP DVS K H K QDI K SDSSRL K S E - RAEAL 690
Cdd:NF033839 315 ETPK PEVKPQL E K PK PEVKPQ PE K PK PEVKPQL ETPK -- P E VKPQPE KP KP E V KP QPE K P K PEV K PQPETP K P E v KPQPE 392
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 691 K QR P DGRSESLRRDHDS K --- Q K SDDRGESERHRGDQSRVRR PE TLRSSSRNEHSTKSDGS K TEK le RKHRH E SGDSRDR 767
Cdd:NF033839 393 K PK P EVKPQPEKPKPEV K pqp E K PKPEVKPQPEKPKPEVKPQ PE KPKPEVKPQPEKPKPEV K PQP -- ETPKP E VKPQPEK 470
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1958753999 768 P SG E Q K SR P DS P R vkq G D TN K SRPGF K S P NSKD dkrt EGNRS K VD SN K A H T DN KA 822
Cdd:NF033839 471 P KP E V K PQ P EK P K --- P D NS K PQADD K K P STPN ---- NLSKD K QP SN Q A S T NE KA 518
PTZ00121
PTZ00121
MAEBL; Provisional
448-999
2.21e-10
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 66.70
E-value: 2.21e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 448 Q D SDNI KK P EE TK qc N DAP I SVLQ E DSVGSLKSIPENHPETPKN K S D p EL S K S E -- M K QN E SRLS E S K pnenqlge S K SN 525
Cdd:PTZ00121 1237 K D AEEA KK A EE ER -- N NEE I RKFE E ARMAHFARRQAAIKAEEAR K A D - EL K K A E ek K K AD E AKKA E E K -------- K K AD 1305
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 526 E S K LETKTETQTE E L K QSENKTTESKQSESAVV E PKQNENRLCDTKPNDNKQNNTRS E NTKARP E TP K QK A ESRPETP K Q 605
Cdd:PTZ00121 1306 E A K KKAEEAKKAD E A K KKAEEAKKKADAAKKKA E EAKKAAEAAKAEAEAAADEAEAA E EKAEAA E KK K EE A KKKADAA K K 1385
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 606 K S E grpet P K Q K G D grpe TP K Q K S E grpetpkqkgegrp E TP K HRH E NR K dsg KPSTE KK P D VS K H K QDI K SDSSRL K SE 685
Cdd:PTZ00121 1386 K A E ----- E K K K A D ---- EA K K K A E -------------- E DK K KAD E LK K --- AAAAK KK A D EA K K K AEE K KKADEA K KK 1439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 686 RA EA L K - QRPDGRS E SLRRDHDS K Q K SDDRGESERHRGDQSRVRRPETLRSSSR n E HST K S D GS K TEKLER K HRH E SGDS 764
Cdd:PTZ00121 1440 AE EA K K a DEAKKKA E EAKKAEEA K K K AEEAKKADEAKKKAEEAKKADEAKKKAE - E AKK K A D EA K KAAEAK K KAD E AKKA 1518
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 765 RDRPSGEQKSRPDSPR vkqg DTNKSRPGFKSPNSKDD K RT E GNRSKVDSN KA HTDN KAE FPSYLLGGRSSAL K N fv IPKI 844
Cdd:PTZ00121 1519 EEAKKADEAKKAEEAK ---- KADEAKKAEEKKKADEL K KA E ELKKAEEKK KA EEAK KAE EDKNMALRKAEEA K K -- AEEA 1592
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 845 KRDKDGNITQ E T KK md MK G E QKD K V E KMGL - V E D L N K GA --- K P V VV L Q K LSLDDVQ K L -- I K DR EE KSRSSLKSLKN K P 918
Cdd:PTZ00121 1593 RIEEVMKLYE E E KK -- MK A E EAK K A E EAKI k A E E L K K AE eek K K V EQ L K K KEAEEKK K A ee L K KA EE ENKIKAAEEAK K A 1670
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 919 SKSN K GS id QSVL K ELPP E LL A EIESTMPLC E RV K MNKR K RSTVN EK P K YA E ISSD E DNDSDE A f E SSR K RHKK D DD KA W 998
Cdd:PTZ00121 1671 EEDK K KA -- EEAK K AEED E KK A AEALKKEAE E AK K AEEL K KKEAE EK K K AE E LKKA E EENKIK A - E EAK K EAEE D KK KA E 1747
.
gi 1958753999 999 E 999
Cdd:PTZ00121 1748 E 1748
Cohesin_HEAT
pfam12765
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most ...
1677-1718
2.73e-09
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most frequently in sister chromatid cohesion proteins such as Nipped-B. HEAT repeats are found tandemly repeated in many proteins, and they appear to serve as flexible scaffolding on which other components can assemble.
Pssm-ID: 403845 [Multi-domain]
Cd Length: 42
Bit Score: 54.77
E-value: 2.73e-09
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1958753999 1677 K C LS EV V AV DPSIL ARL D MQRGVHG RL M D N S T SVR E AA V ELL 1718
Cdd:pfam12765 1 K A LS SL V EK DPSIL DSP D VKEAISR RL T D S S P SVR D AA L ELL 42
PTZ00121
PTZ00121
MAEBL; Provisional
370-871
9.94e-09
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 61.31
E-value: 9.94e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 370 ERES A IERERFSK E VQDK D KPL KK RKQ D SYPQEAGGATGGNRPASQ E TGSTGNGSRP A LMVSIDLHQ A GR AD S --- Q A SL 446
Cdd:PTZ00121 1376 AKKK A DAAKKKAE E KKKA D EAK KK AEE D KKKADELKKAAAAKKKAD E AKKKAEEKKK A DEAKKKAEE A KK AD E akk K A EE 1455
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 447 TQDSDNI - KK P EE T K QCND A PISVLQEDSVGSL K sipe NHP E TP K N K S D PELSKS E M K Q -- N E SRLS E SKPNENQL ge S K 523
Cdd:PTZ00121 1456 AKKAEEA k KK A EE A K KADE A KKKAEEAKKADEA K ---- KKA E EA K K K A D EAKKAA E A K K ka D E AKKA E EAKKADEA -- K K 1529
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 524 SN E S K letktet QTE E L K QS E N -- K TT E S K QS E savv E P K QN E NR lcd T K PNDN K QNNTRSENTKARP E TP K QKA E S R P E 601
Cdd:PTZ00121 1530 AE E A K ------- KAD E A K KA E E kk K AD E L K KA E ---- E L K KA E EK --- K K AEEA K KAEEDKNMALRKA E EA K KAE E A R I E 1595
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 602 TPKQKS E grp E TP K Q K GD -- GRP E TP K Q K S E GRPETPKQ K GEGRPETP K HRH E NR K DSGKPST E KKPDVSKHKQDI K SDS 679
Cdd:PTZ00121 1596 EVMKLY E --- E EK K M K AE ea KKA E EA K I K A E ELKKAEEE K KKVEQLKK K EAE E KK K AEELKKA E EENKIKAAEEAK K AEE 1672
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 680 SRL K S E R A EALKQRPDGRS E S L RRDHDSKQ K SD drges E RHRGDQSRVRRP E T L RSSSR n E HST K SDGS K T E KL E R K HRH 759
Cdd:PTZ00121 1673 DKK K A E E A KKAEEDEKKAA E A L KKEAEEAK K AE ----- E LKKKEAEEKKKA E E L KKAEE - E NKI K AEEA K K E AE E D K KKA 1746
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 760 E SG dsrd RPSG E Q K SRPDSPRVKQGDTNKSRPGF K SPNSKDDKRT E GNRSKVDSN K AHT D NKAE F PSYLL GG RSS alk N F 839
Cdd:PTZ00121 1747 E EA ---- KKDE E E K KKIAHLKKEEEKKAEEIRKE K EAVIEEELDE E DEKRRMEVD K KIK D IFDN F ANIIE GG KEG --- N L 1819
490 500 510
....*....|....*....|....*....|..
gi 1958753999 840 VI PKI K RDK D GN I TQETKKMD M KG E QK D KV EK 871
Cdd:PTZ00121 1820 VI NDS K EME D SA I KEVADSKN M QL E EA D AF EK 1851
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
482-680
1.94e-08
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 59.78
E-value: 1.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 482 PENH PE T PK NKSD P E L SK s EMKQNESRLSES KP NENQLG E SKSN E S K LETK T ETQTEELKQSEN K TTESK Q S E SAVV E P K 561
Cdd:NF033839 332 VKPQ PE K PK PEVK P Q L ET - PKPEVKPQPEKP KP EVKPQP E KPKP E V K PQPE T PKPEVKPQPEKP K PEVKP Q P E KPKP E V K 410
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 562 QNENR lcd T KP NDNK Q NNTRSENT K AR PE T PK QKAESR PE T P ----- K Q ksegr PETPK QKGDGR PE T PK QKSEGR PE T P 636
Cdd:NF033839 411 PQPEK --- P KP EVKP Q PEKPKPEV K PQ PE K PK PEVKPQ PE K P kpevk P Q ----- PETPK PEVKPQ PE K PK PEVKPQ PE K P 482
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1958753999 637 K ------ Q KGEGR P E TP KHRHENRKD S GKP ST EK K P d VS K H K QDIK S DS S 680
Cdd:NF033839 483 K pdnskp Q ADDKK P S TP NNLSKDKQP S NQA ST NE K A - TN K P K KSLP S TG S 531
PTZ00121
PTZ00121
MAEBL; Provisional
359-1075
9.60e-08
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 58.23
E-value: 9.60e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 359 E LDAL AE IERIERESAI E RE R FSK E VQD K DKPLK K RKQDSYPQE A GG A TGGNR pa SQETGSTGNGSRPALMVSIDLHQAG 438
Cdd:PTZ00121 1095 E AFGK AE EAKKTETGKA E EA R KAE E AKK K AEDAR K AEEARKAED A RK A EEARK -- AEDAKRVEIARKAEDARKAEEARKA 1172
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 439 RADSQ A SLTQDSDNIK K P EE TKQCN DA P i SVLQEDSVGSLKSIP E NHPETPKN K SDPELSKS E M K QNESRLSESKPNE N Q 518
Cdd:PTZ00121 1173 EDAKK A EAARKAEEVR K A EE LRKAE DA R - KAEAARKAEEERKAE E ARKAEDAK K AEAVKKAE E A K KDAEEAKKAEEER N N 1251
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 519 LGES K SN E SKLETKTET Q TEELKQSEN K TT E S K QS E SA -- VV E P K QN E NRLCDTKPNDNKQNNTRSENT K ARP E TP K Q KA 596
Cdd:PTZ00121 1252 EEIR K FE E ARMAHFARR Q AAIKAEEAR K AD E L K KA E EK kk AD E A K KA E EKKKADEAKKKAEEAKKADEA K KKA E EA K K KA 1331
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 597 ES --- RP E TP K QKS E GRPETPKQKG D grpet PKQKS E GRP E TPKQ K G E grp E TP K HRHENR K dsg K PSTE KK P D VS K H K - 672
Cdd:PTZ00121 1332 DA akk KA E EA K KAA E AAKAEAEAAA D ----- EAEAA E EKA E AAEK K K E --- E AK K KADAAK K --- K AEEK KK A D EA K K K a 1400
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 673 QDI K SDSSR LK se R A E A L K QRP D --- GRS E SLRRDHDS K Q K SDDRGESERHRGDQSRVRRP E TLRSSS rn E HST K S D GS K 749
Cdd:PTZ00121 1401 EED K KKADE LK -- K A A A A K KKA D eak KKA E EKKKADEA K K K AEEAKKADEAKKKAEEAKKA E EAKKKA -- E EAK K A D EA K 1476
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 750 TEKL E R K hrhe SG D SRDRPSG E Q K SRP D SPRVKQGDTN K SRPGF K S pns KDD K RTEGNRSKVDSN KA HTDN KAE fpsyll 829
Cdd:PTZ00121 1477 KKAE E A K ---- KA D EAKKKAE E A K KKA D EAKKAAEAKK K ADEAK K A --- EEA K KADEAKKAEEAK KA DEAK KAE ------ 1543
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 830 ggrssalknfvip KI K RDKDGNITQ E T KK MD -- M K G E QKD K V E kmglv ED L N KGAKPVVVLQ K LSLDDVQKLI K DR EE KS 907
Cdd:PTZ00121 1544 ------------- EK K KADELKKAE E L KK AE ek K K A E EAK K A E ----- ED K N MALRKAEEAK K AEEARIEEVM K LY EE EK 1605
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 908 RSSLKSL K NKPSKSN K GS idqsvlkelpp EL LAEI E stmplc E RV K MNKR K RSTVN EK P K YA E ISSD E DNDSDE A F E SSR 987
Cdd:PTZ00121 1606 KMKAEEA K KAEEAKI K AE ----------- EL KKAE E ------ E KK K VEQL K KKEAE EK K K AE E LKKA E EENKIK A A E EAK 1668
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 988 K RH --- KK DDDKAWEY E ERDRRSSGDHRRSGHSHDGRRSSGGGRYRNRSPSDSDMEDYSPPPSLS E VARKMKK kekq K K R 1064
Cdd:PTZ00121 1669 K AE edk KK AEEAKKAE E DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE E AKKEAEE ---- D K K 1744
730
....*....|.
gi 1958753999 1065 KA Y E P K LTP EE 1075
Cdd:PTZ00121 1745 KA E E A K KDE EE 1755
PRK12678
PRK12678
transcription termination factor Rho; Provisional
591-808
2.64e-07
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 56.45
E-value: 2.64e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 591 TPKQK A ESRPETPKQKSEG R PETPKQKGD -- GRPETPKQKSEGRPETPKQKGEGRPETPKHRHEN R KDSGKPSTEKKPDV 668
Cdd:PRK12678 63 AAAAA A TPAAPAAAARRAA R AAAAARQAE qp AAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA R ERRERGEAARRGAA 142
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 669 S K HKQDIKSDSSRLKSER AE A l KQRPDGRSESL R R D HDSK Q KSDD RGE SE R HRGDQSRVRRPE tl R SSS R NEHSTKSDGS 748
Cdd:PRK12678 143 R K AGEGGEQPATEARADA AE R - TEEEERDERRR R G D REDR Q AEAE RGE RG R REERGRDGDDRD -- R RDR R EQGDRREERG 219
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 749 KTEKLE R KH R HESG D S RD RPSGEQKSRPDSPRVKQ G DTNKS R P G FKS p NSK D DKRTE G NR 808
Cdd:PRK12678 220 RRDGGD R RG R RRRR D R RD ARGDDNREDRGDRDGDD G EGRGG R R G RRF - RDR D RRGRR G GD 278
ftsN
TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
481-666
3.45e-07
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain]
Cd Length: 298
Bit Score: 54.70
E-value: 3.45e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 481 IPENHPE tpkn KSD PE LSKSEMKQNESRLSESK P N -- E NQLGESKSN E SKLETKTETQTEELKQSENKTTESKQSESAVV 558
Cdd:TIGR02223 47 LLTESKQ ---- ANE PE TLQPKNQTENGETAADL P P kp E ERWSYIEEL E AREVLINDPEEPSNGGGVEESAQLTAEQRQLL 122
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 559 E PK Q NEN R lcdtkp NDN K QNN T RSENTKARP E TP KQ K AE SR P ETPKQKSEGRPETPKQ K GDGRPET -- P KQK SEGRPETP 636
Cdd:TIGR02223 123 E QM Q ADM R ------ AAE K VLA T APSEQTVAV E AR KQ T AE KK P QKARTAEAQKTPVETE K IASKVKE ak Q KQK ALPKQTAE 196
170 180 190
....*....|....*....|....*....|
gi 1958753999 637 K Q KGEGRP ET P kh RHENRK D SG KP STEK K P 666
Cdd:TIGR02223 197 T Q SNSKPI ET A -- PKADKA D KT KP KPKE K A 224
PTZ00449
PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
365-798
1.66e-06
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain]
Cd Length: 943
Bit Score: 53.93
E-value: 1.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 365 EI ERIERE S AIERERFSK E VQ DK - D K P LKKRKQDSY P QE A G G --- ATG G NRPA S Q E TGSTGN G SR P AL mvsidlhqagra 440
Cdd:PTZ00449 484 EI KKLIKK S KKKLAPIEE E DS DK h D E P PEGPEASGL P PK A P G dke GEE G EHED S K E SDEPKE G GK P GE ------------ 551
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 441 dsqasl T QDSDNI KKP EET K QCNDAP I SV L QEDSVGSLKSIPENH PE T PK NKSD P ELSKSEMKQNESR L S E SK -- P NENQ 518
Cdd:PTZ00449 552 ------ T KEGEVG KKP GPA K EHKPSK I PT L SKKPEFPKDPKHPKD PE E PK KPKR P RSAQRPTRPKSPK L P E LL di P KSPK 625
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 519 LG ES KSNESKLETKTETQTE E LKQSEN -- K TTESKQ S ESAVVE PK QN E N ---------- RLCD TK PNDNKQNNTR S ENTK 586
Cdd:PTZ00449 626 RP ES PKSPKRPPPPQRPSSP E RPEGPK ii K SPKPPK S PKPPFD PK FK E K fyddyldaaa KSKE TK TTVVLDESFE S ILKE 705
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 587 AR PETP KQKAES ------- R P ETPKQKS E -- G R P ETPKQKGDGRPET P KQKSEGRP ETP KQKGE ---------------- 641
Cdd:PTZ00449 706 TL PETP GTPFTT prplppk L P RDEEFPF E pi G D P DAEQPDDIEFFTP P EEERTFFH ETP ADTPL pdilaeefkeedihae 785
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 642 - G R P ETPKH R ------ HE NRKDSGK PS TE KK pdvs K H KQ D - IKSDSSR L K S ERAEAL K Q r PD G RSES L R R dhdsk Q KS D D 713
Cdd:PTZ00449 786 t G E P DEAMK R pdspse HE DKPPGDH PS LP KK ---- R H RL D g LALSTTD L E S DAGRIA K D - AS G KIVK L K R ----- S KS F D 855
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 714 rgeserhrg D QSR V RRP E TLRSSS R ---- NEHS T KS D GSK T EKL E R KH RH E S gd S R D RP S g EQK S R P DS P rvkqgd TNKS 789
Cdd:PTZ00449 856 --------- D LTT V EEA E EMGAEA R kivv DDDG T EA D DED T HPP E E KH KS E V -- R R R RP P - KKP S K P KK P ------ SKPK 917
....*....
gi 1958753999 790 R P gf K S P N S 798
Cdd:PTZ00449 918 K P -- K K P D S 924
PRK12678
PRK12678
transcription termination factor Rho; Provisional
549-785
2.26e-06
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 53.37
E-value: 2.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 549 ESKQSES A VVEPKQNENRLCDTKPNDNKQNNT R SENTK A RPETPKQKAESRPETPKQKSEGRPETPKQKG d GRP E TPKQK 628
Cdd:PRK12678 56 KEARGGG A AAAAATPAAPAAAARRAARAAAAA R QAEQP A AEAAAAKAEAAPAARAAAAAAAEAASAPEAA - QAR E RRERG 134
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 629 SEG R PETPKQK GE - G RPETPKH R HENRKDSGKPSTEKKPDVSKHKQDIKSDSSRLKSE R A E ALKQRP D GRSESL R RDH D S 707
Cdd:PRK12678 135 EAA R RGAARKA GE g G EQPATEA R ADAAERTEEEERDERRRRGDREDRQAEAERGERGR R E E RGRDGD D RDRRDR R EQG D R 214
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958753999 708 KQKSDD R GESE R HRGDQS R V RR PETLRSSSRNEHSTKS D GSKTEKLE R KH R HESG D S R D R PS G EQKSRPD s P RVKQG D 785
Cdd:PRK12678 215 REERGR R DGGD R RGRRRR R D RR DARGDDNREDRGDRDG D DGEGRGGR R GR R FRDR D R R G R RG G DGGNERE - P ELRED D 291
PTZ00449
PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
604-1017
7.80e-06
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain]
Cd Length: 943
Bit Score: 51.61
E-value: 7.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 604 K Q K SEGRP E TPKQ K G D GR PE T P kq KSE G R P E tpkq K GE G RP E TPKHR HE NR K D S GK P STEK KP DVS K HKQDI K S -- DSSR 681
Cdd:PTZ00449 493 K K K LAPIE E EDSD K H D EP PE G P -- EAS G L P P ---- K AP G DK E GEEGE HE DS K E S DE P KEGG KP GET K EGEVG K K pg PAKE 566
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 682 L K SERAEA L KQR P DG -- RSESLRRDHDS K Q ksddrge SE R H R GD Q SRV R RPETLRSSS rnehstk S D GS K TE K lerkh R H 759
Cdd:PTZ00449 567 H K PSKIPT L SKK P EF pk DPKHPKDPEEP K K ------- PK R P R SA Q RPT R PKSPKLPEL ------- L D IP K SP K ----- R P 627
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 760 ES GD S RD RP SGE Q ks RP D SP RVKQ G DTNKSR P gf K S P N S K ------------- DD KRTEGNR SK vd SN K AHTDNKAE F P S 826
Cdd:PTZ00449 628 ES PK S PK RP PPP Q -- RP S SP ERPE G PKIIKS P -- K P P K S P kppfdpkfkekfy DD YLDAAAK SK -- ET K TTVVLDES F E S 701
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 827 Y L ------ LG G RSSALKNFVI PK IK RD KD gni TQETKKM D MKG EQ K D KV E KMG -------- LV E DLNKGAK P VVVLQKLS 892
Cdd:PTZ00449 702 I L ketlpe TP G TPFTTPRPLP PK LP RD EE --- FPFEPIG D PDA EQ P D DI E FFT ppeeertf FH E TPADTPL P DILAEEFK 778
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 893 LD D VQKLIKDRE E ------------- K SRSSLK SL KN K PSKSNKGSIDQSV L KELPPELLAE iestm PLCER VK MNKR K R 959
Cdd:PTZ00449 779 EE D IHAETGEPD E amkrpdspsehed K PPGDHP SL PK K RHRLDGLALSTTD L ESDAGRIAKD ----- ASGKI VK LKRS K S 853
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958753999 960 ---- S TV N E K ---- PKYAE I SS D E D NDS -- DE AFESSRKR HK K dddkawey E E R D RR SSGDHRRSGHS 1017
Cdd:PTZ00449 854 fddl T TV E E A eemg AEARK I VV D D D GTE ad DE DTHPPEEK HK S -------- E V R R RR PPKKPSKPKKP 913
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
751-996
3.11e-05
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 49.66
E-value: 3.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 751 EK LER K HRHESGDSRDRP S GEQ K SRP --- DSPRV K QGDTNKSRPGFKSPNSKD D KRTEGNRSKVDS NK AHTDNK aefp S Y 827
Cdd:PTZ00108 1149 EK EIA K EQRLKSKTKGKA S KLR K PKL kkk EKKKK K SSADKSKKASVVGNSKRV D SDEKRKLDDKPD NK KSNSSG ---- S D 1224
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 828 LLGGRSSAL K NFVIPKIKRDKDG N ITQETKKMDMKGEQK D kvekmglved L N K GA KP VVVLQKL S LDDVQKLIKDREEKS 907
Cdd:PTZ00108 1225 QEDDEEQKT K PKKSSVKRLKSKK N NSSKSSEDNDEFSSD D ---------- L S K EG KP KNAPKRV S AVQYSPPPPSKRPDG 1294
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 908 R S SLK S LKNK P S K SNKGSIDQSV L KE L PPELLA E IEST mplc ERV K MNK R ------- KR S TVNEK P KYAEIS S DEDN D S D 980
Cdd:PTZ00108 1295 E S NGG S KPSS P T K KKVKKRLEGS L AA L KKKKKS E KKTA ---- RKK K SKT R vkqasas QS S RLLRR P RKKKSD S SSED D D D 1370
250
....*....|....*.
gi 1958753999 981 EAFES S RKRHKK DD DK 996
Cdd:PTZ00108 1371 SEVDD S EDEDDE DD ED 1386
PRK08581
PRK08581
amidase domain-containing protein;
431-653
1.89e-04
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain]
Cd Length: 619
Bit Score: 47.09
E-value: 1.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 431 S I D LHQAGRADSQASL T QDS DN IK K PEE T KQCNDAPISVLQEDSVGS L KSIPE N HPE T PK ---- N K S DPE L ------ SK S 500
Cdd:PRK08581 52 S K D TSSKDTDKADNNN T SNQ DN ND K KFS T IDSSTSDSNNIIDFIYKN L PQTNI N QLL T KN kydd N Y S LTT L iqnlfn LN S 131
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 501 EMKQN E SRLSES K PNENQLGE S K S NESKLETKTETQTEELKQS - ENKTTES K Q S E S A vv EPKQNENRLCDTKP N DNKQNN 579
Cdd:PRK08581 132 DISDY E QPRNSE K STNDSNKN S D S SIKNDTDTQSSKQDKADNQ k APSSNNT K P S T S N -- KQPNSPKPTQPNQS N SQPASD 209
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958753999 580 TRSENTKARPETPKQKAESRPETPK Q K SE GRPE T P K Q - KGDGRPETPKQKSEGR P ET P K Q KGEGRPET P KHRH EN 653
Cdd:PRK08581 210 DTANQKSSSKDNQSMSDSALDSILD Q Y SE DAKK T Q K D y ASQSKKDKTETSNTKN P QL P T Q DELKHKSK P AQSF EN 284
PDS5
cd19953
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. ...
1669-1765
4.61e-04
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. Together with WapI and Scc3, it is involved in the release of the cohesin complex from chromosomes during S phase. The core of the cohesin complex consists of a coiled-coiled heterodimer of Smc1 and Smc30, together with Scc1 (also called kleisin). Pds5 interacts with Scc1 via a conserved patch on the surface of its heat repeats. Pds5 also promotes the acetylation of Smc3 that protects cohesin from releasing activity in G2 phase.
Pssm-ID: 410996 [Multi-domain]
Cd Length: 630
Bit Score: 45.59
E-value: 4.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1669 IA VR TK A M K C L SEVV A VDP S - IL A R ldmqrg VH -------- GR LM D N S TS VR E A A VE LLGRFV L CR P Q LAE QYYDM L IE R 1739
Cdd:cd19953 259 VD VR LL A T K L L GKMF A EKG S a GF A Q ------ TY pslwkefl GR FN D K S PE VR L A W VE SAKHIL L NH P D LAE DILEA L KK R 332
90 100
....*....|....*....|....*.
gi 1958753999 1740 I LD TGIS VR KRVI K ILR D ICI E QPTF 1765
Cdd:cd19953 333 L LD PDEK VR LAAV K AIC D LAY E DLLH 358
PDS5
pfam20168
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid ...
1668-1811
4.76e-04
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid cohesion protein PDS5. The large PDS5 molecule is exclusively alpha helical, composed of a large number of HEAT-like repeats and helical extensions/additions that deviate from the HEAT repeat pattern.
Pssm-ID: 466319 [Multi-domain]
Cd Length: 1051
Bit Score: 45.66
E-value: 4.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1668 AI AVR TKAMKCLSEVVAVD P SI la R LDMQRGVHG RL M D NSTS VR E AAV ELL G RF ------- V LCRPQ L AE qyydm L I ER I 1740
Cdd:pfam20168 297 SV AVR IAWVEAAKQILLNH P DL -- R SEILEALKD RL L D PDEK VR L AAV KAI G DL dyetllh V VSEKL L KT ----- L A ER L 369
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 1741 L D TGI SVRK RVI K I L RDI ------- CI E QP tf PKIT E MCV ---- K MIR -- RV ND E E g I KK LV NETFQKLWF t P TPHN D K E 1807
Cdd:pfam20168 370 R D KKP SVRK EAL K T L AKL ynvayge IE E GD -- EEAI E KFG wipn K ILH ly YI ND P E - I RA LV ERVLFEYLL - P ALLD D E E 445
....
gi 1958753999 1808 AMT R 1811
Cdd:pfam20168 446 RVK R 449
SF-CC1
TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
691-808
4.77e-04
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain]
Cd Length: 494
Bit Score: 45.30
E-value: 4.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 691 KQ R PDG R SESLRRDH D SKQKS D - D R ges ER H R g D Q SR V R RPE tl RS SS R NE H stk S D GSKTEKL ER KH R HESGDS R D RP S 769
Cdd:TIGR01622 3 RD R ERE R LRDSSSAG D RDRRR D k G R --- ER S R - D R SR D R ERS -- RS RR R DR H --- R D RDYYRGR ER RS R SRRPNR R Y RP R 73
90 100 110
....*....|....*....|....*....|....*....
gi 1958753999 770 GEQKS R P DS P R VKQG D TNKS R PGFKSPNSKDDKR TE GN R 808
Cdd:TIGR01622 74 EKRRR R G DS Y R RRRD D RRSR R EKPRARDGTPEPL TE DE R 112
Caldesmon
pfam02029
Caldesmon;
369-749
5.98e-04
Caldesmon;
Pssm-ID: 460421 [Multi-domain]
Cd Length: 495
Bit Score: 45.24
E-value: 5.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 369 IE R E SAIE RER FSKEVQDK dkplkk R K Q DSYPQEA G GA T GGNR P ASQETGSTGNGSR P ALMVSI D LHQ A G radsqasltq 448
Cdd:pfam02029 1 IE D E EEAA RER RRRAREER ------ R R Q KEEEEPS G QV T ESVE P NEHNSYEEDSELK P SGQGGL D EEE A F ---------- 64
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 449 d S D NIK K P EE TK Q CNDAPISVL Q EDSVGSLKSIP E NHP E TPK N KSDP E L S KS E MK - QNE SRL SES K PN E NQLG E SKSN E S 527
Cdd:pfam02029 65 - L D RTA K R EE RR Q KRLQEALER Q KEFDPTIADEK E SVA E RKE N NEEE E N S SW E KE e KRD SRL GRY K EE E TEIR E KEYQ E N 143
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 528 K LE T KTETQT EE LKQS E N K TT E SK qsesav VE P KQ N ENRLCDTKPNDN K QNNTRS E NTK --- ARPET P KQ K AESRP E tp K 604
Cdd:pfam02029 144 K WS T EVRQAE EE GEEE E D K SE E AE ------ EV P TE N FAKEEVKDEKIK K EKKVKY E SKV fld QKRGH P EV K SQNGE E -- E 215
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 605 QKSEGRPETPK Q K G DGRPETPKQKS E GRP E TPKQKG E G R petpkhrhen R KDSG K P S T E KKP dv SKH KQ - DIKSDSSR LK 683
Cdd:pfam02029 216 VTKLKVTTKRR Q G G LSQSQEREEEA E VFL E AEQKLE E L R ---------- R RRQE K E S E E FEK -- LRQ KQ q EAELELEE LK 283
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958753999 684 SE R aeal KQ R PDGRS E SLR R DHDSKQKSDD R G E S E RH R - GDQSRV RR P E TLRSSSRNEHSTK S D G S K 749
Cdd:pfam02029 284 KK R ---- EE R RKLLE E EEQ R RKQEEAERKL R E E E E KR R m KEEIER RR A E AAEKRQKLPEDSS S E G K K 346
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
597-827
2.18e-03
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 43.88
E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 597 E SRPETP K QKSEGRPETP K Q KG DGRP e TP K Q K SE g RP E TP K Q K GEGRPETPKHRHE N R K DSGKPSTE K KP D VSKH K QDIK 676
Cdd:PTZ00108 1143 E QEEVEE K EIAKEQRLKS K T KG KASK - LR K P K LK - KK E KK K K K SSADKSKKASVVG N S K RVDSDEKR K LD D KPDN K KSNS 1220
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 677 S D S SRLKS E RAEALKQRPDGRSESLRRDHD SK QKS D DRGE S ERHR g DQSRVRRPETL R S S SRNEHSTKS dg SK TEKL E RK 756
Cdd:PTZ00108 1221 S G S DQEDD E EQKTKPKKSSVKRLKSKKNNS SK SSE D NDEF S SDDL - SKEGKPKNAPK R V S AVQYSPPPP -- SK RPDG E SN 1297
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958753999 757 HRHESGDS ------- R DRP S GEQKSRPDSPRV K QGDTN KS RPGF K SPNSKDDK R TEGNRS K VD S NKAHT D NKAEFPSY 827
Cdd:PTZ00108 1298 GGSKPSSP tkkkvkk R LEG S LAALKKKKKSEK K TARKK KS KTRV K QASASQSS R LLRRPR K KK S DSSSE D DDDSEVDD 1375
U2AF_lg
TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
683-805
2.40e-03
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.
Pssm-ID: 273727 [Multi-domain]
Cd Length: 509
Bit Score: 43.34
E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 683 KS E RAEALKQRPD GR SESLRRDHDSKQKS D DRGESE RHR GDQS R VR R PETLRSSS R NEH S TKSD g S KTEKLE R KH R H - ES 761
Cdd:TIGR01642 1 RD E EPDREREKSR GR DRDRSSERPRRRSR D RSRFRD RHR RSRE R SY R EDSRPRDR R RYD S RSPR - S LRYSSV R RS R D r PR 79
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1958753999 762 GD SR DRP S G EQ --- KS R PD SP RVKQGDTN K S R PGFKSPNSKDDKR T E 805
Cdd:TIGR01642 80 RR SR SVR S I EQ hrr RL R DR SP SNQWRKDD K K R SLWDIKPPGYELV T A 126
PTZ00112
PTZ00112
origin recognition complex 1 protein; Provisional
483-824
3.34e-03
origin recognition complex 1 protein; Provisional
Pssm-ID: 240274 [Multi-domain]
Cd Length: 1164
Bit Score: 43.05
E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 483 ENHPE TP K nksdpels K S E M K QNESR L SESKPNE N ------- Q L G E SKSNES K LETKTE T QTEEL K QSENKTTE S KQ S ES 555
Cdd:PTZ00112 59 LSFEN TP R -------- K E E K K KKNLN L PDYNQIQ N nthdfyi D L N E RSKTPI K NNDNVT T PIKAN K KEKHNLDS S SS S SI 130
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 556 AVVEPKQN enrl CDTK P NDNKQNNTR S ENT K AR P ETP K QKAE ----- S RPET P KQ K SEGRPETP KQ KG ------- D GRPE 623
Cdd:PTZ00112 131 SSSLTNIS ---- FFSS P TSIYSCLSN S LSS K HS P KVI K ENQS thvni S SDNS P RN K EISNKQLK KQ TN vthttcy D KMRR 206
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 624 T P KQK S EGRPE T PKQKG E GRP E TP K HRHEN R KDS -- G K PST EK KPDVSK H --------- KQDI K S D SSRLK S - E R AEA L K 691
Cdd:PTZ00112 207 S P RNT S TIKNN T NDKNK E KNK E KD K NIKKD R DGD kq T K RNS EK SKVQNS H fdvrilrsy TKEN K K D EKNVV S g I R SSV L L 286
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 692 Q R pdg R S ES LR R D HDSKQKSDDR gese RHR GD QSRVRRPETLR S S S R N EHSTK S DGSKT eklerkhrhesgdsr D R P S GE 771
Cdd:PTZ00112 287 K R --- K S QC LR K D SYVYSNHQKK ---- AKT GD PKNIIHRNNGS S N S N N DDTSS S NHLGS --------------- N R I S NR 344
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1958753999 772 QK S R P DSPRVKQGD TN KSR pgfksp N S K DD K RTEGNRSKVDSNKAH T D NK AEF 824
Cdd:PTZ00112 345 NP S S P YKKQTTTKH TN NTK ------ N N K YN K TKTTQKFNHPLRHHA T I NK RSS 391
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
454-819
4.45e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 42.31
E-value: 4.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 454 KKPEE TK QCN DA PISVLQE D SVGSL K SIP E NHPETPKNKSDPELS K S E MKQ N --------- E SRLS ES ---- K PN E NQ L G 520
Cdd:NF033838 114 ELTSK TK KEL DA AFEQFKK D TLEPG K KVA E ATKKVEEAEKKAKDQ K E E DRR N yptntyktl E LEIA ES dvev K KA E LE L V 193
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 521 ESKSN E SK letktet QT E EL KQ SEN K T t ESK QS E SAVV E ---- PKQNENRLCDTKPNDNKQNNTRSENTKARPET PK QK A 596
Cdd:NF033838 194 KEEAK E PR ------- DE E KI KQ AKA K V - ESK KA E ATRL E kikt DREKAEEEAKRRADAKLKEAVEKNVATSEQDK PK RR A 265
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 597 E ---- SR P E TP KQ K SEGRPETPKQK G DGRPET P KQ K S E GR - P E TP K QKG E GRPETPKHRH E N R KDS g KPS T E K KPDVSKH 671
Cdd:NF033838 266 K rgvl GE P A TP DK K ENDAKSSDSSV G EETLPS P SL K P E KK v A E AE K KVE E AKKKAKDQKE E D R RNY - PTN T Y K TLELEIA 344
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 672 KQ D I K SDSSR L KSERA EA LKQ R PDGRSESLRRDHD SK QKSDD R G E serhrgdqsrvrrpetlrsssrneh ST K S D GS K T E 751
Cdd:NF033838 345 ES D V K VKEAE L ELVKE EA KEP R NEEKIKQAKAKVE SK KAEAT R L E ------------------------- KI K T D RK K A E 399
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958753999 752 KLERKHRH E SGDSRDR P S g EQ KSRPDS P RVK qgdtnks R P GF K SPNSKDDKRT E gnrs K VDSNK A HT D 819
Cdd:NF033838 400 EEAKRKAA E EDKVKEK P A - EQ PQPAPA P QPE ------- K P AP K PEKPAEQPKA E ---- K PADQQ A EE D 455
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
364-648
7.99e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 41.54
E-value: 7.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 364 AE I E RIE R ES A IER E RFS K E V -- QDK DKP l K K R KQDSYPQ E AGGATGGNRP A SQETG S T G NGSR P A lmvsidlhqagra D 441
Cdd:NF033838 233 AE E E AKR R AD A KLK E AVE K N V at SEQ DKP - K R R AKRGVLG E PATPDKKEND A KSSDS S V G EETL P S ------------- P 298
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 442 S QASLTQDSDNI KK P EE T - K QCN D AP isvl Q ED S vgslksip E N H P ETPKNKSDP E LSK S EM K QN E SR L SES K PNENQ lg 520
Cdd:NF033838 299 S LKPEKKVAEAE KK V EE A k K KAK D QK ---- E ED R -------- R N Y P TNTYKTLEL E IAE S DV K VK E AE L ELV K EEAKE -- 364
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 521 es KS NE S K L etktetqteel KQ SEN K T t ESK QS E SAVV E PKQNENR lcd TKPNDN K QNNTRSENT K AR P - E T P KQKAESR 599
Cdd:NF033838 365 -- PR NE E K I ----------- KQ AKA K V - ESK KA E ATRL E KIKTDRK --- KAEEEA K RKAAEEDKV K EK P a E Q P QPAPAPQ 427
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 600 PE T P KQ K S E GRP E T PK - Q K GDGR ---------- P E TPKQKSEGR P et PK QKGEGR P E TPK 648
Cdd:NF033838 428 PE K P AP K P E KPA E Q PK a E K PADQ qaeedyarrs E E EYNRLTQQQ P -- PK TEKPAQ P S TPK 485
PRK12678
PRK12678
transcription termination factor Rho; Provisional
603-814
8.83e-03
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 41.43
E-value: 8.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 603 PKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPKHRHENRKD S GKPSTE kkpdvskhkqd IKSDSS R L 682
Cdd:PRK12678 66 AAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAA S APEAAQ ----------- ARERRE R G 134
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958753999 683 KSE R AE A LKQRPD G RSESLRRDHDSKQKSDDRG E SERH R GDQS R VR R PETLRSSS R NEH stksdgsktekl E RKH R HESG 762
Cdd:PRK12678 135 EAA R RG A ARKAGE G GEQPATEARADAAERTEEE E RDER R RRGD R ED R QAEAERGE R GRR ------------ E ERG R DGDD 202
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1958753999 763 DS R DRPSGEQKS R PDSP R VKQ GD t NKS R PGFKSPNSKDDKRTEGN R SKV D SN 814
Cdd:PRK12678 203 RD R RDRREQGDR R EERG R RDG GD - RRG R RRRRDRRDARGDDNRED R GDR D GD 253
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01