NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|225403216|gb|ACN89703|]
View 

ORF1a polyprotein [Murine coronavirus RJHM/A]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2285-2839 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


:

Pssm-ID: 409662  Cd Length: 555  Bit Score: 1117.53  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2285 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGPLPMFVGQIVAWVKTTFGVLTICDFYQVTDLGYRSSFCNGSMVCEL 2364
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2365 CFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLGTMHWSAR 2444
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2445 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2524
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2525 KHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2604
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2605 LLHSKVKGVPETHVVVVENEADKAGFLNAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDRKS 2684
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2685 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVAAD 2764
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 2765 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
250-832 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


:

Pssm-ID: 394870  Cd Length: 586  Bit Score: 1100.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  250 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSIEYVGQPIED 329
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  330 MVDGDVVMREPAHLLAPNAIVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCGFRGWVPGNMMDGF 409
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  410 PCPGCCKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRESFKLYGHAVVPFGGAAYWSPYPGMWLPVIWSSVKSYSYLTY 489
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  490 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLEQRAILGVDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 569
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  570 D-GLAPLLLDGLVPRSYYLIKSGQAFTSLMVNFSREVVDMCMDMALLFMHDVKVATKYVKKVTGKLAVRFKALGIAVVRK 648
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  649 ITEWFDLAVDTAASAAGWLCYQLVNGLFAVANGVITFVQEVPELVKNFVDKFKTFFKVLIDSMSVSILSGLTVVKTASNR 728
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  729 VCLAGSKVYEVVQKSLPAYIMPVGC--SEATCLVGEIEPAVFEDDVVDVVKAPLTYQGCCKPPSSFEKICIVDKLYMAKC 806
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 225403216  807 GDQFYPVVVDNDTVGVLDQCWRFPCA 832
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-355 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


:

Pssm-ID: 152398  Cd Length: 355  Bit Score: 699.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    81 FVDEDPQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   161 CLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFRRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYA 240
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   241 LLRGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSI 320
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 225403216   321 EYVGQPIEDMVDGDVVMREPAHLLAPNAIVKRLPR 355
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3340-3633 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


:

Pssm-ID: 394887  Cd Length: 297  Bit Score: 576.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3340 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3419
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3420 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3499
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3500 QLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3579
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 3580 AD--LVLDALASMTGVTVEQMLAAIKRLHSGF-QGKQILGSCVLEDELTPSDVYQQL 3633
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2851-3233 7.59e-165

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


:

Pssm-ID: 394836  Cd Length: 376  Bit Score: 514.45  E-value: 7.59e-165
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2851 FVANLICFIVLWALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGlVYYRNSKACPVV 2930
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2931 VAVIDqDIGHTLFNVPTKVLRYGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYT 3010
Cdd:cd21473    80 VGVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3011 EGVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraM 3090
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--G 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3091 PGTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFLMLFV 3170
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 225403216 3171 FQVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCITYVAVVVSNHALWLF 3233
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1608-1905 1.98e-161

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


:

Pssm-ID: 409649  Cd Length: 304  Bit Score: 501.35  E-value: 1.98e-161
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1608 NKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSAADLAAVKDAFGFDEP-QLL 1686
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1687 QYYSMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1765
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1766 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCGDKLVHCTQFNV-PF 1843
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 225403216 1844 LICSNTPEGKKLPD-DVVAANIFTGG-SVGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1905
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3640-3926 1.91e-150

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


:

Pssm-ID: 394846  Cd Length: 290  Bit Score: 469.42  E-value: 1.91e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3640 SKRTRVIKGTCCWILASTFLFCSIISAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3718
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3719 TNYLVVYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVfVTMRSINHDVFSTMFLVGRLVSLVSMWYFGaNLE 3798
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3799 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3874
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 225403216 3875 LGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
Peptidase_C16 super family cl03374
Peptidase C16 family;
1083-1331 4.73e-127

Peptidase C16 family;


The actual alignment was detected with superfamily member pfam01831:

Pssm-ID: 460353  Cd Length: 249  Bit Score: 400.61  E-value: 4.73e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1083 AFDAIYSETLSAFYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLGMQKLWLSYKAGYDQCFVDKLVK 1162
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1163 SAPKSIILPQGGYVADFAYFFLSQCSFKVHANWRCLKCGMELKLQGLDAMFFYGDVVSHMCKCGNSMTLLSADIPYTLHF 1242
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1243 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKQIDGKVVTKFNGDKFDFMVGHGMTFSMSPFEIAQLYGSCITPN 1322
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 225403216  1323 VCFVKGDVI 1331
Cdd:pfam01831  241 LCFGKGDVI 249
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4019-4212 4.94e-116

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


:

Pssm-ID: 409258  Cd Length: 196  Bit Score: 366.42  E-value: 4.94e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4019 SEFVNMASFVEYELAKKNLDEAKASGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4098
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4099 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4178
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 225403216 4179 DGAVKQLNEID---VNSTWPLVISANRHNEvSTVVLQ 4212
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4323-4452 2.47e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


:

Pssm-ID: 409326  Cd Length: 130  Bit Score: 275.70  E-value: 2.47e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4323 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4402
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 225403216 4403 VEHPDVDGLCKLRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1942-2060 2.20e-82

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


:

Pssm-ID: 409350  Cd Length: 119  Bit Score: 266.63  E-value: 2.20e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1942 GKYYTKPIIKAQFRTFEKVEGVYTNFKLVGHDIAEKLNAKLGFDCNSPFMEYKITEWPTATGDVVLASDDLYVSRYSGGC 2021
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 225403216 2022 VTFGKPVIWRGHEEASLKSLTYFNRPSVVCENKFNVLPV 2060
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2112-2236 6.11e-78

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


:

Pssm-ID: 409627  Cd Length: 125  Bit Score: 254.14  E-value: 6.11e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2112 VMEAQKRSSVTTVAVKEVKLNGVKKPVKVEDSVVVNDPTSETKVVKSLSIVDVYDMFLTGCRYVVWTANELSRLINSPTV 2191
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 225403216 2192 REYVKWGMSKLIIPANLLLLRDEKQEFVAPKVVKAKAIACYGAVK 2236
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4213-4322 1.85e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


:

Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.59  E-value: 1.85e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 225403216 4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3247-3334 1.59e-48

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


:

Pssm-ID: 465099  Cd Length: 92  Bit Score: 168.86  E-value: 1.59e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3247 GTFEEMALTTFMITKESYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREAACSQLAKAMETFNhNNGNDVLYQPP 3326
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGNDVLYTPP 84

                   ....*...
gi 225403216  3327 TASVTTSF 3334
Cdd:pfam16348   85 TVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1533-1607 3.90e-46

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


:

Pssm-ID: 394840  Cd Length: 75  Bit Score: 161.43  E-value: 3.90e-46
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 1533 QLDDDARVFVQANMDCLPTDWRLVNKFDSVDGVRTIKYFECPGEVFVSSQGKKFGYVQNGSFKEASVSQIRALLA 1607
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3927-4015 1.17e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


:

Pssm-ID: 409253  Cd Length: 83  Bit Score: 134.49  E-value: 1.17e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4006
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 225403216 4007 YvrdnTVLQ 4015
Cdd:cd21827    79 P----TVLQ 83
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
852-941 2.96e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


:

Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 2.96e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  852 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHGViGTKVCALLERLVDDYVYLFDEGGEE 931
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 225403216  932 VIASRMYCSF 941
Cdd:cd21467    80 VLASEMYCSF 89
B-CoV_A_NSP1 super family cl13410
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
922-1074 3.41e-32

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


The actual alignment was detected with superfamily member pfam11963:

Pssm-ID: 152398  Cd Length: 355  Bit Score: 131.21  E-value: 3.41e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   922 VYLFDEGGEEVIASRMYCS-FSAPDEDCVATDV--VYADENQDDDADDPVVLVADTQEEDGVAKEQVDSADSEICVAH-- 996
Cdd:pfam11963  190 IYLRKGGNKGSVTSDHFRRaFTMPVYDFNVEDAyaEVHDEPKGKYSQKAYALLRGYRGVKPVLFVDQYGCDYTGCLADgl 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   997 TGGQEMTEPDAVGSQTPIASAEETEVGEA--CDREGIAEVK----ATVCADALDACP--DQVEAFDIEKVEDSILSELQT 1068
Cdd:pfam11963  270 EAYGDYTLQDMKQLQPVWLANLDFDVVVAwhVVRDPRAVMRlqtiATICGIAYVAQPteDVVDGDVVIKEPVHLLSADAI 349

                   ....*.
gi 225403216  1069 ELNAPA 1074
Cdd:pfam11963  350 VLRLPS 355
Macro_SF super family cl00019
macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular ...
1340-1464 6.03e-23

macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Macrodomains include the yeast macrodomain Poa1 which is a phosphatase of ADP-ribose-1"-phosphate, a by-product of tRNA splicing. Some macrodomains have ADPr-unrelated binding partners such as the coronavirus SUD-N (N-terminal subdomain) and SUD-M (middle subdomain) of the SARS-unique domain (SUD) which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). Macrodomains regulate a wide variety of cellular and organismal processes, including DNA damage repair, signal transduction, and immune response.


The actual alignment was detected with superfamily member cd21557:

Pssm-ID: 469581  Cd Length: 127  Bit Score: 97.24  E-value: 6.03e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1340 EVIVNPANGRMAHGAGVAGAIAKAAGKAFINETaDMVKAQGVCQVGGCYESTGGKLCKKVLNIVGPDARGHgkQCYSLLE 1419
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 225403216 1420 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1464
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
 
Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2285-2839 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1117.53  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2285 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGPLPMFVGQIVAWVKTTFGVLTICDFYQVTDLGYRSSFCNGSMVCEL 2364
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2365 CFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLGTMHWSAR 2444
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2445 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2524
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2525 KHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2604
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2605 LLHSKVKGVPETHVVVVENEADKAGFLNAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDRKS 2684
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2685 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVAAD 2764
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 2765 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
250-832 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1100.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  250 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSIEYVGQPIED 329
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  330 MVDGDVVMREPAHLLAPNAIVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCGFRGWVPGNMMDGF 409
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  410 PCPGCCKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRESFKLYGHAVVPFGGAAYWSPYPGMWLPVIWSSVKSYSYLTY 489
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  490 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLEQRAILGVDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 569
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  570 D-GLAPLLLDGLVPRSYYLIKSGQAFTSLMVNFSREVVDMCMDMALLFMHDVKVATKYVKKVTGKLAVRFKALGIAVVRK 648
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  649 ITEWFDLAVDTAASAAGWLCYQLVNGLFAVANGVITFVQEVPELVKNFVDKFKTFFKVLIDSMSVSILSGLTVVKTASNR 728
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  729 VCLAGSKVYEVVQKSLPAYIMPVGC--SEATCLVGEIEPAVFEDDVVDVVKAPLTYQGCCKPPSSFEKICIVDKLYMAKC 806
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 225403216  807 GDQFYPVVVDNDTVGVLDQCWRFPCA 832
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-355 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 699.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    81 FVDEDPQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   161 CLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFRRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYA 240
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   241 LLRGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSI 320
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 225403216   321 EYVGQPIEDMVDGDVVMREPAHLLAPNAIVKRLPR 355
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3340-3633 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 576.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3340 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3419
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3420 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3499
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3500 QLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3579
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 3580 AD--LVLDALASMTGVTVEQMLAAIKRLHSGF-QGKQILGSCVLEDELTPSDVYQQL 3633
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3365-3639 8.43e-166

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 512.76  E-value: 8.43e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3365 GLWLDDKVYCPRHVICSSADMTdPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKP 3444
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3445 GETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQLELSTGCHTGTDFSGNFYGPYRDA 3524
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3525 QVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLVLDALASMTGVTVEQMLAAIKR 3604
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 225403216  3605 LHSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3639
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2851-3233 7.59e-165

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 514.45  E-value: 7.59e-165
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2851 FVANLICFIVLWALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGlVYYRNSKACPVV 2930
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2931 VAVIDqDIGHTLFNVPTKVLRYGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYT 3010
Cdd:cd21473    80 VGVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3011 EGVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraM 3090
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--G 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3091 PGTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFLMLFV 3170
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 225403216 3171 FQVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCITYVAVVVSNHALWLF 3233
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1608-1905 1.98e-161

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 501.35  E-value: 1.98e-161
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1608 NKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSAADLAAVKDAFGFDEP-QLL 1686
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1687 QYYSMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1765
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1766 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCGDKLVHCTQFNV-PF 1843
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 225403216 1844 LICSNTPEGKKLPD-DVVAANIFTGG-SVGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1905
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-242 9.67e-155

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 479.19  E-value: 9.67e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    6 KYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDIFVDED 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   86 PQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGVCLGNG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  166 RFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFrRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYALL 242
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHK-RGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3640-3926 1.91e-150

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 469.42  E-value: 1.91e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3640 SKRTRVIKGTCCWILASTFLFCSIISAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3718
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3719 TNYLVVYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVfVTMRSINHDVFSTMFLVGRLVSLVSMWYFGaNLE 3798
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3799 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3874
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 225403216 3875 LGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2339-2826 1.14e-148

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 471.82  E-value: 1.14e-148
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2339 TICDFYQVtdlGYRSS------FCNGSMVCELCFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGYSLYTV 2412
Cdd:pfam19218    2 YPCDGYVD---GYSNSsfnksdYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2413 CFYPLFVLVGMQLLTTWLPEFFMLGTMHWsarlfvfVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYK 2492
Cdd:pfam19218   79 TFVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSK 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2493 RNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEV 2572
Cdd:pfam19218  152 SARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKV 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2573 KQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVE-NEADKAGFLNAAVFYAQSLYRPMLMVEKK 2651
Cdd:pfam19218  232 EFQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNsNGSNLAQAKNACVYYSQLLCKPIKLVDSN 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2652 LITTANTGLSVSRTMFDLYVDSLLSVLDVDRKSLTSFVNAAHnslkegvqleqvmdtfvgcarrkcAIDSDVETKSITKS 2731
Cdd:pfam19218  312 LLSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNA 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2732 VMAAVNAGVEVTDESCNNLVPTYVKS-DTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKAC 2810
Cdd:pfam19218  368 VLNAHRYDVLLTDDSFNNFVPTYAKPeDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTA 447
                          490
                   ....*....|....*.
gi 225403216  2811 VKTGLKIKLTYNKQEA 2826
Cdd:pfam19218  448 KKKGVTFMLTFNTNRM 463
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2862-3218 2.05e-141

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 445.95  E-value: 2.05e-141
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2862 WALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGlvYYRNSKACPVVVAVIDQDIGHT 2941
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG--SPTNSRSCPIVVGVVDEVVGRV 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2942 LFNVPTKVLRYGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHADGTPHPYCYTEGVMHNASLYS 3021
Cdd:pfam19217   79 VPGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3022 SLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYramPGTFCGRNAFD 3101
Cdd:pfam19217  159 DLVPHVRYKLVDGN-YVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3102 LIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDYTSVVVINVIVWCINFLMLFVFQVYPTLSCLY 3181
Cdd:pfam19217  235 LLTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVY 314
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 225403216  3182 ACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCI 3218
Cdd:pfam19217  315 AVLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
Peptidase_C16 pfam01831
Peptidase C16 family;
1083-1331 4.73e-127

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 400.61  E-value: 4.73e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1083 AFDAIYSETLSAFYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLGMQKLWLSYKAGYDQCFVDKLVK 1162
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1163 SAPKSIILPQGGYVADFAYFFLSQCSFKVHANWRCLKCGMELKLQGLDAMFFYGDVVSHMCKCGNSMTLLSADIPYTLHF 1242
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1243 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKQIDGKVVTKFNGDKFDFMVGHGMTFSMSPFEIAQLYGSCITPN 1322
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 225403216  1323 VCFVKGDVI 1331
Cdd:pfam01831  241 LCFGKGDVI 249
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4019-4212 4.94e-116

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 366.42  E-value: 4.94e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4019 SEFVNMASFVEYELAKKNLDEAKASGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4098
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4099 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4178
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 225403216 4179 DGAVKQLNEID---VNSTWPLVISANRHNEvSTVVLQ 4212
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
4016-4211 2.48e-104

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 332.97  E-value: 2.48e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4016 ALQSEFVNMASFVEYELAKKNLDEAKASGSAnQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4095
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4096 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSI 4175
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 225403216  4176 QDADGAVKQLNEIDVNS----TWPLVISANRHNEVstVVL 4211
Cdd:pfam08717  160 KDADGKIVHLKEITMDNspnlAWPLIVTAERANSA--VKL 197
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4323-4452 2.47e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 275.70  E-value: 2.47e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4323 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4402
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 225403216 4403 VEHPDVDGLCKLRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3667-3926 4.50e-83

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 274.90  E-value: 4.50e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3667 FVKWTMFMYVTTHML-GVTLCALCFVSFAMLLIKHKHLYLTMYIMPVLCTLFYTNYLVVYK-QSFRGLAYAWlshfvpAV 3744
Cdd:pfam19213    1 LLMYTALYWLPPNLItPVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3745 DYTYMDEVLYGVVLLVAMVFV--TMRSINHDvFSTMFLVGRLVSLVSMWYFGANLEEE-----VLLFLTSLFGTYTWTTM 3817
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVRSK-YSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3818 LSLATAKVIAKWlaVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRP 3897
Cdd:pfam19213  154 IAYKLAKYIVVY--VPPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 225403216  3898 PRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1942-2060 2.20e-82

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 266.63  E-value: 2.20e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1942 GKYYTKPIIKAQFRTFEKVEGVYTNFKLVGHDIAEKLNAKLGFDCNSPFMEYKITEWPTATGDVVLASDDLYVSRYSGGC 2021
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 225403216 2022 VTFGKPVIWRGHEEASLKSLTYFNRPSVVCENKFNVLPV 2060
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2112-2236 6.11e-78

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 254.14  E-value: 6.11e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2112 VMEAQKRSSVTTVAVKEVKLNGVKKPVKVEDSVVVNDPTSETKVVKSLSIVDVYDMFLTGCRYVVWTANELSRLINSPTV 2191
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 225403216 2192 REYVKWGMSKLIIPANLLLLRDEKQEFVAPKVVKAKAIACYGAVK 2236
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4334-4452 1.32e-73

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 241.57  E-value: 1.32e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4334 AILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRVEHPDVDGLCK 4413
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 225403216  4414 LRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4213-4322 1.85e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.59  E-value: 1.85e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 225403216 4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1946-2060 8.44e-61

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 205.48  E-value: 8.44e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1946 TKPIIKAQFRTFEKVEGVYTNFKLV--GHDIAEKLNAKLGFDCNSPFM-EYKITEWPTATGDVVLASDDLYVSRYSGGCV 2022
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLTcsGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 225403216  2023 TFGKPVIWRGHEEASLKSLTYFNRPSVVC-ENKFNVLPV 2060
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVClECKFNTKPV 129
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4213-4322 4.82e-58

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 196.93  E-value: 4.82e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4213 NNELMPQKLRTQVVNSGS-DMNCNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVtDAHCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 225403216  4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3247-3334 1.59e-48

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 168.86  E-value: 1.59e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3247 GTFEEMALTTFMITKESYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREAACSQLAKAMETFNhNNGNDVLYQPP 3326
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGNDVLYTPP 84

                   ....*...
gi 225403216  3327 TASVTTSF 3334
Cdd:pfam16348   85 TVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1533-1607 3.90e-46

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 161.43  E-value: 3.90e-46
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 1533 QLDDDARVFVQANMDCLPTDWRLVNKFDSVDGVRTIKYFECPGEVFVSSQGKKFGYVQNGSFKEASVSQIRALLA 1607
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3927-4015 1.17e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 134.49  E-value: 1.17e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4006
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 225403216 4007 YvrdnTVLQ 4015
Cdd:cd21827    79 P----TVLQ 83
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3927-4015 1.42e-36

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 134.50  E-value: 1.42e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4006
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*....
gi 225403216  4007 YVRDNTVLQ 4015
Cdd:pfam08716   75 YLENRTILQ 83
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1607-1914 2.93e-36

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 142.04  E-value: 2.93e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1607 ANKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSAADLAAVKDA---FGFDEP 1683
Cdd:pfam08715    2 CKQITIYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVPTTDWVEFYGFKSIleyYTLDAS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1684 QLLQYYSMLgmcKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKF 1763
Cdd:pfam08715   82 KYVIYLSAL---TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1764 NEPSDSTDFIRVVLREADLSGATCDLEFI--CKCGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCG-DKLVHCTQFN 1840
Cdd:pfam08715  159 GDFGDANWTLTNLAEHFDAEYTNAFLKKRvcCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGaNNTDEVIEAS 238
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  1841 VPFLICSNT--PEGKKLPDDVVAANIFTGG-SVGHYTHVKCKPkyQLYDACNVSKVSEAKGNFTDCLYLKNLKQTFS 1914
Cdd:pfam08715  239 LPYLLLSATdgPAAVDCLEDGVGTVAFVGStNSGHYTYQTAKQ--AFYDGAKDRKFGKKSPYVTAVYTRFAFKNETS 313
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
852-941 2.96e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 2.96e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  852 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHGViGTKVCALLERLVDDYVYLFDEGGEE 931
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 225403216  932 VIASRMYCSF 941
Cdd:cd21467    80 VLASEMYCSF 89
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
922-1074 3.41e-32

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 131.21  E-value: 3.41e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   922 VYLFDEGGEEVIASRMYCS-FSAPDEDCVATDV--VYADENQDDDADDPVVLVADTQEEDGVAKEQVDSADSEICVAH-- 996
Cdd:pfam11963  190 IYLRKGGNKGSVTSDHFRRaFTMPVYDFNVEDAyaEVHDEPKGKYSQKAYALLRGYRGVKPVLFVDQYGCDYTGCLADgl 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   997 TGGQEMTEPDAVGSQTPIASAEETEVGEA--CDREGIAEVK----ATVCADALDACP--DQVEAFDIEKVEDSILSELQT 1068
Cdd:pfam11963  270 EAYGDYTLQDMKQLQPVWLANLDFDVVVAwhVVRDPRAVMRlqtiATICGIAYVAQPteDVVDGDVVIKEPVHLLSADAI 349

                   ....*.
gi 225403216  1069 ELNAPA 1074
Cdd:pfam11963  350 VLRLPS 355
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1340-1464 6.03e-23

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 97.24  E-value: 6.03e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1340 EVIVNPANGRMAHGAGVAGAIAKAAGKAFINETaDMVKAQGVCQVGGCYESTGGKLCKKVLNIVGPDARGHgkQCYSLLE 1419
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 225403216 1420 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1464
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
671-832 5.97e-22

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 95.41  E-value: 5.97e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   671 LVNGLFAVANGVITFVqeVPELVKNFVDKFKTFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSkVYEVVQKSLPAYIMP 750
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSNA-LVKVVSVKLKGKKQA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   751 V--GCSEATCLVG---EIEPAVFEDDvvdvvKAPLTYQGCCKPPSSFEKICIVDKLYMAKCGDQFYPVVvdnDTVGVLDQ 825
Cdd:pfam19212   78 GlkGAKEATVFVGatvPVTPTRVEVV-----TVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPAS---TDGVVVPP 149

                   ....*..
gi 225403216   826 CWRFPCA 832
Cdd:pfam19212  150 VFKLKGG 156
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1322-1452 1.82e-17

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 81.58  E-value: 1.82e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   1322 NVCFVKGDVIKVlrrvGAEVIVNPANGRMAHGAGVAGAIAKAAGKAFinETADMVK-AQGVCQVGGCYESTGGKL-CKKV 1399
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKAL--SKEEVRKlAGGECPVGTAVVTEGGNLpAKYV 74
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 225403216   1400 LNIVGPDARGHGKQCYSLLERAYQ------HINKCDNVVTTLISAGIFSVPTDVSLTYL 1452
Cdd:smart00506   75 IHAVGPRASGHSKEGFELLENAYRnclelaIELGITSVALPLIGTGIYGVPKDRSAQAL 133
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1343-1446 7.98e-10

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 59.12  E-value: 7.98e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1343 VNPANGRMAHGAGVAGAIAKAAGKAFINETADMVKaqGVCQVGGCYESTGGKL-CKKVLNIVGPDARGHGKQ-CYSLLER 1420
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELKK--GGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHgEEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 225403216  1421 AYQHI------NKCDNVVTTLISAGIFSVPTD 1446
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1323-1475 4.83e-05

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 46.71  E-value: 4.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1323 VCFVKGDvikvLRRVGAEVIVNPANGRMAHGAGVAGAIAKAAGKAfINETADMVKAQGVCQVGGCYESTGGKL-CKKVLN 1401
Cdd:COG2110     1 IEIVQGD----ITELDVDAIVNAANSSLLGGGGVAGAIHRAAGPE-LLEECRRLCKQGGCPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1402 IVGPDARGHGKQCYSLLERAYQHI------NKCDNVVTTLISAGIFSVPTD----VSLTYLLGVVTKN-----VILVSNN 1466
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaeeLGIRSIAFPAIGTGVGGFPWEeaapIAVETLRDFLEEHpsleeVRFVLFD 155

                  ....*....
gi 225403216 1467 QDDFDVIEK 1475
Cdd:COG2110   156 EEDYEAYRR 164
 
Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2285-2839 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1117.53  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2285 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGPLPMFVGQIVAWVKTTFGVLTICDFYQVTDLGYRSSFCNGSMVCEL 2364
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2365 CFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLGTMHWSAR 2444
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2445 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2524
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2525 KHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2604
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2605 LLHSKVKGVPETHVVVVENEADKAGFLNAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDRKS 2684
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2685 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVAAD 2764
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 2765 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
250-832 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1100.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  250 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSIEYVGQPIED 329
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  330 MVDGDVVMREPAHLLAPNAIVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCGFRGWVPGNMMDGF 409
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  410 PCPGCCKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRESFKLYGHAVVPFGGAAYWSPYPGMWLPVIWSSVKSYSYLTY 489
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  490 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLEQRAILGVDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 569
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  570 D-GLAPLLLDGLVPRSYYLIKSGQAFTSLMVNFSREVVDMCMDMALLFMHDVKVATKYVKKVTGKLAVRFKALGIAVVRK 648
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  649 ITEWFDLAVDTAASAAGWLCYQLVNGLFAVANGVITFVQEVPELVKNFVDKFKTFFKVLIDSMSVSILSGLTVVKTASNR 728
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  729 VCLAGSKVYEVVQKSLPAYIMPVGC--SEATCLVGEIEPAVFEDDVVDVVKAPLTYQGCCKPPSSFEKICIVDKLYMAKC 806
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 225403216  807 GDQFYPVVVDNDTVGVLDQCWRFPCA 832
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
TM_Y_betaCoV_Nsp3_C cd21713
C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2285-2839 0e+00

C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409661  Cd Length: 545  Bit Score: 835.61  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2285 VSRGFFLVATVFLLWFNFLYANVILSDFylpnigplPMFVGQIVAWVKTTFGVLTICDFYQVTDLGYRSSFCNGSMVCEL 2364
Cdd:cd21713     1 VSLLLFLCLTVLLLWFNFLYANFILSDS--------PTFVGSIVAWFKYTLGISTICDFYQVTYLGDISEFCTGSMLCSL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2365 CFSGFDMLDNYDAINVVQHVVDRRVSFDYIslFKLVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLgtMHWSAR 2444
Cdd:cd21713    73 CLSGMDSLDNYDALNMVQHTVSSRLSDDYI--FKLVLELFFAYLLYTVAFYVLGLLAILQLFFSYLPLFFML--NSWLVV 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2445 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2524
Cdd:cd21713   149 LFVYVINMVPASTLVRMYIVVASLYFVYKLYVHVVYGCNDTACLMCYKRNRATRVECSTVVNGSKRSFYVMANGGTGFCT 228
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2525 KHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2604
Cdd:cd21713   229 KHNWNCVNCDTYGPGNTFICDEVAADLSTQFKRPINPTDSSYYSVTSVEVKNGSVHLYYERDGQRVYERFSLSLFVNLDK 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2605 LLHSKVKGVP--ETHVVVVENEADKAGFLNAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDR 2682
Cdd:cd21713   309 LKHSEVKGSPpfNVIVFDASNRAEENGAKSAAVYYSQLLCKPILLVDKKLVTTVGDSAEVARKMFDAYVNSFLSTYNVTM 388
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2683 KSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVA 2762
Cdd:cd21713   389 DKLKTLVSTAHNSLKEGVQLEQVLKTFIGAARQKAAVESDVETKDIVKCVQLAHQADVDFTTDSCNNLVPTYVKVDTITT 468
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 2763 ADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21713   469 ADLGVLIDNNAKHVNANVAKAANVALIWNVAAFLKLSESLRRQLRSAARKTGLNFKLTTSKLRAVVPILTTPFSLKG 545
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-355 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 699.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    81 FVDEDPQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   161 CLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFRRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYA 240
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   241 LLRGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELSPVWRDSLDNEVVVAWHVDRDPRAVMRLQTLATVRSI 320
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 225403216   321 EYVGQPIEDMVDGDVVMREPAHLLAPNAIVKRLPR 355
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3340-3633 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 576.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3340 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3419
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3420 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3499
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3500 QLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3579
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 3580 AD--LVLDALASMTGVTVEQMLAAIKRLHSGF-QGKQILGSCVLEDELTPSDVYQQL 3633
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3365-3639 8.43e-166

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 512.76  E-value: 8.43e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3365 GLWLDDKVYCPRHVICSSADMTdPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKP 3444
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3445 GETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQLELSTGCHTGTDFSGNFYGPYRDA 3524
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3525 QVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLVLDALASMTGVTVEQMLAAIKR 3604
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 225403216  3605 LHSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3639
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2851-3233 7.59e-165

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 514.45  E-value: 7.59e-165
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2851 FVANLICFIVLWALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGlVYYRNSKACPVV 2930
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2931 VAVIDqDIGHTLFNVPTKVLRYGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYT 3010
Cdd:cd21473    80 VGVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3011 EGVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraM 3090
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--G 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3091 PGTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFLMLFV 3170
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 225403216 3171 FQVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCITYVAVVVSNHALWLF 3233
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1608-1905 1.98e-161

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 501.35  E-value: 1.98e-161
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1608 NKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSAADLAAVKDAFGFDEP-QLL 1686
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1687 QYYSMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1765
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1766 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCGDKLVHCTQFNV-PF 1843
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 225403216 1844 LICSNTPEGKKLPD-DVVAANIFTGG-SVGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1905
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
CoV_Nsp5_Mpro cd21646
coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains ...
3340-3632 5.56e-160

coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394885  Cd Length: 292  Bit Score: 496.56  E-value: 5.56e-160
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3340 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADmTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3419
Cdd:cd21646     1 KKMAQPSGKVERCMVSVTYGSTTLNGLWLDDTVYCPRHVICKSTT-SGPDYDDLLSRARNHNFSVQSGGVQLRVVGVTMQ 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3420 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3499
Cdd:cd21646    80 GALLRLKVDTSNPHTPKYKFVTVKPGDSFTILACYNGSPSGVYGVNMRSNYTIKGSFLNGACGSVGYNIDGGTVEFCYMH 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3500 QLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3579
Cdd:cd21646   160 HLELPNGCHTGTDLTGKFYGPYVDQQVAQVEGADTLITDNVVAWLYAAIINGDRWWLNSSRTTVNDFNEWAMANGFTPVS 239
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 225403216 3580 ADLVLDALASMTGVTVEQMLAAIKRLHSGFQGKQILGSCVLEDELTPSDVYQQ 3632
Cdd:cd21646   240 QVDCLSILAAKTGVSVERLLAAIQQLHQNFGGKQILGSTSLEDEFTPEDVVRQ 292
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-242 9.67e-155

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 479.19  E-value: 9.67e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    6 KYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDIFVDED 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   86 PQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGVCLGNG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  166 RFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFrRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYALL 242
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHK-RGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3640-3926 1.91e-150

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 469.42  E-value: 1.91e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3640 SKRTRVIKGTCCWILASTFLFCSIISAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3718
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3719 TNYLVVYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVfVTMRSINHDVFSTMFLVGRLVSLVSMWYFGaNLE 3798
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3799 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3874
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 225403216 3875 LGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2339-2826 1.14e-148

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 471.82  E-value: 1.14e-148
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2339 TICDFYQVtdlGYRSS------FCNGSMVCELCFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGYSLYTV 2412
Cdd:pfam19218    2 YPCDGYVD---GYSNSsfnksdYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2413 CFYPLFVLVGMQLLTTWLPEFFMLGTMHWsarlfvfVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYK 2492
Cdd:pfam19218   79 TFVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSK 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2493 RNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEV 2572
Cdd:pfam19218  152 SARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKV 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2573 KQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVE-NEADKAGFLNAAVFYAQSLYRPMLMVEKK 2651
Cdd:pfam19218  232 EFQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNsNGSNLAQAKNACVYYSQLLCKPIKLVDSN 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2652 LITTANTGLSVSRTMFDLYVDSLLSVLDVDRKSLTSFVNAAHnslkegvqleqvmdtfvgcarrkcAIDSDVETKSITKS 2731
Cdd:pfam19218  312 LLSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNA 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2732 VMAAVNAGVEVTDESCNNLVPTYVKS-DTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKAC 2810
Cdd:pfam19218  368 VLNAHRYDVLLTDDSFNNFVPTYAKPeDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTA 447
                          490
                   ....*....|....*.
gi 225403216  2811 VKTGLKIKLTYNKQEA 2826
Cdd:pfam19218  448 KKKGVTFMLTFNTNRM 463
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2862-3218 2.05e-141

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 445.95  E-value: 2.05e-141
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2862 WALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGlvYYRNSKACPVVVAVIDQDIGHT 2941
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG--SPTNSRSCPIVVGVVDEVVGRV 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  2942 LFNVPTKVLRYGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHADGTPHPYCYTEGVMHNASLYS 3021
Cdd:pfam19217   79 VPGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3022 SLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYramPGTFCGRNAFD 3101
Cdd:pfam19217  159 DLVPHVRYKLVDGN-YVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3102 LIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDYTSVVVINVIVWCINFLMLFVFQVYPTLSCLY 3181
Cdd:pfam19217  235 LLTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVY 314
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 225403216  3182 ACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCI 3218
Cdd:pfam19217  315 AVLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
betaCoV_Nsp2_SARS_MHV-like cd21515
betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), ...
251-832 6.20e-134

betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), and related proteins; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This family includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2 rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 439198  Cd Length: 562  Bit Score: 433.43  E-value: 6.20e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  251 ILFVDQYGCDYTGCLAKGLEDYGDLTLSE----------MKELSPVWRDSLDNEVVVAWHVDRdPRAVMRLQTLATVRSI 320
Cdd:cd21515     2 TRYVDQYFCGPDGYPLECIKDLLAKAGKSsctlsdeqldFKELKRGGYCCRDHEHEIAWYVER-SDAPYELQTPFTIKSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  321 EYvgqpiedmvdGDVVMREPAHLLAPNAIVK-RLPRLVETMLYTDSS-VTEFCYKTKLCDCGFITQFGYVDCcgDTCGFR 398
Cdd:cd21515    81 KK----------DTFKGEVPAFVFPLNSKVKvLKPRVVKKKLEGFMGkIRTVYPVASPNECNPMTLSALMKC--DHCDET 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  399 GWVPGNMMdGFPCPgCCKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRES-----------------FKLYGHAVVPFG 461
Cdd:cd21515   149 SWQTGNFV-GATCL-CGAEYTLTKEDATSAGYLPPGAVVKMPCPACKNDEVgpehsfadyhnssgiktFLRKGGRTVPFG 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  462 GAAYWSPYP----GMWLPVIWSSVKSYsyltYTGVVGCKAIVQETDAICRSLYmdyvqhkcgNLEQRAILGVDDVYHRQL 537
Cdd:cd21515   227 GCVFAYVGCyngcAYWVPRAWSNIGSN----HTGVVGSGVEVLNDDLLEILLR---------EKVNINIVGDFKLNEEVV 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  538 LVNRGDYSLLLENVDLFVKRRAEFAcKFATCGDGLAPLLLDGLVPRSYYLIKSGQAFTSLMVnfsrEVVDMCMDMALLFM 617
Cdd:cd21515   294 IILASFSASVLAFVDTVKGLDFETF-KFIVESCGNFPVTKGKFVPGAWNLGKSKQVLTPLPA----FPSQAAMVVRSIFA 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  618 HDVKVATKYVK----------------KVTGKLAVRFKALGiavvrkitewfdlaVDTAASAAGWLCYQLV----NGLFA 677
Cdd:cd21515   369 RTVFTATHSVPalqeaaitiidgispqALRLLDAMRFTADL--------------VTNSVLAMAYVTGGLVqvtsQWLDN 434
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  678 VANGVITFVQEVPELVKNfvdKFKTFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSKVYEVVQKSLPAYIMPVGcseat 757
Cdd:cd21515   435 LFGTVVDLLKPVLEWLEE---KISSGIEFLIDLWEILKLLVTGAYKIVKGQIVLAGKNVSEVVQSFLSVLNKALG----- 506
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 225403216  758 clvgeiepavfeddVVDVVKAPLTYQGCckppssfeKICIVDKLYMAKCGDQFYPVVVDND---TVGVLDQCWRFPCA 832
Cdd:cd21515   507 --------------LLLPLKAPKEELFL--------TEGDTVDTSLTSEEVVVKTGVLEELdtpTSKVVDGPLVGTPV 562
Peptidase_C16 pfam01831
Peptidase C16 family;
1-249 1.95e-129

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 407.16  E-value: 1.95e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216    81 FVDEDPQKVEASTMMALQFGSAVLVKPSKRLSVQAWAKLGVLPKTPAMGLFKRFCLCNTRECVCDAHVAFQLFTVQPDGV 160
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   161 CLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRRGGNKGSVTSGHFRRAVTMPVYDFNVEDACEEVHLNPRGKYSCKAYA 240
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 225403216   241 LLRGYRGVK 249
Cdd:pfam01831  241 LCFGKGDVI 249
Peptidase_C16 pfam01831
Peptidase C16 family;
1083-1331 4.73e-127

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 400.61  E-value: 4.73e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1083 AFDAIYSETLSAFYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLGMQKLWLSYKAGYDQCFVDKLVK 1162
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1163 SAPKSIILPQGGYVADFAYFFLSQCSFKVHANWRCLKCGMELKLQGLDAMFFYGDVVSHMCKCGNSMTLLSADIPYTLHF 1242
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1243 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKQIDGKVVTKFNGDKFDFMVGHGMTFSMSPFEIAQLYGSCITPN 1322
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 225403216  1323 VCFVKGDVI 1331
Cdd:pfam01831  241 LCFGKGDVI 249
CoV_PLPro cd21688
Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease ...
1608-1905 7.13e-120

Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409647  Cd Length: 299  Bit Score: 381.83  E-value: 7.13e-120
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1608 NKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSaIHKGKVFFQYSglsAADLAAVKDAFGFDEP-QLL 1686
Cdd:cd21688     1 KTKKVLVTVDGVNFRTIVVTTGDTYGQQLGPVYLDGADVTKGKPD-NHEGETFFVLP---STPDKAALEYYGFLDPsFLG 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1687 QYYSMLGMCKWpVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNEP 1766
Cdd:cd21688    77 RYLSTLAHKWK-VKVVDGLRSLKWSDNNCYVSAVILALQQLKIKFKAPALQEAWNKFLGGDPARFVALIYASGNKTVGEP 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1767 SDSTDFIRVVLREADLSGATCDLEFICK-CGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCGD-KLVHCTQFNVPFL 1844
Cdd:cd21688   156 GDVRETLTHLLQHADLSSATRVLRVVCKhCGIKTTTLTGVEAVMYVGALSYDDLKTGVSIPCPCGGeWTVQVIQQESPFL 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 225403216 1845 ICSN-TPEGKKLP-DDVVAANIFTGGS-VGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1905
Cdd:cd21688   236 LLSAaPPAEYKLQqDTFVAANVFTGNTnVGHYTHVTAKELLQKFDGAKVTKTSEDKGPVTDVLY 299
alphaCoV_Nsp5_Mpro cd21665
alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3341-3635 1.59e-116

alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in alphacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394886  Cd Length: 296  Bit Score: 372.40  E-value: 1.59e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3341 KMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTdPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQG 3420
Cdd:cd21665     3 KMAQPSGVVEKCVVRVSYGNMVLNGLWLGDTVYCPRHVIASDTTST-IDYDHEYSLMRLHNFSISVGNVFLGVVGVTMRG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3421 SLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQ 3500
Cdd:cd21665    82 ALLVIKVNQNNVNTPKYTFRTLKPGDSFNILACYDGVPSGVYGVNMRTNYTIKGSFINGACGSPGYNLNNGTVEFCYMHQ 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3501 LELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKA 3580
Cdd:cd21665   162 LELGSGCHVGSDLDGVMYGGYEDQPTLQVEGANVLVTENVVAFLYAALLNGCNWWLSSDRVTVEAFNEWAVANGFTTVSS 241
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 3581 DLVLDALASMTGVTVEQMLAAIKRLHSGFQGKQILGSCVLEDELTPSDVYQQLAG 3635
Cdd:cd21665   242 TDCFSILAAKTGVDVERLLAAIQRLSKGFGGKTILGYTSLTDEFTLSEVIKQMYG 296
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4019-4212 4.94e-116

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 366.42  E-value: 4.94e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4019 SEFVNMASFVEYELAKKNLDEAKASGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4098
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4099 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4178
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 225403216 4179 DGAVKQLNEID---VNSTWPLVISANRHNEvSTVVLQ 4212
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
TM_Y_SARS-CoV-like_Nsp3_C cd21717
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe ...
2333-2839 3.25e-110

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe acute respiratory syndrome-related coronavirus and betacoronavirus in the B lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the sarbecovirus subgenus (B lineage), including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and the related murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409665  Cd Length: 531  Bit Score: 363.92  E-value: 3.25e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2333 TTFGVLTICDFYQVtdlGYRSS-------FCNGSMVCELCFSGFDMLDNYDAINVVQHVVDrrvSFDY-ISLFKLVVELV 2404
Cdd:cd21717    24 SNLGAPSYCDGVRE---SYLNSsnvttmdFCEGSFPCSVCLSGLDSLDSYPALETIQVTIS---SYKLdLTILGLAAEWF 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2405 IGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLGTmhWSARLFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSK 2484
Cdd:cd21717    98 LAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNS--WLMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTS 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2485 PGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDS 2564
Cdd:cd21717   176 STCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFCKTHNWNCLNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQ 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2565 AYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVENEA--DKAGFLNAAVFYAQSLY 2642
Cdd:cd21717   256 SSYVVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSkcDESAAKSASVYYSQLMC 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2643 RPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKcAIDSD 2722
Cdd:cd21717   336 QPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQG-VVDTD 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2723 VETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADL 2802
Cdd:cd21717   415 VDTKDVIECLKLSHHSDLEVTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQL 494
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 225403216 2803 QHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21717   495 RKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKG 531
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
4016-4211 2.48e-104

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 332.97  E-value: 2.48e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4016 ALQSEFVNMASFVEYELAKKNLDEAKASGSAnQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4095
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4096 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSI 4175
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 225403216  4176 QDADGAVKQLNEIDVNS----TWPLVISANRHNEVstVVL 4211
Cdd:pfam08717  160 KDADGKIVHLKEITMDNspnlAWPLIVTAERANSA--VKL 197
TM_Y_HKU9-like_Nsp3_C cd21715
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus ...
2294-2839 7.38e-95

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus bat coronavirus HKU9 and betacoronavirus in the D lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409663  Cd Length: 526  Bit Score: 319.11  E-value: 7.38e-95
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2294 TVFLLWFNFLYANVILSDFYLPNIgplpmfvgqiVAWVKTTFGVLTICDFYQVTDLGyrssfcnGSMVCELCFSGFDMLD 2373
Cdd:cd21715     1 YLWFVWTCLAICGVWLSEPYAPSL----------LTRFKHFLGIVMPCDYVLVNETG-------TGWLHHLCMAGMDGLD 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2374 nYDAINVVQHvvdRRVS-FDYISLFkLVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPefFMLGTmHWSARLFVFVANM 2452
Cdd:cd21715    64 -YPALRMQQH---RYGSpYDYTYIL-MLLEAFCAYLLYTPALPIVGILAVLHLLVLYLP--IPLGN-SWLVVFLYYIIRL 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2453 LPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLN 2532
Cdd:cd21715   136 VPFTSMLRMYIVIAFLWLCYKGFVHVRYGCNNVACLMCYKKNVAKRIECSTVVNGVKRMFYVNANGGTYFCTKHNWNCVS 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2533 CNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKG 2612
Cdd:cd21715   216 CDTYTVDSTFISRQVALDLSAQFKRPINHTDEAYYEVTSVEVRNGYVYCYFDSDGQRSYERFPMDAFTNVSKLHYSELKG 295
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2613 VPETHVVVV---ENEADKAGFLNAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDLYVDSLLSVLDVDRKSLTSFV 2689
Cdd:cd21715   296 AAPAFNVLVfdaTNRIEENAVKTAAIYYAQLACKPILLVDKRMVGVVGDDATIAKAMFEAYAQNYLLKYSIAMDKVKHLY 375
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2690 NAAHNSLKEGVQLEQVMDTFVGCARRKCA-IDSDVETKSITKSVMAAVNAGVEVTDESCNNLVPTYVKSDTIVAADLGVL 2768
Cdd:cd21715   376 STALQQIASGMTVESVLKVFVGSTRAEAKdLESDVDTNDLVSCIRLCHQEGWDWTTDSWNNLVPTYIKQDTLSTLEVGQF 455
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 225403216 2769 IQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFSLKG 2839
Cdd:cd21715   456 MTANARYVNANVAKGAAVNLVWRYADFIKLSESMRRQLRVAARKTGLNLLVTTSSLKADVPCVVTPFKIVG 526
TM_Y_MERS-CoV-like_Nsp3_C cd21716
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle ...
2289-2836 2.65e-93

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409664  Cd Length: 566  Bit Score: 315.98  E-value: 2.65e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2289 FFLVATVFLL---WFNFLYANVILSDFYLPNIGPLPMFVGQIVAWVkttfGVLTICDFYQVTdlgYR------SSFC-NG 2358
Cdd:cd21716     5 LMLCTTGLLLssvYHLYVFNQVLSSDVMLEDATGLKAFYKEVRSYL----GISSACDGLASA---YRansfdvPDFCaNR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2359 SMVCELCFSGFDMLDNYDAINVVQ-HVVDRRVSFDYISLfklVVELVIGYSLYTVCFYPLFVLVGMQLLTTWLPEFFMLG 2437
Cdd:cd21716    78 SALCNWCLIGQDSITHYSALKMVQtHLSHYVLNIDWLWF---ALELLLAYVLYTSAFNWLLLACTLQYFFAQTSAFVDWR 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2438 TMHWSARLFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMAN 2517
Cdd:cd21716   155 SYNYVVSGIFLLFTHIPLDGLVRIYNVLACLWFLRKFYNHVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYITAN 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2518 GGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYERDGQRVYDDVSAS 2597
Cdd:cd21716   235 GGTSFCRRHNWNCVDCDTAGVGNTFICEEVANDLTTSLRRLVKPTDRSHYYVDSVEVKDTVVQLNYRRDGQSCYERFPLC 314
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2598 LFVDMNGLLHSKV----KGVPEtHVVVVENEADKaGFLN----AAVFYAQSLYRPMLMVEKKLITTANTGLSVSRTMFDL 2669
Cdd:cd21716   315 YFTNLDKLKFKEVckttTGIPE-HNFIIYDSSDR-GQENlarsACVYYSQVLCKPILLVDSNLVTSVGDSSEIAIKMFDS 392
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2670 YVDSLLSVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETKSITKSVMAAVNAGVEVTDESCNN 2749
Cdd:cd21716   393 FVNSFVSLYNVTRDKLEKLISTARDGVKRGDNFQSVLKTFIDAARGPAGVESDVETNEIVDAVQYAHKHDIQLTTESYNN 472
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2750 LVPTYVKSDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANVP 2829
Cdd:cd21716   473 YVPSYVKPDSVATSDLGSLIDCNAASVNQTSMRNANGACIWNAAAYMKLSDSLKRQIRIACRKCNLNFRLTTSKLRANDN 552

                  ....*..
gi 225403216 2830 ILTTPFS 2836
Cdd:cd21716   553 ILSVKFS 559
gammaCoV_Nsp5_Mpro cd21667
gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3338-3639 1.08e-91

gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in gammacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394888  Cd Length: 306  Bit Score: 301.32  E-value: 1.08e-91
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3338 GIVKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVIcssADMTDPDYPNLLCRVTSSDFCVMS-DRMSLTVMSY 3416
Cdd:cd21667     1 GFKKLVSPSSAVEKCIVSVSYRGNNLNGLWLGDSIYCPRHVL---GKFSGDQWQDVLNLANNHEFEVVTqNGVTLNVVSR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3417 QMQGSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFV 3496
Cdd:cd21667    78 RLKGAVLILQTAVANANTPKYKFVKANCGDSFTIACSYGGTVVGLYPVTMRSNGTIRASFLAGACGSVGFNIEKGVVNFF 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3497 YMHQLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCN---WF---VQSDSCSLEEFNVWA 3570
Cdd:cd21667   158 YMHHLELPNALHTGTDLMGEFYGGYVDEEVAQRVPPDNLVTNNIVAWLYAAIISVKEssfSLpkwLESTTVSVEDYNKWA 237
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 225403216 3571 MTNGFSSIKADLVLDALASMTGVTVEQMLAAIKRLHSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3639
Cdd:cd21667   238 SDNGFTPFSTSTAITKLSAITGVDVCKLLRTIMVKSAQWGSDPILGQYNFEDELTPESVFNQVGGVRLQ 306
alpha_betaCoV_Nsp2 cd21511
alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ...
251-753 3.58e-89

alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This alpha- and betacoronavirus family includes alphacoronavirus human coronavirus 229E (HCoV-229E) Nsp2, betacoronavirus Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle. This family may be distantly related to the gammacoronavirus Avian infectious bronchitis virus (IBV) Nsp2; IBV Nsp2 is a weak protein kinase R (PKR) antagonist, which may suggest that it plays a role in interfering with intracellular immunity.


Pssm-ID: 439197  Cd Length: 399  Bit Score: 297.93  E-value: 3.58e-89
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  251 ILFVDQYGCDYTGCLAKGLEDYGDLTLSEM---------KELSPVWRDSLDNEVVVAWHVDRdPRAVMRLQTLATVRSIe 321
Cdd:cd21511     2 VTYVDQYGCGPDGKPVECIKDLLDVAKKGSctlseqldgIELKNGVYDLRDHEVVIAWYVER-KDVPYEKQTIFTIKSA- 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  322 yvgqpiedmVDGDVVMREPAHLLAPNAIVK-RLPRLVETMLYTDSSVTE-FCYKTKLCDCGFITQFGYVDCCgdTCGFRG 399
Cdd:cd21511    80 ---------KFGTFVGEVPAHVFPLNSIVKeIQPRVKKKKKVTLSGVIRsFYSKASPNECNPITLSALVKCT--HCDEKS 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  400 WVPGNMMDGFPCPgCCKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRESFKLYGHAVVPFGGAAYWSPYP----GMWLP 475
Cdd:cd21511   149 WQTGDFVDGFTCE-CGAEYLNWKLDAQSSGVLPPGAVVKTQCPACVNRETFLRGGGRIVYFGGAVYSYVGCingvAYWVP 227
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  476 VIWSSVKSysylTYTGVVGckaivqetdaicrslymdyvqhkcgnleqrailgvddvyhrqllvnrgdyslllenvdlfv 555
Cdd:cd21511   228 RASSSVGC----FHTGVVG------------------------------------------------------------- 242
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  556 krraefackfatcgdglapllldGLVPRSYYLIKSGQAFTSLMvnfsrevVDMCMDMALLFMHDVKVATKYVKKV----- 630
Cdd:cd21511   243 -----------------------KIVPGAWGLGASAQKLTPLT-------TGAAVVFVLIFARTLFAAVGSVPQLqasap 292
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  631 -----------TGKLAVRFKALGIAvvrkitewfdlAVDTAASAAGWLCYQLVN---GLFAVANGVItfvqevpelvknf 696
Cdd:cd21511   293 tildgivnasdRLVDAMQFSADLVV-----------ATTTSAGAAGYVVAGLVDllkPILEWVLSKI------------- 348
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  697 vdkfktffkvlidsmsvsilsgltvvktasNRVCLAGSKVYEVVQKSLPAYIMPVGC 753
Cdd:cd21511   349 ------------------------------GQVCYAGCDVYERVMAFLNVVVKAAGK 375
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4323-4452 2.47e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 275.70  E-value: 2.47e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4323 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4402
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 225403216 4403 VEHPDVDGLCKLRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3667-3926 4.50e-83

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 274.90  E-value: 4.50e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3667 FVKWTMFMYVTTHML-GVTLCALCFVSFAMLLIKHKHLYLTMYIMPVLCTLFYTNYLVVYK-QSFRGLAYAWlshfvpAV 3744
Cdd:pfam19213    1 LLMYTALYWLPPNLItPVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3745 DYTYMDEVLYGVVLLVAMVFV--TMRSINHDvFSTMFLVGRLVSLVSMWYFGANLEEE-----VLLFLTSLFGTYTWTTM 3817
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVRSK-YSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3818 LSLATAKVIAKWlaVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRP 3897
Cdd:pfam19213  154 IAYKLAKYIVVY--VPPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 225403216  3898 PRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1942-2060 2.20e-82

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 266.63  E-value: 2.20e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1942 GKYYTKPIIKAQFRTFEKVEGVYTNFKLVGHDIAEKLNAKLGFDCNSPFMEYKITEWPTATGDVVLASDDLYVSRYSGGC 2021
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 225403216 2022 VTFGKPVIWRGHEEASLKSLTYFNRPSVVCENKFNVLPV 2060
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2112-2236 6.11e-78

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 254.14  E-value: 6.11e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2112 VMEAQKRSSVTTVAVKEVKLNGVKKPVKVEDSVVVNDPTSETKVVKSLSIVDVYDMFLTGCRYVVWTANELSRLINSPTV 2191
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 225403216 2192 REYVKWGMSKLIIPANLLLLRDEKQEFVAPKVVKAKAIACYGAVK 2236
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
deltaCoV_Nsp5_Mpro cd21668
deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3336-3632 9.29e-74

deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in deltacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394889  Cd Length: 302  Bit Score: 249.73  E-value: 9.29e-74
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3336 QSGIVKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDPDYPNLL-CRvtssDFCVMS--DRMSLT 3412
Cdd:cd21668     1 QAGIKILLHPSGVVERCMVSVTYNGSTLNGIWLHNVVYCPRHVIGKYTGSQWQDMVSIAdCR----DFVIFCptQGIQLT 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3413 VMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDS 3492
Cdd:cd21668    77 VQSVKMVGAVLQLTVHTKNLHTPDYEFERATPGSSMTIACAYDGIVRNVYHVVLQTNNLIYASFLNGACGSVGYTLKGKT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3493 VRFVYMHQLELSTGCHTGTDFSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILN---RCNWFVQSDsCSLEEFNVW 3569
Cdd:cd21668   157 LLLHYMHHLEFNNKTHGGTDLHGHFYGPYVDEEVAQHQTAFQYYTDNVVAQIYAHLLTidaKPKWLASQE-ISVEDFNEW 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 3570 AMTNGFSSIKADL----VLDALASMTGVTVEQMLAAIKRLHSGFQGKQILGSCVLEDELTPSDVYQQ 3632
Cdd:cd21668   236 AANNSFANFPCESsnmaYLEGLAQTTKVSVGRVLNTIIQLTLNRGGALIMGKPDFECDWTPEMVYNQ 302
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4334-4452 1.32e-73

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 241.57  E-value: 1.32e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4334 AILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRVEHPDVDGLCK 4413
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 225403216  4414 LRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4213-4322 1.85e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.59  E-value: 1.85e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 225403216 4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
CoV_Nsp8 cd21816
Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) ...
4019-4206 2.50e-67

Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409256  Cd Length: 194  Bit Score: 227.02  E-value: 2.50e-67
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4019 SEFVNMASFVEYELAKKNLDEAKASGsANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4098
Cdd:cd21816     1 SEFSHLPSYAAYATAQAAYEQAVKNG-DSPQELKKLTKALNIAKSEFDRDAAVQKKLEKMADQAMTSMYKEARAEDRRAK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4099 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4178
Cdd:cd21816    80 ITSAMHALLFSMLKKLDSDAVNNIFEQARDGVVPLNIIPLTTANKLMVVIPDYETYKKTVDGNTFTYAGALWSIVTVVDA 159
                         170       180       190
                  ....*....|....*....|....*....|..
gi 225403216 4179 DGAVKQLNEIDV----NSTWPLVISANRHNEV 4206
Cdd:cd21816   160 DGKIVHLSEINMdnspNIAWPLIVTCLRAGAV 191
alphaCoV_Nsp8 cd21830
alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4019-4212 3.51e-67

alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine epidemic diarrhea coronavirus (PEDV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. FCoV Nsp8 forms a 1:2 heterotrimer with Nsp7; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409257  Cd Length: 195  Bit Score: 226.46  E-value: 3.51e-67
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4019 SEFVNMASFVEYELAKKNLDEAKASGSaNQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4098
Cdd:cd21830     4 STFANMPSFIAYETARQDYEDAVKNGS-SPQLIKQLKKAMNIAKSEFDREASVQRKLDRMAEQAAAQMYKEARAVNRKSK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4099 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4178
Cdd:cd21830    83 VISAMHSLLFGMLRRLDMSSVDTILNLAKDGVVPLSIIPAASATRLVVVVPDLESFSKIRRDGCVHYAGVVWTIVDIKDN 162
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 225403216 4179 DGAVKQLNEIDV----NSTWPLVISANRhnevsTVVLQ 4212
Cdd:cd21830   163 DGKVVHLKEVTAaneeSLAWPLHLNCER-----IVKLQ 195
CoV_Nsp10 cd21872
coronavirus non-structural protein 10; This model represents the non-structural protein 10 ...
4323-4451 4.96e-61

coronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16, and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409325  Cd Length: 131  Bit Score: 206.17  E-value: 4.96e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4323 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4402
Cdd:cd21872     1 AGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVKPEANMDQESFGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 225403216 4403 VEHPDVDGLCKLRGKFVQVPL-GIKDPVSYVLTHDVCQVCGFWRDGSCSC 4451
Cdd:cd21872    81 IDHPNPDGFCDYKGKFVQIPTtCANDPVGFTLRNTVCTVCQMWKGYGCSC 130
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1946-2060 8.44e-61

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 205.48  E-value: 8.44e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1946 TKPIIKAQFRTFEKVEGVYTNFKLV--GHDIAEKLNAKLGFDCNSPFM-EYKITEWPTATGDVVLASDDLYVSRYSGGCV 2022
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLTcsGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 225403216  2023 TFGKPVIWRGHEEASLKSLTYFNRPSVVC-ENKFNVLPV 2060
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVClECKFNTKPV 129
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4213-4322 4.82e-58

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 196.93  E-value: 4.82e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  4213 NNELMPQKLRTQVVNSGS-DMNCNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVtDAHCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 225403216  4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
TM_Y_CoV_Nsp3_C cd21686
C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; ...
2354-2834 5.02e-57

C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409657  Cd Length: 476  Bit Score: 207.43  E-value: 5.02e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2354 SFCNGSMVCELCFSGFDMLDNYDAINVVQHVVDRRVSFDY-ISLFKLVVELVIGYSLYTVCFYPLFVLVGMQLLTTwlpe 2432
Cdd:cd21686    54 SYCAGDLVCQVCLDGQDSLHLYPHLRVVQQPLQTTDYTVYaLSLILYLANMTLFMGTFIVTFFVNFYGVGIPFYGW---- 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2433 ffmlgtmhwsarlfvFVANMLPAFTLLRFYIVVtaMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYY 2512
Cdd:cd21686   130 ---------------LLIDVPQSAFMMTFSVFF--FYYVLKFFVHVTHGCKIPTCMVCAKLARPPRVEVETVVQGRKYSF 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2513 DVMANGGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSvieVKQVGCSMRLFYERDGQRVYD 2592
Cdd:cd21686   193 YVYTNGGFTFCKEHNFYCKNCDLYGPGCTFISDEVAEELSRATKLSVKPTAPAFLL---VDDVEVQNDVVFARAKYNQNA 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2593 DVSASLFVDMngllhskvkgvpeTHVVVVENEADKAGFL----NAAVFYAQSLYRPMLMVEKKLITTANTGLSVSRtmfd 2668
Cdd:cd21686   270 HVSLSKFSDI-------------PDFIIAANFGSNCEQLstakNAAVYYSQDLCKPILILDQALSRPIDNYQEVAS---- 332
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2669 lyvdSLLSVLDVDRksLTSFVNAAHNSlKEGVQlEQVMDTFVgCArrkcaidsdvetksitksVMAAVNAGVEVTDESCN 2748
Cdd:cd21686   333 ----RIEKYYPVAK--IKPTGDIFTDI-KQGTD-GEASDSAI-NA------------------AVLAHQRDVEFTGDSFN 385
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2749 NLVPTYVKSDTIVAADLGVlIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLTYNKQEANV 2828
Cdd:cd21686   386 NILPSYAKDESKLTAEDQA-MSVIAESGNANVNVKGTIPVVWLVADFIRLSEQARKYIISAAKKNGVTFALTPSTLRMRG 464

                  ....*.
gi 225403216 2829 PILTTP 2834
Cdd:cd21686   465 NIATQP 470
TM_Y_alphaCoV_Nsp3_C cd21712
C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2352-2839 7.23e-53

C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alphacoronavirus, including Porcine epidemic diarrhea virus and Human coronavirus 229E, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409660  Cd Length: 501  Bit Score: 195.92  E-value: 7.23e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2352 RSSFCNGSMVCELCFSGFDMLDNYDAINVVQHVVDRRVSFDYISLFKLVVELVIGySLYTVCFYPLFVLvgmQLLTTWLp 2431
Cdd:cd21712    54 KSEVCGNSLLCKACLAGYDELSDFPHLQVVWDHVSDPLFSNVLPLFYFAFLLIFG-NNYVRCFLLYFVA---QYINNWG- 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2432 EFFMLGTMHWsarlfvfVANMLPaFTLLRFYIVVT-AMYKVYCLCRHVMYGCSKPGCLFCYKRNRSVRVKCSTVVGGSLR 2510
Cdd:cd21712   129 VYFGYQDYSW-------FLHFVP-FDSFSDEIVVIfIVVKVLLFLKHVIFGCDKPSCKACSKSARLTRIPVQTIVNGSMK 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2511 YYDVMANGGTGFCTKHQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYysvIEVKQVGCS---MRLFYERDG 2587
Cdd:cd21712   201 SFYVHANGGGKFCKKHNFFCVNCDSYGVGNTFINDEVARELSNVVKTTVQPTGPAY---IEVDKVEFSngfYYLYSGDTF 277
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2588 QRVYDDVSASLFVDMNGLlhsKVKGVPETHVVVVENEADKAGFLNAAVFYAQSLYRPMlmvekKLittantglsvsrtmf 2667
Cdd:cd21712   278 WRYNFDITEKKYSCKEVL---KNCNLLDDFIVYNNNGSNVAQVKNACVYFSQLLCKPI-----KL--------------- 334
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2668 dlyVDS-LLSVLDVDRKS--LTSFVNAAHNSLkeGVQLEQVMDTfvgcarRKCAIDSDVETkSITKSVMAAVNA---GVE 2741
Cdd:cd21712   335 ---VDSaLLSSLSVDFNGalHKAFVKVLKNSF--NKDLSNCKTL------EECKKALGLDV-SDDEFESAVSNAhryDVL 402
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2742 VTDESCNNLVPTYVK-SDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACVKTGLKIKLT 2820
Cdd:cd21712   403 LTDRSFNNFVTSYAKpEEKLSTHDIAVCMRAGAKVVNHNVLTKENVPIVWLAKDFSALSEEARKYIVKTTKAKGVNFLLT 482
                         490
                  ....*....|....*....
gi 225403216 2821 YNKQEANVPILTTPFSLKG 2839
Cdd:cd21712   483 FNDNRMTTTLPAVSIVSKK 501
gammaCoV_Nsp8 cd21832
gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4016-4212 7.09e-52

gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409259  Cd Length: 210  Bit Score: 183.23  E-value: 7.09e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4016 ALQSEFVNMASFVEYELAKKNLDEAKA---SGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARI 4092
Cdd:cd21832     1 SVTQEFSHIPSYAEYERAKDLYEKVLAdskNGGVTQQELAAYRKAANIAKSVFDRDLAVQKKLDSMAERAMTTMYKEARV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4093 NDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHI 4172
Cdd:cd21832    81 TDRRAKLVSSLHALLFSMLKKIDSEKLNVLFDQASSGVVPLATVPIVCSNKLTLVIPDPETWVKCVEGMHVTYSTVVWNI 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 225403216 4173 QSIQDADGavkqlNEIDVNST--------------WPLVISANR--HNEVStVVLQ 4212
Cdd:cd21832   161 DTVIDADG-----TELHPTSTgsgltycisgdniaWPLKVNLTRngHNKVD-AVLQ 210
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3247-3334 1.59e-48

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 168.86  E-value: 1.59e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3247 GTFEEMALTTFMITKESYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREAACSQLAKAMETFNhNNGNDVLYQPP 3326
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGNDVLYTPP 84

                   ....*...
gi 225403216  3327 TASVTTSF 3334
Cdd:pfam16348   85 TVSVTSSL 92
alphaCoV-Nsp6 cd21558
alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3625-3926 5.79e-48

alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394844  Cd Length: 293  Bit Score: 175.08  E-value: 5.79e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3625 TPSDVYQQLAGVKLQS-KRTRVIKGtccwILASTFLFCSIISAFVKWTMFMYVTTHMLGVTLCALCFVS-FAMLLIKHKH 3702
Cdd:cd21558     2 TTSEVIKQMYGVNLQSgKVKSAFKN----VLLVGVFLFMFWSELLMYTSFFWINPGLVTPVFLVLVLVSlLLTLFLKHKM 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3703 LYLTMYIMPVLCtlfytnYLVVYKQSFRGLAYAWLS-HFVPAVDYTYMDevLYGVVLLVAMVFV----TMRSINHDVFST 3777
Cdd:cd21558    78 LFLQTFLLPSVI------VTAFYNLAWDYYVTAVLAeYFDYHVSLMSFD--IQGVLNIFVCLFVfflhTYRFVTSGTSWF 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3778 MFLVGRLVSLVSMWYFGanleEEVLLFLTSLFG-TYTWttMLSLATAKVIAKWLAVNVLYFTDIPQIKLVLLSYLCIGYV 3856
Cdd:cd21558   150 TYVVSLVFVLYNYFYGN----DYLSLLMMVLSSiTNNW--YVGAIAYKLAYYIVYVPPSLVADFGTVKAVMLVYVALGYL 223
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3857 CCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21558   224 CCVYYGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLKAPTGVFDALLLSFKLIGIGGERTIKISTVQ 293
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1533-1607 3.90e-46

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 161.43  E-value: 3.90e-46
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 1533 QLDDDARVFVQANMDCLPTDWRLVNKFDSVDGVRTIKYFECPGEVFVSSQGKKFGYVQNGSFKEASVSQIRALLA 1607
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
betaCoV_Nsp3_NAB cd21795
nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents ...
1952-2060 1.06e-43

nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus including highly pathogenic human coronaviruses (CoVs) such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but may not be conserved in the Nsp3 NAB from betacoronaviruses in other lineages.


Pssm-ID: 409347  Cd Length: 110  Bit Score: 155.81  E-value: 1.06e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1952 AQFRTFEKVEGVYTNFKLVG---HDIAEKLNAKLGFdcNSPFMEYKITEWPTATGDVVLASDDLYVSRYSGGCVTFGKPV 2028
Cdd:cd21795     1 LDVPAAPKPVTVYDNFKLVScqnQSIADDFNRTLGF--TKPGSELLLTVYPNTSGDVVAVSDDNYTVVYKKGSLLMGKPV 78
                          90       100       110
                  ....*....|....*....|....*....|...
gi 225403216 2029 IWRgHEEASLKSLTYFNRPSVVCENK-FNVLPV 2060
Cdd:cd21795    79 LWV-HKNNTWKKLVPLNKPNVVCLRNlFSVLPI 110
gammaCoV_Nsp10 cd21902
gammacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4324-4451 2.18e-43

gammacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of gammacoronaviruses, including Infectious bronchitis virus (IBV)and Bottlenose dolphin coronavirus HKU22(BdCoV HKU22). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409327  Cd Length: 134  Bit Score: 155.82  E-value: 2.18e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4324 GTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRV 4403
Cdd:cd21902     2 GHETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVHNGSGFAITSKPSPTPDQDSYGGASVCLYCRAHI 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 225403216 4404 EHP----DVDGLCKLRGKFVQVPLGIKDPVSYVLTHDVCQVCGFWRDGSCSC 4451
Cdd:cd21902    82 AHPggagNLDGRCQFKGSFVQIPTTEKDPVGFCLRNKVCTVCQCWIGYGCQC 133
deltaCoV_Nsp10 cd21903
deltacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4324-4452 5.21e-42

deltacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of deltacoronaviruses, including Thrush coronavirus HKU12-600 and Wigeon coronavirus HKU20. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409328  Cd Length: 128  Bit Score: 151.55  E-value: 5.21e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4324 GTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDhAGTGMAITIKPEATTNQDSYGGASVCIYCRSRV 4403
Cdd:cd21903     2 GTQIEYQENASLLTYLAFAVDPKEAYLKHLADGGKPIQGCIQMIAP-LGPGFAVTTKPQPNEHQYSYGGASICLYCRAHI 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 225403216 4404 EHPDVDGLCKLRGKFVQVPLGiKDPVSYVLTHDVCQVCGFWRDGSCSCV 4452
Cdd:cd21903    81 PHPGVDGRCPYKGRFVHIDKD-KEPVSFALTHEPCNSCQRWVNYDCTCG 128
CoV_Nsp6 cd21526
coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3629-3926 1.31e-40

coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394843  Cd Length: 287  Bit Score: 153.45  E-value: 1.31e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3629 VYQQLAGVKLQSkrTRVIKGTCCWIlaSTFLFCSIISAF-VKWTMFMyvttHMLGVTLCALCFVSFAmllIKHKHLYLTM 3707
Cdd:cd21526     1 VYNQAPGVLLQS--VFVVKKTSTFW--SHFLFAAFTMLLaAPLVFPV----HAYVILLMCFTVVTFT---VKHKVAFLTT 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3708 YIMPVLCTL-FYTNYLVVYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVFVTMRSINHDVFSTMFLvgrLVS 3786
Cdd:cd21526    70 FLLPSLITMvAIANTFWIQVVTFLRTWYDTVFVSPIAQDLYGYTVALYMLIYAGLATNYTLKTLRYRATSFLSF---LMQ 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3787 LVSMWYFGANLEEEVLLFLTSLFGTYTWTTMLSLATAKV--IAKWLaVNVLYFTDIPQIKLVLLSYLCIGYVCCCYWGVL 3864
Cdd:cd21526   147 NFLTLYTAHYAYKLLPWTESLLFTALTMLSSHSLIGAIVfwLARWM-LRVEYPIIFPDLAIRVLAYNVIGYVCTCYFGLM 225
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 225403216 3865 SLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21526   226 WLANRFFTLTLGVYDYMVSVEQFRYMMAVKLNPPKNAFEVFILNIKLLGIGGNRNIKVATVQ 287
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3927-4015 1.17e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 134.49  E-value: 1.17e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4006
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 225403216 4007 YvrdnTVLQ 4015
Cdd:cd21827    79 P----TVLQ 83
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3927-4015 1.42e-36

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 134.50  E-value: 1.42e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4006
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*....
gi 225403216  4007 YVRDNTVLQ 4015
Cdd:pfam08716   75 YLENRTILQ 83
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1607-1914 2.93e-36

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 142.04  E-value: 2.93e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1607 ANKVDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSAADLAAVKDA---FGFDEP 1683
Cdd:pfam08715    2 CKQITIYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVPTTDWVEFYGFKSIleyYTLDAS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1684 QLLQYYSMLgmcKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKF 1763
Cdd:pfam08715   82 KYVIYLSAL---TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1764 NEPSDSTDFIRVVLREADLSGATCDLEFI--CKCGVKQEQRKGVDAVMHFGTLDKSGLVKGYNIACTCG-DKLVHCTQFN 1840
Cdd:pfam08715  159 GDFGDANWTLTNLAEHFDAEYTNAFLKKRvcCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGaNNTDEVIEAS 238
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  1841 VPFLICSNT--PEGKKLPDDVVAANIFTGG-SVGHYTHVKCKPkyQLYDACNVSKVSEAKGNFTDCLYLKNLKQTFS 1914
Cdd:pfam08715  239 LPYLLLSATdgPAAVDCLEDGVGTVAFVGStNSGHYTYQTAKQ--AFYDGAKDRKFGKKSPYVTAVYTRFAFKNETS 313
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
852-941 2.96e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 2.96e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  852 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHGViGTKVCALLERLVDDYVYLFDEGGEE 931
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 225403216  932 VIASRMYCSF 941
Cdd:cd21467    80 VLASEMYCSF 89
betaCoV_Nsp1 cd21876
non-structural protein 1 from betacoronavirus; This model represents the non-structural ...
57-196 4.04e-36

non-structural protein 1 from betacoronavirus; This model represents the non-structural protein 1 (Nsp1) from betacoronaviruses, including highly pathogenic coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and murine hepatitis virus (MHV) genomes cause drastic reduction or elimination of infectious virus; bovine coronavirus (BCoV) Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409338  Cd Length: 114  Bit Score: 134.07  E-value: 4.04e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   57 HVRVDCSRLPALECCVQSAIIRdifvdedPQKVEASTMMALQFGSAVLVKPSKRLSVQawaklgvlPKTPAMGLFKRFC- 135
Cdd:cd21876     1 HVSLTLPWLQALENPVQPWIDR-------PEEALESAKAALAEGKLVFVPPYKGLHPL--------LPGPRVFLVRRHGn 65
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 225403216  136 ---LCNTRECVCDAHVAFqlftvqpdgvCLGNGRFIGWFVPVTAIpeyakQWLQPWSILLRRGG 196
Cdd:cd21876    66 ptrPFDVRELAADADGVN----------YGRSGRTIGVLVPLDGE-----QPYGYINILLRKYG 114
CoV_Nsp9 cd21881
coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) ...
4213-4322 5.79e-34

coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from coronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for CoV replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG at the C-terminus; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409329  Cd Length: 111  Bit Score: 128.02  E-value: 5.79e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVVNSGSDMNCNTPT-QCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4291
Cdd:cd21881     1 NNELSPVALKQMSCAAGTDQTCTDDEaKAYYNNSKGGRFVLAITSDKPDLKVARFLKEDGGTIYTELEPPCRFVTDVPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 225403216 4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21881    81 PKVKYLYFIKNLNSLNRGMVLGSISATVRLQ 111
deltaCoV-Nsp6 cd21561
deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3621-3926 1.06e-33

deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394847  Cd Length: 296  Bit Score: 134.03  E-value: 1.06e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3621 EDELTPSDVYQQlAGVKLQSKrtrVIKGTCCWILasTFLFCSIISAFVKWTMFmyvTTHMLGVTLC-ALCFVSFAMLLIK 3699
Cdd:cd21561     2 ECDWTPEMVYNQ-APINLQSG---VVKKTCMWFF--HFLFMAVIFLLAALHVF---PVHLYPIVLPvFTILAFLLTLTIK 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3700 HKHLYLTMYIMPVL-------CTLFYTNYLVVYkqsfrglAYAWLSHFVPAVDYTYMDEVLYGVVLLVAMVFVTMRSINH 3772
Cdd:cd21561    73 HTVVFTTTYLLPSLlmmvvnaNTFWIPNTYLRS-------IYEYVFGSFISERLYGYTVALYILVYAQLAINYTLRTRRY 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3773 dvFSTMFLVGrlvSLVSMWYFganleeEVLLFLTSLFgTYTWT-----TMLSLATAKVIAK----WLAVNVLYFTDIPQI 3843
Cdd:cd21561   146 --RATSFISF---CMQALQYG------YVAHIVYRLL-TTPWTegllfTAFSLLTSHPLLAalswWLAGRIPLPLILPDL 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3844 KLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVS 3923
Cdd:cd21561   214 AIRVIVYYVIGYVMCMRFGLFWLINKFTTIPMGTYKYMVSIEQLKYMMAVKMSPPRNAFEVLWANIRLLGLGGNRNIAVS 293

                  ...
gi 225403216 3924 QIQ 3926
Cdd:cd21561   294 TVQ 296
betaCoV_Nsp3_betaSM cd21727
betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model ...
2120-2236 8.68e-33

betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409626  Cd Length: 125  Bit Score: 125.34  E-value: 8.68e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2120 SVTTVAVKEVKLNGVKKPVKVEDSVVVNDPTSETKVVKSLSIVDVYDMFLTGCR-YVVWTANELSRLINSPTVRE--YVK 2196
Cdd:cd21727     9 SVSASQQKMVILKGLKKPFVVNGNVSVVDNDSGTKVVEELSKTDLYTMYVDGKYqVVVLKANELSRVLGLHTVEShaAVN 88
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 225403216 2197 WGMSKLIIPANLLLLRDEKQefvAPKVVKAKAIACYGAVK 2236
Cdd:cd21727    89 VLASGSVTRYAKLLLRASFY---FVEFTKATFTATNAVSK 125
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
922-1074 3.41e-32

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 131.21  E-value: 3.41e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   922 VYLFDEGGEEVIASRMYCS-FSAPDEDCVATDV--VYADENQDDDADDPVVLVADTQEEDGVAKEQVDSADSEICVAH-- 996
Cdd:pfam11963  190 IYLRKGGNKGSVTSDHFRRaFTMPVYDFNVEDAyaEVHDEPKGKYSQKAYALLRGYRGVKPVLFVDQYGCDYTGCLADgl 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   997 TGGQEMTEPDAVGSQTPIASAEETEVGEA--CDREGIAEVK----ATVCADALDACP--DQVEAFDIEKVEDSILSELQT 1068
Cdd:pfam11963  270 EAYGDYTLQDMKQLQPVWLANLDFDVVVAwhVVRDPRAVMRlqtiATICGIAYVAQPteDVVDGDVVIKEPVHLLSADAI 349

                   ....*.
gi 225403216  1069 ELNAPA 1074
Cdd:pfam11963  350 VLRLPS 355
alphaCoV_Nsp9 cd21897
alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4213-4322 2.08e-29

alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) of alphacoronaviruses, including Porcine epidemic diarrhea virus (PEDV), Porcine transmissible gastroenteritis coronavirus (TGEV), and Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409330  Cd Length: 108  Bit Score: 114.72  E-value: 2.08e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVVNSGSDMNCNTPTQCYYNTTGTGkIVYAILSDCDGLKYTKIvKEDGNCVVLELDPPCKFSVQDVKGL 4292
Cdd:cd21897     1 NNEIMPGKLKQRAVKAEGDGFSGDGKALYNNEGGKT-FMYAFIADKPDLKYVKW-EFDGGCNTIELEPPCKFLVDTPNGP 78
                          90       100       110
                  ....*....|....*....|....*....|
gi 225403216 4293 KIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21897    79 QIKYLYFVKNLNTLRRGAVLGYIGATVRLQ 108
TM_Y_deltaCoV_Nsp3_C cd21711
C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2341-2840 1.40e-27

C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from deltacoronavirus, including Magpie-robin coronavirus HKU18 and Bulbul coronavirus HKU11, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409659  Cd Length: 490  Bit Score: 120.58  E-value: 1.40e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2341 CDFYQVTDLGYrSSFCNGSMVCELCFSGFDMLDNYDAINVVQhvvdrrvsfdyislfklVVELVIGYSLYTVCFYPLFVL 2420
Cdd:cd21711    43 CYYNATQHYDY-NSFCAGDLTCQACFDGQDSLHLYKHLRVNQ-----------------QPVQTTDYTVYALSIVLLLAN 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2421 VGMQLLTTwlpeffmlgtmhwsarLFVFVANM------------LPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSKPGCL 2488
Cdd:cd21711   105 PTLVLGTL----------------LVVFFVNFygvqipfygtlqLDYQNTLVMVFSVYYFYKVMKFFRHLAKGCKKPTCS 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2489 FCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCNSWKPGNTFIThEAAADLSKELKRPVNPTDSAYys 2568
Cdd:cd21711   169 ICAKKRIPPTITVETVVQGRKYPSVIETNGGFNICKEHNFYCKNCDSQTPGTFIPT-EAVESLSRKTRLSVKPTAPAY-- 245
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2569 vIEVKQVGCSMRLFYER---DGQRV-----YDDVSASLFVDMNGLLHSKvkgvpeTHVVVVENEADKAGFLNA----AVF 2636
Cdd:cd21711   246 -LLARDVECQTDVVVARathNGNAHvciskYSDIRTVDQLLKPTPLFSY------TPDVIIAADFDNAGSLKTakelAVV 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2637 YAQSLYRPMLMVEKKlittantglsvsrtmFDLYVDSLLSVLDVDRKSLTSfvnaahnslkegVQLEQVMDTFVGCARrk 2716
Cdd:cd21711   319 LSMDLKRTIIIIDQA---------------YSRPIDNYQEVKSRIEKYYPF------------QKITPTGDIFADIKQ-- 369
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2717 cAIDSDVeTKSITKSVMAAVNAGVEVTDESCNNLVPTYV-KSDTIVAADLGVLIQNNakhVQSNVAKAANVACIWSVDAF 2795
Cdd:cd21711   370 -ATNGQA-SDSAINAAILAVQRGLDFTIDNPNNILPHYAfDFSTLSAEDQSTLIESG---CAKGNLKGTNVGVVLSANLV 444
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*.
gi 225403216 2796 NQLSADLQHRLRKACVKTGLKIKLTYNKQEANVPILTTPFS-LKGG 2840
Cdd:cd21711   445 TRLSQKAIRVIANAASRNGVTCAVTPSTLVLRGNIATQPLTrIKAG 490
SUD_C_DPUP_CoV_Nsp3 cd21513
C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This ...
1536-1607 1.89e-24

C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This family contains the SUD-C of Nsp3 from Severe Acute Respiratory Syndrome (SARS) coronavirus (CoV), Middle East respiratory syndrome-related (MERS) CoV, and Rousettus bat CoV HKU9, as well as the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) Nsp3. Though structurally similar, there is little sequence similarity between these four domain subfamilies: SARS SUD-C, MERS SUD-C, HKU9 SUD-C, and MHV DPUP. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Nsp3 of SARS coronavirus includes a SARS-unique domain (SUD) consisting of three globular domains separated by short linker peptide segments: SUD-N, SUD-M, and SUD-C. SUD-N and SUD-M are macro domains which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). The SUD-C domain adopts a frataxin-like fold and has structural similarity to DNA-binding domains of DNA-modifying enzymes. It binds to single-stranded RNA and recognizes purine bases more strongly than pyrimidine bases. SUD-C also regulates the RNA binding behavior of the SUD-M macrodomain. SUD-C is not as specific to SARS CoV Nsp3 as originally thought, and is conserved in the Nsp3s of all four lineages (A-D) of betacoronavirus.


Pssm-ID: 394838  Cd Length: 71  Bit Score: 99.17  E-value: 1.89e-24
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 225403216 1536 DDARVFVQANMDCLPTDWRLVNKFDSVDGVRTIKYFECPGeVFVSSQGKKFGYVQNGSFKEASVSQIRALLA 1607
Cdd:cd21513     1 TDERVFVQAVMLNGPRDWRLVNKFDSVDGVRYKKYLKRGG-IFVCSQDKKFYYVQNDVFLEFSVSKIRALLA 71
deltaCoV_Nsp8 cd21833
deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4022-4181 2.47e-24

deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of deltacoronaviruses that include White-eye coronavirus HKU16 and Quail coronavirus UAE-HKU30, among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409260  Cd Length: 189  Bit Score: 103.17  E-value: 2.47e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4022 VNMASFVEYELAKKNLDEAKASGSANQQQIKQLeKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSKVVS 4101
Cdd:cd21833     7 INLDSYRIYKEADAAYKKSVELNEPPQEQKKKL-KAVNIAKAEWEREAASQRKLEKLADAAMKSMYLAERAEDRRIKLTS 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4102 ALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPPLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDADGA 4181
Cdd:cd21833    86 GLTAMLYHMLRRLDSDRVKALFECAKQQILPIHAIVGVSNDNLKVIFNDKESYLQYVDGNTLIYKGVRYTIVKKLSLDNA 165
TM_Y_gammaCoV_Nsp3_C cd21710
C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2293-2820 4.44e-24

C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from gammacoronavirus, including Infectious bronchitis virus. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409658  Cd Length: 525  Bit Score: 110.23  E-value: 4.44e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2293 ATVFLLWFNFLYANVILSDFYLPNIGPLPMFVGQIVAWVKTTFGVLticdfyqvtdlgyrsSFCNGSMVCELCFSGFDML 2372
Cdd:cd21710    14 TALLILWFVYTSNPVMFTGIRVLDFLFEGSFCGPYNDYGKDSFDVL---------------RYCGDDFTCRVCLHDKDSL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2373 DNYDAINVVQHVVDRRVS---FD----YISLFKLVVELVIGYSLytVCFYPLFVLVGMQLLTTWLpeffmlGTMHWsarl 2445
Cdd:cd21710    79 HLYKHAYSVEQFYKDAVSgisFNwnwlYLVFLILFVKPVAGFVI--ICYCVKYLVLSSTVLQTGV------GFLDW---- 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2446 fvFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYgCSKPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTK 2525
Cdd:cd21710   147 --FIQTVFTHFNFMGAGFYFWLFYKIYIQVHHILY-CKDITCEVCKRVARSNRHEVSVVVGGRKQLVHVYTNSGYNFCKR 223
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2526 HQWNCLNCNSWKPGNTFITHEAAADLSKELKRPVNPTDSAYYSVIEVKQVGCSMRLFYE-----RDGQR-VYDDVSASLF 2599
Cdd:cd21710   224 HNWYCRNCDKYGHQNTFMSPEVAGELSEKLKRHVKPTAHAYHVVDDACLVDDFVNLKYKaatpgKDGAHsAVKCFSVSDF 303
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2600 VDMNGLLHSKVKG---------VPETHVVVVENEADkagflNAAVFYAQSLYRPMLMVEKKLITTANTGlSVSRTMFDLY 2670
Cdd:cd21710   304 LKKAVFLKDALKCeqisndsfiVCNTQSAHALEEAK-----NAAIYYAQYLCKPILILDQALYEQLVVE-PVSKSVVDKV 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 2671 VDSLLSVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCArrkcaidsdvetksitksvmaavNAGVEVTDESCNNL 2750
Cdd:cd21710   378 CSILSNIISVDTAALNYKAGTLRDALLSVTKDEEAVDMAIFCH-----------------------NNDVEYTSDGFTNV 434
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 225403216 2751 VPTY-VKSDTIVAADLGVLIQNNAKHVQSNVAKAANVacIWSVDAFNQLSADLQHRLRKACVKTGLKIKLT 2820
Cdd:cd21710   435 VPSYgIDTDKLTPRDRGFLINADASIANLRVKNAPPV--VWKFSDLIKLSDSCLKYLISATVKSGGRFFIT 503
gammaCoV_Nsp9 cd21899
gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4211-4322 1.06e-23

gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from gammacoronaviruses such as Avian infectious bronchitis virus (IBV). CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409332  Cd Length: 113  Bit Score: 98.77  E-value: 1.06e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4211 LQNNELMPQKLRTQVVNSGSDM-NCNTPTQCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDV 4289
Cdd:cd21899     1 LQNNELMPHGVKTKACVAGVDQaHCSVESKCYYTNISGNSVVAAITSSNPNLKVASFLNEAGNQIYVDLDPPCKFGMKVG 80
                          90       100       110
                  ....*....|....*....|....*....|...
gi 225403216 4290 KGLKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21899    81 DKVEVVYLYFIKNTRSIVRGMVLGAISNVVVLQ 113
gammaCoV-Nsp6 cd21559
gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3629-3926 1.38e-23

gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394845  Cd Length: 307  Bit Score: 104.85  E-value: 1.38e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3629 VYQQLAGVKLQSKrtrVIKGTCCWILASTFLFC--SIISAFVKWTMF---MYVttHMLGVTLCALCFVSFAmllIKHKHL 3703
Cdd:cd21559     3 VFNQVGGVRLQSS---FVKKATSWFWSRCVLACflFVLCAIVLFTAVplkYYV--HAAVILLVAVLFISFT---VKHVMA 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3704 YLTMYIMPVLCTLFYTNYLVVyKQSFRGLAYAWLSHFVPAVDYTYMDeVLYGVVLLVAMVFVTMRSI----NHDVFSTMF 3779
Cdd:cd21559    75 FMDTFLLPTLCTVIIGVCAEV-PFIYNTLISQVVIFFSQWYDPVVFD-TVVPWMFLPLVLYTAFKCVqgcySINSFSTSL 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3780 LVG-RLVSLVSMWYFGANLEEEVLLFLTSLFGTYTWTTMLSLATAK----VIAKWLAVNVLYFTD-IPQIKLVLLSYL-- 3851
Cdd:cd21559   153 LVLyQFMKLGFVIYTSSNTLTAYTEGNWELFFELVHTTVLANFSSNsligLIVFKIAKWMLYYCNaTYFNSYVLMAVMvn 232
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 225403216 3852 CIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPRNSFEALMLNFKLLGIGGVPVIEVSQIQ 3926
Cdd:cd21559   233 VIGWLFTCYFGLYWWLNKVFGLTLGKYNYKVSVEQYKYMCLHKIRPPKSVWDVFSTNMLIQGIGGERVLPIATVQ 307
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1340-1464 6.03e-23

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 97.24  E-value: 6.03e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1340 EVIVNPANGRMAHGAGVAGAIAKAAGKAFINETaDMVKAQGVCQVGGCYESTGGKLCKKVLNIVGPDARGHgkQCYSLLE 1419
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 225403216 1420 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1464
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
671-832 5.97e-22

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 95.41  E-value: 5.97e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   671 LVNGLFAVANGVITFVqeVPELVKNFVDKFKTFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSkVYEVVQKSLPAYIMP 750
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSNA-LVKVVSVKLKGKKQA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   751 V--GCSEATCLVG---EIEPAVFEDDvvdvvKAPLTYQGCCKPPSSFEKICIVDKLYMAKCGDQFYPVVvdnDTVGVLDQ 825
Cdd:pfam19212   78 GlkGAKEATVFVGatvPVTPTRVEVV-----TVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPAS---TDGVVVPP 149

                   ....*..
gi 225403216   826 CWRFPCA 832
Cdd:pfam19212  150 VFKLKGG 156
gammaCoV_PLPro cd21733
gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1615-1873 1.47e-20

gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of gammacoronavirus, including Avian coronavirus, Canada goose coronavirus, and Beluga whale coronavirus SW1. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in several CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409650  Cd Length: 304  Bit Score: 95.57  E-value: 1.47e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1615 TVDGVNFRSCCVAEGEVFGKtLGSVFCDGINVTKVRcsAIHKGKVFFqysgLSAADlAAVKDAFGFDEPQLLQYYSMLGM 1694
Cdd:cd21733    10 TEDGVKYRSVVVKPGDSLSQ-FGQVFARNKTVFTAD--DVEDKEILF----IPTTD-KAVLEYYGLDAQKYVIYLQTLAQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1695 cKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFpKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNEPSDStDFIR 1774
Cdd:cd21733    82 -KWNVQYRDNFLILEWRDGNCWISSAIVLLQAAKIRF-KGFLAEAWAKFLGGDPTEFVAWCYASCNAKVGDFSDA-NWLL 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1775 VVLRE---ADLSGATCDLEFICKCGVKQEQRKGVDA---------VMHFGTldksglvkGYNIACTCGDKLV-HCTQFNV 1841
Cdd:cd21733   159 ANLAEyfdADYTNAFLKRRVSCNCGVKNYELRGLEAciqpvrapnLLHFKT--------QYSNCPTCGANSVdEVVEASL 230
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 225403216 1842 PF--LICSNTPEGKKLPDDVVAANIFTGG-SVGHY 1873
Cdd:cd21733   231 PYllLLATDGPATVDCDENAVGNVVFIGStNSGHC 265
alphaCoV_Nsp7 cd21826
alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3927-4015 5.48e-20

alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine transmissible gastroenteritis coronavirus (TGEV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. FCoV Nsp7 forms a 2:1 heterotrimer with Nsp8; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409252  Cd Length: 83  Bit Score: 87.04  E-value: 5.48e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4006
Cdd:cd21826     1 SKLTDIKCTNVVLLGCLSSMNVAANSKEWAYCVDLHNKINLCDDPEKAQEMLLALLAFFLSKQKDFG------LDDLLDS 74

                  ....*....
gi 225403216 4007 YVRDNTVLQ 4015
Cdd:cd21826    75 YFDNNSILQ 83
deltaCoV_Nsp9 cd21900
deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4213-4322 6.18e-20

deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from deltacoronaviruses such as the Porcine delta coronavirus (PDCoV) Porcine coronavirus HKU15. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409333  Cd Length: 109  Bit Score: 87.87  E-value: 6.18e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 4213 NNELMPQKLRTQVvNSGSDMNCNTPT-QCYYNTTGTGKIVYAILSDCDGLKYTKIVKEDGNcVVLELDPPCKFSVQDVKG 4291
Cdd:cd21900     1 NNELCLRNVFTAQ-NTASDGNGNESTaKSFYVSRTGKKILVAVTSTKDNLKTVTCDTDTGK-VVLNLDPPMRFSHVVGGK 78
                          90       100       110
                  ....*....|....*....|....*....|.
gi 225403216 4292 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4322
Cdd:cd21900    79 QSVVYLYFIQNISSLNRGMVIGHISGTTILQ 109
CoV_Nsp7 cd21811
coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) ...
3927-4015 1.30e-17

coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409251  Cd Length: 83  Bit Score: 80.22  E-value: 1.30e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4006
Cdd:cd21811     1 SKLTDVKCTAVVLLSLLQKLRVESNSKLWKQCVQLHNDILLAKDTTEVFEKLVSLLSVLLSMQGAVD------LNRLCEE 74

                  ....*....
gi 225403216 4007 YVRDNTVLQ 4015
Cdd:cd21811    75 MLENRAVLQ 83
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1322-1452 1.82e-17

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 81.58  E-value: 1.82e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216   1322 NVCFVKGDVIKVlrrvGAEVIVNPANGRMAHGAGVAGAIAKAAGKAFinETADMVK-AQGVCQVGGCYESTGGKL-CKKV 1399
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKAL--SKEEVRKlAGGECPVGTAVVTEGGNLpAKYV 74
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 225403216   1400 LNIVGPDARGHGKQCYSLLERAYQ------HINKCDNVVTTLISAGIFSVPTDVSLTYL 1452
Cdd:smart00506   75 IHAVGPRASGHSKEGFELLENAYRnclelaIELGITSVALPLIGTGIYGVPKDRSAQAL 133
gammaCoV_Nsp7 cd21828
gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3927-4015 1.92e-15

gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409254  Cd Length: 83  Bit Score: 74.06  E-value: 1.92e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 3927 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4006
Cdd:cd21828     1 SKLTDVKCTTVVLMQLLTKLNVEANSKMHKYLVELHNKILASDDVVECMDNLLGMLVTLLCIDSTID------LSEYCDD 74

                  ....*....
gi 225403216 4007 YVRDNTVLQ 4015
Cdd:cd21828    75 ILKRSTVLQ 83
alphaCoV_PLPro cd21731
alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1610-1887 1.67e-10

alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alphacoronavirus, including Swine acute diarrhea syndrome coronavirus (SADS-CoV) which causes severe diarrhea in piglets, and Human coronavirus 229E which infects humans and bats and causes the common cold. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in SADS-CoV and many others has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409648  Cd Length: 289  Bit Score: 65.34  E-value: 1.67e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1610 VDVLCTVDGVNFRSCCVAEGEVFGKTLGSVFCDGINVTKVRC-SAIHKG-KV-----FFQYSGLSAADLAAVKD--AFGF 1680
Cdd:cd21731     3 VVVKVTEDGRNVKDVVVDTDKTFGEQLGVCSVNDKDVTGVVPpDDSDKVvSVapdvdWDSHYGFPNAAVFHTLDhsAYAF 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1681 DepqllqyysmlgmckwpVVVCGNYFAFKQSNNNCYINVACLMLQHLSLKFPKWQWQEAWNEFRSGKPLRFVSLV----- 1755
Cdd:cd21731    83 E-----------------SDIVNGKRVLKQSDNNCWVNAVCLQLQFAKPTFKSEGLQALWNKFLTGDVAGFVHWLywitg 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1756 LAKGsfkfnEPSDSTDfirVVLREADL--SGATCDLEFICKCGVKQEQRKGVDAVMHFGTLdKSGLVKGYniaCTCGDKL 1833
Cdd:cd21731   146 ANKG-----DPGDAEN---TLNKLSKYlvSSGSVTVERTTGCDSCNSKRTVTTPVVNASVL-RSGVDDGV---CKHGVKV 213
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 225403216 1834 ---VHCTQFNVPFLICSNTPEGKK-LPDDVVAANIFTG-GSVGHYThVKCKPKYQLYDA 1887
Cdd:cd21731   214 ttrVVSVKGTVIITSVGKPVVSDAlLLLDGVSYTAFSGdVDNGHYT-VYDKATGKVYDG 271
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1343-1446 7.98e-10

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 59.12  E-value: 7.98e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  1343 VNPANGRMAHGAGVAGAIAKAAGKAFINETADMVKaqGVCQVGGCYESTGGKL-CKKVLNIVGPDARGHGKQ-CYSLLER 1420
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELKK--GGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHgEEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 225403216  1421 AYQHI------NKCDNVVTTLISAGIFSVPTD 1446
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
MERS-CoV-like_Nsp3_NAB cd21823
nucleic acid binding domain of non-structural protein 3 from Middle East respiratory ...
1943-2060 1.02e-07

nucleic acid binding domain of non-structural protein 3 from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), and appears to be partially conserved in the Nsp3 NAB from betacoronaviruses in the C lineage.


Pssm-ID: 409349  Cd Length: 123  Bit Score: 53.60  E-value: 1.02e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1943 KYYT-KPIIKAQFRTFekVEG-VYTNFKLVGHD-------IAEKLNAKLGFDCNSPFME-YKITEWPTATGDVVLASDDL 2012
Cdd:cd21823     1 KYFTsKPPIEYSPATV--LAGsVYTNSCLVASDgtpggdaISLAFNNLLGFDESKPVSKkLTYSLLPNEDGDVLLAEFST 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 225403216 2013 YVSRYSGGCVTFGKPVIW--RGHEEASLKSltyFNRPSVvcENKFNVLPV 2060
Cdd:cd21823    79 YDPIYKNGAMLKGKPILWvnNGLFDSALNK---FNRASL--RQIYDVAPV 123
Macro_Af1521_BAL-like cd02907
macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse ...
1322-1447 2.36e-07

macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. The macrodomains in this family show similarity to Af1521, a protein from Archaeoglobus fulgidus containing a stand-alone macrodomain. Af1521 binds ADP-ribose and exhibits phosphatase activity toward ADP-ribose-1"-monophosphate (Appr-1"-p). Also included in this family are the N-terminal (or first) macrodomains of BAL (B-aggressive lymphoma) proteins which contain multiple macrodomains, such as the first macrodomain of mono-ADP-ribosyltransferase PARP14 (PARP-14, also known as ADP-ribosyltransferase diphtheria toxin-like 8, ATRD8, B aggressive lymphoma protein 2, or BAL2). Most BAL proteins also contain a C-terminal PARP active site and are also named as PARPs. Human BAL1 (or PARP-9) was originally identified as a risk-related gene in diffuse large B-cell lymphoma that promotes malignant B-cell migration. Some BAL family proteins exhibit PARP activity. Poly (ADP-ribosyl)ation is an immediate DNA-damage-dependent post-translational modification of histones and other nuclear proteins. BAL proteins may also function as transcriptional repressors.


Pssm-ID: 394877 [Multi-domain]  Cd Length: 158  Bit Score: 53.26  E-value: 2.36e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1322 NVCFVKGDvikvLRRVGAEVIVNPANGRMAHGAGVAGAIAKAAGKAFINETADMVKAQGVCQVGGCYESTGGKL-CKKVL 1400
Cdd:cd02907     3 KVSVYKGD----ITKEKVDAIVNAANERLKHGGGVAGAISKAGGPEIQEECDKYIKKNGKLRVGEVVVTSAGKLpCKYVI 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 225403216 1401 NIVGPDARGHGKQ-CYSLLERA-YQHINKCDNV-VTTL----ISAGIFSVPTDV 1447
Cdd:cd02907    79 HAVGPRWSGGSKEeCEDLLYKAvLNSLEEAEELkATSIaipaISSGIFGFPLDL 132
betaCoV_Nsp2_HKU9-like cd21518
betacoronavirus non-structural protein 2 (Nsp2) similar to bat coronavirus HKU9 Nsp2, and ...
255-494 1.33e-05

betacoronavirus non-structural protein 2 (Nsp2) similar to bat coronavirus HKU9 Nsp2, and related proteins from betacoronaviruses in the D lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Rousettus bat coronavirus HKU9 and betacoronaviruses in the nobecovirus subgenus (D lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394869  Cd Length: 597  Bit Score: 51.31  E-value: 1.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  255 DQYGCDYTGCLAKGLEDY-----GDLTLSEM-KELSPVWRDS----LDNEVVVAWHVDRDPRAVMRlQTLATVRSIeyvg 324
Cdd:cd21518     6 DQYGFDNNGVLVKPVKDLlgdikSDFTLEQLlLALSPYRTDDgydlPGGFVKVAVKVVRKPVPVVK-QTIFTVQGV---- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  325 qpIEDMVDGDVvmrepaHLLAPNAIVKRL-PRLVETMLYTDSSVTEFCYKTKlcdcGF--ITQFGYVDCcgDTCGFRGWV 401
Cdd:cd21518    81 --LEQLVEGYY------YPYSTGSVVKHTkPRRDSPVGKTVESIMLSLYGTS----GYnpATPVVRLRC--SYCDFYGWV 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  402 PGNMMDGFPCpGCCKSYMPWE--LEAQSSGVIPEGGVLFTQSTdtvnrESFKLY--GHAVVPFGGaAYWSPYP-----GM 472
Cdd:cd21518   147 PLKDMGTVVC-SCGAEYQLTSscVDAESAGFIKPGCVMLLDKS-----PGMRLIpgNRTYVAFGG-AIWSPIGkvndvTV 219
                         250       260
                  ....*....|....*....|....
gi 225403216  473 WLPviwssvKSYSYLT--YTGVVG 494
Cdd:cd21518   220 WVP------RAYSVVAgdHSGAVG 237
betaCoV_Nsp2_MERS-like cd21517
betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins ...
254-437 1.63e-05

betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins from betacoronaviruses in the C lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Middle East respiratory syndrome-related coronavirus (MERS-CoV) and betacoronaviruses in the merbecovirus subgenus (C lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394868  Cd Length: 660  Bit Score: 51.27  E-value: 1.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  254 VDQYGCDYTGclaKGLEDYGDLTLSE-MKELSPVWRD-----------SLDNEVV-VAWHVDRDPRAVMRlQTLATVRSI 320
Cdd:cd21517     5 IDQYMCGKDG---KPIADYAALAAKEgLTKLADVEADvssradsdgfiTFKNKLYrIVWHVERKDVPYPK-QTIFTINSV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  321 eyvgqpiedmVDGDVVMREPAH---------LLAPNAIV--KRLPRLVETMLYTdssvteFCYKTKLCDCGFITQFGYVD 389
Cdd:cd21517    81 ----------VQKDGIEDVPPHsftlggkvlVLVPRNKWggKSDLTLKQKLLYT------FYGKDAVENPSYIYHSAFVD 144
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 225403216  390 CCGdtCGFRGWVPGNMMDGFPCpGCCKSYMPWELEAQSSGVIPEGGVL 437
Cdd:cd21517   145 CTS--CGNGSWLTGNAVQGFAC-DCGASYSANDVELQSSGLVKPNALF 189
betaCoV_Nsp2_SARS-like cd21516
betacoronavirus non-structural protein 2 (Nsp2) similar to SARS-CoV Nsp2, and related proteins ...
253-714 2.47e-05

betacoronavirus non-structural protein 2 (Nsp2) similar to SARS-CoV Nsp2, and related proteins from betacoronaviruses in the B lineage; Non-structural proteins (Nsps) from Severe acute respiratory syndrome coronavirus (SARS-CoV) and betacoronaviruses in the sarbecovirus subgenus (B lineage) are encoded in ORF1a and ORF1b. Post infection, the SARS-CoV genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. The function of Nsp2 remains unknown. Deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. Rather than playing a role in viral replication, SARS-CoV Nsp2 may be involved in altering the host cell environment; it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis.


Pssm-ID: 439199  Cd Length: 637  Bit Score: 50.54  E-value: 2.47e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  253 FVDQYGCDYTG----CLAKGLEDYG--DLTLSEMKELSPVWRDSL---DNEVVVAWHVDRDpRAVMRLQTLATVRSIEYV 323
Cdd:cd21516     4 YVDNNFCGPDGypleCIKDLLARAGksSCPLSEQLDFIGLKRGVYccrEHEHEIAWYTERS-EKSYELQTPFEIKSAKKF 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  324 gqpieDMVDGDVvmrePAHLLAPNAIVKRL-PRLVET----------MLYTDSSVTEF-------------CYKTKLCDC 379
Cdd:cd21516    83 -----DTFKGEC----PHFVFPLNSTVKVIqPRVEKKktegfmgrirSVYPVASPGECnpmalstlmkcnhCGETSWQTS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  380 GFITQF------GYVDCCG-DTCGFrgwVPGNMMDGFPCPGCcksympweleaQSSGVIPEGGVlfTQSTDTVNRESFKL 452
Cdd:cd21516   154 DFLKATcefcgtENLTKEGpTTCGY---LPQNAVVKMPCPAC-----------KNDEVGPEHSL--ADYHNHSGIETRLR 217
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  453 YGHAVVPFGGA--AYWSPYPG--MWLPVIWSSVKSysylTYTGVVG---------CKAIVQ---------------ETDA 504
Cdd:cd21516   218 KGGRTVCFGGCvfAYVGCYNKcaYWVPRASANIGS----NHTGVVGedvetlnddLLEILQrekvninivgdfklnEEVA 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  505 I---------------CRSLymDYVQHKCgNLEQRAILGVDDVYHRQLLVNRGDYSLLLENVDLFVKR-----RAEFACK 564
Cdd:cd21516   294 IilasfsastsafietVKGL--DYKTFKQ-IVESCGNFKVTKGKAKKGAWNIGTQKSVLTPLLAFPSQaagvvRSIFSRT 370
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216  565 FATCGDGLAPL------LLDGLVPRSYYLIKSGQAFTSLMVNfsrEVVDMCMDMALLfmhdVKVATKYVKKVTGKLAVRF 638
Cdd:cd21516   371 LDTAGHSLRALqraaitILDGISPQSLRLLDAMVFTSDLATN---SVLVMAYDTGGL----VQVTSQWLDNLFGTCADKL 443
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216  639 KalgiAVVRKITEWFDLAVDTAASAAGWLCYqLVNGLFAVANG-VITFVQEVPELVKNFVDKFKTFFKVLIDSMSVS 714
Cdd:cd21516   444 K----PVLTWLEEKLKEGVDFLRDAWEILKF-LVTGAYKIVKGqIVLAAKNIKECVQSFVAVVNKVLSLCYDQIQIA 515
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1323-1475 4.83e-05

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 46.71  E-value: 4.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1323 VCFVKGDvikvLRRVGAEVIVNPANGRMAHGAGVAGAIAKAAGKAfINETADMVKAQGVCQVGGCYESTGGKL-CKKVLN 1401
Cdd:COG2110     1 IEIVQGD----ITELDVDAIVNAANSSLLGGGGVAGAIHRAAGPE-LLEECRRLCKQGGCPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1402 IVGPDARGHGKQCYSLLERAYQHI------NKCDNVVTTLISAGIFSVPTD----VSLTYLLGVVTKN-----VILVSNN 1466
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaeeLGIRSIAFPAIGTGVGGFPWEeaapIAVETLRDFLEEHpsleeVRFVLFD 155

                  ....*....
gi 225403216 1467 QDDFDVIEK 1475
Cdd:COG2110   156 EEDYEAYRR 164
deltaCoV_PLPro cd21734
deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1685-1757 8.32e-05

deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in the non-structural protein 3 (Nsp3) region of deltacoronavirus, including Porcine deltacoronavirus, Bulbul coronavirus HKU11, and Common moorhen coronavirus HKU21. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409651  Cd Length: 313  Bit Score: 48.19  E-value: 8.32e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 225403216 1685 LLQYYSMLGMC--KWPVVVCGNYFAFKQSNNNCYINVACLMLQ--HLSLKFPkwqWQEAWNEFRSGKPLRFVSLVLA 1757
Cdd:cd21734    75 LSQYCVYLKYChhKWSVSRTNGLMHLKQKDNNCFVSAAINLFQntHYQLRPA---IDALYQEYLNGNPSRFVAWIYA 148
Macro_SF cd02749
macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular ...
1341-1453 2.41e-03

macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Macrodomains include the yeast macrodomain Poa1 which is a phosphatase of ADP-ribose-1"-phosphate, a by-product of tRNA splicing. Some macrodomains have ADPr-unrelated binding partners such as the coronavirus SUD-N (N-terminal subdomain) and SUD-M (middle subdomain) of the SARS-unique domain (SUD) which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). Macrodomains regulate a wide variety of cellular and organismal processes, including DNA damage repair, signal transduction, and immune response.


Pssm-ID: 394871  Cd Length: 121  Bit Score: 40.84  E-value: 2.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1341 VIVNPANGRMAHGAGVAGAIAKAAGKAFINETADmVKAQGVCQVGGCYESTGGKL-CKKVLNIVGPDARGHgKQCYSLLE 1419
Cdd:cd02749     2 AIVNPANNDLYLGGGVAKAISKKAGGDLQEECEE-RKKNGYLKVGEVAVTKGGNLpARYIIHVVGPVASSK-KKTYEPLK 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 225403216 1420 RAYQHIN-KCDN-----VVTTLISAGIFSVPTDVSLTYLL 1453
Cdd:cd02749    80 KCVKNCLsLADEkglksVAFPAIGTGIAGFPPEEAARIML 119
HKU9-like_Nsp3_NAB cd21825
nucleic acid binding domain of non-structural protein 3 from Rousettus bat coronavirus HKU9 ...
1959-2060 4.66e-03

nucleic acid binding domain of non-structural protein 3 from Rousettus bat coronavirus HKU9 and betacoronavirus in the D lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the D lineage.


Pssm-ID: 409351  Cd Length: 117  Bit Score: 39.82  E-value: 4.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 225403216 1959 KVEGVYTNFKLVG---HDIAEKLNAKLGFDCNSPfmEYKITEWPTATGDVVLASDDLyVSRYSGGCVTFGKPVIWRGHEE 2035
Cdd:cd21825    16 KLVTPYDGFYLSScqnLALAESFNKAINATKQGP--KKLLTVYPNCSGDVVAVSDDN-VTAHPYGSLIMGKPVLFVTKPN 92
                          90       100
                  ....*....|....*....|....*
gi 225403216 2036 ASLKSLTYFNRPSVVCENKFNVLPV 2060
Cdd:cd21825    93 TWKKLVPLLSALVVETTNKYEVLPV 117
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH