NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|381354069|gb|AFG25768|]
View 

1ab polyprotein [Rat coronavirus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HCoV_HKU1-like_RdRp cd21593
human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, ...
4457-5381 0e+00

human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the A lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of human coronavirus HKU1, murine hepatitis virus, and similar proteins from betacoronaviruses in the embecovirus subgenera (A lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


:

Pssm-ID: 394897  Cd Length: 925  Bit Score: 2065.32  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKE 4536
Cdd:cd21593     1 TNFLNRVRGTSVNARLVPCASGLSTDVQLRAFDICNANRAGIGLYYKVNCCRFQRLDEDGNKLDKFFVVKRTNLEVYNKE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4537 KECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWY 4616
Cdd:cd21593    81 KECYELTKSCGVVAEHEFFTFDVDGSRVPHIVRKDLSKYTMLDLCYALRHFDRNDCSTLCEILSMYAECDESYFTKKDWY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4617 DFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 4696
Cdd:cd21593   161 DFVENPDIINVYKKLGPIFNRALVNTAKFADTLVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4697 MLTMCHALDSELYVNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFG 4776
Cdd:cd21593   241 MLTMCHALDCELFVNDTYRQFDLVQYDFTDYKLELFNKYFKYWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPNTCFG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4777 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 4856
Cdd:cd21593   321 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4857 QTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYEGGC 4936
Cdd:cd21593   401 QTVKPGNFNQDFYDFILSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGC 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4937 IPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 5016
Cdd:cd21593   481 IPASQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDDIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5017 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 5096
Cdd:cd21593   561 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5097 HTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 5176
Cdd:cd21593   641 HGDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5177 YSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETDIEK 5256
Cdd:cd21593   721 YSNVYRSDYVDPTFVNEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMSESKCWVETDINN 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5257 GPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVYLEY 5336
Cdd:cd21593   801 GPHEFCSQHTMLVKMDGDYVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVYHENEEYQNVFRVYLEY 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 5337 IKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21593   881 IKKLYNDLGNQILDSYSVILSTCDGQKFTDESFYKNMYLRSAVMQ 925
betaCoV_Nsp14 cd21659
nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5984-6500 0e+00

nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


:

Pssm-ID: 394958  Cd Length: 519  Bit Score: 1138.29  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRV 6063
Cdd:cd21659     1 TGLFKDCSKSYVGLHPAYAPTFLSVDDKYKTNGDLCVCLNIIDSVVTYSRLISLMGFKLDLTLPGYPKLFITREEAIKRV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6064 RAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVR 6143
Cdd:cd21659    81 RAWIGFDVEGAHATRDAIGTNFPLQLGFSTGVNFVVEPTGLVDTEDGYMFTKIVAKAPPGEQFKHLIPLMSKGQPWDVVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6144 IRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIVD 6223
Cdd:cd21659   161 IRIVQMLSDTLDDLSDSVVFVTWAHGFELTSLRYFAKIGKERTCCMCTKRATCYSSRTGYYGCWRHSVGCDYVYNPFIVD 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6224 IQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAMLC 6303
Cdd:cd21659   241 VQQWGYTGNLQSNHDRYCSVHKGAHVASSDAIMTRCLAVHDCFCKRVNWDVEYPIISNELSINSSCRLVQRVVLKAALLA 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6304 NRYDVC*DIGNPKGLACVK--GYDFKFYDASPVVKSVKQFVYKYEAHKDQFLDGLCMFW*CNVDKYPANAVVCRFDTRVL 6381
Cdd:cd21659   321 NRFDLCYDIGNPKGIACVKdpVVDWKFYDAQPVVKSVKQLFYTYEAHKDQFKDGLCMFWNCNVDKYPANAIVCRFDTRVL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6382 NKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPLRSATCITRCNLGGAVCLK 6461
Cdd:cd21659   401 SKLNLPGCNGGSLYVNKHAFHTPAFDKSAFENLKPLPFFYYSDTPCEYHGGNDVKDVDYVPLKSATCITRCNLGGAVCRK 480
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 381354069 6462 HAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:cd21659   481 HAEEYREYLEAYNTATTAGFTLWVYKTFDFYNLWNTFTK 519
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2280-2834 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


:

Pssm-ID: 409662  Cd Length: 555  Bit Score: 1120.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2280 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFKTTFGVSTICDFYQVTDLGYRSSFCNGSMVCEL 2359
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2360 CFSGFDMLDSYDAINVVQHVVDRRVSFDYISILKLVVELIIGYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLETMHWSAR 2439
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2440 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2519
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2520 KHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2599
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2600 LLHSKVKGVPETHVVVVENEADKAGFLGAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKS 2679
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2680 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAAD 2759
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 2760 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-831 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


:

Pssm-ID: 394870  Cd Length: 586  Bit Score: 1088.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  249 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSIDYVGQPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  329 VVDGDVVVRAPAHLLAADALVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCDFRGWVPGNMLDGF 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  409 PCPGCSKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNREAFKLYGHAVVPFGSAVYWSPYPGMWLPVVWSSVKSYSGLTY 488
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  489 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLDQRATLGLDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 568
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  569 D-GFVPLLLDGLVPRSYYLIKSGQAYTSMMVNFSHEVIDMCMDMALLFMHDVKVATKYVKKFTGKLAVRFKALGVAVVRK 647
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  648 ITEWFDLAVDIAASAAGWLCYQLVNGLFAVANGVITFVQEAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNR 727
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  728 VCLAGSKVYEVVQKSLSAYVLPVGC--SEATCLVGESEPAVFEDDVVGVVKTPLTYQGCCKPPTSFEKICIVDKLYMAKC 805
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 381354069  806 GDQFYPVVVDNDTVGVLDQCWRFPCA 831
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
betaCoV_Nsp13-helicase cd21722
helicase domain of betacoronavirus non-structural protein 13; This model represents the ...
5631-5970 0e+00

helicase domain of betacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from betacoronavirus, including pathogenic human viruses such as Severe acute respiratory syndrome coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


:

Pssm-ID: 409655 [Multi-domain]  Cd Length: 340  Bit Score: 728.14  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5631 RFASVYSVPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDC 5710
Cdd:cd21722     1 GLYPTYNVPEEFQNNVVNYQKIGMKRYCTVQGPPGTGKSHLAIGLAVYYPTARVVYTACSHAAVDALCEKAFKFLNINKC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5711 TRIVPAKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRV 5790
Cdd:cd21722    81 SRIIPAKARVECYDKFKVNDTSRQYVFSTINALPETVTDILVVDEVSMCTNYDLSVINARVRAKHIVYIGDPAQLPAPRT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5791 LLNKGTLEPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNM 5870
Cdd:cd21722   161 LLTKGTLEPEYFNSVTRLMCCLGPDIFLGTCYRCPKEIVDTVSALVYDNKLKAKKDNSGQCFKVYYKGSVTHDSSSAINR 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5871 QQIHLISKFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKK 5950
Cdd:cd21722   241 PQIYLVKKFLKANPAWSKAVFISPYNSQNAVARRVLGLQTQTVDSSQGSEYDYVIYCQTAETAHSVNVNRFNVAITRAKK 320
                         330       340
                  ....*....|....*....|
gi 381354069 5951 GILCVMSSMQLFESLNFTTL 5970
Cdd:cd21722   321 GILCVMSSMQLFESLQFTEL 340
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


:

Pssm-ID: 152398  Cd Length: 355  Bit Score: 659.71  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    81 FVDKDPQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   161 WLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGH-RRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYT 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHfRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   240 LLKGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSI 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 381354069   320 DYVGQPTEDVVDGDVVVRAPAHLLAADALVKRLPR 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
CoV_Methyltr_2 pfam06460
Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus ...
6878-7173 0e+00

Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus polyprotein. The SARS-CoV RNA cap SAM-dependent (nucleoside-2'-O-)-methyltransferase (2'-O-MTase) is a heterodimer comprising SARS-CoV nsp10 and nsp16. When bound to nsp10, nsp16 is active as a type-0 RNA cap-dependent 2'-O-MTase, ie., active only when the cap guanine is methylated at its N7 position. Nsp10 binds to nsp16 through an activation surface area in nsp10, and the resulting complex exhibits RNA cap (nucleoside-2'-O)-methyltransferase activity. Nsp10 is a double zinc finger protein together with nsp4, nsp5, nsp12, nsp14, and nsp16, nsp10 has been found to be essential in the assembly of a functional replication/transcription complex. Nsp16 adopts a typical fold of the S-adenosylmethionine-dependent methyltransferase (SAM) family as defined initially for the catechol O-MTase but it lacks several elements of the canonical MTase fold, such as helices B and C. The nsp16 topology matches those of dengue virus NS5 N-terminal domain and of vaccinia virus VP39 MTases.


:

Pssm-ID: 461919  Cd Length: 296  Bit Score: 600.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6878 AADWKPGYVMPVLYKYLESPLERVNLWNYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPG 6957
Cdd:pfam06460    1 SAAWKPGYSMPVLYKYQRMCLERCNLYNYGAGITLPSGIMMNVAKYTQLCQYLNTTTLAVPHNMRVLHLGAGSDKGVAPG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6958 SAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKL 7037
Cdd:pfam06460   81 SAVLRQWLPAGTILVDNDLNDFVSDADFSVTGDCATLYTEDKWDLIISDMYDPRTKNIDGENVSKDGFFTYLCGFIREKL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  7038 ALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGA 7117
Cdd:pfam06460  161 ALGGSIAIKITEFSWNADLYKLMGRFAWWTMFCTNVNASSSEAFLIGINYLGKPKVEIDGNTMHANYIFWRNSTVMQLSA 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069  7118 YSLFDMSKFPLKAAGTAVVSLKPDQINDLVLSLIEKGRLLVRDTRKEVFVGDSLVN 7173
Cdd:pfam06460  241 YSLFDMSKFPLKLKGTAVVNLKEDQINDMVYSLLEKGKLLIRDNGKEVFFSDSLVN 296
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3334-3627 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


:

Pssm-ID: 394887  Cd Length: 297  Bit Score: 575.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3334 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSDDMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3413
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3414 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3493
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3494 QLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3573
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 3574 AD--LV*DALASMTGVTVEQVLAAIKRLYSGF-QGKQILGSCVLEDELTPSDVYQQL 3627
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1607-1904 4.81e-160

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


:

Pssm-ID: 409649  Cd Length: 304  Bit Score: 497.88  E-value: 4.81e-160
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1607 NKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDAFGFDEP-QLL 1685
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1686 KYYNMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1764
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1765 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLVHCTQFNV-PF 1842
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069 1843 LICSYTPEGRKLPD-DVVAANIFTGG-SLGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1904
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2846-3227 2.73e-159

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


:

Pssm-ID: 394836  Cd Length: 376  Bit Score: 499.04  E-value: 2.73e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2846 FVANLICFIVLWALIPTYAVHKSDMQLPLYASFKVIENGVLRDVSVTDATSANKFNQFDQWYESTFGLAYYRTSSCPVVV 2925
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYGSVPTNSKSCPIVV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2926 AVIDqDIGHTLFNVPTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYTE 3005
Cdd:cd21473    81 GVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYDT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3006 GVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraMP 3085
Cdd:cd21473   159 GLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--GP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3086 GTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFMMLFVF 3165
Cdd:cd21473   236 GVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYVV 314
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3166 QVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCIIYVAVVVSNHALWLF 3227
Cdd:cd21473   315 TQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3634-3920 1.09e-149

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


:

Pssm-ID: 394846  Cd Length: 290  Bit Score: 467.87  E-value: 1.09e-149
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3634 SKRTRVIKGTCCWILASTFLFCSIIAAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3712
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3713 TNYL*VYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLIAMVfVTMRSINHDVFSIMFLVGRLVSLVSMWYFGaNLE 3792
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3793 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3868
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3869 LGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
Peptidase_C16 super family cl03374
Peptidase C16 family;
1082-1330 1.23e-125

Peptidase C16 family;


The actual alignment was detected with superfamily member pfam01831:

Pssm-ID: 460353  Cd Length: 249  Bit Score: 397.14  E-value: 1.23e-125
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1082 AFDAIYSKALSAVYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLEMQKLWLSYKAGYDQCFVDKLVK 1161
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1162 SVPRSIILPQGGYVADFAYYFLSQCSFKAHANWRCLKCDMALKLQGLDAMFFYGDVVSHMCKCGSGMTLLSADIPYTLHF 1241
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1242 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKLIDGKNVTKFTGDKFDFMVGHGMTFSMSPFETAQLYGSCITPN 1321
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 381354069  1322 VCFVKGDVI 1330
Cdd:pfam01831  241 LCFGKGDVI 249
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4013-4206 3.91e-115

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


:

Pssm-ID: 409258  Cd Length: 196  Bit Score: 364.49  E-value: 3.91e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4013 SEFVNMASFVEYELAKKNLDEAK*SGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4092
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4093 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4172
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 381354069 4173 DGAVKQLNEID---VNSIWPLVIAANRHNEvSTVVLQ 4206
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4317-4446 3.07e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


:

Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.09  E-value: 3.07e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4317 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4396
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 381354069 4397 VEHPDVDGLCKLRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
NendoU_cv_Nsp15-like cd21161
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural ...
6724-6874 4.16e-81

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural Protein 15 (Nsp15) and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Except for turkey coronavirus (TCoV) Nsp15, Mn2+ is generally essential for the catalytic activity of coronavirus Nsp15. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and murine hepatitis virus (MHV) form a functional hexamer while Porcine DeltaCoronavirus (PDCoV) Nsp15 has been shown to exist as a dimer and a monomer in solution. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


:

Pssm-ID: 439158  Cd Length: 151  Bit Score: 264.89  E-value: 4.16e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6724 FTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSYDS 6803
Cdd:cd21161     1 FTQGRSLEDFKPRSQMERDFLSMDQDVFIQKYGLEDLGFEHIVYGDFSKPTIGGLHLLIGLVRLKKEGKLYVEEFHNSDS 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6804 SIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:cd21161    81 TVQNYFVTDANNGSSKQVCTVVDLLLDDFVDILKSQDLSVVSKVVTVSIDYKPIRFMLWCKDGKVKTFYPQ 151
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2107-2231 3.04e-79

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


:

Pssm-ID: 409627  Cd Length: 125  Bit Score: 258.77  E-value: 3.04e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2107 VTEVHQEPSVSAVDVKEVKLNGVKKPVKVEDSVVVNDPTSDTKVVKSLSIVDVYDMFLTGCKYVVWTANELSRLVNSPTV 2186
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 2187 REYVKWGMGKIVNSTKLLLLRDERQEFVAPKVVKAKAIACYGAVK 2231
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1941-2059 2.87e-78

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


:

Pssm-ID: 409350  Cd Length: 119  Bit Score: 255.45  E-value: 2.87e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1941 GKYYTKPIIKAQFRTFEK*DGVYTNFKL*GHSIAEKL*AKLGFDCDSPFVEYKITEWPTATGDV*LASDDLYVSRYLSGC 2020
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 381354069 2021 ITFGKPVVWLGHEEASLKSLTYFNRPSVVCENKFNVLPV 2059
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4207-4316 3.75e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


:

Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.21  E-value: 3.75e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 381354069 4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
M_alpha_beta_cv_Nsp15-like cd21167
middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6567-6689 8.22e-57

middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain and a C-terminal catalytic (NendoU) domain. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. This middle domain harbors residues involved in hexamer formation and in trimer stability. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of the other coronaviruses; it has been shown to exist as a dimer and a monomer in solution.


:

Pssm-ID: 439161  Cd Length: 127  Bit Score: 194.47  E-value: 8.22e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6567 PHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKN 6646
Cdd:cd21167     1 PVPELKLLRNLGVDICYKFVLWDYEREAPFTSSTIGVCKYTDIDKKSDLNVLFDGRDPGSLERFRSARNAVLISTTKVKG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 381354069 6647 LSMIKGPQRADLNGVVVEKVGDSDVEFWFAMRSDGDDVIFSRT 6689
Cdd:cd21167    81 LKPIKGPNYASLNGVVVESVDKKKVKFYYYVRKDGEFVDLTDT 123
ZBD_cv_Nsp13-like cd21401
Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related ...
5382-5476 1.68e-56

Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. This coronavirus family includes Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) non-structural protein 13 (SARS-Nsp13) and belongs to helicase superfamily 1 (SF1) and to a family of nindoviral replication helicases. SARS-Nsp13 has an N-terminal CH/ZBD, a stalk domain, a 1B regulatory domain, and SF1 helicase core. The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase (RdRp). Structural studies of a stable SARS-CoV-2 RTC which included two molecules of Nsp13, the RdRp holoenzyme (Nsp7, two molecules of Nsp8, Nsp12), and an RNA template product, show that one Nsp13 CH/ZBD domain interacts with Nsp12, and both Nsp13-CH/ZBD domains interact with the Nsp8. This stable SARS-CoV-2 RTC suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching.


:

Pssm-ID: 439168  Cd Length: 95  Bit Score: 192.22  E-value: 1.68e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5382 SVGACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNSPGCDVNDVTKLYLGGMSYYCEDHKPQYS 5461
Cdd:cd21401     1 AVGLCVVCNSQTVLRCGDCIRRPFLCCKCCYDHVMSTSHKFILSINPYVCNAPGCGVSDVTKLYLGGMSYYCEDHKPSLS 80
                          90
                  ....*....|....*
gi 381354069 5462 FKLVMNGMVFGLYKQ 5476
Cdd:cd21401    81 FPLCANGFVFGLYKN 95
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3234-3328 1.32e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


:

Pssm-ID: 465099  Cd Length: 92  Bit Score: 163.85  E-value: 1.32e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3234 GTEVRsdGTFEEMALTTFMITKVSYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREA*CSQLAKAMETFNhNNGN 3313
Cdd:pfam16348    1 GDKFV--GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGN 77
                           90
                   ....*....|....*
gi 381354069  3314 DVLYQPPTASVTTSF 3328
Cdd:pfam16348   78 DVLYTPPTVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1532-1606 1.28e-45

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


:

Pssm-ID: 394840  Cd Length: 75  Bit Score: 160.27  E-value: 1.28e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 1532 QLDDDARVFVQANMDCLPTDWRLVNKLDVVDGVRTIKYFECPGEIFVSSQGKKFGYVQNGLFKVASVSQIRALLA 1606
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
1B_cv_Nsp13-like cd21409
1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze ...
5531-5609 5.93e-45

1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. Members of this subfamily belong to helicase superfamily 1 (SF1) and include coronavirus helicases such as Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13). SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). Structural studies of a stable RTC which included the RNA-dependent RNA polymerase holoenzyme (Nsp7, two molecules of Nsp82, Nsp12), two molecules of Nsp13 helicase accessory factor and an RNA template product suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching. SARS-Nsp13 is a multidomain protein; its other domains include an N-terminal Cys/His rich zinc-binding domain (CH/ZBD) and a SF1 helicase core. The 1B domain is involved in nucleic acid substrate binding; the 1B domain of the related Equine arteritis virus (EAV) Nsp10 undergoes large conformational change upon substrate binding, and together with the 1A and 2A domains of the helicase core form a channel that accommodates the single stranded nucleic acids.


:

Pssm-ID: 394817  Cd Length: 79  Bit Score: 158.66  E-value: 5.93e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 381354069 5531 ASATIREIVSDRELILSWEIGKVRPPLNKNYVFTGYHFTNNGKTVLGEYVFDKSELTNGVYYRATTTYKLSVGDVFILT 5609
Cdd:cd21409     1 ASATVKEVVGPRELVLSWEAGKTKPPLNRNYVFTGYHITKNSKTQLGEYTFEKSDYSDSVYYKSTTTYKLQPGDIFVLT 79
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
851-940 6.18e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


:

Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 6.18e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  851 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHDViGTKVCALLDRLAEDYVYLFDEGGDE 930
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 381354069  931 VIAPRMYCSF 940
Cdd:cd21467    80 VLASEMYCSF 89
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3921-4001 7.49e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


:

Pssm-ID: 409253  Cd Length: 83  Bit Score: 132.95  E-value: 7.49e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4000
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  .
gi 381354069 4001 Y 4001
Cdd:cd21827    79 P 79
NTD_alpha_betaCoV_Nsp15-like cd21171
N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6503-6563 3.48e-31

N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Coronavirus (CoV) Nsp15 is a nidovirus endoribonuclease (NendoU). NendoUs are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include CoV Nsp15 and arterivirus Nsp11, both of which may participate in the viral replication process and in the evasion of the host immune system. This small NTD structure, present in coronavirus Nsp15, is missing in Nsp11. CoV Nsp15 has an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Nsp15 from Severe Acute Respiratory Syndrome (SARS)-CoV, human CoV229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Residues in this N-terminal domain are important for hexamer (dimer of trimers) formation.


:

Pssm-ID: 439163  Cd Length: 61  Bit Score: 118.82  E-value: 3.48e-31
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6503 SLENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:cd21171     1 SLENVAYNVVKKGHFVGVEGELPVAIVNDKVFVKDGGVDVLVFTNKTSLPTNVAFELYAKR 61
stalk_CoV_Nsp13-like cd21689
stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the ...
5480-5527 8.93e-23

stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the stalk domain of coronavirus non-structural protein 13 (Nsp13) helicase, found in the Nsp3s of alpha-, beta-, gamma-, and deltacoronaviruses, including Severe Acute Respiratory Syndrome coronavirus (SARS-CoV), SARS-CoV-2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome coronavirus (MERS-CoV). Helicases are classified based on the arrangement of conserved motifs into six superfamilies; coronavirus helicases in this family belong to superfamily 1 (SF1). Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It consists of an N-terminal ZBD (Cys/His rich zinc-binding domain), a stalk domain, a 1B regulatory domain, and SF1 helicase core. The stalk domain lies between the ZBD domain and the 1B domain; a short loop connects the ZBD to the stalk domain. The stalk domain is comprised of three tightly-interacting alpha-helices connected to the 1B domain, transferring the effect from the ZBD domain onto the helicase core domains. The ZBD and stalk domains are critical for the helicase activity of SARS-CoV Nsp13.


:

Pssm-ID: 410205  Cd Length: 48  Bit Score: 94.21  E-value: 8.93e-23
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5480 GSPYIEDFNKIASCKWTEVDDYALANECTERLKLFAAETQKATEEAFK 5527
Cdd:cd21689     1 GSPDVDDFNRLATSDWSDVEDYKLANTCKDSLKLFAAETIKAKEESVK 48
Macro_SF super family cl00019
macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular ...
1339-1463 1.34e-21

macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Macrodomains include the yeast macrodomain Poa1 which is a phosphatase of ADP-ribose-1"-phosphate, a by-product of tRNA splicing. Some macrodomains have ADPr-unrelated binding partners such as the coronavirus SUD-N (N-terminal subdomain) and SUD-M (middle subdomain) of the SARS-unique domain (SUD) which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). Macrodomains regulate a wide variety of cellular and organismal processes, including DNA damage repair, signal transduction, and immune response.


The actual alignment was detected with superfamily member cd21557:

Pssm-ID: 469581  Cd Length: 127  Bit Score: 93.77  E-value: 1.34e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1339 EVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETaDMVKNQGVCLVGECYESAGGKLCKKVLNIVGPDARGQgrQCYSLLE 1418
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 1419 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1463
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
B-CoV_A_NSP1 super family cl13410
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
967-1073 4.85e-17

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


The actual alignment was detected with superfamily member pfam11963:

Pssm-ID: 152398  Cd Length: 355  Bit Score: 86.92  E-value: 4.85e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   967 SVVLVADAQE-DGVAKEQVE-VDSEICVAH---TGGqdELTEPDAVGSQTPIASAEKTEVGEAS--DREGIAEAKR---- 1035
Cdd:pfam11963  238 AYALLRGYRGvKPVLFVDQYgCDYTGCLADgleAYG--DYTLQDMKQLQPVWLANLDFDVVVAWhvVRDPRAVMRLqtia 315
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 381354069  1036 TVCADDLDACP--DQVEAFEIEEVEDSILDELQTELNAPS 1073
Cdd:pfam11963  316 TICGIAYVAQPteDVVDGDVVIKEPVHLLSADAIVLRLPS 355
 
Name Accession Description Interval E-value
HCoV_HKU1-like_RdRp cd21593
human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, ...
4457-5381 0e+00

human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the A lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of human coronavirus HKU1, murine hepatitis virus, and similar proteins from betacoronaviruses in the embecovirus subgenera (A lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394897  Cd Length: 925  Bit Score: 2065.32  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKE 4536
Cdd:cd21593     1 TNFLNRVRGTSVNARLVPCASGLSTDVQLRAFDICNANRAGIGLYYKVNCCRFQRLDEDGNKLDKFFVVKRTNLEVYNKE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4537 KECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWY 4616
Cdd:cd21593    81 KECYELTKSCGVVAEHEFFTFDVDGSRVPHIVRKDLSKYTMLDLCYALRHFDRNDCSTLCEILSMYAECDESYFTKKDWY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4617 DFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 4696
Cdd:cd21593   161 DFVENPDIINVYKKLGPIFNRALVNTAKFADTLVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4697 MLTMCHALDSELYVNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFG 4776
Cdd:cd21593   241 MLTMCHALDCELFVNDTYRQFDLVQYDFTDYKLELFNKYFKYWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPNTCFG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4777 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 4856
Cdd:cd21593   321 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4857 QTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYEGGC 4936
Cdd:cd21593   401 QTVKPGNFNQDFYDFILSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGC 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4937 IPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 5016
Cdd:cd21593   481 IPASQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDDIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5017 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 5096
Cdd:cd21593   561 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5097 HTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 5176
Cdd:cd21593   641 HGDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5177 YSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETDIEK 5256
Cdd:cd21593   721 YSNVYRSDYVDPTFVNEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMSESKCWVETDINN 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5257 GPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVYLEY 5336
Cdd:cd21593   801 GPHEFCSQHTMLVKMDGDYVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVYHENEEYQNVFRVYLEY 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 5337 IKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21593   881 IKKLYNDLGNQILDSYSVILSTCDGQKFTDESFYKNMYLRSAVMQ 925
betaCoV_Nsp14 cd21659
nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5984-6500 0e+00

nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394958  Cd Length: 519  Bit Score: 1138.29  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRV 6063
Cdd:cd21659     1 TGLFKDCSKSYVGLHPAYAPTFLSVDDKYKTNGDLCVCLNIIDSVVTYSRLISLMGFKLDLTLPGYPKLFITREEAIKRV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6064 RAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVR 6143
Cdd:cd21659    81 RAWIGFDVEGAHATRDAIGTNFPLQLGFSTGVNFVVEPTGLVDTEDGYMFTKIVAKAPPGEQFKHLIPLMSKGQPWDVVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6144 IRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIVD 6223
Cdd:cd21659   161 IRIVQMLSDTLDDLSDSVVFVTWAHGFELTSLRYFAKIGKERTCCMCTKRATCYSSRTGYYGCWRHSVGCDYVYNPFIVD 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6224 IQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAMLC 6303
Cdd:cd21659   241 VQQWGYTGNLQSNHDRYCSVHKGAHVASSDAIMTRCLAVHDCFCKRVNWDVEYPIISNELSINSSCRLVQRVVLKAALLA 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6304 NRYDVC*DIGNPKGLACVK--GYDFKFYDASPVVKSVKQFVYKYEAHKDQFLDGLCMFW*CNVDKYPANAVVCRFDTRVL 6381
Cdd:cd21659   321 NRFDLCYDIGNPKGIACVKdpVVDWKFYDAQPVVKSVKQLFYTYEAHKDQFKDGLCMFWNCNVDKYPANAIVCRFDTRVL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6382 NKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPLRSATCITRCNLGGAVCLK 6461
Cdd:cd21659   401 SKLNLPGCNGGSLYVNKHAFHTPAFDKSAFENLKPLPFFYYSDTPCEYHGGNDVKDVDYVPLKSATCITRCNLGGAVCRK 480
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 381354069 6462 HAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:cd21659   481 HAEEYREYLEAYNTATTAGFTLWVYKTFDFYNLWNTFTK 519
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2280-2834 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1120.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2280 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFKTTFGVSTICDFYQVTDLGYRSSFCNGSMVCEL 2359
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2360 CFSGFDMLDSYDAINVVQHVVDRRVSFDYISILKLVVELIIGYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLETMHWSAR 2439
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2440 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2519
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2520 KHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2599
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2600 LLHSKVKGVPETHVVVVENEADKAGFLGAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKS 2679
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2680 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAAD 2759
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 2760 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-831 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1088.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  249 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSIDYVGQPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  329 VVDGDVVVRAPAHLLAADALVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCDFRGWVPGNMLDGF 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  409 PCPGCSKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNREAFKLYGHAVVPFGSAVYWSPYPGMWLPVVWSSVKSYSGLTY 488
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  489 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLDQRATLGLDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 568
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  569 D-GFVPLLLDGLVPRSYYLIKSGQAYTSMMVNFSHEVIDMCMDMALLFMHDVKVATKYVKKFTGKLAVRFKALGVAVVRK 647
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  648 ITEWFDLAVDIAASAAGWLCYQLVNGLFAVANGVITFVQEAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNR 727
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  728 VCLAGSKVYEVVQKSLSAYVLPVGC--SEATCLVGESEPAVFEDDVVGVVKTPLTYQGCCKPPTSFEKICIVDKLYMAKC 805
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 381354069  806 GDQFYPVVVDNDTVGVLDQCWRFPCA 831
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
CoV_ExoN pfam06471
Coronavirus proofreading exoribonuclease; This region of coronavirus polyproteins encodes the ...
5982-6500 0e+00

Coronavirus proofreading exoribonuclease; This region of coronavirus polyproteins encodes the NSP14 protein. Its N-terminal exoribonuclease (ExoN) domain plays a proofreading role for prevention of lethal mutagenesis, and the C-terminal domain functions as a (guanine-N7) methyl transferase (N7-MTase) for mRNA capping. NSP14 forms the nsp14-nsp10 complex involved in RNA viral proofreading.


Pssm-ID: 399465  Cd Length: 515  Bit Score: 951.90  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5982 CTTNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIK 6061
Cdd:pfam06471    1 NTTGLFKDCSKEYSGLHPAHAPTYLSLDDKFKTSGDLAVCVGVSDKDVTYKRLISLMGFKMSLNVEGYHNMFITRDEAIR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6062 RVRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDV 6141
Cdd:pfam06471   81 HVRAWIGFDVEGAHATGDNVGTNLPLQLGFSTGVDFVVTPEGCVDTENGSVFEPVNAKAPPGEQFKHLIPLMRKGQPWHV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6142 VRIRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLI 6221
Cdd:pfam06471  161 VRIRIVQMLADTLAGLSDRVVFVLWAHGLELTTMRYFVKIGREQVCS-CGKRATCFNSSTDTYACWKHSLGCDYVYNPFL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6222 VDIQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAM 6301
Cdd:pfam06471  240 IDIQQWGYTGSLSSNHDEHCNVHGNAHVASGDAIMTRCLAVHDCFVKRVDWSLEYPIIANELRVNKACRLVQRMVLKAAL 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6302 LCNRYDVC*DIGNPKGLACV--KGYDFKFYDASPVVKSVKQFVYKYEAHKDqFLDGLCMFW*CNVDKYPANAVVCRFDTR 6379
Cdd:pfam06471  320 LADKPPVVHDIGNPKGIKCVrrAGVKWKFYDANPIVKNVKQLEYDYETHKD-KMDGLCLFWNCNVDMYPANAIVCRFDTR 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6380 VLNKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCvymeGMESKQVDYVPLRSATCITRCNLGGAVC 6459
Cdd:pfam06471  399 VLSKLNLPGCNGGSLYVNKHAFHTPAFDRRAFANLKPMPFFYYSDSPC----ESVGKQVDYVPLKSATCITRCNIGGAVC 474
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 381354069  6460 LKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:pfam06471  475 KKHANEYREYVESYNMMTTAGFTFWVPKNFDTYNLWNTFTR 515
betaCoV_Nsp13-helicase cd21722
helicase domain of betacoronavirus non-structural protein 13; This model represents the ...
5631-5970 0e+00

helicase domain of betacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from betacoronavirus, including pathogenic human viruses such as Severe acute respiratory syndrome coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409655 [Multi-domain]  Cd Length: 340  Bit Score: 728.14  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5631 RFASVYSVPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDC 5710
Cdd:cd21722     1 GLYPTYNVPEEFQNNVVNYQKIGMKRYCTVQGPPGTGKSHLAIGLAVYYPTARVVYTACSHAAVDALCEKAFKFLNINKC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5711 TRIVPAKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRV 5790
Cdd:cd21722    81 SRIIPAKARVECYDKFKVNDTSRQYVFSTINALPETVTDILVVDEVSMCTNYDLSVINARVRAKHIVYIGDPAQLPAPRT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5791 LLNKGTLEPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNM 5870
Cdd:cd21722   161 LLTKGTLEPEYFNSVTRLMCCLGPDIFLGTCYRCPKEIVDTVSALVYDNKLKAKKDNSGQCFKVYYKGSVTHDSSSAINR 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5871 QQIHLISKFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKK 5950
Cdd:cd21722   241 PQIYLVKKFLKANPAWSKAVFISPYNSQNAVARRVLGLQTQTVDSSQGSEYDYVIYCQTAETAHSVNVNRFNVAITRAKK 320
                         330       340
                  ....*....|....*....|
gi 381354069 5951 GILCVMSSMQLFESLNFTTL 5970
Cdd:cd21722   321 GILCVMSSMQLFESLQFTEL 340
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 659.71  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    81 FVDKDPQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   161 WLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGH-RRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYT 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHfRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   240 LLKGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSI 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 381354069   320 DYVGQPTEDVVDGDVVVRAPAHLLAADALVKRLPR 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
CoV_RPol_N pfam06478
Coronavirus RNA-dependent RNA polymerase, N-terminal; This family covers the N-terminal region ...
4467-4815 0e+00

Coronavirus RNA-dependent RNA polymerase, N-terminal; This family covers the N-terminal region of the coronavirus RNA-directed RNA Polymerase which corresponds to the nonstructural protein 12 (NSP12) produced by cleavage of ORF1b. NSP12 contains a polymerase domain that assumes a structure resembling a cupped 'right hand', similar to other polymerases, containing a fingers domain, a palm domain and a thumb domain. Coronavirus NSP12 also contains a nidovirus-unique N-terminal extension that possesses a kinase-like fold allowing the binding of NSP12 to NSP7 and NSP8. NSP12 possesses some minimal activity on its own, but the addition of the NSP7 and NSP8 co-factors greatly stimulates polymerase activity.


Pssm-ID: 461929  Cd Length: 353  Bit Score: 652.22  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4467 SVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEKECYELTKEC 4546
Cdd:pfam06478    1 SSAARLEPCASGTDPDVVYRAFDIYNKDVAGIGKFLKTNCCRFQEVDKDGNLLDSYFVVKRCTKSVYEHEESCYNLLKDC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4547 GVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYDFVENPDIIN 4626
Cdd:pfam06478   81 GVVAEHDFFKFDVGGDMVPNISRQDLTKYTMMDLCYALRHFDEKDCEVLKEILVTYGCCEEDYFEKKDWYDPVENPDIYR 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4627 VYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMPMLTMCHALDS 4706
Cdd:pfam06478  161 VYAKLGPIVRRALLKTVAFCDAMVEAGLVGVLTLDNQDLNGNFYDFGDFVKTAPGCGVPVVDSYYSYMMPIMTMTHALAS 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4707 ELYVNGT----YREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFGPLVRQI 4782
Cdd:pfam06478  241 ECFMDSDlgkdYKKYDLLKYDFTEEKLELFDKYFKYWDQTYHPNCVDCLDDRCILHCANFNVLFSTVIPNTAFGPLVRKV 320
                          330       340       350
                   ....*....|....*....|....*....|...
gi 381354069  4783 FVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRL 4815
Cdd:pfam06478  321 FVDGVPFVVTAGYHFKELGVVMNQDVNTHSSRL 353
CoV_Methyltr_2 pfam06460
Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus ...
6878-7173 0e+00

Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus polyprotein. The SARS-CoV RNA cap SAM-dependent (nucleoside-2'-O-)-methyltransferase (2'-O-MTase) is a heterodimer comprising SARS-CoV nsp10 and nsp16. When bound to nsp10, nsp16 is active as a type-0 RNA cap-dependent 2'-O-MTase, ie., active only when the cap guanine is methylated at its N7 position. Nsp10 binds to nsp16 through an activation surface area in nsp10, and the resulting complex exhibits RNA cap (nucleoside-2'-O)-methyltransferase activity. Nsp10 is a double zinc finger protein together with nsp4, nsp5, nsp12, nsp14, and nsp16, nsp10 has been found to be essential in the assembly of a functional replication/transcription complex. Nsp16 adopts a typical fold of the S-adenosylmethionine-dependent methyltransferase (SAM) family as defined initially for the catechol O-MTase but it lacks several elements of the canonical MTase fold, such as helices B and C. The nsp16 topology matches those of dengue virus NS5 N-terminal domain and of vaccinia virus VP39 MTases.


Pssm-ID: 461919  Cd Length: 296  Bit Score: 600.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6878 AADWKPGYVMPVLYKYLESPLERVNLWNYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPG 6957
Cdd:pfam06460    1 SAAWKPGYSMPVLYKYQRMCLERCNLYNYGAGITLPSGIMMNVAKYTQLCQYLNTTTLAVPHNMRVLHLGAGSDKGVAPG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6958 SAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKL 7037
Cdd:pfam06460   81 SAVLRQWLPAGTILVDNDLNDFVSDADFSVTGDCATLYTEDKWDLIISDMYDPRTKNIDGENVSKDGFFTYLCGFIREKL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  7038 ALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGA 7117
Cdd:pfam06460  161 ALGGSIAIKITEFSWNADLYKLMGRFAWWTMFCTNVNASSSEAFLIGINYLGKPKVEIDGNTMHANYIFWRNSTVMQLSA 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069  7118 YSLFDMSKFPLKAAGTAVVSLKPDQINDLVLSLIEKGRLLVRDTRKEVFVGDSLVN 7173
Cdd:pfam06460  241 YSLFDMSKFPLKLKGTAVVNLKEDQINDMVYSLLEKGKLLIRDNGKEVFFSDSLVN 296
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3334-3627 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 575.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3334 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSDDMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3413
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3414 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3493
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3494 QLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3573
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 3574 AD--LV*DALASMTGVTVEQVLAAIKRLYSGF-QGKQILGSCVLEDELTPSDVYQQL 3627
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3359-3633 6.89e-164

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 507.75  E-value: 6.89e-164
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3359 GLWLDDKVYCPRHVICSSDDMTdPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKP 3438
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3439 GETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQLELSTGCHTGTDLSGNFYGPYRDA 3518
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3519 QVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLV*DALASMTGVTVEQVLAAIKR 3598
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 381354069  3599 LYSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3633
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1607-1904 4.81e-160

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 497.88  E-value: 4.81e-160
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1607 NKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDAFGFDEP-QLL 1685
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1686 KYYNMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1764
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1765 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLVHCTQFNV-PF 1842
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069 1843 LICSYTPEGRKLPD-DVVAANIFTGG-SLGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1904
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2846-3227 2.73e-159

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 499.04  E-value: 2.73e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2846 FVANLICFIVLWALIPTYAVHKSDMQLPLYASFKVIENGVLRDVSVTDATSANKFNQFDQWYESTFGLAYYRTSSCPVVV 2925
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYGSVPTNSKSCPIVV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2926 AVIDqDIGHTLFNVPTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYTE 3005
Cdd:cd21473    81 GVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYDT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3006 GVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraMP 3085
Cdd:cd21473   159 GLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--GP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3086 GTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFMMLFVF 3165
Cdd:cd21473   236 GVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYVV 314
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3166 QVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCIIYVAVVVSNHALWLF 3227
Cdd:cd21473   315 TQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
capping_2-OMTase_betaCoV_Nsp16 cd23528
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of betacoronavirus, also called ...
6905-7120 2.74e-155

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of betacoronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. The betacoronavirus (betaCoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467740  Cd Length: 216  Bit Score: 480.35  E-value: 2.74e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6905 NYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDTV 6984
Cdd:cd23528     1 NYGQPATLPTGTMMNVAKYTQLCQYLNTCTLAVPANMRVIHFGAGSDKGVAPGTAVLRQWLPTDAILVDNDLNPFVSDAD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6985 ASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFA 7064
Cdd:cd23528    81 ATYFGDCVTVPTDCKWDLIISDMYDPRTKNVGGENVSKEGFFTYLCGFIKDKLALGGSVAIKITEHSWSADLYKLMGHFA 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069 7065 FWTIFCTNVNASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGAYSL 7120
Cdd:cd23528   161 WWTVFCTNVNASSSEAFLIGINYLGKPKEEIDGNVMHANYIFWRNSTPMNLSSYSL 216
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-241 5.21e-155

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 480.35  E-value: 5.21e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    6 KYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDIFVDKD 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   86 PQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGVWLGNG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069  166 RFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGHRRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYTLL 241
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHKRGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3634-3920 1.09e-149

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 467.87  E-value: 1.09e-149
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3634 SKRTRVIKGTCCWILASTFLFCSIIAAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3712
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3713 TNYL*VYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLIAMVfVTMRSINHDVFSIMFLVGRLVSLVSMWYFGaNLE 3792
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3793 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3868
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3869 LGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2334-2821 7.22e-149

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 472.98  E-value: 7.22e-149
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2334 TICDFYQVtdlGYRSS------FCNGSMVCELCFSGFDMLDSYDAINVVQHVVDRRVSFDYISILKLVVELIIGYSLYTV 2407
Cdd:pfam19218    2 YPCDGYVD---GYSNSsfnksdYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2408 CFYPLFVLIGMQLLTTWLPEFFMLETMHWsarlfvfVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYK 2487
Cdd:pfam19218   79 TFVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSK 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2488 RNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEV 2567
Cdd:pfam19218  152 SARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKV 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2568 KQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVE-NEADKAGFLGAAVFYAQSLYRPMLMVEKK 2646
Cdd:pfam19218  232 EFQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNsNGSNLAQAKNACVYYSQLLCKPIKLVDSN 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2647 LITTANTGLSVSQTMFDLYVDSLLNVLDVDRKSLTSFVNAAHnslkegvqleqvmdtfvgcarrkcAIDSDVETRSITKS 2726
Cdd:pfam19218  312 LLSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNA 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2727 VMSAVNAGVDFTDESCNNLVPTYVKS-DTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKAC 2805
Cdd:pfam19218  368 VLNAHRYDVLLTDDSFNNFVPTYAKPeDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTA 447
                          490
                   ....*....|....*.
gi 381354069  2806 SKTGLKIKLTYNKQEA 2821
Cdd:pfam19218  448 KKKGVTFMLTFNTNRM 463
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2857-3212 2.25e-135

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 429.38  E-value: 2.25e-135
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2857 WALIPTYAVHKSDMQLPLYASFKVIENGVLRDVSVTDATSANKFNQFDQWYESTFGlAYYRTSSCPVVVAVIDQDIGHTL 2936
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG-SPTNSRSCPIVVGVVDEVVGRVV 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2937 FNVPTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHADGTPHPYCYTEGVMHNASLYSS 3016
Cdd:pfam19217   80 PGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYSD 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3017 LVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYramPGTFCGRNAFDL 3096
Cdd:pfam19217  160 LVPHVRYKLVDGN-YVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLSL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3097 IHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDYTSVVVINVIVWCINFMMLFVFQVYPTLSCLYA 3176
Cdd:pfam19217  236 LTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVYA 315
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 381354069  3177 CFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCI 3212
Cdd:pfam19217  316 VLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
Peptidase_C16 pfam01831
Peptidase C16 family;
1082-1330 1.23e-125

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 397.14  E-value: 1.23e-125
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1082 AFDAIYSKALSAVYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLEMQKLWLSYKAGYDQCFVDKLVK 1161
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1162 SVPRSIILPQGGYVADFAYYFLSQCSFKAHANWRCLKCDMALKLQGLDAMFFYGDVVSHMCKCGSGMTLLSADIPYTLHF 1241
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1242 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKLIDGKNVTKFTGDKFDFMVGHGMTFSMSPFETAQLYGSCITPN 1321
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 381354069  1322 VCFVKGDVI 1330
Cdd:pfam01831  241 LCFGKGDVI 249
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4013-4206 3.91e-115

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 364.49  E-value: 3.91e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4013 SEFVNMASFVEYELAKKNLDEAK*SGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4092
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4093 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4172
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 381354069 4173 DGAVKQLNEID---VNSIWPLVIAANRHNEvSTVVLQ 4206
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
4010-4205 6.78e-104

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 332.20  E-value: 6.78e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4010 ALQSEFVNMASFVEYELAKKNLDEAK*SGSAnQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4089
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4090 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSI 4169
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 381354069  4170 QDADGAVKQLNEIDVNS----IWPLVIAANRHNEVstVVL 4205
Cdd:pfam08717  160 KDADGKIVHLKEITMDNspnlAWPLIVTAERANSA--VKL 197
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4317-4446 3.07e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.09  E-value: 3.07e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4317 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4396
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 381354069 4397 VEHPDVDGLCKLRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3661-3920 1.61e-83

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 276.44  E-value: 1.61e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3661 FVKWTMFMYVTTHML-GVTLCALCFVSFAMLLIKHKHLYLTMYIMPVLCTLFYTNYL*VYK-QSFRGLAYAWlshfvpAV 3738
Cdd:pfam19213    1 LLMYTALYWLPPNLItPVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3739 DYTYMDEVLYGVVLLIAMVFV--TMRSINHDvFSIMFLVGRLVSLVSMWYFGANLEEE-----VLLFLTSLFGTYTWTTM 3811
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVRSK-YSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3812 LSLATAKVIAKWlaVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRP 3891
Cdd:pfam19213  154 IAYKLAKYIVVY--VPPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 381354069  3892 PKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
NendoU_cv_Nsp15-like cd21161
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural ...
6724-6874 4.16e-81

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural Protein 15 (Nsp15) and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Except for turkey coronavirus (TCoV) Nsp15, Mn2+ is generally essential for the catalytic activity of coronavirus Nsp15. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and murine hepatitis virus (MHV) form a functional hexamer while Porcine DeltaCoronavirus (PDCoV) Nsp15 has been shown to exist as a dimer and a monomer in solution. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


Pssm-ID: 439158  Cd Length: 151  Bit Score: 264.89  E-value: 4.16e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6724 FTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSYDS 6803
Cdd:cd21161     1 FTQGRSLEDFKPRSQMERDFLSMDQDVFIQKYGLEDLGFEHIVYGDFSKPTIGGLHLLIGLVRLKKEGKLYVEEFHNSDS 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6804 SIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:cd21161    81 TVQNYFVTDANNGSSKQVCTVVDLLLDDFVDILKSQDLSVVSKVVTVSIDYKPIRFMLWCKDGKVKTFYPQ 151
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2107-2231 3.04e-79

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 258.77  E-value: 3.04e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2107 VTEVHQEPSVSAVDVKEVKLNGVKKPVKVEDSVVVNDPTSDTKVVKSLSIVDVYDMFLTGCKYVVWTANELSRLVNSPTV 2186
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 2187 REYVKWGMGKIVNSTKLLLLRDERQEFVAPKVVKAKAIACYGAVK 2231
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1941-2059 2.87e-78

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 255.45  E-value: 2.87e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1941 GKYYTKPIIKAQFRTFEK*DGVYTNFKL*GHSIAEKL*AKLGFDCDSPFVEYKITEWPTATGDV*LASDDLYVSRYLSGC 2020
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 381354069 2021 ITFGKPVVWLGHEEASLKSLTYFNRPSVVCENKFNVLPV 2059
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
CoV_NSP15_C pfam19215
Coronavirus replicase NSP15, uridylate-specific endoribonuclease; This entry represents the ...
6722-6874 2.28e-74

Coronavirus replicase NSP15, uridylate-specific endoribonuclease; This entry represents the C-terminal domain of coronavirus non-structural protein 15 (NSP15 or nsp15). NSP15 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. This domain exhibits endoribonuclease activity designated EndoU, highly conserved in all known CoVs and is part of the replicase-transcriptase complex that plays important roles in virus replication and transcription. NSP15 is a Uridylate-specific endoribonuclease that cleaves the 5'-polyuridines from negative-sense viral RNA, termed PUN RNA either upstream or downstream of uridylates, at GUU or GU to produce molecules with 2',3'-cyclic phosphate ends. PUN RNA is a CoV MDA5-dependent pathogen-associated molecular pattern (PAMP).


Pssm-ID: 465999  Cd Length: 155  Bit Score: 246.09  E-value: 2.28e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6722 TIFTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSY 6801
Cdd:pfam19215    2 TLFTQGRTLEDFVPRSTMEKDFLNMDQQQFIQKYGLEDLGFEHIVYGDFSKTTIGGLHLLISLVRLTKMGILKVEEFVPN 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069  6802 -DSSIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:pfam19215   82 dDSTVKNCSVTYANDGSSKAVCTVLDLLLDDFVDILKSLDLSVVSKVVTVNIDFQPVRFMLWCKDGKVQTFYPQ 155
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4328-4446 6.10e-73

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 240.42  E-value: 6.10e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4328 AILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRVEHPDVDGLCK 4407
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 381354069  4408 LRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4207-4316 3.75e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.21  E-value: 3.75e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 381354069 4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1945-2059 1.25e-59

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 202.78  E-value: 1.25e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1945 TKPIIKAQFRTFEK*DGVYTNFKL--*GHSIAEKL*AKLGFDCDSPFV-EYKITEWPTATGDV*LASDDLYVSRYLSGCI 2021
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLtcSGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 381354069  2022 TFGKPVVWLGHEEASLKSLTYFNRPSVVC-ENKFNVLPV 2059
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVClECKFNTKPV 129
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4207-4316 7.39e-58

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 196.93  E-value: 7.39e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4207 NNELMPQKLRTQVVNSGS-DMNCNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVtDAHCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 381354069  4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
M_alpha_beta_cv_Nsp15-like cd21167
middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6567-6689 8.22e-57

middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain and a C-terminal catalytic (NendoU) domain. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. This middle domain harbors residues involved in hexamer formation and in trimer stability. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of the other coronaviruses; it has been shown to exist as a dimer and a monomer in solution.


Pssm-ID: 439161  Cd Length: 127  Bit Score: 194.47  E-value: 8.22e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6567 PHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKN 6646
Cdd:cd21167     1 PVPELKLLRNLGVDICYKFVLWDYEREAPFTSSTIGVCKYTDIDKKSDLNVLFDGRDPGSLERFRSARNAVLISTTKVKG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 381354069 6647 LSMIKGPQRADLNGVVVEKVGDSDVEFWFAMRSDGDDVIFSRT 6689
Cdd:cd21167    81 LKPIKGPNYASLNGVVVESVDKKKVKFYYYVRKDGEFVDLTDT 123
ZBD_cv_Nsp13-like cd21401
Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related ...
5382-5476 1.68e-56

Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. This coronavirus family includes Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) non-structural protein 13 (SARS-Nsp13) and belongs to helicase superfamily 1 (SF1) and to a family of nindoviral replication helicases. SARS-Nsp13 has an N-terminal CH/ZBD, a stalk domain, a 1B regulatory domain, and SF1 helicase core. The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase (RdRp). Structural studies of a stable SARS-CoV-2 RTC which included two molecules of Nsp13, the RdRp holoenzyme (Nsp7, two molecules of Nsp8, Nsp12), and an RNA template product, show that one Nsp13 CH/ZBD domain interacts with Nsp12, and both Nsp13-CH/ZBD domains interact with the Nsp8. This stable SARS-CoV-2 RTC suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching.


Pssm-ID: 439168  Cd Length: 95  Bit Score: 192.22  E-value: 1.68e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5382 SVGACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNSPGCDVNDVTKLYLGGMSYYCEDHKPQYS 5461
Cdd:cd21401     1 AVGLCVVCNSQTVLRCGDCIRRPFLCCKCCYDHVMSTSHKFILSINPYVCNAPGCGVSDVTKLYLGGMSYYCEDHKPSLS 80
                          90
                  ....*....|....*
gi 381354069 5462 FKLVMNGMVFGLYKQ 5476
Cdd:cd21401    81 FPLCANGFVFGLYKN 95
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3234-3328 1.32e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 163.85  E-value: 1.32e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3234 GTEVRsdGTFEEMALTTFMITKVSYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREA*CSQLAKAMETFNhNNGN 3313
Cdd:pfam16348    1 GDKFV--GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGN 77
                           90
                   ....*....|....*
gi 381354069  3314 DVLYQPPTASVTTSF 3328
Cdd:pfam16348   78 DVLYTPPTVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1532-1606 1.28e-45

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 160.27  E-value: 1.28e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 1532 QLDDDARVFVQANMDCLPTDWRLVNKLDVVDGVRTIKYFECPGEIFVSSQGKKFGYVQNGLFKVASVSQIRALLA 1606
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
1B_cv_Nsp13-like cd21409
1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze ...
5531-5609 5.93e-45

1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. Members of this subfamily belong to helicase superfamily 1 (SF1) and include coronavirus helicases such as Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13). SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). Structural studies of a stable RTC which included the RNA-dependent RNA polymerase holoenzyme (Nsp7, two molecules of Nsp82, Nsp12), two molecules of Nsp13 helicase accessory factor and an RNA template product suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching. SARS-Nsp13 is a multidomain protein; its other domains include an N-terminal Cys/His rich zinc-binding domain (CH/ZBD) and a SF1 helicase core. The 1B domain is involved in nucleic acid substrate binding; the 1B domain of the related Equine arteritis virus (EAV) Nsp10 undergoes large conformational change upon substrate binding, and together with the 1A and 2A domains of the helicase core form a channel that accommodates the single stranded nucleic acids.


Pssm-ID: 394817  Cd Length: 79  Bit Score: 158.66  E-value: 5.93e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 381354069 5531 ASATIREIVSDRELILSWEIGKVRPPLNKNYVFTGYHFTNNGKTVLGEYVFDKSELTNGVYYRATTTYKLSVGDVFILT 5609
Cdd:cd21409     1 ASATVKEVVGPRELVLSWEAGKTKPPLNRNYVFTGYHITKNSKTQLGEYTFEKSDYSDSVYYKSTTTYKLQPGDIFVLT 79
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
851-940 6.18e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 6.18e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  851 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHDViGTKVCALLDRLAEDYVYLFDEGGDE 930
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 381354069  931 VIAPRMYCSF 940
Cdd:cd21467    80 VLASEMYCSF 89
CoV_NSP15_M pfam19216
Coronavirus replicase NSP15, middle domain; This entry represents the non-catalytic middle ...
6564-6681 6.67e-36

Coronavirus replicase NSP15, middle domain; This entry represents the non-catalytic middle domain from coronavirus non-structural protein 15 (NSP15). NSP15 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. This domain is formed by ten beta strands organized into three beta hairpins.


Pssm-ID: 466000  Cd Length: 118  Bit Score: 134.38  E-value: 6.67e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6564 SIRPHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCiESLNVLFDGRDNGALEAFKKCRNGVYINTTK 6643
Cdd:pfam19216    1 NVGLTPPLKLLRNLGVTATYNFVLWDYENERPFTNYTINVCKYTDIIN-EDVCVLYDNRIKGSLERFCQLKNAVLISPTK 79
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 381354069  6644 IKNLSMIKGPQRADLNGVVVEKVGDSDVEFWFAMRSDG 6681
Cdd:pfam19216   80 IKKLVAIKIPNYGYLNGVPVSTTEKKPVTFYIYVRKNG 117
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3921-4001 7.49e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 132.95  E-value: 7.49e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4000
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  .
gi 381354069 4001 Y 4001
Cdd:cd21827    79 P 79
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1606-1918 7.89e-36

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 141.27  E-value: 7.89e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1606 ANKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDA---FGFDEP 1682
Cdd:pfam08715    2 CKQITIYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVPTTDWVEFYGFKSIleyYTLDAS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1683 QLLKYYNMLgmcKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKF 1762
Cdd:pfam08715   82 KYVIYLSAL---TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1763 NEPSDSTDFIRVVLREADLSGATCDLEFI--CKCGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLV-HCTQFN 1839
Cdd:pfam08715  159 GDFGDANWTLTNLAEHFDAEYTNAFLKKRvcCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGANNTdEVIEAS 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1840 VPFLICSYT--PEGRKLPDDVVAANIFTGG-SLGHYTHVKCKPkyQLYDACNVSKVSEAKGNFTDCLYLKNLKQTFSSVL 1916
Cdd:pfam08715  239 LPYLLLSATdgPAAVDCLEDGVGTVAFVGStNSGHYTYQTAKQ--AFYDGAKDRKFGKKSPYVTAVYTRFAFKNETSLPV 316

                   ..
gi 381354069  1917 AT 1918
Cdd:pfam08715  317 AK 318
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3921-4008 1.56e-35

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 131.80  E-value: 1.56e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4000
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*...
gi 381354069  4001 YVRDNTVL 4008
Cdd:pfam08716   75 YLENRTIL 82
NTD_alpha_betaCoV_Nsp15-like cd21171
N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6503-6563 3.48e-31

N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Coronavirus (CoV) Nsp15 is a nidovirus endoribonuclease (NendoU). NendoUs are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include CoV Nsp15 and arterivirus Nsp11, both of which may participate in the viral replication process and in the evasion of the host immune system. This small NTD structure, present in coronavirus Nsp15, is missing in Nsp11. CoV Nsp15 has an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Nsp15 from Severe Acute Respiratory Syndrome (SARS)-CoV, human CoV229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Residues in this N-terminal domain are important for hexamer (dimer of trimers) formation.


Pssm-ID: 439163  Cd Length: 61  Bit Score: 118.82  E-value: 3.48e-31
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6503 SLENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:cd21171     1 SLENVAYNVVKKGHFVGVEGELPVAIVNDKVFVKDGGVDVLVFTNKTSLPTNVAFELYAKR 61
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
670-831 4.25e-23

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 99.26  E-value: 4.25e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   670 LVNGLFAVANGVITFVqeAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSkVYEVVQKSLSAYVLP 749
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSNA-LVKVVSVKLKGKKQA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   750 V--GCSEATCLVGESEPAVfeDDVVGVVKTPLTYQGCCKPPTSFEKICIVDKLYMAKCGDQFYPVVvdnDTVGVLDQCWR 827
Cdd:pfam19212   78 GlkGAKEATVFVGATVPVT--PTRVEVVTVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPAS---TDGVVVPPVFK 152

                   ....
gi 381354069   828 FPCA 831
Cdd:pfam19212  153 LKGG 156
stalk_CoV_Nsp13-like cd21689
stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the ...
5480-5527 8.93e-23

stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the stalk domain of coronavirus non-structural protein 13 (Nsp13) helicase, found in the Nsp3s of alpha-, beta-, gamma-, and deltacoronaviruses, including Severe Acute Respiratory Syndrome coronavirus (SARS-CoV), SARS-CoV-2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome coronavirus (MERS-CoV). Helicases are classified based on the arrangement of conserved motifs into six superfamilies; coronavirus helicases in this family belong to superfamily 1 (SF1). Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It consists of an N-terminal ZBD (Cys/His rich zinc-binding domain), a stalk domain, a 1B regulatory domain, and SF1 helicase core. The stalk domain lies between the ZBD domain and the 1B domain; a short loop connects the ZBD to the stalk domain. The stalk domain is comprised of three tightly-interacting alpha-helices connected to the 1B domain, transferring the effect from the ZBD domain onto the helicase core domains. The ZBD and stalk domains are critical for the helicase activity of SARS-CoV Nsp13.


Pssm-ID: 410205  Cd Length: 48  Bit Score: 94.21  E-value: 8.93e-23
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5480 GSPYIEDFNKIASCKWTEVDDYALANECTERLKLFAAETQKATEEAFK 5527
Cdd:cd21689     1 GSPDVDDFNRLATSDWSDVEDYKLANTCKDSLKLFAAETIKAKEESVK 48
DNA2 COG1112
Superfamily I DNA and/or RNA helicase [Replication, recombination and repair];
5749-5970 2.05e-22

Superfamily I DNA and/or RNA helicase [Replication, recombination and repair];


Pssm-ID: 440729 [Multi-domain]  Cd Length: 819  Bit Score: 107.14  E-value: 2.05e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5749 DIIVVDEVSMLTNYE-LSVInsrVRAKHYVYIGDPAQLPaPRVLLNKGTLEPRYF--NSVTKLMCCLGPD--IFLGTCYR 5823
Cdd:COG1112   557 DLVIIDEASQATLAEaLGAL---ARAKRVVLVGDPKQLP-PVVFGEEAEEVAEEGldESLLDRLLARLPErgVMLREHYR 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5824 CPKEIVDTVSALVYNNKLKA---------KNDNSSMCFkVYYKGQTTHESSSAVNMQQIHLISKFLKAN-----PSWSNA 5889
Cdd:COG1112   633 MHPEIIAFSNRLFYDGKLVPlpspkarrlADPDSPLVF-IDVDGVYERRGGSRTNPEEAEAVVELVRELledgpDGESIG 711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5890 VfISPYNSQ-NYVAKRVLGLQTQ--------TVDSAQGSEYDFVI----YSQTAETAHSV-----NVNRFNVAITRAKKG 5951
Cdd:COG1112   712 V-ITPYRAQvALIRELLREALGDglepvfvgTVDRFQGDERDVIIfslvYSNDEDVPRNFgflngGPRRLNVAVSRARRK 790
                         250
                  ....*....|....*....
gi 381354069 5952 iLCVMSSMQLFESLNFTTL 5970
Cdd:COG1112   791 -LIVVGSRELLDSDPSTPA 808
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1339-1463 1.34e-21

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 93.77  E-value: 1.34e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1339 EVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETaDMVKNQGVCLVGECYESAGGKLCKKVLNIVGPDARGQgrQCYSLLE 1418
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 1419 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1463
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
CoV_NSP15_N pfam19219
Coronavirus replicase NSP15, N-terminal oligomerization; This is the N-terminal domain of the ...
6503-6563 3.96e-20

Coronavirus replicase NSP15, N-terminal oligomerization; This is the N-terminal domain of the coronavirus nonstructural protein 15 (NSP15), which is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP15, is a nidoviral RNA uridylate-specific endoribonuclease (NendoU) carrying C-terminal catalytic domain belonging to the EndoU family. The SARS-CoV-2 NendoU monomers assemble into a double-ring hexamer, generated by a dimer of trimers. The hexamer is stabilized by the interactions of N-terminal oligomerization domain.


Pssm-ID: 466003  Cd Length: 61  Bit Score: 87.36  E-value: 3.96e-20
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069  6503 SLENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:pfam19219    1 SLENLAYNVVKKGHFVGVDGELPVAIVNDKVFVKVGGVDVLLFENKTSLPTNVAFELYAKR 61
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
967-1073 4.85e-17

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 86.92  E-value: 4.85e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   967 SVVLVADAQE-DGVAKEQVE-VDSEICVAH---TGGqdELTEPDAVGSQTPIASAEKTEVGEAS--DREGIAEAKR---- 1035
Cdd:pfam11963  238 AYALLRGYRGvKPVLFVDQYgCDYTGCLADgleAYG--DYTLQDMKQLQPVWLANLDFDVVVAWhvVRDPRAVMRLqtia 315
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 381354069  1036 TVCADDLDACP--DQVEAFEIEEVEDSILDELQTELNAPS 1073
Cdd:pfam11963  316 TICGIAYVAQPteDVVDGDVVIKEPVHLLSADAIVLRLPS 355
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1321-1451 1.17e-16

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 80.04  E-value: 1.17e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   1321 NVCFVKGDVIKVarlvEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADmVKNQGVCLVGECYESAGGKL-CKKVL 1399
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKALSKEEVR-KLAGGECPVGTAVVTEGGNLpAKYVI 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 381354069   1400 NIVGPDARGQGRQCYSLLERAYQ------HINKCDNVVTTLISAGIFSVPTDVSLTYL 1451
Cdd:smart00506   76 HAVGPRASGHSKEGFELLENAYRnclelaIELGITSVALPLIGTGIYGVPKDRSAQAL 133
AAA_12 pfam13087
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
5807-5955 2.34e-13

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins.


Pssm-ID: 463780 [Multi-domain]  Cd Length: 196  Bit Score: 72.58  E-value: 2.34e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5807 KLMCCLGPD--IFLGTCYRCPKEIVDTVSALVYNNKLKA-KNDNSSMCFKVYY----------------KGQTTHESSSA 5867
Cdd:pfam13087    7 ERLQELGPSavVMLDTQYRMHPEIMEFPSKLFYGGKLKDgPSVAERPLPDDFHlpdplgplvfidvdgsEEEESDGGTSY 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5868 VNMQQI----HLISKFLKANPS-WSNAVFISPYNSQNYVAKRVL--------GLQTQTVDSAQGSEYDFVIYSqT--AET 5932
Cdd:pfam13087   87 SNEAEAelvvQLVEKLIKSGPEePSDIGVITPYRAQVRLIRKLLkrklggklEIEVNTVDGFQGREKDVIIFS-CvrSNE 165
                          170       180
                   ....*....|....*....|....*..
gi 381354069  5933 AHSV----NVNRFNVAITRAKKGiLCV 5955
Cdd:pfam13087  166 KGGIgflsDPRRLNVALTRAKRG-LII 191
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1342-1445 5.26e-10

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 60.27  E-value: 5.26e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1342 VNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKnqGVCLVGECYESAGGKL-CKKVLNIVGPDARGQGRQ-CYSLLER 1419
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELKK--GGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHgEEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 381354069  1420 AYQHI------NKCDNVVTTLISAGIFSVPTD 1445
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1322-1443 3.19e-04

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 45.17  E-value: 3.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1322 VCFVKGDVIKVarlvEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKnQGVCLVGECYESAGGKL-CKKVLN 1400
Cdd:COG2110     1 IEIVQGDITEL----DVDAIVNAANSSLLGGGGVAGAIHRAAGPELLEECRRLCK-QGGCPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 1401 IVGPDARGQGRQCYSLLERAYQHI------NKCDNVVTTLISAGIFSVP 1443
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaeeLGIRSIAFPAIGTGVGGFP 124
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
5662-5690 1.62e-03

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 44.00  E-value: 1.62e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 381354069 5662 GPPGTGKSHLAIGLAVYYCTA--RVVYTAAS 5690
Cdd:NF038214   97 GPPGTGKTHLAIALGYAACRQgyRVRFTTAA 127
PRK06526 PRK06526
transposase; Provisional
5662-5702 2.97e-03

transposase; Provisional


Pssm-ID: 180607  Cd Length: 254  Bit Score: 43.32  E-value: 2.97e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 381354069 5662 GPPGTGKSHLAIGLAVYYCTA--RVVYTAASHaAVDALCEKAH 5702
Cdd:PRK06526  105 GPPGTGKTHLAIGLGIRACQAghRVLFATAAQ-WVARLAAAHH 146
 
Name Accession Description Interval E-value
HCoV_HKU1-like_RdRp cd21593
human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, ...
4457-5381 0e+00

human coronavirus HKU1 RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the A lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of human coronavirus HKU1, murine hepatitis virus, and similar proteins from betacoronaviruses in the embecovirus subgenera (A lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394897  Cd Length: 925  Bit Score: 2065.32  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKE 4536
Cdd:cd21593     1 TNFLNRVRGTSVNARLVPCASGLSTDVQLRAFDICNANRAGIGLYYKVNCCRFQRLDEDGNKLDKFFVVKRTNLEVYNKE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4537 KECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWY 4616
Cdd:cd21593    81 KECYELTKSCGVVAEHEFFTFDVDGSRVPHIVRKDLSKYTMLDLCYALRHFDRNDCSTLCEILSMYAECDESYFTKKDWY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4617 DFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 4696
Cdd:cd21593   161 DFVENPDIINVYKKLGPIFNRALVNTAKFADTLVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4697 MLTMCHALDSELYVNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFG 4776
Cdd:cd21593   241 MLTMCHALDCELFVNDTYRQFDLVQYDFTDYKLELFNKYFKYWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPNTCFG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4777 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 4856
Cdd:cd21593   321 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4857 QTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYEGGC 4936
Cdd:cd21593   401 QTVKPGNFNQDFYDFILSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGC 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4937 IPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 5016
Cdd:cd21593   481 IPASQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDDIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5017 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 5096
Cdd:cd21593   561 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5097 HTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 5176
Cdd:cd21593   641 HGDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5177 YSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETDIEK 5256
Cdd:cd21593   721 YSNVYRSDYVDPTFVNEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMSESKCWVETDINN 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5257 GPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVYLEY 5336
Cdd:cd21593   801 GPHEFCSQHTMLVKMDGDYVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVYHENEEYQNVFRVYLEY 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 5337 IKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21593   881 IKKLYNDLGNQILDSYSVILSTCDGQKFTDESFYKNMYLRSAVMQ 925
betaCoV_RdRp cd21589
betacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: ...
4457-5381 0e+00

betacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible for replication and transcription of the viral RNA genome; This subfamily contains the RNA-dependent RNA polymerase (RdRp) of betacoronaviruses, including the RdRps from three highly pathogenic human coronaviruses (CoVs) such as Middle East respiratory syndrome (MERS)-related CoV, Severe acute respiratory syndrome (SARS) CoV, and SARS-CoV-2, also known as 2019 novel CoV (2019-nCoV) or COVID-19 virus. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir, which shows potential for the treatment of SARS-CoV-2 viral infections. The structure of SARS-CoV-2 Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438016  Cd Length: 925  Bit Score: 2034.33  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKE 4536
Cdd:cd21589     1 TNFLNRVRGTSVNARLVPCASGLSTDVQLRAFDICNANVAGIGLYYKVNCCRFQRLDEDGNKLDKFFVVKRTNLEVYNKE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4537 KECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWY 4616
Cdd:cd21589    81 KECYELLKDCGVVAEHDFFTFDVDGSRVPHIVRKDLTKYTMLDLCYALRHFDRNDCSTLKEILVTYAECDESYFTKKDWY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4617 DFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 4696
Cdd:cd21589   161 DFVENPDIINVYKKLGPIFNRALLNTAKFADAMVEAGLVGVLTLDNQDLNGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4697 MLTMCHALDSELYVNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFG 4776
Cdd:cd21589   241 MLTMCHALDCELFVNKPYRQFDLVQYDFTDYKLELFNKYFKYWSMTYHPNTVECEDDRCIIHCANFNILFSMVLPNTCFG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4777 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 4856
Cdd:cd21589   321 PLVRQIFVDGVPFVVSIGYHYKELGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4857 QTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYEGGC 4936
Cdd:cd21589   401 QTVKPGNFNQDFYDFILSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGC 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4937 IPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 5016
Cdd:cd21589   481 IPASQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMT 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5017 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCCS 5096
Cdd:cd21589   561 GRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDTCCS 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5097 HTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRL 5176
Cdd:cd21589   641 HSDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVTANVCSLMACNGNKIEDLSIRELQKRL 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5177 YSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETDIEK 5256
Cdd:cd21589   721 YSNVYRSDYVDPTFVNEYYEFLNKHFSMMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMSESKCWVETDINK 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5257 GPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVYLEY 5336
Cdd:cd21589   801 GPHEFCSQHTMLVKMDGDYVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVYHENPEYQNVFRVYLEY 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 5337 IKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21589   881 IKKLYNDLGNQILDSYSVILSTCDGQKFTDESFYKNMYLRSAVMQ 925
MERS-CoV-like_RdRp cd21592
Middle East respiratory syndrome-related coronavirus RNA-dependent RNA polymerase, also known ...
4457-5381 0e+00

Middle East respiratory syndrome-related coronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the C lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of Middle East respiratory syndrome (MERS)-related CoV, bat-CoV HKU5, and similar proteins from betacoronaviruses in the merbecovirus subgenera (C lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir, which has been shown to potently inhibit MERS RdRp. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394896  Cd Length: 931  Bit Score: 1529.98  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICN--ANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYN 4534
Cdd:cd21592     1 SNFLNRVRGSIVNARIEPCASGLSTDVVFRAFDICNykAKVAGIGKYYKTNTCRFVELDDQGHKLDSYFVVKRHTMENYE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4535 KEKECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKD 4614
Cdd:cd21592    81 LEKHCYDLLKDCDAVARHDFFVFDVDKVKTPHIVRQRLTEYTMMDLVYALRHFDQNNCEVLKSILVKYGCCDASYFDNKL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4615 WYDFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYM 4694
Cdd:cd21592   161 WFDFVENPSVIGVYHKLGERVRQAVLNTVKFCDHMVKAGLVGVLTLDNQDLNGKWYDFGDFVITQPGSGVAIVDSYYSYL 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4695 MPMLTMCHALDSELY----VNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVL 4770
Cdd:cd21592   241 MPVLSMTDCLAAETHrdcdFNKPLIEWPLTEYDFTDYKVQLFEKYFKHWDQTYHANCVNCADDRCVLHCANFNVLFSMTL 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4771 PKTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAI 4850
Cdd:cd21592   321 PKTCFGPIVRKIFVDGVPFVVSCGYHYKELGLVMNMDVSLHRHRLSLKELMMYAADPAMHIASSNALLDLRTSCFSVAAL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4851 TSGVKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFE 4930
Cdd:cd21592   401 TTGLTFQTVRPGNFNQDFYDFVVSKGFFKEGSSVTLKHFFFAQDGHAAITDYNYYSYNLPTMCDIKQMLFCMEVVNKYFE 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4931 IYEGGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVS 5010
Cdd:cd21592   481 IYDGGCLNASEVVVNNLDKSAGYPFNKFGKARVYYESMSYQEQDELFAMTKRNVIPTITQMNLKYAISAKNRARTVAGVS 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5011 ILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARK 5090
Cdd:cd21592   561 ILSTMTNRQYHQKMLKSMAATRGATCVIGTTKFYGGWDFMLKTLYKDVDNPHLMGWDYPKCDRAMPNMCRIFASLILARK 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5091 HDSCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIR 5170
Cdd:cd21592   641 HGTCCTTRDRFYRLANECAQVLSEYVLCGGGYYVKPGGTSSGDATTAYANSVFNILQATTANVSALMGANGNKIVDKEVK 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5171 ELQKRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWV 5250
Cdd:cd21592   721 DMQFDLYVNVYRNSKPDPKFVDKYYAFLNKHFSMMILSDDGVVCYNSDYAAKGYIAGIQNFKETLYYQNNVFMSEAKCWV 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5251 ETDIEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVF 5330
Cdd:cd21592   801 EPDLKKGPHEFCSQHTLYIKDGDDGYFLPYPDPSRILSAGCFVDDIVKTDGTLMVERFVSLAIDAYPLTKHEDIEYQNVF 880
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|.
gi 381354069 5331 RVYLEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21592   881 WVYLQYIEKLYKDLTGHMLDSYSVMLCGDNSAKFWEESFYRDLYSAPTTLQ 931
CoV_RdRp cd21530
coronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible ...
4458-5381 0e+00

coronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible for replication and transcription of the viral RNA genome; This family contains the RNA-dependent RNA polymerase of alpha-, beta-, gamma-, delta-coronaviruses, including three highly pathogenic human coronaviruses (CoVs) such as Middle East respiratory syndrome (MERS)-related CoV, Severe acute respiratory syndrome (SARS) CoV, and SARS-CoV-2, also known as 2019 novel CoV (2019-nCoV) or COVID-19 virus. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir, which shows potential for the treatment of SARS-CoV-2 viral infections. The structure of SARS-CoV-2 Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438015  Cd Length: 928  Bit Score: 1492.02  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4458 NFLKRVRGTSvNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEK 4537
Cdd:cd21530     2 SYLNRVRGSS-AARLTPLGNGTDPDVVKRAFDIYNDKVAGFFKFLKTNCARFQEKRENDNLIDSYFVVKRCTFSNYEHEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4538 ECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYD 4617
Cdd:cd21530    81 TCYNLLKDCGALAKHDFFKFRKDGDMVPNISRQRLTKYTMMDLVYALRHFDEGNCDVLKEILVTYGCCDDDYFNKKDWYD 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4618 FVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMPM 4697
Cdd:cd21530   161 PVENPDIYRVYAKLGEIVRRALLKAVQFCDAMVNAGIVGVLTLDNQDLNGNFYDFGDFIQTTPGSGVPVVDSYYSYLMPI 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4698 LTMCHALDSELYVN----GTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKT 4773
Cdd:cd21530   241 MTLTRALAAECHVDtdltKPYKKYDLLKYDFTEEKLKLFDKYFKYWDQTYHPNCVDCLDDRCVLHCANFNVLFSTVIPPT 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4774 CFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSG 4853
Cdd:cd21530   321 SFGPLCRKVFVDGVPFVVTTGYHFKELGVVHNQDVNTHSSRLSLKELLVFVGDPALIAASSNLLLDLRTTCFSVAALSSG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4854 VKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYE 4933
Cdd:cd21530   401 IAFQTVKPGHFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMLDIRQLLFCLEVVDKYFDCYE 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4934 GGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILS 5013
Cdd:cd21530   481 GGCINANQVVVTNLDKSAGFPFNKFGKARLYYDSMSYEEQDALFAYTKRNVLPTITQMNLKYAISAKNRARTVAGVSILS 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5014 TMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDS 5093
Cdd:cd21530   561 TMTNRQFHQKLLKSIVNTRNATVVIGTTKFYGGWDNMLRTLYSGVENPMLMGWDYPKCDRAMPNMLRIAASLVLARKHTN 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5094 CCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQ 5173
Cdd:cd21530   641 CCTLSHRFYRLANECAQVLSEVVMSGGGLYVKPGGTSSGDATTAYANSVFNICQAVSANVNRLLSTDTNSIANKYVRDLQ 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5174 KRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETD 5253
Cdd:cd21530   721 RRLYECLYRNRSVDTDFVNEFYAYLRKHFSMMILSDDGVVCYNSTYAKQGLVADISGFKSILYYQNNVFMSDSKCWTETD 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5254 IEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVY 5333
Cdd:cd21530   801 LTKGPHEFCSQHTMLVEQDDDPYYLPYPDPSRILGAGVFVDDVVKTDPVLMLERYVSLAIDAYPLTKHPNQEYAKVFYLL 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5334 LEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21530   881 LDYIRKLHQELTGGMLDMYSVMLDNDNTSKFWEEEFYEAMYEPSTTLQ 928
alphaCoV_RdRp cd21588
alphacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: ...
4457-5381 0e+00

alphacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible for replication and transcription of the viral RNA genome; This subfamily contains the RNA-dependent RNA polymerase (RdRp) of alphacoronaviruses, including human coronaviruses (HCoVs), HCoV-NL63, and HCoV-229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394892  Cd Length: 924  Bit Score: 1431.43  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4457 TNFLKRVRGTSvNARLVPCaSGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDedgnKLDKFFVVKRTNLEVYNKE 4536
Cdd:cd21588     1 QSYLNRVRGSS-AARLEPC-NGTDTDHVVRAFDIYNKDVACIGKFLKVNCVRFKNLD----KHDAFYVVKRCTKSVMEHE 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4537 KECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWY 4616
Cdd:cd21588    75 QSIYNLLKDSGAVAEHDFFTWKDGRSIYGNVCRQDLTKYTMMDLCYALRNFDEKNCEVLKEILVLTGACDESYFDNKNWF 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4617 DFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMP 4696
Cdd:cd21588   155 DPVENEDIHRVYAKLGKIVANAMLKCVALCDAMVEKGIVGVLTLDNQDLNGNFYDFGDFVKTIPGMGVPCCTSYYSYMMP 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4697 MLTMCHALDSELYVNGT-----YREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLP 4771
Cdd:cd21588   235 VMGMTNCLASECFVKSDifgsdFKTYDLLEYDFTEHKEKLFNKYFKYWGQDYHPNCVDCYDDMCIVHCANFNTLFSTTIP 314
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4772 KTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAIT 4851
Cdd:cd21588   315 NTAFGPLCRKVFIDGVPLVTTAGYHFKQLGIVWNKDLNTHSSRLSINELLRFVTDPALLVASSPALVDQRTVCFSVAALS 394
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4852 SGVKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEI 4931
Cdd:cd21588   395 TGMTYQTVKPGHFNKEFYDFLREQGFFEEGSELTLKHFFFAQKGDAAIKDFDYYRYNRPTVLDICQARVVYKVVQRYFDI 474
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4932 YEGGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSI 5011
Cdd:cd21588   475 YEGGCITAREVVVTNLNKSAGYPLNKFGKAGLYYESLSYEEQDALYALTKRNVLPTMTQLNLKYAISGKERARTVGGVSL 554
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5012 LSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKH 5091
Cdd:cd21588   555 LSTMTTRQYHQKHLKSIVNTRNATVVIGTTKFYGGWDNMLKNLIDGVDNPCLMGWDYPKCDRALPNMIRMISAMILGSKH 634
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5092 DSCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRE 5171
Cdd:cd21588   635 VTCCTHSDRFYRLCNELAQVLTEVVYSNGGFYLKPGGTTSGDATTAYANSVFNIFQAVSANVNRLLSVDSNTCNNLTVKS 714
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5172 LQKRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVE 5251
Cdd:cd21588   715 LQRKLYDNCYRSSSVDDSFVDEYYGYLRKHFSMMILSDDGVVCYNKDYASLGYVADISAFKATLYYQNNVFMSTSKCWVE 794
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5252 TDIEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFR 5331
Cdd:cd21588   795 PDLNKGPHEFCSQHTMQIVDKDGTYYLPYPDPSRILSAGVFVDDIVKTDAVILLERYVSLAIDAYPLSKHPNPEYRKVFY 874
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|
gi 381354069 5332 VYLEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21588   875 VLLDWVKHLYKTLNQGVLESFSVTLLEDSSSKFWDESFYASMYEKSTVLQ 924
batCoV-HKU9-like_RdRp cd21596
Bat coronavirus HKU9 RNA-dependent RNA polymerase, also known as non-structural protein 12, ...
4458-5381 0e+00

Bat coronavirus HKU9 RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the D lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of bat coronavirus HKU9 and similar proteins from betacoronaviruses in the nobecovirus subgenera (D lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394898  Cd Length: 929  Bit Score: 1417.11  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4458 NFLKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEK 4537
Cdd:cd21596     2 CFLNRVRGTSGVARLVPLGSGVQPDVVLRAFDICNTKVAGFGLHLKNNCCRYQELDADGNQLDSYFVVKRHTESNYLLEQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4538 ECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYD 4617
Cdd:cd21596    82 RCYEKLKDCGVVARHDFFKFNIEGVMTPHVSRERLTKYTMADLVYSLRHFDNNNCDTLKEILVLRGCCTVDYFDKKDWYD 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4618 FVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMPM 4697
Cdd:cd21596   162 PVENPDIIRVYHKLGETVRKAVLSAVKMADAMVEQGLIGVLTLDNQDLNGQWYDFGDFIEGPAGAGVAVMDTYYSLAMPV 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4698 LTMCHALDSELYVNGTY----REFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKT 4773
Cdd:cd21596   242 YTMTNMLAAECHVDGDLskpkRVWDICKYDYTQFKYSLFSKYFKYWDMQYHPNCVACADDRCILHCANFNILFSMVLPNT 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4774 CFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSG 4853
Cdd:cd21596   322 SFGPLVQKIYVDGVPFVVSTGYHYRELGVVMNQDVRQHAQRLSLRELLVYAADPAMHVAASNALADKRTVCMSVAAMTTG 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4854 VKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYE 4933
Cdd:cd21596   402 VTFQTVKPGQFNEDFYKFAIKCGFFKEGSSISFKHFFYAQDGNAAISDYDYYRYNLPTMCDIKQLLFSLEVVDKYFDCYD 481
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4934 GGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILS 5013
Cdd:cd21596   482 GGCLQASQVVVANYDKSAGFPFNKFGKARLYYESLSYADQDELFAYTKRNVLPTITQMNLKYAISAKNRARTVAGVSIAS 561
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5014 TMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDS 5093
Cdd:cd21596   562 TMTNRQFHQKMLKSIAAARGASVVIGTTKFYGGWNRMLRTLCEGVENPHLMGWDYPKCDRAMPNLLRIFASLILARKHST 641
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5094 CCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQ 5173
Cdd:cd21596   642 CCNASERFYRLANECAQVLSEMVLCGGGFYVKPGGTSSGDSTTAYANSVFNICQAVSANLNTFLSIDGNKIYTTYVQELQ 721
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5174 KRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETD 5253
Cdd:cd21596   722 RRLYLGIYRSNTVDNELVLDYYNYLRKHFSMMILSDDGVVCYNADYAQKGYVADIQGFKELLYFQNNVFMSEAKCWVEPD 801
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5254 IEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVY 5333
Cdd:cd21596   802 ITKGPHEFCSQHTMLVDMNGEQVYLPYPDPSRILGAGCFVDDLLKTDGTLMMERYVSLAIDAYPLTKHSDPEYQNVFWCY 881
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5334 LEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21596   882 LQYIKKLHEELTGHLLDTYSVMLASDNASKYWEVDFYENMYMESATLQ 929
gammaCoV_RdRp cd21587
gammacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: ...
4458-5381 0e+00

gammacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible for replication and transcription of the viral RNA genome; This subfamily contains the RNA-dependent RNA polymerase (RdRp) of gammacoronaviruses, including the RdRp of avian infectious bronchitis virus (IBV) and similar proteins. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394891  Cd Length: 931  Bit Score: 1318.73  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4458 NFLKRVRGTSvNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNK----LDKFFVVKRTNLEVY 4533
Cdd:cd21587     2 NYLNRVRGSS-EARLIPLANGCDPDVVKRAFDVCNKESAGMFQNLKRNCARFQEVRDTEDGnleyCDSYFVVKQTTPSNY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4534 NKEKECYELTKEcGVVAEHEFFTFDvegSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYaECGESYFQK- 4612
Cdd:cd21587    81 EHEKACYEDLKS-EVTADHDFFVFN---KNIYNISRQRLTKYTMMDFCYALRHFDPKDCEVLKEILVTY-GCIEDYHPKw 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4613 ----KDWYDFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVAD 4688
Cdd:cd21587   156 feenKDWYDPIENPKYYAMLAKMGPIVRRALLNAVEFGNLMVEKGYVGVVTLDNQDLNGKFYDFGDFQKTAPGAGVPVFD 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4689 SYYSYMMPMLTMCHALDSELY----VNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNI 4764
Cdd:cd21587   236 TYYSYMMPIIAMTDALAPERYfeydVHKGYKSYDLLKYDYTEEKQELFQKYFKYWDQEYHPNCRDCSDDRCLIHCANFNI 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4765 LFSMVLPKTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCC 4844
Cdd:cd21587   316 LFSTLIPQTSFGNLCRKVFVDGVPFIATCGYHSKELGVIMNQDNTMSFSKMGLSQLMQFVGDPALLVGTSNNLVDLRTSC 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4845 FSVAAITSGVKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEV 4924
Cdd:cd21587   396 FSVCALASGITHQTVKPGHFNKDFYDFAEKAGMFKEGSSIPLKHFFYPQTGNAAINDYDYYRYNRPTMFDIRQLLFCLEV 475
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4925 VNKYFEIYEGGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEaLSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRAR 5004
Cdd:cd21587   476 TSKYFECYEGGCIPASQVVVNNLDKSAGYPFNKFGKARLYYE-MSLEEQDQLFESTKKNVLPTITQMNLKYAISAKNRAR 554
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5005 TVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSS 5084
Cdd:cd21587   555 TVAGVSILSTMTNRQFHQKVLKSIVNTRNAPVVIGTTKFYGGWDNMLRNLIQGVEDPILMGWDYPKCDRAMPNLLRIAAS 634
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5085 LVLARKHDSCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKI 5164
Cdd:cd21587   635 LVLARKHTNCCTWSERIYRLYNECAQVLSETVLATGGIYVKPGGTSSGDATTAYANSVFNIIQATSANVARLLSVITRDI 714
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5165 EDLSIRELQKRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMS 5244
Cdd:cd21587   715 VYDDIKSLQYELYQQVYRRVNFDPAFVEKFYSYLCKNFSLMILSDDGVVCYNNTLAKQGLVADISGFREILYYQNNVYMA 794
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5245 EAKCWVETDIEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENP 5324
Cdd:cd21587   795 DSKCWVEPDLEKGPHEFCSQHTMLVEVDGEPKYLPYPDPSRILGACVFVDDVDKTEPVAVMERYIALAIDAYPLVHHENE 874
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 5325 EYQNVFRVYLEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21587   875 EYKKVFFVLLSYIRKLYQELSQNMLMDYSFVMDIDKGSKFWEQEFYENMYRAPTTLQ 931
SARS-CoV-like_RdRp cd21591
Severe acute respiratory syndrome coronavirus RNA-dependent RNA polymerase, also known as ...
4458-5381 0e+00

Severe acute respiratory syndrome coronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12, and similar proteins from betacoronaviruses in the B lineage: responsible for replication and transcription of the viral RNA genome; This group contains the RNA-dependent RNA polymerase (RdRp) of Severe acute respiratory syndrome coronavirus (SARS-CoV), SARS-CoV-2 (also known as 2019 novel CoV (2019-nCoV) or COVID-19 virus), and similar proteins from betacoronaviruses in the sarbecovirus subgenera (B lineage). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir, which shows potential for the treatment of SARS-CoV-2 viral infections. The structure of SARS-CoV-2 Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. Recent studies have shown that the SARS-CoV-2 RdRp requires two iron-sulfur clusters to function optimally. Earlier studies had mistakenly identified these iron-sulfur cluster binding sites for zinc-binding sites, likely because iron-sulfur clusters degrade easily under standard experimental conditions.The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394895  Cd Length: 928  Bit Score: 1295.43  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4458 NFLKRVRGTSVnARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEK 4537
Cdd:cd21591     2 SFLNRVCGVSA-ARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDGNLIDSYFVVKRHTFSNYQHEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4538 ECYELTKECGVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYD 4617
Cdd:cd21591    81 TIYNLLKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYD 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4618 FVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMPM 4697
Cdd:cd21591   161 FVENPDILRVYANLGERVRQALLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPIVDSYYSLLMPI 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4698 LTMCHALDSELYVN----GTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKT 4773
Cdd:cd21591   241 LTLTRALTAESHVDtdltKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPPT 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4774 CFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSG 4853
Cdd:cd21591   321 SFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNN 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4854 VKFQTVKPGNFNQDFYEFILSKGLFKEGSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVNKYFEIYE 4933
Cdd:cd21591   401 VAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYD 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4934 GGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILS 5013
Cdd:cd21591   481 GGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICS 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5014 TMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLARKHDS 5093
Cdd:cd21591   561 TMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTT 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5094 CCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQ 5173
Cdd:cd21591   641 CCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQ 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5174 KRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMSEAKCWVETD 5253
Cdd:cd21591   721 HRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETD 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5254 IEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVHHENPEYQNVFRVY 5333
Cdd:cd21591   801 LTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLY 880
                         890       900       910       920
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5334 LEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21591   881 LQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQ 928
betaCoV_Nsp14 cd21659
nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5984-6500 0e+00

nonstructural protein 14 of betacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394958  Cd Length: 519  Bit Score: 1138.29  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRV 6063
Cdd:cd21659     1 TGLFKDCSKSYVGLHPAYAPTFLSVDDKYKTNGDLCVCLNIIDSVVTYSRLISLMGFKLDLTLPGYPKLFITREEAIKRV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6064 RAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVR 6143
Cdd:cd21659    81 RAWIGFDVEGAHATRDAIGTNFPLQLGFSTGVNFVVEPTGLVDTEDGYMFTKIVAKAPPGEQFKHLIPLMSKGQPWDVVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6144 IRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIVD 6223
Cdd:cd21659   161 IRIVQMLSDTLDDLSDSVVFVTWAHGFELTSLRYFAKIGKERTCCMCTKRATCYSSRTGYYGCWRHSVGCDYVYNPFIVD 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6224 IQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAMLC 6303
Cdd:cd21659   241 VQQWGYTGNLQSNHDRYCSVHKGAHVASSDAIMTRCLAVHDCFCKRVNWDVEYPIISNELSINSSCRLVQRVVLKAALLA 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6304 NRYDVC*DIGNPKGLACVK--GYDFKFYDASPVVKSVKQFVYKYEAHKDQFLDGLCMFW*CNVDKYPANAVVCRFDTRVL 6381
Cdd:cd21659   321 NRFDLCYDIGNPKGIACVKdpVVDWKFYDAQPVVKSVKQLFYTYEAHKDQFKDGLCMFWNCNVDKYPANAIVCRFDTRVL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6382 NKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPLRSATCITRCNLGGAVCLK 6461
Cdd:cd21659   401 SKLNLPGCNGGSLYVNKHAFHTPAFDKSAFENLKPLPFFYYSDTPCEYHGGNDVKDVDYVPLKSATCITRCNLGGAVCRK 480
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 381354069 6462 HAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:cd21659   481 HAEEYREYLEAYNTATTAGFTLWVYKTFDFYNLWNTFTK 519
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2280-2834 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1120.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2280 VSRGFFLVATVFLLWFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFKTTFGVSTICDFYQVTDLGYRSSFCNGSMVCEL 2359
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2360 CFSGFDMLDSYDAINVVQHVVDRRVSFDYISILKLVVELIIGYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLETMHWSAR 2439
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2440 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2519
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2520 KHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2599
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2600 LLHSKVKGVPETHVVVVENEADKAGFLGAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKS 2679
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2680 LTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAAD 2759
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 2760 LGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
deltaCoV_RdRp cd21590
deltacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: ...
4459-5381 0e+00

deltacoronavirus RNA-dependent RNA polymerase, also known as non-structural protein 12: responsible for replication and transcription of the viral RNA genome; This subfamily contains the RNA-dependent RNA polymerase (RdRp) of deltacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. A key component, the RNA-dependent RNA polymerase (RdRp, also known as Nsp12), catalyzes the synthesis of viral RNA and thus plays a central role in the replication and transcription cycle of CoV, possibly interacting with its co-factors, Nsp7 and Nsp8. RdRp is therefore considered a primary target for nucleotide analog antiviral inhibitors such as remdesivir, which has been shown to inhibit human endemic and zoonotic deltacoronaviruses with a highly divergent RdRp. Nsp12 contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 394894  Cd Length: 928  Bit Score: 1112.24  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4459 FLKRVRGTSvNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDED-----GNKLDKFFVVKRTNLEVY 4533
Cdd:cd21590     3 YLNRVTGSS-DARLEPLQPGTQPDAVKRAFHVHNNTTSGIFLSTKTNCARFKTTRSAlplpnKGEVDLYFVTKQCSAKVF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4534 NKEKECYEL--------TKECGVVAEHEFFTFDvegsRVPHIVRKDLSKFTMLDLCYALRHFDRNDcSTLKEILLTYAEC 4605
Cdd:cd21590    82 EIEEKCYNAlstelyttDDTFGVLAKTEFFKFD----KIPNVNRQYLTKYTLLDLAYALRHLSTSK-DVIQEILITMCGT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4606 GESYFQKKdWYDFVENPDIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVA 4685
Cdd:cd21590   157 PEDWFGEN-WFDPIENPTFYKEFHKLGDILNRCVLNANKFASACIDAGLVGILTPDNQDLLGQIYDFGDFIITQPGNGCV 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4686 VADSYYSYMMPMLTMCHALDSELY-VNGTYREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNI 4764
Cdd:cd21590   236 DLSSYYSYLMPIMSMTHMLKCECMdSDGNPLEYDGFQYDFTDFKLELFEKYFKYWDRPYHPNTVDCPDDRCVLHCANFNV 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4765 LFSMVLPKTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRLSLKDLLLYAADPALHVASASALLDLRTCC 4844
Cdd:cd21590   316 LFAMCIPNTAFGNLCSQATVDGHLVVQTVGVHLKELGIVLNQDVTTHMSNINLNTLLRLVGDPTTIASVSDKCLDLRTPC 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4845 FSVAAITSGVKFQTVKPGNFNQDFYEFILSKGLFKEgSSVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEV 4924
Cdd:cd21590   396 QTLATMSSGITKQSVKPGHFNQHFYKHLLDSNLLDQ-LGIDIRHFYYMQDGEAAITDYSYYRYNTPTMVDIKMFLFCLEV 474
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4925 VNKYFEIYEGGCIPATQVIVNNYDKSAGYPFNKFGKARLYYEaLSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRAR 5004
Cdd:cd21590   475 ADKYLEPYEGGCINAQSVVVSNLDKSAGYPFNKLGKARNYYD-MTYAEQNQLFEYTKRNVLPTLTQMNLKYAISAKDRAR 553
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5005 TVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDSPVLMGWDYPKCDRAMPNILRIVSS 5084
Cdd:cd21590   554 TVAGVSIISTMTNRQYHQKMLKSISLARNQTIVIGTTKFYGGWDNMLRRLMCNINNPILVGWDYPKCDRSMPNMLRIAAS 633
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5085 LVLARKHdSCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKI 5164
Cdd:cd21590   634 CLLARKH-TCCNQSQRFYRLANECCQVLSEVVVSGNNLYVKPGGTSSGDATTAYANSVFNILQVVSANVATFLSTSTTSH 712
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5165 EDLSIRELQKRLYSNVYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIANISAFQQVLYYQNNVFMS 5244
Cdd:cd21590   713 INKDIADLHRSLYEDIYRGDSNDITVINRFYQHLQSYFGLMILSDDGVACIDSDAAKSGAVADLDGFRDILFYQNNVYMA 792
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5245 EAKCWVETDIEKGPHEFCSQHTMLVKMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVhHENP 5324
Cdd:cd21590   793 DSKCWTETDMTVGPHEFCSQHTVLAEHDGKPYYLPYPDVSRILGACIFVDDVNKADPVQNLERYISLAIDAYPLT-KVDP 871
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 5325 EYQNVFRVYLEYIKKLYNDLGNQILDSYSVILSTCDGQKFTDETFYKNMYLRSAVMQ 5381
Cdd:cd21590   872 IKGKVFYLLLDYIRVLAQELQDGILDAFQSLTDMSYVNNFVQEAFYAQMYEQSPTLQ 928
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-831 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1088.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  249 PILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSIDYVGQPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  329 VVDGDVVVRAPAHLLAADALVKRLPRLVETMLYTDSSVTEFCYKTKLCDCGFITQFGYVDCCGDTCDFRGWVPGNMLDGF 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTDDSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMIDGF 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  409 PCPGCSKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNREAFKLYGHAVVPFGSAVYWSPYPGMWLPVVWSSVKSYSGLTY 488
Cdd:cd21519   161 ACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQDSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVKSYDGMVY 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  489 TGVVGCKAIVQETDAICRSLYMDYVQHKCGNLDQRATLGLDDVYHRQLLVNRGDYSLLLENVDLFVKRRAEFACKFATCG 568
Cdd:cd21519   241 TGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFSCETATVC 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  569 D-GFVPLLLDGLVPRSYYLIKSGQAYTSMMVNFSHEVIDMCMDMALLFMHDVKVATKYVKKFTGKLAVRFKALGVAVVRK 647
Cdd:cd21519   321 DeGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKALGAKFVKK 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  648 ITEWFDLAVDIAASAAGWLCYQLVNGLFAVANGVITFVQEAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNR 727
Cdd:cd21519   401 LIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSVFKTGRGR 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  728 VCLAGSKVYEVVQKSLSAYVLPVGC--SEATCLVGESEPAVFEDDVVGVVKTPLTYQGCCKPPTSFEKICIVDKLYMAKC 805
Cdd:cd21519   481 VCFAGNKVYKVSRGLLSGFVLPSDVqeSQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKPPKSAEKICIVDNVYMAKC 560
                         570       580
                  ....*....|....*....|....*.
gi 381354069  806 GDQFYPVVVDNDTVGVLDQCWRFPCA 831
Cdd:cd21519   561 GDKFYPVVVDDDTIGLLDQAWRFPCA 586
CoV_ExoN pfam06471
Coronavirus proofreading exoribonuclease; This region of coronavirus polyproteins encodes the ...
5982-6500 0e+00

Coronavirus proofreading exoribonuclease; This region of coronavirus polyproteins encodes the NSP14 protein. Its N-terminal exoribonuclease (ExoN) domain plays a proofreading role for prevention of lethal mutagenesis, and the C-terminal domain functions as a (guanine-N7) methyl transferase (N7-MTase) for mRNA capping. NSP14 forms the nsp14-nsp10 complex involved in RNA viral proofreading.


Pssm-ID: 399465  Cd Length: 515  Bit Score: 951.90  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5982 CTTNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIK 6061
Cdd:pfam06471    1 NTTGLFKDCSKEYSGLHPAHAPTYLSLDDKFKTSGDLAVCVGVSDKDVTYKRLISLMGFKMSLNVEGYHNMFITRDEAIR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6062 RVRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDV 6141
Cdd:pfam06471   81 HVRAWIGFDVEGAHATGDNVGTNLPLQLGFSTGVDFVVTPEGCVDTENGSVFEPVNAKAPPGEQFKHLIPLMRKGQPWHV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6142 VRIRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLI 6221
Cdd:pfam06471  161 VRIRIVQMLADTLAGLSDRVVFVLWAHGLELTTMRYFVKIGREQVCS-CGKRATCFNSSTDTYACWKHSLGCDYVYNPFL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6222 VDIQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAM 6301
Cdd:pfam06471  240 IDIQQWGYTGSLSSNHDEHCNVHGNAHVASGDAIMTRCLAVHDCFVKRVDWSLEYPIIANELRVNKACRLVQRMVLKAAL 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6302 LCNRYDVC*DIGNPKGLACV--KGYDFKFYDASPVVKSVKQFVYKYEAHKDqFLDGLCMFW*CNVDKYPANAVVCRFDTR 6379
Cdd:pfam06471  320 LADKPPVVHDIGNPKGIKCVrrAGVKWKFYDANPIVKNVKQLEYDYETHKD-KMDGLCLFWNCNVDMYPANAIVCRFDTR 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6380 VLNKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCvymeGMESKQVDYVPLRSATCITRCNLGGAVC 6459
Cdd:pfam06471  399 VLSKLNLPGCNGGSLYVNKHAFHTPAFDRRAFANLKPMPFFYYSDSPC----ESVGKQVDYVPLKSATCITRCNIGGAVC 474
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 381354069  6460 LKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:pfam06471  475 KKHANEYREYVESYNMMTTAGFTFWVPKNFDTYNLWNTFTR 515
TM_Y_betaCoV_Nsp3_C cd21713
C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2280-2834 0e+00

C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409661  Cd Length: 545  Bit Score: 841.00  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2280 VSRGFFLVATVFLLWFNFLYANVILSDFylpnigslPTFVGQIVAWFKTTFGVSTICDFYQVTDLGYRSSFCNGSMVCEL 2359
Cdd:cd21713     1 VSLLLFLCLTVLLLWFNFLYANFILSDS--------PTFVGSIVAWFKYTLGISTICDFYQVTYLGDISEFCTGSMLCSL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2360 CFSGFDMLDSYDAINVVQHVVDRRVSFDYIsiLKLVVELIIGYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLetMHWSAR 2439
Cdd:cd21713    73 CLSGMDSLDNYDALNMVQHTVSSRLSDDYI--FKLVLELFFAYLLYTVAFYVLGLLAILQLFFSYLPLFFML--NSWLVV 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2440 LFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCT 2519
Cdd:cd21713   149 LFVYVINMVPASTLVRMYIVVASLYFVYKLYVHVVYGCNDTACLMCYKRNRATRVECSTVVNGSKRSFYVMANGGTGFCT 228
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2520 KHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNG 2599
Cdd:cd21713   229 KHNWNCVNCDTYGPGNTFICDEVAADLSTQFKRPINPTDSSYYSVTSVEVKNGSVHLYYERDGQRVYERFSLSLFVNLDK 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2600 LLHSKVKGVP--ETHVVVVENEADKAGFLGAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDR 2677
Cdd:cd21713   309 LKHSEVKGSPpfNVIVFDASNRAEENGAKSAAVYYSQLLCKPILLVDKKLVTTVGDSAEVARKMFDAYVNSFLSTYNVTM 388
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2678 KSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVA 2757
Cdd:cd21713   389 DKLKTLVSTAHNSLKEGVQLEQVLKTFIGAARQKAAVESDVETKDIVKCVQLAHQADVDFTTDSCNNLVPTYVKVDTITT 468
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 2758 ADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21713   469 ADLGVLIDNNAKHVNANVAKAANVALIWNVAAFLKLSESLRRQLRSAARKTGLNFKLTTSKLRAVVPILTTPFSLKG 545
CoV_Nsp14 cd21528
nonstructural protein 14 of coronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) ...
5984-6500 0e+00

nonstructural protein 14 of coronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394955  Cd Length: 518  Bit Score: 767.78  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSS-VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKR 6062
Cdd:cd21528     1 TGLFKDCSKIFSGLHPAHAPTHLSLDSNFKTDELLADLVGPGVGKdITYRHLISLMGFKMNLDVEGYHNMFITREEAIRN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6063 VRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVV 6142
Cdd:cd21528    81 VRGWIGFDVEGAHAVGDNVGTNLPLQLGFSTGVNFVVVPEGLVDTESGTEFEPVRAKPPPGEQFKHLIPLMRKALPWSVV 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6143 RIRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIV 6222
Cdd:cd21528   161 RKRIVQMLADTLKGLSDRVVFVLWAHGLELTTMRYFVKIGPEKKCC-CGKRATCYNSSSDTYACWNHSLGCDYVYNPYII 239
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6223 DIQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAML 6302
Cdd:cd21528   240 DVQQWGYSGNLQSNHDEHCNVHGNAHVASADAIMTRCLAIHECFVKRVDWSIEYPIIGNELRLNSACRLVQRNFLNSALL 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6303 CNRYDVC*DIGNPKGLACVK--GYDFKFYDASPVVKSVKQFVYKYEAHKDQFLDGLCMFW*CNVDKYPANAVVCRFDTRV 6380
Cdd:cd21528   320 AYKPKVVYDIGNPKGIKCVRraEVKWKFFDKQPIVSNVKKLFYDYAEHHDKFTDGLCLFWNCNVDRYPANSLVCRFDTRV 399
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6381 LNKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCvYMEGMESKQVDYVPLRSATCITRCNLGGAVCL 6460
Cdd:cd21528   400 LSNLNLPGCNGGSLYVNKHAFHTPAFDKSAFKNLKPLPFFFYDDSPC-ETHQKQVSSIDYVPLSAADCITRCNIGGAVCS 478
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 381354069 6461 KHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTR 6500
Cdd:cd21528   479 KHANEYREYVNAYNLMVSAGFTFWVPKQFDTYNLWKTFTR 518
betaCoV_Nsp13-helicase cd21722
helicase domain of betacoronavirus non-structural protein 13; This model represents the ...
5631-5970 0e+00

helicase domain of betacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from betacoronavirus, including pathogenic human viruses such as Severe acute respiratory syndrome coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409655 [Multi-domain]  Cd Length: 340  Bit Score: 728.14  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5631 RFASVYSVPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDC 5710
Cdd:cd21722     1 GLYPTYNVPEEFQNNVVNYQKIGMKRYCTVQGPPGTGKSHLAIGLAVYYPTARVVYTACSHAAVDALCEKAFKFLNINKC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5711 TRIVPAKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRV 5790
Cdd:cd21722    81 SRIIPAKARVECYDKFKVNDTSRQYVFSTINALPETVTDILVVDEVSMCTNYDLSVINARVRAKHIVYIGDPAQLPAPRT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5791 LLNKGTLEPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNM 5870
Cdd:cd21722   161 LLTKGTLEPEYFNSVTRLMCCLGPDIFLGTCYRCPKEIVDTVSALVYDNKLKAKKDNSGQCFKVYYKGSVTHDSSSAINR 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5871 QQIHLISKFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKK 5950
Cdd:cd21722   241 PQIYLVKKFLKANPAWSKAVFISPYNSQNAVARRVLGLQTQTVDSSQGSEYDYVIYCQTAETAHSVNVNRFNVAITRAKK 320
                         330       340
                  ....*....|....*....|
gi 381354069 5951 GILCVMSSMQLFESLNFTTL 5970
Cdd:cd21722   321 GILCVMSSMQLFESLQFTEL 340
alphaCoV_Nsp14 cd21660
nonstructural protein 14 of alphacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5986-6499 0e+00

nonstructural protein 14 of alphacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394959  Cd Length: 510  Bit Score: 719.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5986 LFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSsVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRA 6065
Cdd:cd21660     3 LFKDCSRNPDYLPPSHATTYMSLSDNFKTSGDLAVQIGVKGP-VTYEHVISFMGFRFDVNVPGYHTLFCTRDFAMRNVRG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6066 WVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIR 6145
Cdd:cd21660    82 WLGFDVEGAHVCGDNVGTNVPLQLGFSNGVDFVVQPEGCVVTENGNSIKPVKARAPPGEQFTHLIPLMRKGQPWSVVRKR 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6146 IVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIVDIQ 6225
Cdd:cd21660   162 IVQMCCDYLKGLSDILIFVLWAGGLELTTMRYFVKIGPVKHCH-CGKEATCYNSVSHAYCCFKHALGCDYLYNPYVIDIQ 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6226 QWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAMLCNR 6305
Cdd:cd21660   241 QWGYTGSLSLNHHEHCNVHRNEHVASGDAIMTRCLAIYDCFVKNVDWSITYPFIANEKAINKSGRVVQSHVMRAALKLYN 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6306 YDVC*DIGNPKGLAC-VKGYDFKFYDASPVVKSVKQFVYKYEAHKdqFLDGLCMFW*CNVDKYPANAVVCRFDTRVLNKL 6384
Cdd:cd21660   321 PKAIHDIGNPKGIRCaVTDASWYCYDKQPINSNVKTLEYDYITHG--QMDGLCLFWNCNVDMYPEFSIVCRFDTRCRSKL 398
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6385 NLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGmeskQVDYVPLRSATCITRCNLGGAVCLKHAE 6464
Cdd:cd21660   399 NLEGCNGGSLYVNNHAFHTPAFDKRAFAKLKPMPFFFYDDSECDKVQD----QVNYVPLRANNCITRCNIGGAVCSKHAA 474
                         490       500       510
                  ....*....|....*....|....*....|....*
gi 381354069 6465 EYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFT 6499
Cdd:cd21660   475 LYHAYVEAYNTFTQAGFTIWVPTSFDLYNLWQTFV 509
gammaCoV_Nsp14 cd21658
nonstructural protein 14 of gammacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5984-6499 0e+00

nonstructural protein 14 of gammacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394957  Cd Length: 518  Bit Score: 678.89  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNV-ADSSVTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKR 6062
Cdd:cd21658     1 TGLFKICNKEFSGVHPAYAVTTKALAATYKVNDELAALVNVeAGSEITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6063 VRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVV 6142
Cdd:cd21658    81 VRGWVGFDVEATHACGTNIGTNLPFQVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRALFKSAKPWHVI 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6143 RIRIVQMLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRATCFNSRTGYYGCWRHSYSCDYLYNPLIV 6222
Cdd:cd21658   161 RPRIVQMLADNLCNVSDCVVFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACWKHCLGFDFVYNPLLV 239
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6223 DIQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFCKSVNWNLEYPIILNEVSVNTSCRLLQRVMFRAAML 6302
Cdd:cd21658   240 DIQQWGYSGNLQFNHDLHCNVHGHAHVASADAIMTRCLAINNAFCQDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVD 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6303 CNRYDVC*DIGNPKGLACVKGYD--FKFYDASPVVKSVKQFVYKYEAHKDQFLDGLCMFW*CNVDKYPANAVVCRFDTRV 6380
Cdd:cd21658   320 ALKVNVVYDIGNPKGIKCVRRGDvsFRFYDKNPIVPNVKQFEYDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRN 399
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6381 LNKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQvDYVPLRSATCITRCNLGGAVCL 6460
Cdd:cd21658   400 LSVFNLPGCNGGSLYVNKHAFHTPKFDRISFRNLKAMPFFFYDSSPCDTIQVDGVAQ-DLVSLATKDCITKCNIGGAVCK 478
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 381354069 6461 KHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFT 6499
Cdd:cd21658   479 KHAQMYAEFVTSYNAAVTAGFTFWVTNNFNPYNLWKSFS 517
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 659.71  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    81 FVDKDPQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   161 WLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGH-RRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYT 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHfRRAFTMPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   240 LLKGYRGVKPILFVDQYGCDYTGCLAKGLEDYGDLTLSEMKELFPVWRESLDNEVVVAWHVDRDPRAVMRLQTLATLRSI 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 381354069   320 DYVGQPTEDVVDGDVVVRAPAHLLAADALVKRLPR 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
CoV_RPol_N pfam06478
Coronavirus RNA-dependent RNA polymerase, N-terminal; This family covers the N-terminal region ...
4467-4815 0e+00

Coronavirus RNA-dependent RNA polymerase, N-terminal; This family covers the N-terminal region of the coronavirus RNA-directed RNA Polymerase which corresponds to the nonstructural protein 12 (NSP12) produced by cleavage of ORF1b. NSP12 contains a polymerase domain that assumes a structure resembling a cupped 'right hand', similar to other polymerases, containing a fingers domain, a palm domain and a thumb domain. Coronavirus NSP12 also contains a nidovirus-unique N-terminal extension that possesses a kinase-like fold allowing the binding of NSP12 to NSP7 and NSP8. NSP12 possesses some minimal activity on its own, but the addition of the NSP7 and NSP8 co-factors greatly stimulates polymerase activity.


Pssm-ID: 461929  Cd Length: 353  Bit Score: 652.22  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4467 SVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYKVNCCRFQRVDEDGNKLDKFFVVKRTNLEVYNKEKECYELTKEC 4546
Cdd:pfam06478    1 SSAARLEPCASGTDPDVVYRAFDIYNKDVAGIGKFLKTNCCRFQEVDKDGNLLDSYFVVKRCTKSVYEHEESCYNLLKDC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4547 GVVAEHEFFTFDVEGSRVPHIVRKDLSKFTMLDLCYALRHFDRNDCSTLKEILLTYAECGESYFQKKDWYDFVENPDIIN 4626
Cdd:pfam06478   81 GVVAEHDFFKFDVGGDMVPNISRQDLTKYTMMDLCYALRHFDEKDCEVLKEILVTYGCCEEDYFEKKDWYDPVENPDIYR 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4627 VYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGCGVAVADSYYSYMMPMLTMCHALDS 4706
Cdd:pfam06478  161 VYAKLGPIVRRALLKTVAFCDAMVEAGLVGVLTLDNQDLNGNFYDFGDFVKTAPGCGVPVVDSYYSYMMPIMTMTHALAS 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4707 ELYVNGT----YREFDLVQYDFTDFKLELFNKYFKHWSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFGPLVRQI 4782
Cdd:pfam06478  241 ECFMDSDlgkdYKKYDLLKYDFTEEKLELFDKYFKYWDQTYHPNCVDCLDDRCILHCANFNVLFSTVIPNTAFGPLVRKV 320
                          330       340       350
                   ....*....|....*....|....*....|...
gi 381354069  4783 FVDGVPFVVSIGYHYKELGVVMNMD*DTHRYRL 4815
Cdd:pfam06478  321 FVDGVPFVVTAGYHFKELGVVMNQDVNTHSSRL 353
CoV_Methyltr_2 pfam06460
Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus ...
6878-7173 0e+00

Coronavirus 2'-O-methyltransferase; This domain covers the NSP16 region of the coronavirus polyprotein. The SARS-CoV RNA cap SAM-dependent (nucleoside-2'-O-)-methyltransferase (2'-O-MTase) is a heterodimer comprising SARS-CoV nsp10 and nsp16. When bound to nsp10, nsp16 is active as a type-0 RNA cap-dependent 2'-O-MTase, ie., active only when the cap guanine is methylated at its N7 position. Nsp10 binds to nsp16 through an activation surface area in nsp10, and the resulting complex exhibits RNA cap (nucleoside-2'-O)-methyltransferase activity. Nsp10 is a double zinc finger protein together with nsp4, nsp5, nsp12, nsp14, and nsp16, nsp10 has been found to be essential in the assembly of a functional replication/transcription complex. Nsp16 adopts a typical fold of the S-adenosylmethionine-dependent methyltransferase (SAM) family as defined initially for the catechol O-MTase but it lacks several elements of the canonical MTase fold, such as helices B and C. The nsp16 topology matches those of dengue virus NS5 N-terminal domain and of vaccinia virus VP39 MTases.


Pssm-ID: 461919  Cd Length: 296  Bit Score: 600.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6878 AADWKPGYVMPVLYKYLESPLERVNLWNYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPG 6957
Cdd:pfam06460    1 SAAWKPGYSMPVLYKYQRMCLERCNLYNYGAGITLPSGIMMNVAKYTQLCQYLNTTTLAVPHNMRVLHLGAGSDKGVAPG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6958 SAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKL 7037
Cdd:pfam06460   81 SAVLRQWLPAGTILVDNDLNDFVSDADFSVTGDCATLYTEDKWDLIISDMYDPRTKNIDGENVSKDGFFTYLCGFIREKL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  7038 ALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGA 7117
Cdd:pfam06460  161 ALGGSIAIKITEFSWNADLYKLMGRFAWWTMFCTNVNASSSEAFLIGINYLGKPKVEIDGNTMHANYIFWRNSTVMQLSA 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069  7118 YSLFDMSKFPLKAAGTAVVSLKPDQINDLVLSLIEKGRLLVRDTRKEVFVGDSLVN 7173
Cdd:pfam06460  241 YSLFDMSKFPLKLKGTAVVNLKEDQINDMVYSLLEKGKLLIRDNGKEVFFSDSLVN 296
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3334-3627 0e+00

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 575.12  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3334 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSDDMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3413
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3414 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3493
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3494 QLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3573
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 3574 AD--LV*DALASMTGVTVEQVLAAIKRLYSGF-QGKQILGSCVLEDELTPSDVYQQL 3627
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
CoV_Nsp13-helicase cd21718
helicase domain of coronavirus non-structural protein 13; This model represents the helicase ...
5638-5970 0e+00

helicase domain of coronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from alpha-, beta-, gamma-, and deltacoronavirus, including pathogenic human viruses such as Severe acute respiratory syndrome coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409652 [Multi-domain]  Cd Length: 341  Bit Score: 568.70  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5638 VPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDCTRIVPAK 5717
Cdd:cd21718     8 IPHDFSNHVPSYQKIGKQKYTTVQGPPGTGKSHFAIGLALYYPGARIVYTACSHAAVDALCEKASKWLPNDKCSRIVPQR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5718 VRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRVLLNKGTL 5797
Cdd:cd21718    88 ARVECFDGFKVNNTNAQYIFSTINALPECSADIVVVDEVSMCTNYDLSVVNARLKYKHIVYVGDPAQLPAPRTLLTEGSL 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5798 EPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNMQQIHLIS 5877
Cdd:cd21718   168 EPKDYNVVTRLMVGSGPDVFLSKCYRCPKEIVDTVSKLVYDNKLKAIKPKSRQCFKTFGKGDVRHDNGSAINRPQLEFVK 247
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5878 KFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKKGILCVMS 5957
Cdd:cd21718   248 RFLDRNPRWRKAVFISPYNAMNNRASRLLGLSTQTVDSSQGSEYDYVIFCQTTDTAHALNINRFNVAITRAKHGILVIMR 327
                         330
                  ....*....|....
gi 381354069 5958 SM-QLFESLNFTTL 5970
Cdd:cd21718   328 DEnDLYNALQFKSL 341
alphaCoV_Nsp13-helicase cd21723
helicase domain of alphacoronavirus non-structural protein 13; This model represents the ...
5636-5970 8.68e-171

helicase domain of alphacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from alphacoronavirus, including Porcine epidemic diarrhea virus and Human coronavirus (CoV) NL63. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409656 [Multi-domain]  Cd Length: 340  Bit Score: 530.46  E-value: 8.68e-171
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5636 YSVPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDCTRIVP 5715
Cdd:cd21723     6 FNISEAYSNLVPYYQLIGKQKITTIQGPPGSGKSHCVIGLGLYYPGARIVFTACSHAAVDSLCVKAATAYSVDKCSRIIP 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5716 AKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRVLLNKG 5795
Cdd:cd21723    86 ARARVECYDGFKPNNTSAQYIFSTVNALPECNADIVVVDEVSMCTNYDLSVINQRVSYKHIVYVGDPQQLPAPRTMITRG 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5796 TLEPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKGQTTHESSSAVNMQQIHL 5875
Cdd:cd21723   166 VLEPKDYNVVTQRMCALGPDVFLHKCYRCPAEIVNTVSELVYENKFKPVHPESKQCFKIFCKGNVQVDNGSSINRRQLDV 245
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5876 ISKFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKKGILCV 5955
Cdd:cd21723   246 VKMFLAKNPKWSKAVFISPYNSQNYVASRVLGLQIQTVDSSQGSEYDYVIYTQTSDTAHACNVNRFNVAITRAKKGILCV 325
                         330
                  ....*....|....*
gi 381354069 5956 MSSMQLFESLNFTTL 5970
Cdd:cd21723   326 MCDKELFDALKFFEL 340
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3359-3633 6.89e-164

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 507.75  E-value: 6.89e-164
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3359 GLWLDDKVYCPRHVICSSDDMTdPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKP 3438
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3439 GETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQLELSTGCHTGTDLSGNFYGPYRDA 3518
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3519 QVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKADLV*DALASMTGVTVEQVLAAIKR 3598
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 381354069  3599 LYSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3633
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1607-1904 4.81e-160

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 497.88  E-value: 4.81e-160
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1607 NKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDAFGFDEP-QLL 1685
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1686 KYYNMLGMCK-WPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNE 1764
Cdd:cd21732    81 RYYSALAHVKkWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1765 PSDSTDFIRVVLREADLSGATCDLEFICK-CGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLVHCTQFNV-PF 1842
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069 1843 LICSYTPEGRKLPD-DVVAANIFTGG-SLGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1904
Cdd:cd21732   241 LLMSNTPTEVPLPTgDFVAANVFTGDeSVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
CoV_Nsp5_Mpro cd21646
coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains ...
3334-3626 2.55e-159

coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394885  Cd Length: 292  Bit Score: 495.40  E-value: 2.55e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3334 VKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSDDmTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQ 3413
Cdd:cd21646     1 KKMAQPSGKVERCMVSVTYGSTTLNGLWLDDTVYCPRHVICKSTT-SGPDYDDLLSRARNHNFSVQSGGVQLRVVGVTMQ 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3414 GSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMH 3493
Cdd:cd21646    80 GALLRLKVDTSNPHTPKYKFVTVKPGDSFTILACYNGSPSGVYGVNMRSNYTIKGSFLNGACGSVGYNIDGGTVEFCYMH 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3494 QLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIK 3573
Cdd:cd21646   160 HLELPNGCHTGTDLTGKFYGPYVDQQVAQVEGADTLITDNVVAWLYAAIINGDRWWLNSSRTTVNDFNEWAMANGFTPVS 239
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 381354069 3574 ADLV*DALASMTGVTVEQVLAAIKRLYSGFQGKQILGSCVLEDELTPSDVYQQ 3626
Cdd:cd21646   240 QVDCLSILAAKTGVSVERLLAAIQQLHQNFGGKQILGSTSLEDEFTPEDVVRQ 292
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2846-3227 2.73e-159

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 499.04  E-value: 2.73e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2846 FVANLICFIVLWALIPTYAVHKSDMQLPLYASFKVIENGVLRDVSVTDATSANKFNQFDQWYESTFGLAYYRTSSCPVVV 2925
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYGSVPTNSKSCPIVV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2926 AVIDqDIGHTLFNVPTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHaDGTPHPYCYTE 3005
Cdd:cd21473    81 GVID-DVRGSVPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYDT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3006 GVMHNASLYSSLVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYraMP 3085
Cdd:cd21473   159 GLVEGAKLYSDLLPHVRYKLVDGN-YIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYY--GP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3086 GTFCGRNAFDLIHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDyTSVVVINVIVWCINFMMLFVF 3165
Cdd:cd21473   236 GVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYVV 314
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3166 QVYPTLSCLYACFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCIIYVAVVVSNHALWLF 3227
Cdd:cd21473   315 TQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
deltaCoV_Nsp14 cd21657
nonstructural protein 14 of deltacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus ...
5984-6499 7.03e-156

nonstructural protein 14 of deltacoronavirus; Nonstructural protein 14 (Nsp14) of coronavirus (CoV) plays an important role in viral replication and transcription. It consists of 2 domains with different enzymatic activities: an N-terminal exoribonuclease (ExoN) domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The association of Nsp14 with Nsp10 stimulates its ExoN activity; the complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end mimicking an erroneous replication product. The Nsp10/Nsp14 complex may function in a replicative mismatch repair mechanism. N7-MTase functions in mRNA capping. Nsp14 can methylate GTP, dGTP as well as cap analogs GpppG, GpppA and m7GpppG. The accumulation of m7GTP or Nsp14 has been found to interfere with protein translation of cellular mRNAs.


Pssm-ID: 394956  Cd Length: 508  Bit Score: 495.15  E-value: 7.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5984 TNLFKDCSKSYDGYHPAHAPSFLAVDDKYKVGGDLAVCLNVADSS-VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKR 6062
Cdd:cd21657     1 TPLFKRCGYEYNGVHPAHALTWHDCGAEYRCEEPLAKLVGVADGTlISYKTLVSALGFLPSLKIDTYHNMFLTKDACRAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6063 VRAWVGFDAEGAHATRDSIGTNFPLQLGFSTGIDFVVEATGMFAEREGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDVV 6142
Cdd:cd21657    81 VQSWIGIDVEAAHAVKPNVGTNLPLQIGFSTGKNFSVTPEGIWVNEHGSCTEPVPAKIPPGEQFRHLKKDMRQARPWKVV 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6143 RIRIVQMLSDHLVDlADSVVLVTWAASFELTCLRYFAKVGKEVVCSvCNKRAtCFNSRTGYyGCWRH----SYSCDYLYN 6218
Cdd:cd21657   161 RREIAAHLAEVAPH-TDYICFVTWAHQLELATMRYFVKIGMEEKCF-CGRRA-CFTNGTEF-ACKAHhsltTPQCDYVYN 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6219 PLIVDIQQWGYTGSLTSNHDPICS*HKGAHVASSDAIMTRCLAVHDCFcKSVNWNLEYPIILNEVSVNTSCRLLQRVMFR 6298
Cdd:cd21657   237 PFLIDVATWGFSGRLSTNHDAVCTYHANAHVASADAIMTVCLAIHELF-STVDWDLEFPVTPEQSQLNKACRLVQANYLN 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6299 AAMLCNRYDVC*DIGNPKGLACVK--GYDFKFYDASPVVKSVKQFVYK--YEAHkdqFLDGLCMFW*CNVDKYPANAVVC 6374
Cdd:cd21657   316 ILLTTTKATVVHDIGNPKGIPIVRkpGVKYHFYDQAPIVKHVQKLKYKpeMEAR---FTDGLTMFWNCNVDTYPANALVC 392
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6375 RFDTRvlNKLNLPGCNGGSLYVN*HAFHTSPFTRAAFENLKPMPFFYYSDTPCvymegmESKQVDYVPLRSatCITRCNL 6454
Cdd:cd21657   393 RYDTH--RQKHLIGPNGSALYVNKHAFLTPEMHTYATHKLTLAPLVYYSTTDC------SSEQPIVVTYRD--CVTRCNT 462
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 6455 GGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFT 6499
Cdd:cd21657   463 GTTICPTHALEYQEFINAYNLMARHGFNVYIPRNVNVYNCWLTFT 507
capping_2-OMTase_betaCoV_Nsp16 cd23528
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of betacoronavirus, also called ...
6905-7120 2.74e-155

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of betacoronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. The betacoronavirus (betaCoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467740  Cd Length: 216  Bit Score: 480.35  E-value: 2.74e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6905 NYGKPITLPTGCLMNVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDTV 6984
Cdd:cd23528     1 NYGQPATLPTGTMMNVAKYTQLCQYLNTCTLAVPANMRVIHFGAGSDKGVAPGTAVLRQWLPTDAILVDNDLNPFVSDAD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6985 ASYYGNCITLPFDCQWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFA 7064
Cdd:cd23528    81 ATYFGDCVTVPTDCKWDLIISDMYDPRTKNVGGENVSKEGFFTYLCGFIKDKLALGGSVAIKITEHSWSADLYKLMGHFA 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069 7065 FWTIFCTNVNASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRNSTMWNGGAYSL 7120
Cdd:cd23528   161 WWTVFCTNVNASSSEAFLIGINYLGKPKEEIDGNVMHANYIFWRNSTPMNLSSYSL 216
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-241 5.21e-155

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 480.35  E-value: 5.21e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    6 KYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDIFVDKD 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   86 PQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGVWLGNG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 381354069  166 RFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGHRRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYTLL 241
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHKRGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3634-3920 1.09e-149

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 467.87  E-value: 1.09e-149
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3634 SKRTRVIKGTCCWILASTFLFCSIIAAFVKWTMFMYVTTHMLGVTLCALCFVSFA-MLLIKHKHLYLTMYIMPVLCTLFY 3712
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSACvMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3713 TNYL*VYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLIAMVfVTMRSINHDVFSIMFLVGRLVSLVSMWYFGaNLE 3792
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTG-SLE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3793 EEVLLFLTSLFGTYTWTT---MLSLATAK-VIAKWLAVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMP 3868
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTgvvTVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3869 LGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2334-2821 7.22e-149

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 472.98  E-value: 7.22e-149
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2334 TICDFYQVtdlGYRSS------FCNGSMVCELCFSGFDMLDSYDAINVVQHVVDRRVSFDYISILKLVVELIIGYSLYTV 2407
Cdd:pfam19218    2 YPCDGYVD---GYSNSsfnksdYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2408 CFYPLFVLIGMQLLTTWLPEFFMLETMHWsarlfvfVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYK 2487
Cdd:pfam19218   79 TFVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSK 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2488 RNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEV 2567
Cdd:pfam19218  152 SARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKV 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2568 KQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVE-NEADKAGFLGAAVFYAQSLYRPMLMVEKK 2646
Cdd:pfam19218  232 EFQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNsNGSNLAQAKNACVYYSQLLCKPIKLVDSN 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2647 LITTANTGLSVSQTMFDLYVDSLLNVLDVDRKSLTSFVNAAHnslkegvqleqvmdtfvgcarrkcAIDSDVETRSITKS 2726
Cdd:pfam19218  312 LLSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNA 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2727 VMSAVNAGVDFTDESCNNLVPTYVKS-DTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKAC 2805
Cdd:pfam19218  368 VLNAHRYDVLLTDDSFNNFVPTYAKPeDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTA 447
                          490
                   ....*....|....*.
gi 381354069  2806 SKTGLKIKLTYNKQEA 2821
Cdd:pfam19218  448 KKKGVTFMLTFNTNRM 463
gammaCoV_Nsp13-helicase cd21720
helicase domain of gammacoronavirus non-structural protein 13; This model represents the ...
5638-5970 6.54e-145

helicase domain of gammacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from gammacoronavirus, including Avian infectious bronchitis virus. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. Coronavirus (CoV) Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409653 [Multi-domain]  Cd Length: 343  Bit Score: 456.30  E-value: 6.54e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5638 VPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDCTRIVPAK 5717
Cdd:cd21720     8 VPECFVNNIPLYHLVGKQKRTTVQGPPGSGKSHFAIGLAAYFSNARVVFTACSHAAVDALCEKAFKFLKVDDCTRIVPQR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5718 VRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRVLLNkGTL 5797
Cdd:cd21720    88 TTVDCFSKFKANDTGKKYIFSTINALPEVSCDILLVDEVSMLTNYELSFINGKINYQYVVYVGDPAQLPAPRTLLN-GSL 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5798 EPRYFNSVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYY-KGQTT--HESSSAVNMQQIH 5874
Cdd:cd21720   167 SPKDYNVVTNLMVCVKPDIFLAKCYRCPKEIVDTVSTLVYDGKFIANNPESRQCFKVIVnNGNSDvgHESGSAYNTTQLE 246
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5875 LISKFLKANPSWSNAVFISPYNSQNYVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKKGILC 5954
Cdd:cd21720   247 FVKDFVCRNKEWREATFISPYNAMNQRAYRMLGLNVQTVDSSQGSEYDYVIFCVTADSQHALNINRFNVALTRAKRGILV 326
                         330
                  ....*....|....*..
gi 381354069 5955 VMSSM-QLFESLNFTTL 5970
Cdd:cd21720   327 VMRQRdELYSALKFTEL 343
DEXSMc_CoV_Nsp13 cd22649
DEXSM-box helicase domain of coronavirus Nsp13 helicase; Helicases catalyze the NTP-dependent ...
5623-5823 6.96e-136

DEXSM-box helicase domain of coronavirus Nsp13 helicase; Helicases catalyze the NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified into six superfamilies based on the arrangement of conserved motifs. This family contains coronavirus (CoV) non-structural protein 13 (Nsp13) helicase, including those from highly pathogenic human betaCoVs such as Severe Acute Respiratory Syndrome coronavirus (SARS) and SARS-CoV-2 (also known as 2019 novel CoV (2019-nCoV) or COVID-19 virus). Nsp13 helicase is a component of the viral RNA synthesis replication and transcription complex (RTC). SARS-Nsp13 is strongly inhibited by natural flavonoids, myricetin and scutellarein, and is emerging as a target for development of anti-SARS medications. It contains an N-terminal Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B regulatory domain, and an SF1 helicase core that carries a DEAD-box helicase domain. Nsp13 belongs to the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 438713 [Multi-domain]  Cd Length: 202  Bit Score: 424.12  E-value: 6.96e-136
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5623 PQENYTSIRFAS-VYSVPETFQNNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKA 5701
Cdd:cd22649     1 PQENYVRITGLYpTLNVPEEFSNNVPNYQKIGMQKYTTVQGPPGTGKSHFAIGLALYYPSARVVYTACSHAAVDALCEKA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5702 HKFLNINDCTRIVPAKVRVDCYDKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGD 5781
Cdd:cd22649    81 FKFLNIDKCTRIIPARARVECYDKFKVNDTNAQYVFSTINALPETSADIVVVDEVSMCTNYDLSVINARIRAKHIVYIGD 160
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 381354069 5782 PAQLPAPRVLLNKGTLEPRYFNSVTKLMCCLGPDIFLGTCYR 5823
Cdd:cd22649   161 PAQLPAPRTLLTKGTLEPEYFNSVTRLMCCLGPDIFLGTCYR 202
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2857-3212 2.25e-135

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 429.38  E-value: 2.25e-135
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2857 WALIPTYAVHKSDMQLPLYASFKVIENGVLRDVSVTDATSANKFNQFDQWYESTFGlAYYRTSSCPVVVAVIDQDIGHTL 2936
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG-SPTNSRSCPIVVGVVDEVVGRVV 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  2937 FNVPTKVLRHGFHVLHFITHAFATDSVQCYTPHMQIPYDNFYASGCVLSSLCTMLAHADGTPHPYCYTEGVMHNASLYSS 3016
Cdd:pfam19217   80 PGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYSD 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3017 LVPHVRYNLASSNgYIRFPEVVSEGIVRVVRTRSMTYCRVGLCEEAEEGICFNFNSSWVLNNPYYramPGTFCGRNAFDL 3096
Cdd:pfam19217  160 LVPHVRYKLVDGN-YVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLSL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3097 IHQVLGGLVQPIDFFALTASSVAGAILAIIVVLAFYYLIKLKRAFGDYTSVVVINVIVWCINFMMLFVFQVYPTLSCLYA 3176
Cdd:pfam19217  236 LTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVYA 315
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 381354069  3177 CFYFYTTLYFPSEISVVMHLQWLVMYGAIMPLWFCI 3212
Cdd:pfam19217  316 VLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
betaCoV_Nsp2_SARS_MHV-like cd21515
betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), ...
250-831 1.33e-130

betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), and related proteins; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This family includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2 rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 439198  Cd Length: 562  Bit Score: 424.57  E-value: 1.33e-130
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  250 ILFVDQYGCDYTGCLAKGLEDYGDLTLSE----------MKELFPVWRESLDNEVVVAWHVDRdPRAVMRLQTLATLRSI 319
Cdd:cd21515     2 TRYVDQYFCGPDGYPLECIKDLLAKAGKSsctlsdeqldFKELKRGGYCCRDHEHEIAWYVER-SDAPYELQTPFTIKSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  320 DYvgqptedvvdGDVVVRAPAHLLAADALVK-RLPRLVETMLYTDSS-VTEFCYKTKLCDCGFITQFGYVDCcgDTCDFR 397
Cdd:cd21515    81 KK----------DTFKGEVPAFVFPLNSKVKvLKPRVVKKKLEGFMGkIRTVYPVASPNECNPMTLSALMKC--DHCDET 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  398 GWVPGNMLdGFPCPgCSKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNRE-----------------AFKLYGHAVVPFG 460
Cdd:cd21515   149 SWQTGNFV-GATCL-CGAEYTLTKEDATSAGYLPPGAVVKMPCPACKNDEvgpehsfadyhnssgikTFLRKGGRTVPFG 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  461 SAVYWSPYP----GMWLPVVWSSVKSYsgltYTGVVGCKAIVQETDAICRSLYmdyvqhkcgNLDQRATLGLDDVYHRQL 536
Cdd:cd21515   227 GCVFAYVGCyngcAYWVPRAWSNIGSN----HTGVVGSGVEVLNDDLLEILLR---------EKVNINIVGDFKLNEEVV 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  537 LVNRGDYSLLLENVDLFVKRRAEFAcKFATCGDGFVPLLLDGLVPRSYYLIKSGQAYTSMMVnfshEVIDMCMDMALLFM 616
Cdd:cd21515   294 IILASFSASVLAFVDTVKGLDFETF-KFIVESCGNFPVTKGKFVPGAWNLGKSKQVLTPLPA----FPSQAAMVVRSIFA 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  617 HDVKVATKYVK----------------KFTGKLAVRFKALGVavvrkitewfdlavDIAASAAGWLCYQLVNGLFA-VAN 679
Cdd:cd21515   369 RTVFTATHSVPalqeaaitiidgispqALRLLDAMRFTADLV--------------TNSVLAMAYVTGGLVQVTSQwLDN 434
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  680 GVITFVQEAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSKVYEVVQKSLSAYVLPVGcseatclv 759
Cdd:cd21515   435 LFGTVVDLLKPVLEWLEEKISSGIEFLIDLWEILKLLVTGAYKIVKGQIVLAGKNVSEVVQSFLSVLNKALG-------- 506
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069  760 gesepavfeddVVGVVKTPLTYQGCckpptsfeKICIVDKLYMAKCGDQFYPVVVDND---TVGVLDQCWRFPCA 831
Cdd:cd21515   507 -----------LLLPLKAPKEELFL--------TEGDTVDTSLTSEEVVVKTGVLEELdtpTSKVVDGPLVGTPV 562
Peptidase_C16 pfam01831
Peptidase C16 family;
1082-1330 1.23e-125

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 397.14  E-value: 1.23e-125
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1082 AFDAIYSKALSAVYAVPSDETHFKVCGFYSPAIERTNCWLRSTLIVMQSLPLEFKDLEMQKLWLSYKAGYDQCFVDKLVK 1161
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1162 SVPRSIILPQGGYVADFAYYFLSQCSFKAHANWRCLKCDMALKLQGLDAMFFYGDVVSHMCKCGSGMTLLSADIPYTLHF 1241
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1242 GVRDDKFCAFYTPRKVFRAACAVDVNDCHSMAVVDGKLIDGKNVTKFTGDKFDFMVGHGMTFSMSPFETAQLYGSCITPN 1321
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 381354069  1322 VCFVKGDVI 1330
Cdd:pfam01831  241 LCFGKGDVI 249
Peptidase_C16 pfam01831
Peptidase C16 family;
1-248 1.59e-124

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 393.67  E-value: 1.59e-124
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069     1 MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGRTLVNHVRVDCSRLPALECCVQSAIIRDI 80
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069    81 FVDKDPQKVEASTMMALQFGSAVLIMPSKRLSIQAWANLGVLPRTPAMGLFKRVCLCNTRGCSCDVHVAFQLFTVQPDGV 160
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   161 WLGNGRFIGWFVPVTAIPEYAKQWLQPWSILLRKGGNKGSVTSGH-RRAVTMPVYDFNVEDACEEVHLNPKGKYSRKAYT 239
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHfFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 381354069   240 LLKGYRGVK 248
Cdd:pfam01831  241 LCFGKGDVI 249
deltaCoV_Nsp13-helicase cd21721
helicase domain of deltacoronavirus non-structural protein 13; This model represents the ...
5644-5970 7.73e-121

helicase domain of deltacoronavirus non-structural protein 13; This model represents the helicase domain of non-structural protein 13 (Nsp13) from deltacoronavirus, including Bulbul coronavirus (CoV) HKU11 and Common moorhen CoV HKU21. Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. CoV Nsp13 is a member of the helicase superfamily 1 (SF1); SF1 and SF2 helicases do not form toroidal structures, while SF3-6 helicases do. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It is a multidomain protein containing a Cys/His rich zinc-binding domain (CH/ZBD), a stalk domain, a 1B domain involved in nucleic acid substrate binding, and a SF1 helicase core.


Pssm-ID: 409654 [Multi-domain]  Cd Length: 342  Bit Score: 387.36  E-value: 7.73e-121
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5644 NNVPNYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYCTARVVYTAASHAAVDALCEKAHKFLNINDCTRIVPAKVRVDCY 5723
Cdd:cd21721    14 QHFKSYNEIAMQKVTTVLGPPGTGKSTFAIGLAKYYPNARICYTASSHAAIDALCEKAFKTLPVGQCSRIVPTRTTVECF 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5724 DKFKVNDTTRKYVFTTINALPELVTDIIVVDEVSMLTNYELSVINSRVRAKHYVYIGDPAQLPAPRVLLNKGTLEPRYFN 5803
Cdd:cd21721    94 QDFVVNNTTAQYIFSTINALPDIKCDIVVVDEVSMLTNYELSSVNARLVYNHIVYVGDPYQLPSPRTMLTTGQLSPADYN 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5804 SVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYNNKLKAKNDNSSMCFKV---YYKGQTTHESSSAVNMQQIHLISKFL 5880
Cdd:cd21721   174 VVTDIMVHAGADVMLDMCYRCPREIVDTVSKLVYDNKLKAAKPNSRQCYKTiinNGNNDIAHEGQSAYNEPQLRFALAFR 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5881 KANpSWSNAVFISPYNSQNyVAKRVLGLQTQTVDSAQGSEYDFVIYSQTAETAHSVNVNRFNVAITRAKKGILCVM-SSM 5959
Cdd:cd21721   254 QYK-RWDNVTFISPYNAMN-VKAAMAGFSTQTVDSSQGSEYDYVIFCVTTDSAHALNMSRLNVALTRAKIGILVVFrQAN 331
                         330
                  ....*....|.
gi 381354069 5960 QLFESLNFTTL 5970
Cdd:cd21721   332 ELYNSLQFESI 342
CoV_PLPro cd21688
Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease ...
1607-1904 1.47e-119

Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409647  Cd Length: 299  Bit Score: 381.83  E-value: 1.47e-119
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1607 NKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSaIHKGKVFFQYSglsEADLVAVKDAFGFDEP-QLL 1685
Cdd:cd21688     1 KTKKVLVTVDGVNFRTIVVTTGDTYGQQLGPVYLDGADVTKGKPD-NHEGETFFVLP---STPDKAALEYYGFLDPsFLG 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1686 KYYNMLGMCKWpVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNEP 1765
Cdd:cd21688    77 RYLSTLAHKWK-VKVVDGLRSLKWSDNNCYVSAVILALQQLKIKFKAPALQEAWNKFLGGDPARFVALIYASGNKTVGEP 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1766 SDSTDFIRVVLREADLSGATCDLEFICK-CGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCG-SKLVHCTQFNVPFL 1843
Cdd:cd21688   156 GDVRETLTHLLQHADLSSATRVLRVVCKhCGIKTTTLTGVEAVMYVGALSYDDLKTGVSIPCPCGgEWTVQVIQQESPFL 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069 1844 ICS-YTPEGRKLP-DDVVAANIFTGGS-LGHYTHVKCKPKYQLYDACNVSKVSEAKGNFTDCLY 1904
Cdd:cd21688   236 LLSaAPPAEYKLQqDTFVAANVFTGNTnVGHYTHVTAKELLQKFDGAKVTKTSEDKGPVTDVLY 299
ps-ssRNAv_Nidovirales_RdRp cd23168
catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the order Nidovirales of ...
4987-5341 3.12e-116

catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the order Nidovirales of positive-sense single-stranded RNA [(+)ssRNA] viruses; This family contains the catalytic core domain of RdRP of Nidovirales, an order of enveloped, (+)ssRNA viruses which infect vertebrates and invertebrates. Host organisms include mammals, birds, reptiles, amphibians, fish, arthropods, mollusks, and helminths. The order Nidovirales currently comprises 88 formally recognized virus species of (+)ssRNA viruses which are classified into nine virus families: Abyssoviridae, Arteriviridae, Coronaviridae, Euroniviridae, Medioniviridae, Mesoniviridae, Mononiviridae, Roniviridae, and Tobaniviridae. Based on the genome size, the members of the order Nidovirales can be divided into two groups, large and small nidoviruses. The genomes of the large nidoviruses are well over 25 kb in length with size differences in the 5 kb range. Planarian secretory cell nidovirus (PSCNV), only member of the Mononiviridae family, has the largest known non-segmented RNA genome of 41.1 kb; its host is the planarian flatworm. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438018 [Multi-domain]  Cd Length: 310  Bit Score: 372.46  E-value: 3.12e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4987 TLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRRLIKDV-DSPVLMG 5065
Cdd:cd23168     1 TLTQVNPKYAIQKKKRARTILGVSIISTDVGRQLHQAVLAAIVNTRSANIVIIGTKFYGGWHKMLRYLYPGViEDPVLMG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5066 WDYPKCDRAMPNILRIVSSLVLARKHDSCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNI 5145
Cdd:cd23168    81 WDYPKCDRSVPNMLRYLANLLLASLYDNCCNLSEIVHLLINECAQVLYDYVVYGGNLYRKPGGVSSGDSTTAISNSIYNY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5146 CQAVSANvcslmacnghkiedlsirelqkrlysnvyradhvdpafvseyyeflnkhFSMMILSDDGVVCYNSEFASKGYI 5225
Cdd:cd23168   161 FQTFIAN-------------------------------------------------VRLAILSDDGVACINPDLIDLGDV 191
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5226 ANISAFQQVLYYQNNVFMSEAKCWVETDIEKGPHEFCSQHTMLVKMdgDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLI 5305
Cdd:cd23168   192 ASVSFFLASYYYTNNKKKYSSTCWVEPHEFCSPHEFKSDDKYQDRV--ERVYLPIPDPSRMLSACLLVDTRTKTDILLMI 269
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 381354069 5306 ERFVSLAIDAYPLVHHE-----NPEYQNVFRVYLEYIKKLY 5341
Cdd:cd23168   270 ERLISILIDAYPLTFHTktlpvNIEYAPLILLLLDYIKKLS 310
alphaCoV_Nsp5_Mpro cd21665
alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3335-3629 5.95e-116

alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in alphacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394886  Cd Length: 296  Bit Score: 371.24  E-value: 5.95e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3335 KMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVIcSSDDMTDPDYPNLLCRVTSSDFCVMSDRMSLTVMSYQMQG 3414
Cdd:cd21665     3 KMAQPSGVVEKCVVRVSYGNMVLNGLWLGDTVYCPRHVI-ASDTTSTIDYDHEYSLMRLHNFSISVGNVFLGVVGVTMRG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3415 SLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQ 3494
Cdd:cd21665    82 ALLVIKVNQNNVNTPKYTFRTLKPGDSFNILACYDGVPSGVYGVNMRTNYTIKGSFINGACGSPGYNLNNGTVEFCYMHQ 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3495 LELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCNWFVQSDSCSLEEFNVWAMTNGFSSIKA 3574
Cdd:cd21665   162 LELGSGCHVGSDLDGVMYGGYEDQPTLQVEGANVLVTENVVAFLYAALLNGCNWWLSSDRVTVEAFNEWAVANGFTTVSS 241
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 3575 DLV*DALASMTGVTVEQVLAAIKRLYSGFQGKQILGSCVLEDELTPSDVYQQLAG 3629
Cdd:cd21665   242 TDCFSILAAKTGVDVERLLAAIQRLSKGFGGKTILGYTSLTDEFTLSEVIKQMYG 296
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4013-4206 3.91e-115

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 364.49  E-value: 3.91e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4013 SEFVNMASFVEYELAKKNLDEAK*SGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4092
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4093 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4172
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 381354069 4173 DGAVKQLNEID---VNSIWPLVIAANRHNEvSTVVLQ 4206
Cdd:cd21831   161 DGKIVQLSDITedsENLAWPLVVTATRANS-SAVKLQ 196
TM_Y_SARS-CoV-like_Nsp3_C cd21717
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe ...
2328-2834 5.97e-109

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe acute respiratory syndrome-related coronavirus and betacoronavirus in the B lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the sarbecovirus subgenus (B lineage), including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and the related murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409665  Cd Length: 531  Bit Score: 360.84  E-value: 5.97e-109
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2328 TTFGVSTICDFYQVtdlGYRSS-------FCNGSMVCELCFSGFDMLDSYDAINVVQHVVDrrvSFDY-ISILKLVVELI 2399
Cdd:cd21717    24 SNLGAPSYCDGVRE---SYLNSsnvttmdFCEGSFPCSVCLSGLDSLDSYPALETIQVTIS---SYKLdLTILGLAAEWF 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2400 IGYSLYTVCFYPLFVLIGMQLLTTWLPEFFMLETmhWSARLFVFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSN 2479
Cdd:cd21717    98 LAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNS--WLMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTS 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2480 PGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDS 2559
Cdd:cd21717   176 STCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFCKTHNWNCLNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQ 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2560 AYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVENEA--DKAGFLGAAVFYAQSLY 2637
Cdd:cd21717   256 SSYVVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSkcDESAAKSASVYYSQLMC 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2638 RPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKcAIDSD 2717
Cdd:cd21717   336 QPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQG-VVDTD 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2718 VETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADL 2797
Cdd:cd21717   415 VDTKDVIECLKLSHHSDLEVTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQL 494
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 381354069 2798 QHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21717   495 RKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKG 531
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
4010-4205 6.78e-104

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 332.20  E-value: 6.78e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4010 ALQSEFVNMASFVEYELAKKNLDEAK*SGSAnQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4089
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4090 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSI 4169
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 381354069  4170 QDADGAVKQLNEIDVNS----IWPLVIAANRHNEVstVVL 4205
Cdd:pfam08717  160 KDADGKIVHLKEITMDNspnlAWPLIVTAERANSA--VKL 197
capping_2-OMTase_alphaCoV_Nsp16 cd23527
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of alphacoronavirus, also called ...
6919-7109 2.72e-101

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of alphacoronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. The alphacoronavirus (alphaCoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467739  Cd Length: 193  Bit Score: 324.75  E-value: 2.72e-101
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6919 NVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDC 6998
Cdd:cd23527     1 NVVKYTQLCQYLNSTTMCVPHNMRVLHLGAGSDKGVAPGTAVLRRWLPLDAIIVDNDVNDYVSDADFSITGDCSTLYLED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6999 QWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSS 7078
Cdd:cd23527    81 KFDLVISDMYDGRTKSCDGENVSKDGFFTYINGVITEKLALGGTVAIKITEYSWNKKLYELIQKFEYWTMFCTSVNTSSS 160
                         170       180       190
                  ....*....|....*....|....*....|...
gi 381354069 7079 EGFLIGINWLN--RTRTEIDGKTMHANYLFWRN 7109
Cdd:cd23527   161 EAFLIGVNYLGdfSNKPIIDGNTMHANYIFWRN 193
capping_2-OMTase_CoV_Nsp16 cd23526
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of Coronavirus, also called non-structural ...
6919-7109 8.22e-99

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of Coronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. Coronavirus (CoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467738  Cd Length: 191  Bit Score: 317.48  E-value: 8.22e-99
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6919 NVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDC 6998
Cdd:cd23526     1 NVAKYTQLCQYLNTTTLAVPHNMRVLHFGAGSDKGVAPGTSVLRQWLPTGTILVDNDLNDFVSDADSTIVGDCATYHTEH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6999 QWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSS 7078
Cdd:cd23526    81 KFDLIISDMYDCKTKNVTGENDSKEGFFTYLCRFIKERLALGGSIAVKITEHSWSKDLYELAGHFAWWTMFCTNVNASSS 160
                         170       180       190
                  ....*....|....*....|....*....|.
gi 381354069 7079 EGFLIGINWLNRTRTEIDGKTMHANYLFWRN 7109
Cdd:cd23526   161 EAFLIGINYLGDPKENIDGYTMHANYIFWRN 191
TM_Y_HKU9-like_Nsp3_C cd21715
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus ...
2289-2834 9.81e-97

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus bat coronavirus HKU9 and betacoronavirus in the D lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409663  Cd Length: 526  Bit Score: 325.28  E-value: 9.81e-97
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2289 TVFLLWFNFLYANVILSDFYLPNIgslptfvgqiVAWFKTTFGVSTICDFYQVTDLGyrssfcnGSMVCELCFSGFDMLD 2368
Cdd:cd21715     1 YLWFVWTCLAICGVWLSEPYAPSL----------LTRFKHFLGIVMPCDYVLVNETG-------TGWLHHLCMAGMDGLD 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2369 sYDAINVVQHvvdRRVS-FDYISILkLVVELIIGYSLYTVCFYPLFVLIGMQLLTTWLPefFMLETmHWSARLFVFVANM 2447
Cdd:cd21715    64 -YPALRMQQH---RYGSpYDYTYIL-MLLEAFCAYLLYTPALPIVGILAVLHLLVLYLP--IPLGN-SWLVVFLYYIIRL 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2448 LPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLN 2527
Cdd:cd21715   136 VPFTSMLRMYIVIAFLWLCYKGFVHVRYGCNNVACLMCYKKNVAKRIECSTVVNGVKRMFYVNANGGTYFCTKHNWNCVS 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2528 CDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLLHSKVKG 2607
Cdd:cd21715   216 CDTYTVDSTFISRQVALDLSAQFKRPINHTDEAYYEVTSVEVRNGYVYCYFDSDGQRSYERFPMDAFTNVSKLHYSELKG 295
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2608 VPETHVVVV---ENEADKAGFLGAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKSLTSFV 2684
Cdd:cd21715   296 AAPAFNVLVfdaTNRIEENAVKTAAIYYAQLACKPILLVDKRMVGVVGDDATIAKAMFEAYAQNYLLKYSIAMDKVKHLY 375
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2685 NAAHNSLKEGVQLEQVMDTFVGCARRKCA-IDSDVETRSITKSVMSAVNAGVDFTDESCNNLVPTYVKSDTIVAADLGVL 2763
Cdd:cd21715   376 STALQQIASGMTVESVLKVFVGSTRAEAKdLESDVDTNDLVSCIRLCHQEGWDWTTDSWNNLVPTYIKQDTLSTLEVGQF 455
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 2764 IQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21715   456 MTANARYVNANVAKGAAVNLVWRYADFIKLSESMRRQLRVAARKTGLNLLVTTSSLKADVPCVVTPFKIVG 526
TM_Y_MERS-CoV-like_Nsp3_C cd21716
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle ...
2284-2831 9.47e-94

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409664  Cd Length: 566  Bit Score: 318.29  E-value: 9.47e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2284 FFLVATVFLL---WFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFkttfGVSTICDFYQVTdlgYR------SSFC-NG 2353
Cdd:cd21716     5 LMLCTTGLLLssvYHLYVFNQVLSSDVMLEDATGLKAFYKEVRSYL----GISSACDGLASA---YRansfdvPDFCaNR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2354 SMVCELCFSGFDMLDSYDAINVVQ-HVVDRRVSFDYisiLKLVVELIIGYSLYTVCFYPLFVLIGMQllttwlpeFFMLE 2432
Cdd:cd21716    78 SALCNWCLIGQDSITHYSALKMVQtHLSHYVLNIDW---LWFALELLLAYVLYTSAFNWLLLACTLQ--------YFFAQ 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2433 TMH---WSARLFV-----FVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVGGSL 2504
Cdd:cd21716   147 TSAfvdWRSYNYVvsgifLLFTHIPLDGLVRIYNVLACLWFLRKFYNHVINGCKDTACLLCYKRNRLTRVEASTVVCGGK 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2505 RYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYERDGQR 2584
Cdd:cd21716   227 RTFYITANGGTSFCRRHNWNCVDCDTAGVGNTFICEEVANDLTTSLRRLVKPTDRSHYYVDSVEVKDTVVQLNYRRDGQS 306
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2585 VYDDVSASLFVDMNGLLHSKV----KGVPEtHVVVVENEADKAG---FLGAAVFYAQSLYRPMLMVEKKLITTANTGLSV 2657
Cdd:cd21716   307 CYERFPLCYFTNLDKLKFKEVckttTGIPE-HNFIIYDSSDRGQenlARSACVYYSQVLCKPILLVDSNLVTSVGDSSEI 385
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2658 SQTMFDLYVDSLLNVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCARRKCAIDSDVETRSITKSVMSAVNAGVDF 2737
Cdd:cd21716   386 AIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNFQSVLKTFIDAARGPAGVESDVETNEIVDAVQYAHKHDIQL 465
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2738 TDESCNNLVPTYVKSDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYN 2817
Cdd:cd21716   466 TTESYNNYVPSYVKPDSVATSDLGSLIDCNAASVNQTSMRNANGACIWNAAAYMKLSDSLKRQIRIACRKCNLNFRLTTS 545
                         570
                  ....*....|....
gi 381354069 2818 KQEANVPILTTPFS 2831
Cdd:cd21716   546 KLRANDNILSVKFS 559
gammaCoV_Nsp5_Mpro cd21667
gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3332-3633 2.09e-90

gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in gammacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394888  Cd Length: 306  Bit Score: 298.24  E-value: 2.09e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3332 GIVKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVIcssDDMTDPDYPNLLCRVTSSDFCVMS-DRMSLTVMSY 3410
Cdd:cd21667     1 GFKKLVSPSSAVEKCIVSVSYRGNNLNGLWLGDSIYCPRHVL---GKFSGDQWQDVLNLANNHEFEVVTqNGVTLNVVSR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3411 QMQGSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDSVRFV 3490
Cdd:cd21667    78 RLKGAVLILQTAVANANTPKYKFVKANCGDSFTIACSYGGTVVGLYPVTMRSNGTIRASFLAGACGSVGFNIEKGVVNFF 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3491 YMHQLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILNRCN---WF---VQSDSCSLEEFNVWA 3564
Cdd:cd21667   158 YMHHLELPNALHTGTDLMGEFYGGYVDEEVAQRVPPDNLVTNNIVAWLYAAIISVKEssfSLpkwLESTTVSVEDYNKWA 237
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 381354069 3565 MTNGFSSIKADLV*DALASMTGVTVEQVLAAIKRLYSGFQGKQILGSCVLEDELTPSDVYQQLAGVKLQ 3633
Cdd:cd21667   238 SDNGFTPFSTSTAITKLSAITGVDVCKLLRTIMVKSAQWGSDPILGQYNFEDELTPESVFNQVGGVRLQ 306
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4317-4446 3.07e-85

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.09  E-value: 3.07e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4317 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4396
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 381354069 4397 VEHPDVDGLCKLRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
alpha_betaCoV_Nsp2 cd21511
alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ...
250-752 5.70e-84

alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This alpha- and betacoronavirus family includes alphacoronavirus human coronavirus 229E (HCoV-229E) Nsp2, betacoronavirus Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle. This family may be distantly related to the gammacoronavirus Avian infectious bronchitis virus (IBV) Nsp2; IBV Nsp2 is a weak protein kinase R (PKR) antagonist, which may suggest that it plays a role in interfering with intracellular immunity.


Pssm-ID: 439197  Cd Length: 399  Bit Score: 283.67  E-value: 5.70e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  250 ILFVDQYGCDYTGCLAKGLEDYGDLTLSEM---------KELFPVWRESLDNEVVVAWHVDRdPRAVMRLQTLATLRSI- 319
Cdd:cd21511     2 VTYVDQYGCGPDGKPVECIKDLLDVAKKGSctlseqldgIELKNGVYDLRDHEVVIAWYVER-KDVPYEKQTIFTIKSAk 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  320 --DYVGqptedvvdgdvvvRAPAHLLAADALVK-RLPRLVETMLYTDSSVTE-FCYKTKLCDCGFITQFGYVDCCgdTCD 395
Cdd:cd21511    81 fgTFVG-------------EVPAHVFPLNSIVKeIQPRVKKKKKVTLSGVIRsFYSKASPNECNPITLSALVKCT--HCD 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  396 FRGWVPGNMLDGFPCPgCSKSYMPWELEAQSSGVIPEGGVLFTQSTDTVNREAFKLYGHAVVPFGSAVYWSPYP----GM 471
Cdd:cd21511   146 EKSWQTGDFVDGFTCE-CGAEYLNWKLDAQSSGVLPPGAVVKTQCPACVNRETFLRGGGRIVYFGGAVYSYVGCingvAY 224
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  472 WLPVVWSSVKSysglTYTGVVGckaivqetdaicrslymdyvqhkcgnldqratlglddvyhrqllvnrgdyslllenvd 551
Cdd:cd21511   225 WVPRASSSVGC----FHTGVVG---------------------------------------------------------- 242
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  552 lfvkrraefackfatcgdgfvpllldGLVPRSYYLIKSGQAYTSMMVnfsheviDMCMDMALLFMHDVKVATKYVKK--- 628
Cdd:cd21511   243 --------------------------KIVPGAWGLGASAQKLTPLTT-------GAAVVFVLIFARTLFAAVGSVPQlqa 289
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  629 -------------FTGKLAVRFKALGVAvvrkitewfdlAVDIAASAAGWLCYQLVN---GLFAVANGVItfvqeapelv 692
Cdd:cd21511   290 saptildgivnasDRLVDAMQFSADLVV-----------ATTTSAGAAGYVVAGLVDllkPILEWVLSKI---------- 348
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  693 knfvakfraffkvlidsmsvsilsgltvvktasNRVCLAGSKVYEVVQKSLSAYVLPVGC 752
Cdd:cd21511   349 ---------------------------------GQVCYAGCDVYERVMAFLNVVVKAAGK 375
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3661-3920 1.61e-83

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 276.44  E-value: 1.61e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3661 FVKWTMFMYVTTHML-GVTLCALCFVSFAMLLIKHKHLYLTMYIMPVLCTLFYTNYL*VYK-QSFRGLAYAWlshfvpAV 3738
Cdd:pfam19213    1 LLMYTALYWLPPNLItPVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3739 DYTYMDEVLYGVVLLIAMVFV--TMRSINHDvFSIMFLVGRLVSLVSMWYFGANLEEE-----VLLFLTSLFGTYTWTTM 3811
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVRSK-YSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3812 LSLATAKVIAKWlaVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRP 3891
Cdd:pfam19213  154 IAYKLAKYIVVY--VPPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 381354069  3892 PKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
NendoU_cv_Nsp15-like cd21161
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural ...
6724-6874 4.16e-81

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural Protein 15 (Nsp15) and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Except for turkey coronavirus (TCoV) Nsp15, Mn2+ is generally essential for the catalytic activity of coronavirus Nsp15. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and murine hepatitis virus (MHV) form a functional hexamer while Porcine DeltaCoronavirus (PDCoV) Nsp15 has been shown to exist as a dimer and a monomer in solution. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


Pssm-ID: 439158  Cd Length: 151  Bit Score: 264.89  E-value: 4.16e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6724 FTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSYDS 6803
Cdd:cd21161     1 FTQGRSLEDFKPRSQMERDFLSMDQDVFIQKYGLEDLGFEHIVYGDFSKPTIGGLHLLIGLVRLKKEGKLYVEEFHNSDS 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6804 SIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:cd21161    81 TVQNYFVTDANNGSSKQVCTVVDLLLDDFVDILKSQDLSVVSKVVTVSIDYKPIRFMLWCKDGKVKTFYPQ 151
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2107-2231 3.04e-79

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 258.77  E-value: 3.04e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2107 VTEVHQEPSVSAVDVKEVKLNGVKKPVKVEDSVVVNDPTSDTKVVKSLSIVDVYDMFLTGCKYVVWTANELSRLVNSPTV 2186
Cdd:cd21812     1 GGDVSQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 381354069 2187 REYVKWGMGKIVNSTKLLLLRDERQEFVAPKVVKAKAIACYGAVK 2231
Cdd:cd21812    81 REYVKFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1941-2059 2.87e-78

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 255.45  E-value: 2.87e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1941 GKYYTKPIIKAQFRTFEK*DGVYTNFKL*GHSIAEKL*AKLGFDCDSPFVEYKITEWPTATGDV*LASDDLYVSRYLSGC 2020
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 381354069 2021 ITFGKPVVWLGHEEASLKSLTYFNRPSVVCENKFNVLPV 2059
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKFDVLKV 119
capping_2-OMTase_gammaCoV_Nsp16 cd23529
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of gammacoronavirus, also called ...
6919-7109 2.27e-77

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of gammacoronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. The gammacoronavirus (gammaCoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467741  Cd Length: 196  Bit Score: 256.34  E-value: 2.27e-77
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6919 NVAKYTQLCQYLNTTTIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDTVASYYGNCITLPFDC 6998
Cdd:cd23529     1 NVAKYTQLCQYLSKTTMCVPHNMRVMHFGAGSDKGVAPGSTVLKQWLPEGTLLVDNDIVDYVSDAHVSVLSDCNKYKTEH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6999 QWDLIISDMY-DPLTKNIGEYNVSKDG---FFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVN 7074
Cdd:cd23529    81 KFDLVISDMYtDNDSKRKHEGVIANNGnddVFIYLSNFLRNNLALGGSFAVKVTETSWHESLYDIAQDCAWWTMFCTAVN 160
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 381354069 7075 ASSSEGFLIGINWLNRT-RTEIDGKTMHANYLFWRN 7109
Cdd:cd23529   161 ASSSEAFLVGVNYLGASeKVKVSGKTLHANYIFWRN 196
CoV_NSP15_C pfam19215
Coronavirus replicase NSP15, uridylate-specific endoribonuclease; This entry represents the ...
6722-6874 2.28e-74

Coronavirus replicase NSP15, uridylate-specific endoribonuclease; This entry represents the C-terminal domain of coronavirus non-structural protein 15 (NSP15 or nsp15). NSP15 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. This domain exhibits endoribonuclease activity designated EndoU, highly conserved in all known CoVs and is part of the replicase-transcriptase complex that plays important roles in virus replication and transcription. NSP15 is a Uridylate-specific endoribonuclease that cleaves the 5'-polyuridines from negative-sense viral RNA, termed PUN RNA either upstream or downstream of uridylates, at GUU or GU to produce molecules with 2',3'-cyclic phosphate ends. PUN RNA is a CoV MDA5-dependent pathogen-associated molecular pattern (PAMP).


Pssm-ID: 465999  Cd Length: 155  Bit Score: 246.09  E-value: 2.28e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6722 TIFTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSY 6801
Cdd:pfam19215    2 TLFTQGRTLEDFVPRSTMEKDFLNMDQQQFIQKYGLEDLGFEHIVYGDFSKTTIGGLHLLISLVRLTKMGILKVEEFVPN 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069  6802 -DSSIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:pfam19215   82 dDSTVKNCSVTYANDGSSKAVCTVLDLLLDDFVDILKSLDLSVVSKVVTVNIDFQPVRFMLWCKDGKVQTFYPQ 155
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4328-4446 6.10e-73

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 240.42  E-value: 6.10e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4328 AILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRVEHPDVDGLCK 4407
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 381354069  4408 LRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
deltaCoV_Nsp5_Mpro cd21668
deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3330-3626 2.15e-72

deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in deltacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394889  Cd Length: 302  Bit Score: 246.26  E-value: 2.15e-72
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3330 QSGIVKMVSPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVICSSDDMTDPDYPNLL-CRvtssDFCVMS--DRMSLT 3406
Cdd:cd21668     1 QAGIKILLHPSGVVERCMVSVTYNGSTLNGIWLHNVVYCPRHVIGKYTGSQWQDMVSIAdCR----DFVIFCptQGIQLT 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3407 VMSYQMQGSLLVLTVTLQNPNTPKYSFGVVKPGETFTVLAAYNGRPQGAFHVVMRSSHTIKGSFLCGSCGSVGYVLTGDS 3486
Cdd:cd21668    77 VQSVKMVGAVLQLTVHTKNLHTPDYEFERATPGSSMTIACAYDGIVRNVYHVVLQTNNLIYASFLNGACGSVGYTLKGKT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3487 VRFVYMHQLELSTGCHTGTDLSGNFYGPYRDAQVVQLPVQDYTQTVNVVAWLYAAILN---RCNWFVQSDsCSLEEFNVW 3563
Cdd:cd21668   157 LLLHYMHHLEFNNKTHGGTDLHGHFYGPYVDEEVAQHQTAFQYYTDNVVAQIYAHLLTidaKPKWLASQE-ISVEDFNEW 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 3564 AMTNGFSSIKADL----V*DALASMTGVTVEQVLAAIKRLYSGFQGKQILGSCVLEDELTPSDVYQQ 3626
Cdd:cd21668   236 AANNSFANFPCESsnmaYLEGLAQTTKVSVGRVLNTIIQLTLNRGGALIMGKPDFECDWTPEMVYNQ 302
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4207-4316 3.75e-69

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 229.21  E-value: 3.75e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVVNSGSDMN-CNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 381354069 4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
CoV_Nsp8 cd21816
Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) ...
4013-4200 1.73e-66

Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409256  Cd Length: 194  Bit Score: 225.10  E-value: 1.73e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4013 SEFVNMASFVEYELAKKNLDEAK*SGsANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4092
Cdd:cd21816     1 SEFSHLPSYAAYATAQAAYEQAVKNG-DSPQELKKLTKALNIAKSEFDRDAAVQKKLEKMADQAMTSMYKEARAEDRRAK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4093 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4172
Cdd:cd21816    80 ITSAMHALLFSMLKKLDSDAVNNIFEQARDGVVPLNIIPLTTANKLMVVIPDYETYKKTVDGNTFTYAGALWSIVTVVDA 159
                         170       180       190
                  ....*....|....*....|....*....|..
gi 381354069 4173 DGAVKQLNEIDV----NSIWPLVIAANRHNEV 4200
Cdd:cd21816   160 DGKIVHLSEINMdnspNIAWPLIVTCLRAGAV 191
alphaCoV_Nsp8 cd21830
alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4013-4206 5.37e-66

alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine epidemic diarrhea coronavirus (PEDV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. FCoV Nsp8 forms a 1:2 heterotrimer with Nsp7; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409257  Cd Length: 195  Bit Score: 223.77  E-value: 5.37e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4013 SEFVNMASFVEYELAKKNLDEAK*SGSaNQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4092
Cdd:cd21830     4 STFANMPSFIAYETARQDYEDAVKNGS-SPQLIKQLKKAMNIAKSEFDREASVQRKLDRMAEQAAAQMYKEARAVNRKSK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4093 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDA 4172
Cdd:cd21830    83 VISAMHSLLFGMLRRLDMSSVDTILNLAKDGVVPLSIIPAASATRLVVVVPDLESFSKIRRDGCVHYAGVVWTIVDIKDN 162
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 381354069 4173 DGAVKQLNEIDV----NSIWPLVIAANRhnevsTVVLQ 4206
Cdd:cd21830   163 DGKVVHLKEVTAaneeSLAWPLHLNCER-----IVKLQ 195
CoV_Nsp10 cd21872
coronavirus non-structural protein 10; This model represents the non-structural protein 10 ...
4317-4445 3.62e-61

coronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16, and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409325  Cd Length: 131  Bit Score: 207.32  E-value: 3.62e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4317 AGTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSR 4396
Cdd:cd21872     1 AGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVKPEANMDQESFGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 381354069 4397 VEHPDVDGLCKLRGKFVQVPL-GIKDPVLYVLTHDVCQVCGFWRDGSCSC 4445
Cdd:cd21872    81 IDHPNPDGFCDYKGKFVQIPTtCANDPVGFTLRNTVCTVCQMWKGYGCSC 130
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1945-2059 1.25e-59

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 202.78  E-value: 1.25e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1945 TKPIIKAQFRTFEK*DGVYTNFKL--*GHSIAEKL*AKLGFDCDSPFV-EYKITEWPTATGDV*LASDDLYVSRYLSGCI 2021
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLtcSGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 381354069  2022 TFGKPVVWLGHEEASLKSLTYFNRPSVVC-ENKFNVLPV 2059
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVClECKFNTKPV 129
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4207-4316 7.39e-58

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 196.93  E-value: 7.39e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4207 NNELMPQKLRTQVVNSGS-DMNCNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVtDAHCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 381354069  4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
M_alpha_beta_cv_Nsp15-like cd21167
middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6567-6689 8.22e-57

middle domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain and a C-terminal catalytic (NendoU) domain. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. This middle domain harbors residues involved in hexamer formation and in trimer stability. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of the other coronaviruses; it has been shown to exist as a dimer and a monomer in solution.


Pssm-ID: 439161  Cd Length: 127  Bit Score: 194.47  E-value: 8.22e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6567 PHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKN 6646
Cdd:cd21167     1 PVPELKLLRNLGVDICYKFVLWDYEREAPFTSSTIGVCKYTDIDKKSDLNVLFDGRDPGSLERFRSARNAVLISTTKVKG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 381354069 6647 LSMIKGPQRADLNGVVVEKVGDSDVEFWFAMRSDGDDVIFSRT 6689
Cdd:cd21167    81 LKPIKGPNYASLNGVVVESVDKKKVKFYYYVRKDGEFVDLTDT 123
NendoU_nv cd21158
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural ...
6724-6874 1.10e-56

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural protein 15 (Nsp15), arterivirus Nsp11, torovirus endoribonuclease, and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. This family also includes torovirus NendoUs. Except for turkey coronavirus (TCoV) Nsp15, Mn2+ is generally essential for the catalytic activity of coronavirus Nsp15. Mn2+ is dispensable, and to some extent inhibits the activity of arterivirus (Porcine Reproductive and Respiratory Syndrome virus) PRRSV Nsp11. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and murine hepatitis virus (MHV) form a functional hexamer while Porcine DeltaCoronavirus (PDCoV) Nsp15 has been shown to exist as a dimer and monomer in solution. Nsp11 from the arterivirus PRRSV is a dimer. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


Pssm-ID: 439157  Cd Length: 151  Bit Score: 195.17  E-value: 1.10e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6724 FTQSRFLSSFAPRSEMEKDFMDLDEDVFVAKYSLQDYAFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSYDS 6803
Cdd:cd21158     1 FTQGRNLQEFLPRSDMERDFLPVDMDVFIEKYGLEIYAFEHVVYGDFSHTTLGGLHLVISLYKRFKEGPLPREEFIPNDS 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6804 SIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLNLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYPR 6874
Cdd:cd21158    81 TVKNYGVTSPGTKASKAVCTLIDLLLDDFVEILKSQDLEVVSKVVKVMIDFKEVRFMLWCKDGDVQTFYPQ 151
TM_Y_CoV_Nsp3_C cd21686
C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; ...
2349-2829 1.53e-56

C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409657  Cd Length: 476  Bit Score: 206.66  E-value: 1.53e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2349 SFCNGSMVCELCFSGFDMLDSYDAINVVQHVVDRRVS-------FDYISILKLVVELIIGYslYTVCFYPLFVLIGMQLL 2421
Cdd:cd21686    54 SYCAGDLVCQVCLDGQDSLHLYPHLRVVQQPLQTTDYtvyalslILYLANMTLFMGTFIVT--FFVNFYGVGIPFYGWLL 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2422 TTWLPEFFMLEtmhwsarlfvfvanmlpaftllrfyIVVTAMYKVYCLCRHVMYGCSNPGCLFCYKRNRSVRVKCSTVVG 2501
Cdd:cd21686   132 IDVPQSAFMMT-------------------------FSVFFFYYVLKFFVHVTHGCKIPTCMVCAKLARPPRVEVETVVQ 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2502 GSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKqvgCSMRLFYERD 2581
Cdd:cd21686   187 GRKYSFYVYTNGGFTFCKEHNFYCKNCDLYGPGCTFISDEVAEELSRATKLSVKPTAPAFLLVDDVE---VQNDVVFARA 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2582 GQRVYDDVSASLFVDMNGLLHSKVKGVPETHVVVVENeadkagflgAAVFYAQSLYRPMLMVEKKLITTANTGLSVSQtm 2661
Cdd:cd21686   264 KYNQNAHVSLSKFSDIPDFIIAANFGSNCEQLSTAKN---------AAVYYSQDLCKPILILDQALSRPIDNYQEVAS-- 332
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2662 fdlyvdSLLNVLDVDRksLTSFVNAAHNSlKEGVQlEQVMDTFVgCArrkcaidsdvetrsitksVMSAVNAGVDFTDES 2741
Cdd:cd21686   333 ------RIEKYYPVAK--IKPTGDIFTDI-KQGTD-GEASDSAI-NA------------------AVLAHQRDVEFTGDS 383
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2742 CNNLVPTYVKSDTIVAADLGVlIQNNAKHVQSNVAKAANVACIWSVDAFNQLSADLQHRLRKACSKTGLKIKLTYNKQEA 2821
Cdd:cd21686   384 FNNILPSYAKDESKLTAEDQA-MSVIAESGNANVNVKGTIPVVWLVADFIRLSEQARKYIISAAKKNGVTFALTPSTLRM 462

                  ....*...
gi 381354069 2822 NVPILTTP 2829
Cdd:cd21686   463 RGNIATQP 470
ZBD_cv_Nsp13-like cd21401
Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related ...
5382-5476 1.68e-56

Cys/His rich zinc-binding domain (CH/ZBD) of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. This coronavirus family includes Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) non-structural protein 13 (SARS-Nsp13) and belongs to helicase superfamily 1 (SF1) and to a family of nindoviral replication helicases. SARS-Nsp13 has an N-terminal CH/ZBD, a stalk domain, a 1B regulatory domain, and SF1 helicase core. The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase (RdRp). Structural studies of a stable SARS-CoV-2 RTC which included two molecules of Nsp13, the RdRp holoenzyme (Nsp7, two molecules of Nsp8, Nsp12), and an RNA template product, show that one Nsp13 CH/ZBD domain interacts with Nsp12, and both Nsp13-CH/ZBD domains interact with the Nsp8. This stable SARS-CoV-2 RTC suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching.


Pssm-ID: 439168  Cd Length: 95  Bit Score: 192.22  E-value: 1.68e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5382 SVGACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNSPGCDVNDVTKLYLGGMSYYCEDHKPQYS 5461
Cdd:cd21401     1 AVGLCVVCNSQTVLRCGDCIRRPFLCCKCCYDHVMSTSHKFILSINPYVCNAPGCGVSDVTKLYLGGMSYYCEDHKPSLS 80
                          90
                  ....*....|....*
gi 381354069 5462 FKLVMNGMVFGLYKQ 5476
Cdd:cd21401    81 FPLCANGFVFGLYKN 95
capping_2-OMTase_deltaCoV_Nsp16 cd23530
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of deltacoronavirus, also called ...
6919-7109 1.29e-50

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of deltacoronavirus, also called non-structural protein 16; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. The deltacoronavirus (deltaCoV) 2'OMTase activity is located in the non-structural protein 16 (Nsp16). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Nsp16 requires Nsp10 to bind both m7GpppA-RNA substrate and SAM cofactor; the structure suggests that Nsp10 may stabilize the SAM-binding pocket and extend the substrate RNA-binding groove of Nsp16.


Pssm-ID: 467742  Cd Length: 183  Bit Score: 179.21  E-value: 1.29e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6919 NVAKYTQLCQYLNTT-TIAVPANMRVLHLGAGSDKGVAPGSAVLRQWLPAGSILVDNDVNPFVSDT----VASYYgncIT 6993
Cdd:cd23530     1 NVIKYRQLFNYIVKKdRLAVPHNMTVLHLGAASAEGTAPGTSVIKQMFPEGTVIIDLDIREFTSDAnqiiVTDYR---TY 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6994 LPFDcQWDLIISDMYdpltknigeyNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNV 7073
Cdd:cd23530    78 MPPH-HVDAIFSDLY----------SCDDIHFFDNLIRIVKERLALGGSIFVKITEHSYSPELYSLAGWFDDYQLFCTAV 146
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 381354069 7074 NASSSEGFLIGINWLNRTRTEIDGKTMHANYLFWRN 7109
Cdd:cd23530   147 NASSSEAFLCCFNYLGHAKENVNGFNLHASYIKWRN 182
gammaCoV_Nsp8 cd21832
gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4010-4206 1.56e-50

gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409259  Cd Length: 210  Bit Score: 180.15  E-value: 1.56e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4010 ALQSEFVNMASFVEYELAK----KNLDEAK*SGSANQQQIKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEAR 4085
Cdd:cd21832     1 SVTQEFSHIPSYAEYERAKdlyeKVLADSK-NGGVTQQELAAYRKAANIAKSVFDRDLAVQKKLDSMAERAMTTMYKEAR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4086 INDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWH 4165
Cdd:cd21832    80 VTDRRAKLVSSLHALLFSMLKKIDSEKLNVLFDQASSGVVPLATVPIVCSNKLTLVIPDPETWVKCVEGMHVTYSTVVWN 159
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 4166 IQSIQDADGavkqlNEIDVNSI--------------WPLVIAANR--HNEVStVVLQ 4206
Cdd:cd21832   160 IDTVIDADG-----TELHPTSTgsgltycisgdniaWPLKVNLTRngHNKVD-AVLQ 210
TM_Y_alphaCoV_Nsp3_C cd21712
C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2328-2834 8.93e-50

C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alphacoronavirus, including Porcine epidemic diarrhea virus and Human coronavirus 229E, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409660  Cd Length: 501  Bit Score: 187.45  E-value: 8.93e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2328 TTFGVSTICDFYqvTDLGYRSSF-----CNGSMVCELCFSGFDMLDSYDAINVV-QHVVDRRVSFDYISILkLVVELIIG 2401
Cdd:cd21712    32 FPPLNSSLCSGY--VDGYANSSFvksevCGNSLLCKACLAGYDELSDFPHLQVVwDHVSDPLFSNVLPLFY-FAFLLIFG 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2402 ySLYTVCFYPLFVLigmQLLTTWLPEFFMLETmhwsarlfVFVANMLPaFTLLRFYIVVT-AMYKVYCLCRHVMYGCSNP 2480
Cdd:cd21712   109 -NNYVRCFLLYFVA---QYINNWGVYFGYQDY--------SWFLHFVP-FDSFSDEIVVIfIVVKVLLFLKHVIFGCDKP 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2481 GCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSA 2560
Cdd:cd21712   176 SCKACSKSARLTRIPVQTIVNGSMKSFYVHANGGGKFCKKHNFFCVNCDSYGVGNTFINDEVARELSNVVKTTVQPTGPA 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2561 YYSVTEVKQVGCSMRLFYERDGQRVYDDVSASLFVDMNGLlhsKVKGVPETHVVVVENEADKAGFLGAAVFYAQSLYRPM 2640
Cdd:cd21712   256 YIEVDKVEFSNGFYYLYSGDTFWRYNFDITEKKYSCKEVL---KNCNLLDDFIVYNNNGSNVAQVKNACVYFSQLLCKPI 332
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2641 LMVEKKLITTantgLSV--SQTMFDLYVDSLLNVLDVDRKSLTSFvnaahNSLKEGVQLEQVMDTFvgcarrkcaidsdv 2718
Cdd:cd21712   333 KLVDSALLSS----LSVdfNGALHKAFVKVLKNSFNKDLSNCKTL-----EECKKALGLDVSDDEF-------------- 389
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2719 etrsitksVMSAVNA---GVDFTDESCNNLVPTYVK-SDTIVAADLGVLIQNNAKHVQSNVAKAANVACIWSVDAFNQLS 2794
Cdd:cd21712   390 --------ESAVSNAhryDVLLTDRSFNNFVTSYAKpEEKLSTHDIAVCMRAGAKVVNHNVLTKENVPIVWLAKDFSALS 461
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 381354069 2795 ADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFSLKG 2834
Cdd:cd21712   462 EEARKYIVKTTKAKGVNFLLTFNDNRMTTTLPAVSIVSKK 501
capping_2-OMTase_Nidovirales cd20762
Cap-0 specific (nucleoside-2'-O-)-methyltransferase of nidovirales; Cap-0 specific ...
6919-7109 1.15e-48

Cap-0 specific (nucleoside-2'-O-)-methyltransferase of nidovirales; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. Nidovirales viruses, which comprise a family of ss(+)RNA viruses, cap their mRNAs. For one member, coronavirus, the 2'OMTase activity is located in the non-structural protein 16 (Nsp16). For others, the 2'OMTase activity may be located in replicase polyprotein 1ab.


Pssm-ID: 467737  Cd Length: 175  Bit Score: 173.27  E-value: 1.15e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6919 NVAKYTQLCQYLNTTtIAVPANMRVLHLGAGSDkgVAPGSAVLRQWLpAGSILVDNDVNPFVSDTVASYYGNCITlPFDC 6998
Cdd:cd20762     1 NITKYVQLCSYINDH-LKVPPKPRVLHLGAAGI--YSPGDEVDYIPV-TGGIVLNHDFNDCVDHADIRPINDCNG-RFGG 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6999 QWDLIISDMYDPLTKNigeynvsKDGFFTYLCHLicdkLALGGSVAIKITEFSWNAELYSLMGKFAFWTIFCTNVNASSS 7078
Cdd:cd20762    76 KYDLIISDIYNPGTDN-------TELLLDYINNH----LALGGSIIWKTTRRSNLTNLNQIAKYFGSWTFFTTRVNASSS 144
                         170       180       190
                  ....*....|....*....|....*....|.
gi 381354069 7079 EGFLIGINWLNRTRTEIDGKTMHANYLFWRN 7109
Cdd:cd20762   145 EVFLVFKYYLLFKEQIDQEQQILHHLAAYRN 175
alphaCoV-Nsp6 cd21558
alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3619-3920 6.34e-47

alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394844  Cd Length: 293  Bit Score: 172.77  E-value: 6.34e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3619 TPSDVYQQLAGVKLQS-KRTRVIKGtccwILASTFLFCSIIAAFVKWTMFMYVTTHMLGVTLCALCFVS-FAMLLIKHKH 3696
Cdd:cd21558     2 TTSEVIKQMYGVNLQSgKVKSAFKN----VLLVGVFLFMFWSELLMYTSFFWINPGLVTPVFLVLVLVSlLLTLFLKHKM 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3697 LYLTMYIMPVLCtlfytnYL*VYKQSFRGLAYAWLS-HFVPAVDYTYMDevLYGVVLLIAMVFV----TMRSINHDVFSI 3771
Cdd:cd21558    78 LFLQTFLLPSVI------VTAFYNLAWDYYVTAVLAeYFDYHVSLMSFD--IQGVLNIFVCLFVfflhTYRFVTSGTSWF 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3772 MFLVGRLVSLVSMWYFGanleEEVLLFLTSLFG-TYTWttMLSLATAKVIAKWLAVNVLYFTDVPQIKLVLLSYLCIGYV 3850
Cdd:cd21558   150 TYVVSLVFVLYNYFYGN----DYLSLLMMVLSSiTNNW--YVGAIAYKLAYYIVYVPPSLVADFGTVKAVMLVYVALGYL 223
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3851 CCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21558   224 CCVYYGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLKAPTGVFDALLLSFKLIGIGGERTIKISTVQ 293
M_cv-Nsp15-like cd21165
middle domain of coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus ...
6567-6689 8.89e-47

middle domain of coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain and a C-terminal catalytic (NendoU) domain. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of the other coronavirus members; it has been shown to exist as a dimer and a monomer in solution.


Pssm-ID: 439160  Cd Length: 126  Bit Score: 165.91  E-value: 8.89e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6567 PHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKN 6646
Cdd:cd21165     1 STPTLKLLKNLGVDATYNFVLWDYERDTPFFNSTNGVCTYTDIDPNSGLTVLYDDRYGGSLERFLQADNAVLISTTKVKG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 381354069 6647 LSMIKGPQRADLNGVVVEKVgDSDVEFWFAMRSDGDDVIFSRT 6689
Cdd:cd21165    81 LSPPKGPNYASLNGVPVEGV-DKGVQLYVYVRKDGQFVTLTDT 122
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3234-3328 1.32e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 163.85  E-value: 1.32e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3234 GTEVRsdGTFEEMALTTFMITKVSYCKLKNSVSDVAFNRYLSLYNKYRYFSGKMDTAAYREA*CSQLAKAMETFNhNNGN 3313
Cdd:pfam16348    1 GDKFV--GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDFS-NSGN 77
                           90
                   ....*....|....*
gi 381354069  3314 DVLYQPPTASVTTSF 3328
Cdd:pfam16348   78 DVLYTPPTVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1532-1606 1.28e-45

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 160.27  E-value: 1.28e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 1532 QLDDDARVFVQANMDCLPTDWRLVNKLDVVDGVRTIKYFECPGEIFVSSQGKKFGYVQNGLFKVASVSQIRALLA 1606
Cdd:cd21524     1 QLDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
1B_cv_Nsp13-like cd21409
1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze ...
5531-5609 5.93e-45

1B domain of coronavirus SARS NSP13 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. Members of this subfamily belong to helicase superfamily 1 (SF1) and include coronavirus helicases such as Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13). SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). Structural studies of a stable RTC which included the RNA-dependent RNA polymerase holoenzyme (Nsp7, two molecules of Nsp82, Nsp12), two molecules of Nsp13 helicase accessory factor and an RNA template product suggests that the Nsp13 helicase may drive RTC backtracking, affecting proofreading and template switching. SARS-Nsp13 is a multidomain protein; its other domains include an N-terminal Cys/His rich zinc-binding domain (CH/ZBD) and a SF1 helicase core. The 1B domain is involved in nucleic acid substrate binding; the 1B domain of the related Equine arteritis virus (EAV) Nsp10 undergoes large conformational change upon substrate binding, and together with the 1A and 2A domains of the helicase core form a channel that accommodates the single stranded nucleic acids.


Pssm-ID: 394817  Cd Length: 79  Bit Score: 158.66  E-value: 5.93e-45
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 381354069 5531 ASATIREIVSDRELILSWEIGKVRPPLNKNYVFTGYHFTNNGKTVLGEYVFDKSELTNGVYYRATTTYKLSVGDVFILT 5609
Cdd:cd21409     1 ASATVKEVVGPRELVLSWEAGKTKPPLNRNYVFTGYHITKNSKTQLGEYTFEKSDYSDSVYYKSTTTYKLQPGDIFVLT 79
gammaCoV_Nsp10 cd21902
gammacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4318-4445 2.06e-43

gammacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of gammacoronaviruses, including Infectious bronchitis virus (IBV)and Bottlenose dolphin coronavirus HKU22(BdCoV HKU22). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409327  Cd Length: 134  Bit Score: 156.60  E-value: 2.06e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4318 GTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDHAGTGMAITIKPEATTNQDSYGGASVCIYCRSRV 4397
Cdd:cd21902     2 GHETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVHNGSGFAITSKPSPTPDQDSYGGASVCLYCRAHI 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 381354069 4398 EHP----DVDGLCKLRGKFVQVPLGIKDPVLYVLTHDVCQVCGFWRDGSCSC 4445
Cdd:cd21902    82 AHPggagNLDGRCQFKGSFVQIPTTEKDPVGFCLRNKVCTVCQCWIGYGCQC 133
CoV_Nsp6 cd21526
coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3623-3920 3.31e-41

coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394843  Cd Length: 287  Bit Score: 156.15  E-value: 3.31e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3623 VYQQLAGVKLQSkrTRVIKGTCCWIlaSTFLFCSIIAAF-VKWTMFMyvttHMLGVTLCALCFVSFAmllIKHKHLYLTM 3701
Cdd:cd21526     1 VYNQAPGVLLQS--VFVVKKTSTFW--SHFLFAAFTMLLaAPLVFPV----HAYVILLMCFTVVTFT---VKHKVAFLTT 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3702 YIMPVLCTL-FYTNYL*VYKQSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLIAMVFVTMRSINHDVFSIMFLvgrLVS 3780
Cdd:cd21526    70 FLLPSLITMvAIANTFWIQVVTFLRTWYDTVFVSPIAQDLYGYTVALYMLIYAGLATNYTLKTLRYRATSFLSF---LMQ 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3781 LVSMWYFGANLEEEVLLFLTSLFGTYTWTTMLSLATAKV--IAKWLaVNVLYFTDVPQIKLVLLSYLCIGYVCCCYWGVL 3858
Cdd:cd21526   147 NFLTLYTAHYAYKLLPWTESLLFTALTMLSSHSLIGAIVfwLARWM-LRVEYPIIFPDLAIRVLAYNVIGYVCTCYFGLM 225
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 3859 SLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21526   226 WLANRFFTLTLGVYDYMVSVEQFRYMMAVKLNPPKNAFEVFILNIKLLGIGGNRNIKVATVQ 287
deltaCoV_Nsp10 cd21903
deltacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4318-4446 8.36e-41

deltacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of deltacoronaviruses, including Thrush coronavirus HKU12-600 and Wigeon coronavirus HKU20. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409328  Cd Length: 128  Bit Score: 148.86  E-value: 8.36e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4318 GTATEYASNSAILSLCAFSVDPKKTYLDYIQQGGVPVTNCVKMLCDhAGTGMAITIKPEATTNQDSYGGASVCIYCRSRV 4397
Cdd:cd21903     2 GTQIEYQENASLLTYLAFAVDPKEAYLKHLADGGKPIQGCIQMIAP-LGPGFAVTTKPQPNEHQYSYGGASICLYCRAHI 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 4398 EHPDVDGLCKLRGKFVQVPLGiKDPVLYVLTHDVCQVCGFWRDGSCSCV 4446
Cdd:cd21903    81 PHPGVDGRCPYKGRFVHIDKD-KEPVSFALTHEPCNSCQRWVNYDCTCG 128
betaCoV_Nsp3_betaSM cd21727
betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model ...
2107-2231 1.19e-38

betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409626  Cd Length: 125  Bit Score: 142.67  E-value: 1.19e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2107 VTEVHQEPSVSAVDVKEVKLNGVKKPVKVEDSVVVNDPTSDTKVVKSLSIVDVYDMFLTGCK-YVVWTANELSRLVNSPT 2185
Cdd:cd21727     1 VEPVTVETSVSASQQKMVILKGLKKPFVVNGNVSVVDNDSGTKVVEELSKTDLYTMYVDGKYqVVVLKANELSRVLGLHT 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 2186 VRE--YVKWGMGKIVNSTKLLLLRDERQefvAPKVVKAKAIACYGAVK 2231
Cdd:cd21727    81 VEShaAVNVLASGSVTRYAKLLLRASFY---FVEFTKATFTATNAVSK 125
Tobaniviridae_RdRp cd23186
catalytic core domain of RNA-dependent RNA polymerase (RdRP) in the Tobaniviridae family of ...
4904-5373 8.10e-37

catalytic core domain of RNA-dependent RNA polymerase (RdRP) in the Tobaniviridae family of positive-sense single-stranded RNA [(+)ssRNA] viruses; This group contains the catalytic core domain of RdRp of RNA viruses belonging to the family Tobaniviridae, order Nidovirales. Tobaniviridae RNA viruses infect vertebrates; their host organisms include mammals, fish, and snakes. Member viruses have a viral envelope and (+)ssRNA genome. The genome size of Tobaniviruses ranges from 20 to 32 kilobases. The family is the only member of the suborder Tornidovirineae. The family Tobaniviridae has four subfamilies (Piscanivirinae, Remotovirinae, Remotovirinae, and Torovirinae) and eight genera (Bafinivirus, Oncotshavirus, Bostovirus, Infratovirus, Pregotovirus, Sectovirus, Tiruvirus, and Torovirus). The Tobaniviridae family belongs to the order Nidovirales, which currently comprises 88 formally recognized virus species of (+)ssRNA viruses, which are classified into nine virus families across seven different suborders. The structure of Tobaniviridae RdRp contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438036  Cd Length: 401  Bit Score: 146.77  E-value: 8.10e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4904 YYKYNLPTMVDIKQLLFVLEVVNKYFEIYeggcipATQVIVNNYDK------SAGYPFNKFgKARLYYEALSFEEQDEIY 4977
Cdd:cd23186     1 YYDYQGPLFLDPHILKFLYEYMLKDFSSY------ATDARFTYHEPgkprlsSMGVGLRGF-KQDAVYQALPEDFIDRLL 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4978 AYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLKSI---AATRGVPVVIGTTKFYGGWDDMLRRL 5054
Cdd:cd23186    74 ELAKKTPLPFSTKIITKFALTKKARARTIAACSFIASTIFRFLHKPVTNNMvkqAQNNIGHCLIGVSKFNLGFDKFLRSR 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5055 IKDVDSPVLMGWDYPKCDRAMPNILRIVSSLVLAR----KHDSccshtdrfYRLANECAQVLSEIVMCGGCYYVKPGGTS 5130
Cdd:cd23186   154 YGGIEDYNVFGSDYTKCDRSFPLVFRALAAALLYElggwDPKN--------HLFVNEIFAFMLDFVFIGGHIFNKPGGTS 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5131 SGDATTAFANSVFNicqavsanvcslmacnghkiedlsirelqkrlysnvyradhvdpAFVSEYYEFLNKHFsMMILSDD 5210
Cdd:cd23186   226 SGDATTAFSNTLYN--------------------------------------------YMVHLYVQFQTFYF-FNFLSDD 260
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5211 GVVCYNSEfASKgyIANISAFQQVLYYQNNVFMSEAKCWVET-DIekgpHEFCSQHTMLVkmdgDEVYLPYPDPSRILGA 5289
Cdd:cd23186   261 SFILSKPE-AFP--IFTTENFSRKLQTILHTTVDQTKAWSASgHI----HEFCSSHIEEV----NGVYQFIPDPNRLLAG 329
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5290 GCFVDDLlkTDSVLLIERFVSLAIDAyPLVHHENPEYQNVfrvyleyIKKLYNDLGNQILDSYSVILSTcdgQKFTDETF 5369
Cdd:cd23186   330 LLITGKA--SDVDLDIWRTVAILAEL-AVYSRVDPAFFNA-------LFQLFQNKHAEFVTKYGVNPLP---DQLLEKDF 396

                  ....
gi 381354069 5370 YKNM 5373
Cdd:cd23186   397 YTNL 400
betaCoV_Nsp3_NAB cd21795
nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents ...
1951-2059 3.37e-36

nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus including highly pathogenic human coronaviruses (CoVs) such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but may not be conserved in the Nsp3 NAB from betacoronaviruses in other lineages.


Pssm-ID: 409347  Cd Length: 110  Bit Score: 135.01  E-value: 3.37e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1951 AQFRTFEK*DGVYTNFKL*G---HSIAEKL*AKLGFDcdSPFVEYKITEWPTATGDV*LASDDLYVSRYLSGCITFGKPV 2027
Cdd:cd21795     1 LDVPAAPKPVTVYDNFKLVScqnQSIADDFNRTLGFT--KPGSELLLTVYPNTSGDVVAVSDDNYTVVYKKGSLLMGKPV 78
                          90       100       110
                  ....*....|....*....|....*....|...
gi 381354069 2028 VWLgHEEASLKSLTYFNRPSVVCENK-FNVLPV 2059
Cdd:cd21795    79 LWV-HKNNTWKKLVPLNKPNVVCLRNlFSVLPI 110
NendoU_XendoU-like cd21144
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural ...
6761-6873 5.00e-36

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of coronavirus Nonstructural protein 15 (Nsp15), arterivirus Nsp11, torovirus endoribonuclease, Xenopus laevis endoribonuclease XendoU, and related proteins; Nidovirus endoribonucleases (NendoUs) and eukaryotic Xenopus laevis-like endoribonucleases (XendoUs) are uridylate-specific endoribonucleases which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. XendoU is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA. Except for turkey coronavirus (TCoV) Nsp15, Mn2+ is generally essential for the catalytic activity of coronavirus Nsp15. Mn2+ is dispensable, and to some extent inhibits the activity of arterivirus (Porcine Reproductive and Respiratory Syndrome virus) PRRSV Nsp11. XendoU also requires Mn2+. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and murine hepatitis virus (MHV) forms a functional hexamer while Porcine DeltaCoronavirus (PDCoV) Nsp15 has been shown to exist as a dimer and a monomer in solution. Nsp11 from the arterivirus PRRSV is a dimer.


Pssm-ID: 439156  Cd Length: 113  Bit Score: 134.67  E-value: 5.00e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6761 AFEHVVYGSFNQKIIGGLHLLIGLARRQRKSNLVIQEFVSYDSSIHSYFITDENSGSSKSVCTVIDLLLDDFVDILKSLN 6840
Cdd:cd21144     1 AFEHVVYGDFSHQELGGLHLLIGLYKREKEKNIDNESRPDEDSTVLNYFITWKQTEMVKPVCSVIDLSLDDFVAIYKSQD 80
                          90       100       110
                  ....*....|....*....|....*....|...
gi 381354069 6841 LNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYP 6873
Cdd:cd21144    81 LQVVSKVVKVRIDETEIQFMLWCKDGYVGTFYP 113
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
851-940 6.18e-36

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 133.47  E-value: 6.18e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  851 KIKIIFALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKEHDViGTKVCALLDRLAEDYVYLFDEGGDE 930
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELPL-GDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 381354069  931 VIAPRMYCSF 940
Cdd:cd21467    80 VLASEMYCSF 89
CoV_NSP15_M pfam19216
Coronavirus replicase NSP15, middle domain; This entry represents the non-catalytic middle ...
6564-6681 6.67e-36

Coronavirus replicase NSP15, middle domain; This entry represents the non-catalytic middle domain from coronavirus non-structural protein 15 (NSP15). NSP15 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. This domain is formed by ten beta strands organized into three beta hairpins.


Pssm-ID: 466000  Cd Length: 118  Bit Score: 134.38  E-value: 6.67e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  6564 SIRPHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCiESLNVLFDGRDNGALEAFKKCRNGVYINTTK 6643
Cdd:pfam19216    1 NVGLTPPLKLLRNLGVTATYNFVLWDYENERPFTNYTINVCKYTDIIN-EDVCVLYDNRIKGSLERFCQLKNAVLISPTK 79
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 381354069  6644 IKNLSMIKGPQRADLNGVVVEKVGDSDVEFWFAMRSDG 6681
Cdd:pfam19216   80 IKKLVAIKIPNYGYLNGVPVSTTEKKPVTFYIYVRKNG 117
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3921-4001 7.49e-36

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 132.95  E-value: 7.49e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskCLASIEEVSDD 4000
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  .
gi 381354069 4001 Y 4001
Cdd:cd21827    79 P 79
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1606-1918 7.89e-36

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 141.27  E-value: 7.89e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1606 ANKVDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRCSAIHKGKVFFQYSGLSEADLVAVKDA---FGFDEP 1682
Cdd:pfam08715    2 CKQITIYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVPTTDWVEFYGFKSIleyYTLDAS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1683 QLLKYYNMLgmcKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKF 1762
Cdd:pfam08715   82 KYVIYLSAL---TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1763 NEPSDSTDFIRVVLREADLSGATCDLEFI--CKCGVKQDQRKGVDAVMHFGTLDKSDLVKGYNIACTCGSKLV-HCTQFN 1839
Cdd:pfam08715  159 GDFGDANWTLTNLAEHFDAEYTNAFLKKRvcCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGANNTdEVIEAS 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1840 VPFLICSYT--PEGRKLPDDVVAANIFTGG-SLGHYTHVKCKPkyQLYDACNVSKVSEAKGNFTDCLYLKNLKQTFSSVL 1916
Cdd:pfam08715  239 LPYLLLSATdgPAAVDCLEDGVGTVAFVGStNSGHYTYQTAKQ--AFYDGAKDRKFGKKSPYVTAVYTRFAFKNETSLPV 316

                   ..
gi 381354069  1917 AT 1918
Cdd:pfam08715  317 AK 318
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3921-4008 1.56e-35

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 131.80  E-value: 1.56e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4000
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*...
gi 381354069  4001 YVRDNTVL 4008
Cdd:pfam08716   75 YLENRTIL 82
CoV_Nsp9 cd21881
coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) ...
4207-4316 3.67e-34

coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from coronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for CoV replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG at the C-terminus; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409329  Cd Length: 111  Bit Score: 129.17  E-value: 3.67e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVVNSGSDMNCNTPT-QCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDVKG 4285
Cdd:cd21881     1 NNELSPVALKQMSCAAGTDQTCTDDEaKAYYNNSKGGRFVLAITSDKPDLKVARFLKEDGGTIYTELEPPCRFVTDVPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 381354069 4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21881    81 PKVKYLYFIKNLNSLNRGMVLGSISATVRLQ 111
deltaCoV-Nsp6 cd21561
deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3615-3920 1.03e-33

deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394847  Cd Length: 296  Bit Score: 134.41  E-value: 1.03e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3615 EDELTPSDVYQQlAGVKLQSKrtrVIKGTCCWILasTFLFCSIIAAFVKWTMFmyvTTHMLGVTLC-ALCFVSFAMLLIK 3693
Cdd:cd21561     2 ECDWTPEMVYNQ-APINLQSG---VVKKTCMWFF--HFLFMAVIFLLAALHVF---PVHLYPIVLPvFTILAFLLTLTIK 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3694 HKHLYLTMYIMPVL-------CTLFYTNyl*vykqSFRGLAYAWLSHFVPAVDYTYMDEVLYGVVLLIAMVFVTMRSINH 3766
Cdd:cd21561    73 HTVVFTTTYLLPSLlmmvvnaNTFWIPN-------TYLRSIYEYVFGSFISERLYGYTVALYILVYAQLAINYTLRTRRY 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3767 DVFSIMFLVGRlvslvSMWYFganleeEVLLFLTSLFgTYTWT-----TMLSLATAKVIAK----WLAVNVLYFTDVPQI 3837
Cdd:cd21561   146 RATSFISFCMQ-----ALQYG------YVAHIVYRLL-TTPWTegllfTAFSLLTSHPLLAalswWLAGRIPLPLILPDL 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3838 KLVLLSYLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVS 3917
Cdd:cd21561   214 AIRVIVYYVIGYVMCMRFGLFWLINKFTTIPMGTYKYMVSIEQLKYMMAVKMSPPRNAFEVLWANIRLLGLGGNRNIAVS 293

                  ...
gi 381354069 3918 QIQ 3920
Cdd:cd21561   294 TVQ 296
M_cv_Nsp15-NTD_av_Nsp11-like cd21163
middle (M) domain of coronavirus Nonstructural protein 15 (Nsp15) and the N-terminal domain ...
6567-6689 1.24e-33

middle (M) domain of coronavirus Nonstructural protein 15 (Nsp15) and the N-terminal domain (NTD) of arterivirus Nsp11 and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Arterivirus Nsp11 has an N-terminal domain (NTD) and a C-terminal catalytic (NendoU) domain. The NTD of Nsp11 superimposes onto the M-domain of coronavirus Nsp15. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of other coronavirus members; it has been shown to exist as a dimer and a monomer in solution. Nsp11 from the arterivirus PRRSV functions as a dimer.


Pssm-ID: 439159  Cd Length: 123  Bit Score: 128.22  E-value: 1.24e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6567 PHPELKLFRNLNIDVCWSHVLWDYAKDSVFCssTYKVCKYTDLQCIESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKn 6646
Cdd:cd21163     1 STPLPKVLRNLGVDFTPNFVLWDYEDTAPFF--NTTVCKYTPEELCEHLPVLYDDRYGGSLERFLSAPNAVLISLTKVK- 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 381354069 6647 LSMIKGPQRADLNGVVVEKVgDSDVEFWFAMRSDGDDVIFSRT 6689
Cdd:cd21163    78 KYSIPPPAGAYLNGSVVVGT-PKVVSFYLYKRKDGKFVTLPDT 119
NTD_alpha_betaCoV_Nsp15-like cd21171
N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related ...
6503-6563 3.48e-31

N-terminal domain of alpha- and beta-coronavirus Nonstructural protein 15 (Nsp15), and related proteins; Coronavirus (CoV) Nsp15 is a nidovirus endoribonuclease (NendoU). NendoUs are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include CoV Nsp15 and arterivirus Nsp11, both of which may participate in the viral replication process and in the evasion of the host immune system. This small NTD structure, present in coronavirus Nsp15, is missing in Nsp11. CoV Nsp15 has an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Nsp15 from Severe Acute Respiratory Syndrome (SARS)-CoV, human CoV229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Residues in this N-terminal domain are important for hexamer (dimer of trimers) formation.


Pssm-ID: 439163  Cd Length: 61  Bit Score: 118.82  E-value: 3.48e-31
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 6503 SLENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:cd21171     1 SLENVAYNVVKKGHFVGVEGELPVAIVNDKVFVKDGGVDVLVFTNKTSLPTNVAFELYAKR 61
betaCoV_Nsp1 cd21876
non-structural protein 1 from betacoronavirus; This model represents the non-structural ...
57-196 2.98e-30

non-structural protein 1 from betacoronavirus; This model represents the non-structural protein 1 (Nsp1) from betacoronaviruses, including highly pathogenic coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and murine hepatitis virus (MHV) genomes cause drastic reduction or elimination of infectious virus; bovine coronavirus (BCoV) Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409338  Cd Length: 114  Bit Score: 117.90  E-value: 2.98e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   57 HVRVDCSRLPALECCVQSAIIRdifvdkdPQKVEASTMMALQFGSAVLIMPSKRLSiqawanlgVLPRTPAMGLFKRVC- 135
Cdd:cd21876     1 HVSLTLPWLQALENPVQPWIDR-------PEEALESAKAALAEGKLVFVPPYKGLH--------PLLPGPRVFLVRRHGn 65
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 381354069  136 ---LCNTRGCSCDVHVAFqlftvqpdgvWLGNGRFIGWFVPVTAIpeyakQWLQPWSILLRKGG 196
Cdd:cd21876    66 ptrPFDVRELAADADGVN----------YGRSGRTIGVLVPLDGE-----QPYGYINILLRKYG 114
TM_Y_deltaCoV_Nsp3_C cd21711
C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2336-2835 5.07e-30

C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from deltacoronavirus, including Magpie-robin coronavirus HKU18 and Bulbul coronavirus HKU11, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409659  Cd Length: 490  Bit Score: 128.28  E-value: 5.07e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2336 CDFYQVTDLGYrSSFCNGSMVCELCFSGFDMLDSYDAINVVQhvvdrrvsfdyisilklVVELIIGYSLYTVCFYPLFVL 2415
Cdd:cd21711    43 CYYNATQHYDY-NSFCAGDLTCQACFDGQDSLHLYKHLRVNQ-----------------QPVQTTDYTVYALSIVLLLAN 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2416 IGMQLLTTwlpeffmletmhwsarLFVFVANM------------LPAFTLLRFYIVVTAMYKVYCLCRHVMYGCSNPGCL 2483
Cdd:cd21711   105 PTLVLGTL----------------LVVFFVNFygvqipfygtlqLDYQNTLVMVFSVYYFYKVMKFFRHLAKGCKKPTCS 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2484 FCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTKHQWNCLNCDSWKPGNTFITlEAAADLSKELKRPVNPTDSAYYS 2563
Cdd:cd21711   169 ICAKKRIPPTITVETVVQGRKYPSVIETNGGFNICKEHNFYCKNCDSQTPGTFIPT-EAVESLSRKTRLSVKPTAPAYLL 247
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2564 VTEVKqvgCSMRLFYER---DGQRV-----YDDVSASLFVDMNGLLHSKvkgvpeTHVVVVENEADKAGFLGAAVFYAQs 2635
Cdd:cd21711   248 ARDVE---CQTDVVVARathNGNAHvciskYSDIRTVDQLLKPTPLFSY------TPDVIIAADFDNAGSLKTAKELAV- 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2636 lyrPMLMVEKKLITTANTGLSVSQTMFDLYVDSLLNVLDVDRKSLTSfvnaahnslkegvqleqvmDTFVGCARrkcAID 2715
Cdd:cd21711   318 ---VLSMDLKRTIIIIDQAYSRPIDNYQEVKSRIEKYYPFQKITPTG-------------------DIFADIKQ---ATN 372
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2716 SDVeTRSITKSVMSAVNAGVDFTDESCNNLVPTYV-KSDTIVAADLGVLIQNNakhVQSNVAKAANVACIWSVDAFNQLS 2794
Cdd:cd21711   373 GQA-SDSAINAAILAVQRGLDFTIDNPNNILPHYAfDFSTLSAEDQSTLIESG---CAKGNLKGTNVGVVLSANLVTRLS 448
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|..
gi 381354069 2795 ADLQHRLRKACSKTGLKIKLTYNKQEANVPILTTPFS-LKGG 2835
Cdd:cd21711   449 QKAIRVIANAASRNGVTCAVTPSTLVLRGNIATQPLTrIKAG 490
alphaCoV_Nsp9 cd21897
alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4207-4316 1.17e-28

alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) of alphacoronaviruses, including Porcine epidemic diarrhea virus (PEDV), Porcine transmissible gastroenteritis coronavirus (TGEV), and Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409330  Cd Length: 108  Bit Score: 113.18  E-value: 1.17e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVVNSGSDMNCNTpTQCYYNTIGTGKIVYAILSDCDGLKYTKIvKEDGNCVVLELDPPCKFSVQDVKGL 4286
Cdd:cd21897     1 NNEIMPGKLKQRAVKAEGDGFSGD-GKALYNNEGGKTFMYAFIADKPDLKYVKW-EFDGGCNTIELEPPCKFLVDTPNGP 78
                          90       100       110
                  ....*....|....*....|....*....|
gi 381354069 4287 KIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21897    79 QIKYLYFVKNLNTLRRGAVLGYIGATVRLQ 108
Mesoniviridae_RdRp cd23187
catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the Mesoniviridae family of ...
4950-5350 3.77e-28

catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the Mesoniviridae family of positive-sense single-stranded RNA [(+)ssRNA] viruses; This group contains the catalytic core domain of RdRp of RNA viruses belonging to the family Mesoniviridae, order Nidovirales. Member viruses have a viral envelope and (+)ssRNA genome. The family is named after the size of the genomes relative to other nidoviruses, which is intermediate between that of the families Arteriviridae and Coronaviridae, with meso- coming from the Greek word mesos, which means medium, while -ni is an abbreviation of nido. The family Mesoniviridae comprises of mosquito-specific viruses with extensive geographic distribution and host range. The family has only one subfamily, Hexponivirinae, which contains only one genus, Alphamesonivirus. There are 8 subgenera (Casualivirus, Enselivirus, Hanalivirus, Kadilivirus, Karsalivirus, Menolivirus, Namcalivirus, and Ofalivirus) and 10 species in Alphamesonivirus. The structure of Mesoniviridae RdRp contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438037  Cd Length: 424  Bit Score: 121.54  E-value: 3.77e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4950 SAGYPFNKFGKARLYYEaLSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLKSIA 5029
Cdd:cd23187     1 SAGTPYRKFGDSEFMRE-LYGNYRDAIVYHKRHSADQQLTLTINKVAPSKNHRDRTILAISINKSEPGRSLYRWNLDKIK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5030 ATR--GVPVVIGTTKFYGGWDDMLRRLIKD--VDSP------VLMGWDYPKCDRAMPNILRIVSSLVL-------ARKHD 5092
Cdd:cd23187    80 YTSslGGPILIGFTAQYGGWDKLYKYLYKNspADNPdtaehaVLGGKDYPKWDRRISNMLQLTTTTVLyslidpnTQRKL 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5093 SCCSHTDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGhkiEDLSI-RE 5171
Cdd:cd23187   160 NNATPAQTWHEYMAETTQVLYDYLVFGNELYQKPGGVTSGNSRTADGNSLLHLLIDFYAIISQLIQSTP---ENVHLeVN 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5172 LQKRLYSNVYR---ADHVDPAFVSEYYEFLNKHFSMMI-----LSDDGVVCYNSEfaskgyIANISAFQQVLYYQNNVFM 5243
Cdd:cd23187   237 LRNALCKTVFTripSDYIDSSCVTLRNTDTLHTIRRRVakgayLSDDGLIVIDPR------IIRYDDFMSVSHLISHYMI 310
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5244 SEAKCWVETD-IEKGPHEFCSQHTMLVkmdGDEVYlPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPL---- 5318
Cdd:cd23187   311 AQNKHKYHIDaIQRYAREFLSQDTIKF---GDMVF-PIPEFGRMYTAMLLSDNKNTLDPQINITRLLALFSYLYIYyfky 386
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 381354069 5319 ---VHHENPEYQNVFRVYLEYIKKLYNDlgnQILD 5350
Cdd:cd23187   387 edqPTHPTLKFLDALRTYIENKLNTTDE---IFLD 418
ZBD_nv_SF1_Hel-like cd21399
Cys/His rich zinc-binding domain (CH/ZBD) of nidovirus helicases including coronavirus Nsp13 ...
5384-5456 1.14e-24

Cys/His rich zinc-binding domain (CH/ZBD) of nidovirus helicases including coronavirus Nsp13 and arterivirus Nsp10, and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. This nidovirus family includes Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13) and equine arteritis virus (EAV) Nsp10 helicase, and belongs to helicase superfamily 1 (SF1). The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase. SARS-Nsp12 can enhance the helicase activity of SARS-Nsp13. SARS-Nsp13 and EAV Nsp10 are multidomain proteins; their other domains include a 1B regulatory domain and a SF1 helicase core.


Pssm-ID: 394806  Cd Length: 71  Bit Score: 100.73  E-value: 1.14e-24
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 381354069 5384 GACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNspGCDVNDVTKLYLGGMSYYCEDH 5456
Cdd:cd21399     1 GVCYVCGSQTSLRCGTCIRRPFFCCKCCYDHVIQTCHKTVLLASPYVCA--GCGESDITLLYTGGDSYRCVDH 71
gammaCoV_Nsp9 cd21899
gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4205-4316 1.66e-24

gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from gammacoronaviruses such as Avian infectious bronchitis virus (IBV). CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409332  Cd Length: 113  Bit Score: 101.47  E-value: 1.66e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4205 LQNNELMPQKLRTQVVNSGSDM-NCNTPTQCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNCVVLELDPPCKFSVQDV 4283
Cdd:cd21899     1 LQNNELMPHGVKTKACVAGVDQaHCSVESKCYYTNISGNSVVAAITSSNPNLKVASFLNEAGNQIYVDLDPPCKFGMKVG 80
                          90       100       110
                  ....*....|....*....|....*....|...
gi 381354069 4284 KGLKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21899    81 DKVEVVYLYFIKNTRSIVRGMVLGAISNVVVLQ 113
gammaCoV-Nsp6 cd21559
gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3623-3920 2.07e-24

gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394845  Cd Length: 307  Bit Score: 107.55  E-value: 2.07e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3623 VYQQLAGVKLQSKrtrVIKGTCCWILASTFLFC-----SIIAAFVKWTMFMYVttHMLGVTLCALCFVSFAmllIKHKHL 3697
Cdd:cd21559     3 VFNQVGGVRLQSS---FVKKATSWFWSRCVLACflfvlCAIVLFTAVPLKYYV--HAAVILLVAVLFISFT---VKHVMA 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3698 YLTMYIMPVLCTLFYTNYL*VyKQSFRGLAYAWLSHFVPAVDYTYMDeVLYGVVLLIAMVFVTMR------SINHDVFSI 3771
Cdd:cd21559    75 FMDTFLLPTLCTVIIGVCAEV-PFIYNTLISQVVIFFSQWYDPVVFD-TVVPWMFLPLVLYTAFKcvqgcySINSFSTSL 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3772 MFLVgRLVSLVSMWYFGANLEEEVLLFLTSLFGTYTWTTMLSLATAKV--------IAKWLaVNVLYFTDVPQIKLVLLS 3843
Cdd:cd21559   153 LVLY-QFMKLGFVIYTSSNTLTAYTEGNWELFFELVHTTVLANFSSNSliglivfkIAKWM-LYYCNATYFNSYVLMAVM 230
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 3844 YLCIGYVCCCYWGVLSLLNSIFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALVLNFKLLGIGGVPVIEVSQIQ 3920
Cdd:cd21559   231 VNVIGWLFTCYFGLYWWLNKVFGLTLGKYNYKVSVEQYKYMCLHKIRPPKSVWDVFSTNMLIQGIGGERVLPIATVQ 307
deltaCoV_Nsp8 cd21833
deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
4016-4175 2.14e-24

deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of deltacoronaviruses that include White-eye coronavirus HKU16 and Quail coronavirus UAE-HKU30, among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409260  Cd Length: 189  Bit Score: 103.94  E-value: 2.14e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4016 VNMASFVEYELAKKNLDEAK*SGSANQQQIKQLeKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSKVVS 4095
Cdd:cd21833     7 INLDSYRIYKEADAAYKKSVELNEPPQEQKKKL-KAVNIAKAEWEREAASQRKLEKLADAAMKSMYLAERAEDRRIKLTS 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4096 ALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLTSNTLTIIVPDKQVFDQVVDNVYVTYAGNVWHIQSIQDADGA 4175
Cdd:cd21833    86 GLTAMLYHMLRRLDSDRVKALFECAKQQILPIHAIVGVSNDNLKVIFNDKESYLQYVDGNTLIYKGVRYTIVKKLSLDNA 165
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
670-831 4.25e-23

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 99.26  E-value: 4.25e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   670 LVNGLFAVANGVITFVqeAPELVKNFVAKFRAFFKVLIDSMSVSILSGLTVVKTASNRVCLAGSkVYEVVQKSLSAYVLP 749
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSNA-LVKVVSVKLKGKKQA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   750 V--GCSEATCLVGESEPAVfeDDVVGVVKTPLTYQGCCKPPTSFEKICIVDKLYMAKCGDQFYPVVvdnDTVGVLDQCWR 827
Cdd:pfam19212   78 GlkGAKEATVFVGATVPVT--PTRVEVVTVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPAS---TDGVVVPPVFK 152

                   ....
gi 381354069   828 FPCA 831
Cdd:pfam19212  153 LKGG 156
SUD_C_DPUP_CoV_Nsp3 cd21513
C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This ...
1535-1606 6.58e-23

C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This family contains the SUD-C of Nsp3 from Severe Acute Respiratory Syndrome (SARS) coronavirus (CoV), Middle East respiratory syndrome-related (MERS) CoV, and Rousettus bat CoV HKU9, as well as the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) Nsp3. Though structurally similar, there is little sequence similarity between these four domain subfamilies: SARS SUD-C, MERS SUD-C, HKU9 SUD-C, and MHV DPUP. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Nsp3 of SARS coronavirus includes a SARS-unique domain (SUD) consisting of three globular domains separated by short linker peptide segments: SUD-N, SUD-M, and SUD-C. SUD-N and SUD-M are macro domains which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). The SUD-C domain adopts a frataxin-like fold and has structural similarity to DNA-binding domains of DNA-modifying enzymes. It binds to single-stranded RNA and recognizes purine bases more strongly than pyrimidine bases. SUD-C also regulates the RNA binding behavior of the SUD-M macrodomain. SUD-C is not as specific to SARS CoV Nsp3 as originally thought, and is conserved in the Nsp3s of all four lineages (A-D) of betacoronavirus.


Pssm-ID: 394838  Cd Length: 71  Bit Score: 95.70  E-value: 6.58e-23
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 1535 DDARVFVQANMDCLPTDWRLVNKLDVVDGVRTIKYFECPGeIFVSSQGKKFGYVQNGLFKVASVSQIRALLA 1606
Cdd:cd21513     1 TDERVFVQAVMLNGPRDWRLVNKFDSVDGVRYKKYLKRGG-IFVCSQDKKFYYVQNDVFLEFSVSKIRALLA 71
stalk_CoV_Nsp13-like cd21689
stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the ...
5480-5527 8.93e-23

stalk domain of coronavirus Nsp13 helicase and related proteins; This model represents the stalk domain of coronavirus non-structural protein 13 (Nsp13) helicase, found in the Nsp3s of alpha-, beta-, gamma-, and deltacoronaviruses, including Severe Acute Respiratory Syndrome coronavirus (SARS-CoV), SARS-CoV-2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome coronavirus (MERS-CoV). Helicases are classified based on the arrangement of conserved motifs into six superfamilies; coronavirus helicases in this family belong to superfamily 1 (SF1). Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands. Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). It consists of an N-terminal ZBD (Cys/His rich zinc-binding domain), a stalk domain, a 1B regulatory domain, and SF1 helicase core. The stalk domain lies between the ZBD domain and the 1B domain; a short loop connects the ZBD to the stalk domain. The stalk domain is comprised of three tightly-interacting alpha-helices connected to the 1B domain, transferring the effect from the ZBD domain onto the helicase core domains. The ZBD and stalk domains are critical for the helicase activity of SARS-CoV Nsp13.


Pssm-ID: 410205  Cd Length: 48  Bit Score: 94.21  E-value: 8.93e-23
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 5480 GSPYIEDFNKIASCKWTEVDDYALANECTERLKLFAAETQKATEEAFK 5527
Cdd:cd21689     1 GSPDVDDFNRLATSDWSDVEDYKLANTCKDSLKLFAAETIKAKEESVK 48
TM_Y_gammaCoV_Nsp3_C cd21710
C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2288-2815 9.61e-23

C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from gammacoronavirus, including Infectious bronchitis virus. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409658  Cd Length: 525  Bit Score: 106.76  E-value: 9.61e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2288 ATVFLLWFNFLYANVILSDFYLPNIGSLPTFVGQIVAWFKTTFGVsticdfyqvtdLGYrssfCNGSMVCELCFSGFDML 2367
Cdd:cd21710    14 TALLILWFVYTSNPVMFTGIRVLDFLFEGSFCGPYNDYGKDSFDV-----------LRY----CGDDFTCRVCLHDKDSL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2368 DSYDAINVVQHVVDRRVS---FD----YISILKLVVELIIGYSLytVCFYPLFVLIGMQLLTTWLpeffmlETMHWsarl 2440
Cdd:cd21710    79 HLYKHAYSVEQFYKDAVSgisFNwnwlYLVFLILFVKPVAGFVI--ICYCVKYLVLSSTVLQTGV------GFLDW---- 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2441 fvFVANMLPAFTLLRFYIVVTAMYKVYCLCRHVMYgCSNPGCLFCYKRNRSVRVKCSTVVGGSLRYYDVMANGGTGFCTK 2520
Cdd:cd21710   147 --FIQTVFTHFNFMGAGFYFWLFYKIYIQVHHILY-CKDITCEVCKRVARSNRHEVSVVVGGRKQLVHVYTNSGYNFCKR 223
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2521 HQWNCLNCDSWKPGNTFITLEAAADLSKELKRPVNPTDSAYYSVTEVKQVGCSMRLFYE-----RDGQR-VYDDVSASLF 2594
Cdd:cd21710   224 HNWYCRNCDKYGHQNTFMSPEVAGELSEKLKRHVKPTAHAYHVVDDACLVDDFVNLKYKaatpgKDGAHsAVKCFSVSDF 303
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2595 VDMNGLLHSKVKG---------VPETHVVVVENEADKagflgAAVFYAQSLYRPMLMVEKKLITTANTGlSVSQTMFDLY 2665
Cdd:cd21710   304 LKKAVFLKDALKCeqisndsfiVCNTQSAHALEEAKN-----AAIYYAQYLCKPILILDQALYEQLVVE-PVSKSVVDKV 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 2666 VDSLLNVLDVDRKSLTSFVNAAHNSLKEGVQLEQVMDTFVGCArrkcaidsdvetrsitksvmsavNAGVDFTDESCNNL 2745
Cdd:cd21710   378 CSILSNIISVDTAALNYKAGTLRDALLSVTKDEEAVDMAIFCH-----------------------NNDVEYTSDGFTNV 434
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069 2746 VPTY-VKSDTIVAADLGVLIQNNAKHVQSNVAKAANVacIWSVDAFNQLSADLQHRLRKACSKTGLKIKLT 2815
Cdd:cd21710   435 VPSYgIDTDKLTPRDRGFLINADASIANLRVKNAPPV--VWKFSDLIKLSDSCLKYLISATVKSGGRFFIT 503
DNA2 COG1112
Superfamily I DNA and/or RNA helicase [Replication, recombination and repair];
5749-5970 2.05e-22

Superfamily I DNA and/or RNA helicase [Replication, recombination and repair];


Pssm-ID: 440729 [Multi-domain]  Cd Length: 819  Bit Score: 107.14  E-value: 2.05e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5749 DIIVVDEVSMLTNYE-LSVInsrVRAKHYVYIGDPAQLPaPRVLLNKGTLEPRYF--NSVTKLMCCLGPD--IFLGTCYR 5823
Cdd:COG1112   557 DLVIIDEASQATLAEaLGAL---ARAKRVVLVGDPKQLP-PVVFGEEAEEVAEEGldESLLDRLLARLPErgVMLREHYR 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5824 CPKEIVDTVSALVYNNKLKA---------KNDNSSMCFkVYYKGQTTHESSSAVNMQQIHLISKFLKAN-----PSWSNA 5889
Cdd:COG1112   633 MHPEIIAFSNRLFYDGKLVPlpspkarrlADPDSPLVF-IDVDGVYERRGGSRTNPEEAEAVVELVRELledgpDGESIG 711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5890 VfISPYNSQ-NYVAKRVLGLQTQ--------TVDSAQGSEYDFVI----YSQTAETAHSV-----NVNRFNVAITRAKKG 5951
Cdd:COG1112   712 V-ITPYRAQvALIRELLREALGDglepvfvgTVDRFQGDERDVIIfslvYSNDEDVPRNFgflngGPRRLNVAVSRARRK 790
                         250
                  ....*....|....*....
gi 381354069 5952 iLCVMSSMQLFESLNFTTL 5970
Cdd:COG1112   791 -LIVVGSRELLDSDPSTPA 808
ZBD_UPF1_nv_SF1_Hel-like cd21343
Cys/His rich zinc-binding domain (CH/ZBD) of eukaryotic UPF1 helicase, nidovirus SF1 helicases ...
5385-5456 4.08e-22

Cys/His rich zinc-binding domain (CH/ZBD) of eukaryotic UPF1 helicase, nidovirus SF1 helicases including coronavirus Nsp13 and arterivirus Nsp10, and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands, and are classified based on the arrangement of conserved motifs into six superfamilies. Members of this family belong to helicase superfamily 1 (SF1) and include nidoviral helicases such as Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13) and equine arteritis virus (EAV) Nsp10, as well as eukaryotic UPF1 helicase. The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. UPF1 participates in nonsense-mediated mRNA decay (NMD), a pathway which degrades transcripts with premature termination codons. The CH/ZBD of UPF1 interacts with UPF2, a factor also involved in NMD. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase. SARS-Nsp12 can enhance the helicase activity of SARS-Nsp13. UPF1, SARS-Nsp13 and EAV Nsp10 are multidomain proteins; their other domains include a 1B regulatory domain and a SF1 helicase core.


Pssm-ID: 439166  Cd Length: 70  Bit Score: 93.33  E-value: 4.08e-22
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 5385 ACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVSPYVCNspGCDVNDVTKLYLGGMSYYCEDH 5456
Cdd:cd21343     1 ACYVCGSHTVVRCGTCIRRPWFCNSCIYDHLIRTKHKEVLLASPYVCA--GCGESDITLLYFGGVSYRCVDH 70
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1339-1463 1.34e-21

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 93.77  E-value: 1.34e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1339 EVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETaDMVKNQGVCLVGECYESAGGKLCKKVLNIVGPDARGQgrQCYSLLE 1418
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKES-DYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRKG--QDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 1419 RAYQHINK-CDNVVTTLISAGIFSVPTDVSLTYLLGVVTK---NVILVS 1463
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
capping_2-OMTase_viral cd20754
viral Cap-0 specific (nucleoside-2'-O-)-methyltransferase; Cap-0 specific (nucleoside-2'-O-) ...
6920-7109 2.35e-20

viral Cap-0 specific (nucleoside-2'-O-)-methyltransferase; Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) catalyzes the methylation of Cap-0 (m7GpppNp) at the 2'-hydroxyl of the ribose of the first nucleotide, using S-adenosyl-L-methionine (AdoMet) as the methyl donor. This reaction is the fourth and last step in mRNA capping, the creation of the stabilizing five-prime cap (5' cap) on mRNA. Some dsDNA and dsRNA viruses, like the bluetongue virus (BTV), a member of the Reoviridae family, and Vaccinia virus, a member of the Poxviridae family, as well as some ss(+)RNA viruses, like Flaviviridae and Nidovirales, cap their mRNAs and encode their own 2'OMTase. In BTV, all four reactions are catalyzed by a single protein, VP4. In Vaccinia, the activity is located in the processing factor of the poly(A) polymerase, VP39.


Pssm-ID: 467730  Cd Length: 179  Bit Score: 92.12  E-value: 2.35e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6920 VAKYTQLCQYLNTttiaVPANMRVLHLGAGSdkgvAPGSAVLRQWLPAG----------SILVDNDVNpFVSDTVASYYG 6989
Cdd:cd20754     1 QAKLLQLEEYFLY----KPEKMRVIYIGCAP----GGWLYYLRDWFEGTlwvgfdprdtDPLGYNNVI-TVNKFFDHEHT 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6990 NCITLPFDcqWDLIISDMYDPLTKNIGEYNVSKDGFFTYLCHLICDKLALGGSVAIKITEFSWNAELYSLmgkfaFWTIF 7069
Cdd:cd20754    72 KLKFLPNK--KDLLICDIRSDRSSHVTKEEDTTESFLTLQEGYIATKLAKVGSICVKVRAPDLKDDGHFS-----SGTLF 144
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 381354069 7070 CTNVNASSSEGFLIGINWlNRTRTEIDgktmHANYLFWRN 7109
Cdd:cd20754   145 PQPYAASSSEMRLFSANY-DASQIKVV----KADVEKYEN 179
CoV_NSP15_N pfam19219
Coronavirus replicase NSP15, N-terminal oligomerization; This is the N-terminal domain of the ...
6503-6563 3.96e-20

Coronavirus replicase NSP15, N-terminal oligomerization; This is the N-terminal domain of the coronavirus nonstructural protein 15 (NSP15), which is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP15, is a nidoviral RNA uridylate-specific endoribonuclease (NendoU) carrying C-terminal catalytic domain belonging to the EndoU family. The SARS-CoV-2 NendoU monomers assemble into a double-ring hexamer, generated by a dimer of trimers. The hexamer is stabilized by the interactions of N-terminal oligomerization domain.


Pssm-ID: 466003  Cd Length: 61  Bit Score: 87.36  E-value: 3.96e-20
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 381354069  6503 SLENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:pfam19219    1 SLENLAYNVVKKGHFVGVDGELPVAIVNDKVFVKVGGVDVLLFENKTSLPTNVAFELYAKR 61
gammaCoV_PLPro cd21733
gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1614-1872 7.36e-20

gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of gammacoronavirus, including Avian coronavirus, Canada goose coronavirus, and Beluga whale coronavirus SW1. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in several CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409650  Cd Length: 304  Bit Score: 94.42  E-value: 7.36e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1614 TVDGVNFRSCCVTEGEVFGKtLGSVFCDGINVTKVRcsAIHKGKVFFqysgLSEADlVAVKDAFGFDEPQLLKYYNMLGM 1693
Cdd:cd21733    10 TEDGVKYRSVVVKPGDSLSQ-FGQVFARNKTVFTAD--DVEDKEILF----IPTTD-KAVLEYYGLDAQKYVIYLQTLAQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1694 cKWPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFpKWQWQEAWNEFRSGKPLRFVSLVLAKGSFKFNEPSDStDFIR 1773
Cdd:cd21733    82 -KWNVQYRDNFLILEWRDGNCWISSAIVLLQAAKIRF-KGFLAEAWAKFLGGDPTEFVAWCYASCNAKVGDFSDA-NWLL 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1774 VVLRE---ADLSGATCDLEFICKCGVKQDQRKGVDA---------VMHFGTldksdlvkGYNIACTCGSKLV-HCTQFNV 1840
Cdd:cd21733   159 ANLAEyfdADYTNAFLKRRVSCNCGVKNYELRGLEAciqpvrapnLLHFKT--------QYSNCPTCGANSVdEVVEASL 230
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 381354069 1841 PFLICSYT--PEGRKLPDDVVAANIFTGG-SLGHY 1872
Cdd:cd21733   231 PYLLLLATdgPATVDCDENAVGNVVFIGStNSGHC 265
DEXXQc_Upf1-like cd17934
DEXXQ-box helicase domain of Upf1-like helicase; The Upf1-like helicase family includes UPF1, ...
5657-5823 1.37e-19

DEXXQ-box helicase domain of Upf1-like helicase; The Upf1-like helicase family includes UPF1, HELZ, Mov10L1, Aquarius, IGHMBP2 (SMUBP2), coronavirus Nsp13, and similar proteins. They belong to the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 438708 [Multi-domain]  Cd Length: 121  Bit Score: 88.06  E-value: 1.37e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5657 YCTVQGPPGTGKSHLAIGLAVYY----CTARVVYTAASHAAVDALcekahkflnindctrivpakvrvdcydkfkvndtt 5732
Cdd:cd17934     1 ISLIQGPPGTGKTTTIAAIVLQLlkglRGKRVLVTAQSNVAVDNV----------------------------------- 45
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5733 rkyvfttinalpelvtDIIVVDEVSMLTNYELSVInsRVRAKHYVYIGDPAQLPAPRVLLNKGTLEP---RYFNSVTKLM 5809
Cdd:cd17934    46 ----------------DVVIIDEASQITEPELLIA--LIRAKKVVLVGDPKQLPPVVQEDHAALLGLsfiLSLLLLFRLL 107
                         170
                  ....*....|....
gi 381354069 5810 CCLGPDIFLGTCYR 5823
Cdd:cd17934   108 LPGSPKVMLDTQYR 121
deltaCoV_Nsp9 cd21900
deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4207-4316 2.41e-19

deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from deltacoronaviruses such as the Porcine delta coronavirus (PDCoV) Porcine coronavirus HKU15. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409333  Cd Length: 109  Bit Score: 86.72  E-value: 2.41e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4207 NNELMPQKLRTQVvNSGSDMNCNTPT-QCYYNTIGTGKIVYAILSDCDGLKYTKIVKEDGNcVVLELDPPCKFSVQDVKG 4285
Cdd:cd21900     1 NNELCLRNVFTAQ-NTASDGNGNESTaKSFYVSRTGKKILVAVTSTKDNLKTVTCDTDTGK-VVLNLDPPMRFSHVVGGK 78
                          90       100       110
                  ....*....|....*....|....*....|.
gi 381354069 4286 LKIKYLYFVKGCNTLARGWVVGTLSSTVRLQ 4316
Cdd:cd21900    79 QSVVYLYFIQNISSLNRGMVIGHISGTTILQ 109
alphaCoV_Nsp7 cd21826
alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3921-4009 5.03e-19

alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine transmissible gastroenteritis coronavirus (TGEV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. FCoV Nsp7 forms a 2:1 heterotrimer with Nsp8; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409252  Cd Length: 83  Bit Score: 84.73  E-value: 5.03e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4000
Cdd:cd21826     1 SKLTDIKCTNVVLLGCLSSMNVAANSKEWAYCVDLHNKINLCDDPEKAQEMLLALLAFFLSKQKDFG------LDDLLDS 74

                  ....*....
gi 381354069 4001 YVRDNTVLH 4009
Cdd:cd21826    75 YFDNNSILQ 83
Medioniviridae_RdRp cd23188
catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the Medioniviridae family of ...
4950-5319 1.36e-17

catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the Medioniviridae family of positive-sense single-stranded RNA [(+)ssRNA] viruses; This group contains the catalytic core domain of RdRp of RNA viruses belonging to the family Medioniviridae, order Nidovirales. Member viruses have a viral envelope and (+)ssRNA genome. The Medioniviridae subgenera includes Turrinivirus and Balbicanovirus. The structure of Medioniviridae RdRp contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438038  Cd Length: 391  Bit Score: 88.98  E-value: 1.36e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4950 SAGYPFNKFGKARLYyEALSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLKSIA 5029
Cdd:cd23188     1 SAGQPYVKVGDSDVV-RGVLGDDRDTMIKHRCHSHHQTLVTANAKLAVGGKFKCRPISGINVLESDVGRTLFTAILEAIK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5030 AT-RGVPVVIGTTKFYggWDDMLRRLIKDVD----SPVLMGWDYPKCDRAMPNILRIVSSLVLARKHD-----SCCSHTD 5099
Cdd:cd23188    80 HCcYENMIVIGWSKFT--GFDRLFRNFLNSRldhiDYRLSGKDFPQWDRSVESNMQLLTNFLIFCSYDwalcrEFCSLQE 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5100 RFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVSANVCSLMACNGHKIEDLSIRELQKRLYSN 5179
Cdd:cd23188   158 ALHLFCTEFTNTVYSYFICDNLVMRKSGGVCSGNSKTAPGNSIMHAIWEYAAIIEHLHYYRGEDPELIELRQFFMLYESH 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5180 VYRADHVDPAFVSEYYEFLNKHFSMMILSDDGVVCYNSEFASKGYIanisAFQQVLYYQNNVFMSEAkcWVETDIEKGPH 5259
Cdd:cd23188   238 SLSALREHDHLLDTNLLRLQSHHLLRVLSDDGMVLHDKELLFDYSS----LFPYFYLYSNYHFTNDK--HYSCAPLHGPH 311
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5260 EFCSQHTMLVkmdgDEVYLPYPDPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLV 5319
Cdd:cd23188   312 EFCSAEAIIV----DDKYYLCPEPGRHLGALFYSSRTTRFDINVRIALLSSYILEGIPLL 367
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
967-1073 4.85e-17

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 86.92  E-value: 4.85e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   967 SVVLVADAQE-DGVAKEQVE-VDSEICVAH---TGGqdELTEPDAVGSQTPIASAEKTEVGEAS--DREGIAEAKR---- 1035
Cdd:pfam11963  238 AYALLRGYRGvKPVLFVDQYgCDYTGCLADgleAYG--DYTLQDMKQLQPVWLANLDFDVVVAWhvVRDPRAVMRLqtia 315
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 381354069  1036 TVCADDLDACP--DQVEAFEIEEVEDSILDELQTELNAPS 1073
Cdd:pfam11963  316 TICGIAYVAQPteDVVDGDVVIKEPVHLLSADAIVLRLPS 355
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1321-1451 1.17e-16

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 80.04  E-value: 1.17e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069   1321 NVCFVKGDVIKVarlvEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADmVKNQGVCLVGECYESAGGKL-CKKVL 1399
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKALSKEEVR-KLAGGECPVGTAVVTEGGNLpAKYVI 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 381354069   1400 NIVGPDARGQGRQCYSLLERAYQ------HINKCDNVVTTLISAGIFSVPTDVSLTYL 1451
Cdd:smart00506   76 HAVGPRASGHSKEGFELLENAYRnclelaIELGITSVALPLIGTGIYGVPKDRSAQAL 133
CoV_Nsp7 cd21811
coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) ...
3921-4008 1.24e-16

coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409251  Cd Length: 83  Bit Score: 77.91  E-value: 1.24e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4000
Cdd:cd21811     1 SKLTDVKCTAVVLLSLLQKLRVESNSKLWKQCVQLHNDILLAKDTTEVFEKLVSLLSVLLSMQGAVD------LNRLCEE 74

                  ....*...
gi 381354069 4001 YVRDNTVL 4008
Cdd:cd21811    75 MLENRAVL 82
NTD_CoV_Nsp15-like cd21170
N-terminal domain of coronavirus Nonstructural protein 15 (Nsp15) and related proteins; ...
6504-6563 1.93e-16

N-terminal domain of coronavirus Nonstructural protein 15 (Nsp15) and related proteins; Coronavirus (CoV) Nsp15 is a nidovirus endoribonuclease (NendoU). NendoUs are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include CoV Nsp15 and arterivirus Nsp11, both of which may participate in the viral replication process and in the evasion of the host immune system. This NTD structure (approximately 60 residues) present in CoV Nsp15, is missing in Nsp11. CoV Nsp15 has an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Nsp15 from Severe Acute Respiratory Syndrome (SARS)-CoV, human CoV 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from the Nsp15 of these alpha- and beta-coronavirus; it has been shown to exist as dimers and monomers in solution, and to function as a dimer. Nsp15 from Turkey CoV (TCoV), a gammaCoV, has been reported to be a homohexamer.


Pssm-ID: 439162  Cd Length: 60  Bit Score: 76.66  E-value: 1.93e-16
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6504 LENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:cd21170     1 LENLAYNVVKKNHFDGVKGELPVVITGDKVFVKDDGVDVLLFENKTTLPTSVAFELYAKR 60
gammaCoV_Nsp7 cd21828
gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3921-4008 2.13e-14

gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409254  Cd Length: 83  Bit Score: 71.74  E-value: 2.13e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 3921 SRLTDVKCANVVLLNCLQHLHIASNSKLWQYCSTLHNEILATSDLSVAFDKLAQLLVVLFANPAAVDskclasIEEVSDD 4000
Cdd:cd21828     1 SKLTDVKCTTVVLMQLLTKLNVEANSKMHKYLVELHNKILASDDVVECMDNLLGMLVTLLCIDSTID------LSEYCDD 74

                  ....*...
gi 381354069 4001 YVRDNTVL 4008
Cdd:cd21828    75 ILKRSTVL 82
RdRP_1 pfam00680
Viral RNA-dependent RNA polymerase; This family represents the RNA-directed RNA polymerase ...
4917-5223 7.43e-14

Viral RNA-dependent RNA polymerase; This family represents the RNA-directed RNA polymerase found in many positive strand RNA eukaryotic viruses. Structural studies indicate that these proteins form the "right hand" structure found in all oligonucleotide polymerases, containing thumb, finger and palm domains, and also the additional bridging finger and thumb domains unique to RNA-directed RNA polymerases.


Pssm-ID: 425815  Cd Length: 450  Bit Score: 78.22  E-value: 7.43e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4917 QLLFVLEVVNKYFEIYEGgCIPATQVIVNNYDKSAGYPFNKFG--KARLYYEALSFEEQDEIY--AYTKRNVLPTLTQMN 4992
Cdd:pfam00680   69 ELRGVPKKANSTLIVYRA-IDGVEQIDPLNWDTSAGYPYVGLGgkKGDLIEHLKDGTEARELAerLAADWEVLQNGTPLK 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  4993 LKYAIS-----------AKNRARTVAGVSILSTMTGRMFHQKCLKSIAATRGV-PVVIGTTKFYGGWDDMLRRLIKDvdS 5060
Cdd:pfam00680  148 LVYQTClkdelrplekvEKGKTRLVWGEPVEYLLLERAFFDPFNQAFMLNNGFhPIQVGINPFDRGWPRLLRRLARF--G 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5061 PVLMGWDYPKCDRAMPNILRIVSSLVLARKHDSCcSHTDRFYRLANECaqVLSEIVMCGGCYYVKPGGTSSGDATTAFAN 5140
Cdd:pfam00680  226 DYVYELDYSGFDSSVPPWLIRFAFEILRELLGFP-SNVKEWRAILELL--IYTPIALPNGTVFKKTGGLPSGSPFTSIIN 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5141 SVFNicqavsanvCSLMACnghkiedLSIRELQKRLYSNVyradhvdpafvseyyeFLNKHFSMMILSDDGVVCYNSEFA 5220
Cdd:pfam00680  303 SIVN---------YLLILY-------ALLKSLENDGPRVC----------------NLDKYFDFFTYGDDSLVAVSPDFD 350

                   ...
gi 381354069  5221 SKG 5223
Cdd:pfam00680  351 PVL 353
AAA_12 pfam13087
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
5807-5955 2.34e-13

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins.


Pssm-ID: 463780 [Multi-domain]  Cd Length: 196  Bit Score: 72.58  E-value: 2.34e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5807 KLMCCLGPD--IFLGTCYRCPKEIVDTVSALVYNNKLKA-KNDNSSMCFKVYY----------------KGQTTHESSSA 5867
Cdd:pfam13087    7 ERLQELGPSavVMLDTQYRMHPEIMEFPSKLFYGGKLKDgPSVAERPLPDDFHlpdplgplvfidvdgsEEEESDGGTSY 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  5868 VNMQQI----HLISKFLKANPS-WSNAVFISPYNSQNYVAKRVL--------GLQTQTVDSAQGSEYDFVIYSqT--AET 5932
Cdd:pfam13087   87 SNEAEAelvvQLVEKLIKSGPEePSDIGVITPYRAQVRLIRKLLkrklggklEIEVNTVDGFQGREKDVIIFS-CvrSNE 165
                          170       180
                   ....*....|....*....|....*..
gi 381354069  5933 AHSV----NVNRFNVAITRAKKGiLCV 5955
Cdd:pfam13087  166 KGGIgflsDPRRLNVALTRAKRG-LII 191
NendoU_tv_PToV-like cd21162
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of Porcine torovirus (PToV) ...
6764-6873 4.42e-12

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of Porcine torovirus (PToV) endoribonuclease and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. The Porcine torovirus (PToV) strain PToV-NPL/2013 NendoU domain is located at the N-terminus of the ORF1ab replicase polyprotein, between regions annotated as Nonstructural proteins 11 (Nsp11) and 13 (Nsp13). This subfamily belongs to a family which includes Nsp15 from coronaviruses and Nsp11 from arteriviruses, which may participate in the viral replication process and in the evasion of the host immune system. These vary in their requirement for Mn2+. Coronavirus Nsp15 generally form functional hexamers, with the exception of Porcine DeltaCoronavirus (PDCoV) Nsp15 which exists as a dimer and a monomer in solution. Arterivirus (Porcine Reproductive and Respiratory Syndrome virus) PRRSV Nsp11 is a dimer. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


Pssm-ID: 394913  Cd Length: 133  Bit Score: 66.84  E-value: 4.42e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6764 HVVYGSFNQK--IIGGLHLLIGLarrQRKSNLVIQefvSYDSSIHSYFItdeNSGSS-KSVCTVIDLLLDDFVDILKS-L 6839
Cdd:cd21162    27 HVFLGEFTEVstTIGGVHHVPAL---NGTKGSIIP---SYVKPIHTGLI---NVGKGvKRCTTLVDVCANQLYELVKQqI 97
                          90       100       110
                  ....*....|....*....|....*....|....
gi 381354069 6840 NLNCVSKVVNVNVDFKDFQFMLWCNEEKVMTFYP 6873
Cdd:cd21162    98 NGVTVSKVIFINIDFQEVQFMVFASEGDIQTAYP 131
SF1_C_Upf1 cd18808
C-terminal helicase domain of Upf1-like family helicases; The Upf1-like helicase family ...
5824-5964 6.38e-12

C-terminal helicase domain of Upf1-like family helicases; The Upf1-like helicase family includes UPF1, HELZ, Mov10L1, Aquarius, IGHMBP2 (SMUBP2), and similar proteins. They are DEAD-like helicases belonging to superfamily (SF)1, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF2 helicases, SF1 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350195 [Multi-domain]  Cd Length: 184  Bit Score: 68.03  E-value: 6.38e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5824 CPKEIVDTVSALVYNNKLKAKNDNSSMCFKVYYKG--------------QTTHESSSAVNMQQIHLISKFLKAN-----P 5884
Cdd:cd18808     1 MHPEISEFPSKLFYEGKLKAGVSVAARLNPPPLPGpskplvfvdvsggeEREESGTSKSNEAEAELVVELVKYLlksgvK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5885 SWSNAVfISPYNSQ-----NYVAKRVLGLQT---QTVDSAQGSEYDFVIYS--QTAETAHSV----NVNRFNVAITRAKK 5950
Cdd:cd18808    81 PSSIGV-ITPYRAQvalirELLRKRGGLLEDvevGTVDNFQGREKDVIILSlvRSNESGGSIgflsDPRRLNVALTRAKR 159
                         170
                  ....*....|....
gi 381354069 5951 GiLCVMSSMQLFES 5964
Cdd:cd18808   160 G-LIIVGNPDTLSK 172
NTD_gammaCoV_Nsp15-like cd22650
N-terminal domain of gammacoronavirus Nonstructural protein 15 (Nsp15), and related proteins; ...
6504-6563 2.27e-11

N-terminal domain of gammacoronavirus Nonstructural protein 15 (Nsp15), and related proteins; Coronavirus (CoV) Nsp15 is a nidovirus endoribonuclease (NendoU). NendoUs are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include coronavirus Nsp15 and arterivirus Nsp11, both of which may participate in the viral replication process and in the evasion of the host immune system. This NTD structure (approximately 60 residues) present in coronavirus Nsp15, is missing in Nsp11. CoV Nsp15 has an N-terminal domain, a middle (M) domain, and a C-terminal catalytic (NendoU) domain. Nsp15 from alpha- and betaCoVs such as Severe Acute Respiratory Syndrome (SARS)-CoV, human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. The active form of the Nsp15 of Turkey CoV (TCoV), a gammaCoV, has been reported to be a homohexamer. Residues in this N-terminal domain may be important for hexamer formation.


Pssm-ID: 439165  Cd Length: 60  Bit Score: 62.19  E-value: 2.27e-11
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6504 LENVVYNLVNAGHFDGRAGELPCAIIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFAKR 6563
Cdd:cd22650     1 IDNIAYNMYKGGHYDAIAGEMPTVITGDKVFVIDQGVEKAVFVNQTTLPTSVAFELYAKR 60
alphaCoV_PLPro cd21731
alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1609-1754 2.58e-10

alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alphacoronavirus, including Swine acute diarrhea syndrome coronavirus (SADS-CoV) which causes severe diarrhea in piglets, and Human coronavirus 229E which infects humans and bats and causes the common cold. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in SADS-CoV and many others has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409648  Cd Length: 289  Bit Score: 65.34  E-value: 2.58e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1609 VDVLCTVDGVNFRSCCVTEGEVFGKTLGSVFCDGINVTKVRC-SAIHKG-KV-----FFQYSGLSEADLVAVKD--AFGF 1679
Cdd:cd21731     3 VVVKVTEDGRNVKDVVVDTDKTFGEQLGVCSVNDKDVTGVVPpDDSDKVvSVapdvdWDSHYGFPNAAVFHTLDhsAYAF 82
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 1680 DepqllkyynmlgmckwpVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFPKWQWQEAWNEFRSGKPLRFVSLV 1754
Cdd:cd21731    83 E-----------------SDIVNGKRVLKQSDNNCWVNAVCLQLQFAKPTFKSEGLQALWNKFLTGDVAGFVHWL 140
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1342-1445 5.26e-10

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 60.27  E-value: 5.26e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  1342 VNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKnqGVCLVGECYESAGGKL-CKKVLNIVGPDARGQGRQ-CYSLLER 1419
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELKK--GGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHgEEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 381354069  1420 AYQHI------NKCDNVVTTLISAGIFSVPTD 1445
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
ZBD_tv_SF1_Hel-like cd21403
Cys/His rich zinc-binding domain (CH/ZBD) of tornidovirus SF1 helicase and related proteins; ...
5382-5465 9.99e-09

Cys/His rich zinc-binding domain (CH/ZBD) of tornidovirus SF1 helicase and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. This tornidovirus group includes White bream virus (WBV) SF1 helicase encoded on ORF1b and belongs to helicase superfamily 1 (SF1). The CH/ZBD has 3 zinc-finger (ZnF1-3) motifs. Members of this group belong to a family of nindoviral replication helicases which include includes Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) non-structural protein 13 (SARS-Nsp13), a component of the viral RNA synthesis replication and transcription complex (RTC). The SARS-Nsp13 CH/ZBD is indispensable for helicase activity and interacts with SARS-Nsp12, the RNA-dependent RNA polymerase.


Pssm-ID: 394810  Cd Length: 95  Bit Score: 56.20  E-value: 9.99e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5382 SVGACVVCSSQTSLRCGSCIRKPLLCCKCSYDHVMATDHKYVLSVspyVCNSPGCDVNDVTKLYL----GGMSYYCEDHK 5457
Cdd:cd21403     1 SDQQCYCCPNPTVSTCTSCPVPYPLCAYCAYEHYVQTGHLVTHLP---KCHHPGCGESDPRNLNFclvnGGFTTRCDEHV 77

                  ....*...
gi 381354069 5458 PQYSFKLV 5465
Cdd:cd21403    78 TGFSIPLL 85
M_gcv_Nsp15-like cd21168
middle domain of gammacoronavirus Nonstructural protein 15 (Nsp15), and related proteins; ...
6569-6655 2.39e-08

middle domain of gammacoronavirus Nonstructural protein 15 (Nsp15), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Coronavirus Nsp15 NendoUs have an N-terminal domain, a middle (M) domain and a C-terminal catalytic (NendoU) domain. Coronavirus Nsp15 from Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), human Coronavirus 229E (HCoV229E), and Murine Hepatitis Virus (MHV) form a functional hexamer. This middle domain harbors residues involved in hexamer formation and in trimer stability. Oligomerization of Porcine DeltaCoronavirus (PDCoV) Nsp15 differs from that of the other coronaviruses; it has been shown to exist as a dimer and a monomer in solution.


Pssm-ID: 394906  Cd Length: 123  Bit Score: 56.02  E-value: 2.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 6569 PELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCiESLNVLFDGRDNGALEAFKKCRNGVYINTTKIKNLS 6648
Cdd:cd21168     3 PNTAILYGLGVDVTAGFTIWDYENSQPVFRNTVKVCKYTDIEP-NGLCVLYDDRYKGDYQRFLAADNAVLISTQCYKVYS 81

                  ....*..
gi 381354069 6649 MIKGPQR 6655
Cdd:cd21168    82 SVRIPSS 88
Euroniviridae_RdRp cd23191
catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the family Eunroniviridae of ...
4987-5323 3.99e-08

catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the family Eunroniviridae of positive-sense single-stranded RNA [(+)ssRNA] viruses; This group contains the catalytic core domain of RdRp of RNA viruses belonging to the family Eunroniviridae, order Nidovirales. Member viruses have a viral envelope and (+)ssRNA genome. Eunroniviridae is a closely related family of crustacean nidoviruses, within the suborder Ronidovirineae, which also includes the family Roniviridae. Ronidovirineae, named "rod-shaped nidovirus", is 150-200 nm long and approximately 60 nm thick. There are 3 viral species in the Euroniviridae family, all of which have been detected in crustaceans. The structure of Euroniviridae RdRp contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438041  Cd Length: 345  Bit Score: 59.53  E-value: 3.99e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4987 TLTQMNLKYAISAKNRA--RTVAGVSILSTMTgRMFHQKCLKSIAATRGVpvVIGTTKFYGGWDDMLRrLIKDVDSPVLM 5064
Cdd:cd23191     1 FITQVRPKIAVQPQEKPlrSIISGSPVITDCI-RHVTQNMMRIMVSLRHL--FIGNRADPRGFTEMLQ-FLEESPADYQV 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5065 GWDYPKCDRAMPNILRIVSSLVLARKHDSCCSHTDRFYRL-ANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVf 5143
Cdd:cd23191    77 SLDHSKFDRRVDSLLSYAGHLATMDLTDLCGHDPQLVHNImASHFMTYTYNLLLFDGMLYIKNGGVSSGNSITALNNSL- 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5144 nICQAVSANVCSLMACNGHKIeDLSIRELQKRLYSNVYRADHVDPafvseyyEFLNKHFSMMILSDDGVVCYNSEFAS-K 5222
Cdd:cd23191   156 -AAQQHTFICCMREALKGPKI-QWEYQKYQFDLFMDPMELIDIEP-------NKIWKYFRIAGLSDDVVASVPSMLIDpD 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5223 GYIANISAFQQVLYYQNNVFMSEAkcwvetdiEKGPHEFCSQHTMLVkMDGDEVYLPYPDPSRILGAGCFVDDLLKTDSV 5302
Cdd:cd23191   227 DLMAQFKSFGYIMVKDKKYFVSGK--------DEPPTELMSRWPERV-PVGPEIEMPHPTVDRVLSSMLLIEKRSSLDPL 297
                         330       340
                  ....*....|....*....|.
gi 381354069 5303 LLIERFVSLAIDAYPLVHHEN 5323
Cdd:cd23191   298 VKRMRTISILLDGITLVFSKQ 318
DEXSc_RecD-like cd17933
DEXS-box helicase domain of RecD and similar proteins; RecD is a member of the RecBCD (EC 3.1. ...
5656-5787 8.37e-08

DEXS-box helicase domain of RecD and similar proteins; RecD is a member of the RecBCD (EC 3.1.11.5, Exonuclease V) complex. It is the alpha chain of the complex and functions as a 3'-5' helicase. The RecBCD enzyme is both a helicase that unwinds, or separates the strands of DNA, and a nuclease that makes single-stranded nicks in DNA. RecD is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350691 [Multi-domain]  Cd Length: 155  Bit Score: 55.25  E-value: 8.37e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5656 RYCTVQGPPGTGKSHLAIGLAVYYC--TARVVYTAASHAAVDALCEKA-------HKFLNINDcTRIVPAKVRVDCYDKf 5726
Cdd:cd17933    13 RVSVLTGGAGTGKTTTLKALLAALEaeGKRVVLAAPTGKAAKRLSESTgieastiHRLLGINP-GGGGFYYNEENPLDA- 90
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069 5727 kvndttrkyvfttinalpelvtDIIVVDEVSMLTNYELSVINSRVRAK-HYVYIGDPAQLPA 5787
Cdd:cd17933    91 ----------------------DLLIVDEASMVDTRLMAALLSAIPAGaRLILVGDPDQLPS 130
Macro_Af1521_BAL-like cd02907
macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse ...
1321-1446 9.51e-08

macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. The macrodomains in this family show similarity to Af1521, a protein from Archaeoglobus fulgidus containing a stand-alone macrodomain. Af1521 binds ADP-ribose and exhibits phosphatase activity toward ADP-ribose-1"-monophosphate (Appr-1"-p). Also included in this family are the N-terminal (or first) macrodomains of BAL (B-aggressive lymphoma) proteins which contain multiple macrodomains, such as the first macrodomain of mono-ADP-ribosyltransferase PARP14 (PARP-14, also known as ADP-ribosyltransferase diphtheria toxin-like 8, ATRD8, B aggressive lymphoma protein 2, or BAL2). Most BAL proteins also contain a C-terminal PARP active site and are also named as PARPs. Human BAL1 (or PARP-9) was originally identified as a risk-related gene in diffuse large B-cell lymphoma that promotes malignant B-cell migration. Some BAL family proteins exhibit PARP activity. Poly (ADP-ribosyl)ation is an immediate DNA-damage-dependent post-translational modification of histones and other nuclear proteins. BAL proteins may also function as transcriptional repressors.


Pssm-ID: 394877 [Multi-domain]  Cd Length: 158  Bit Score: 55.19  E-value: 9.51e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1321 NVCFVKGDVIKVarlvEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKNQGVCLVGECYE-SAGGKLCKKVL 1399
Cdd:cd02907     3 KVSVYKGDITKE----KVDAIVNAANERLKHGGGVAGAISKAGGPEIQEECDKYIKKNGKLRVGEVVVtSAGKLPCKYVI 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 381354069 1400 NIVGPDARGQG-RQCYSLLERA-YQHINKCDNV-VTTL----ISAGIFSVPTDV 1446
Cdd:cd02907    79 HAVGPRWSGGSkEECEDLLYKAvLNSLEEAEELkATSIaipaISSGIFGFPLDL 132
1B_UPF1_nv_SF1_Hel-like cd21344
1B domain of eukaryotic UPF1 helicase, nidovirus SF1 helicases including coronavirus Nsp13 and ...
5531-5609 1.05e-07

1B domain of eukaryotic UPF1 helicase, nidovirus SF1 helicases including coronavirus Nsp13 and arterivirus Nsp10, and related proteins; Helicases catalyze NTP-dependent unwinding of nucleic acid duplexes into single strands and are classified based on the arrangement of conserved motifs into six superfamilies. Members of this family belong to helicase superfamily 1 (SF1) and include nidoviral helicases such as Severe Acute Respiratory Syndrome coronavirus (SARS) non-structural protein 13 (SARS-Nsp13), Equine arteritis virus (EAV) Nsp10, and eukaryotic UPF1 RNA helicase. SARS-Nsp13 is a component of the viral RNA synthesis replication and transcription complex (RTC). UPF1 participates in nonsense-mediated mRNA decay (NMD), a pathway which degrades transcripts with premature termination codons. UPF1, EAV Nsp10 and SARS-Nsp13 are multidomain proteins with an N-terminal Cys/His rich zinc-binding domain (CH/ZBD), a 1B domain and a SF1 helicase core. The 1B domain is involved in nucleic acid substrate binding; the 1B domain of EAV Nsp10 undergoes large conformational change upon substrate binding, and together with the 1A and 2A domains of the helicase core form a channel that accommodates the single stranded nucleic acids.


Pssm-ID: 439170  Cd Length: 86  Bit Score: 52.70  E-value: 1.05e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5531 ASATIREIV---SDRELILSWEIGK--VRPPLNKNYVFTGYHFTNNGKTVLGEYV--FDKSELTNGVYYRATTTYKLSVG 5603
Cdd:cd21344     1 LIITVRWRLalnDFRGAYFSLEKGKsqCKPPLGDEIVLTYYGDTVPLWEGIGEVIdlPNTGNDDDALELKGSTTYPLTVT 80

                  ....*.
gi 381354069 5604 DVFILT 5609
Cdd:cd21344    81 HIFVLT 86
RecD COG0507
ATPase/5#-3# helicase helicase subunit RecD of the DNA repair enzyme RecBCD (exonuclease V) ...
5660-5787 4.24e-07

ATPase/5#-3# helicase helicase subunit RecD of the DNA repair enzyme RecBCD (exonuclease V) [Replication, recombination and repair];


Pssm-ID: 440273 [Multi-domain]  Cd Length: 514  Bit Score: 56.91  E-value: 4.24e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5660 VQGPPGTGKSHLAIGLAVYYCTA--RVVYTAASHAAVDALCEKA-------HKFLNINDctrivpakvrvdcydkfkvnd 5730
Cdd:COG0507   145 LTGGAGTGKTTTLRALLAALEALglRVALAAPTGKAAKRLSESTgieartiHRLLGLRP--------------------- 203
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 381354069 5731 TTRKYVFTTINALPELvtDIIVVDEVSMLTNYELSVINSRVRAKHY--VYIGDPAQLPA 5787
Cdd:COG0507   204 DSGRFRHNRDNPLTPA--DLLVVDEASMVDTRLMAALLEALPRAGArlILVGDPDQLPS 260
NendoU_av_Nsp11-like cd21160
Nidoviral uridylate-specific endoribonuclease (NendoU) domain of arterivirus PRRSV ...
6817-6873 1.29e-06

Nidoviral uridylate-specific endoribonuclease (NendoU) domain of arterivirus PRRSV Nonstructural protein 11 (Nsp11), and related proteins; Nidovirus endoribonucleases (NendoUs) are uridylate-specific endoribonucleases, which release a cleavage product containing a 2',3'-cyclic phosphate at the 3' terminal end. NendoUs include Nsp15 from coronaviruses and Nsp11 from arteriviruses, both of which may participate in the viral replication process and in the evasion of the host immune system. Mn2+ is dispensable, and to some extent inhibits the activity of arterivirus (Porcine Reproductive and Respiratory Syndrome virus) PRRSV Nsp11. This Nsp11 exists as a dimer. NendoUs are distantly related to Xenopus laevis Mn(2+)-dependent uridylate-specific endoribonuclease (XendoU) which is involved in the processing of intron-encoded box C/D U16 small, nucleolar RNA.


Pssm-ID: 394911  Cd Length: 120  Bit Score: 50.78  E-value: 1.29e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 6817 SSKSVCTVIDLLLDDFVDILKSLNlncVSKVVNVNVDFKDFQFMLWcneeKVMTFYP 6873
Cdd:cd21160    69 AAKALCTVTDVYLPYLEPYLNPPT---QSKVYKVNIDFKPVRLMVW----KDATMYF 118
DEXXQc_DNA2 cd18041
DEXXQ-box helicase domain of DNA2; DNA2 (DNA Replication Helicase/Nuclease 2) possesses ...
5648-5786 1.37e-06

DEXXQ-box helicase domain of DNA2; DNA2 (DNA Replication Helicase/Nuclease 2) possesses different enzymatic activities, such as single-stranded DNA (ssDNA)-dependent ATPase, 5-3 helicase, and endonuclease activities, and is involved in DNA replication and DNA repair in the nucleus and mitochondrion. It is involved in Okazaki fragment processing by cleaving long flaps that escape FEN1: flaps that are longer than 27 nucleotides are coated by replication protein A complex (RPA), leading to recruit DNA2 which cleaves the flap until it is too short to bind RPA and becomes a substrate for FEN1. It is also involved in 5-end resection of DNA during double-strand break (DSB) repair; it is recruited by BLM and mediates the cleavage of 5-ssDNA, while the 3-ssDNA cleavage is prevented by the presence of RPA. DNA2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350799 [Multi-domain]  Cd Length: 203  Bit Score: 52.62  E-value: 1.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5648 NYQHIGMKR------YCTVQGPPGTGKSHLAIGL--AVYYCTARVVYTAASHAAVDALCEKAHK----FLNINDCTRIVP 5715
Cdd:cd18041     4 KDQRQAIKKvlnakdYALILGMPGTGKTTTIAALvrILVALGKSVLLTSYTHSAVDNILLKLKKfgvnFLRLGRLKKIHP 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5716 -------AKVRVDCYDkfkVNDTTRKY-----VFTT---IN--ALPELVTDIIVVDEVSMLTnyELSVINSRVRAKHYVY 5778
Cdd:cd18041    84 dvqeftlEAILKSCKS---VEELESKYesvsvVATTclgINhpIFRRRTFDYCIVDEASQIT--LPICLGPLRLAKKFVL 158

                  ....*...
gi 381354069 5779 IGDPAQLP 5786
Cdd:cd18041   159 VGDHYQLP 166
SF1_C cd18786
C-terminal helicase domain of superfamily 1 DEAD/H-box helicases; Superfamily (SF)1 family ...
5912-5952 2.70e-06

C-terminal helicase domain of superfamily 1 DEAD/H-box helicases; Superfamily (SF)1 family members include UvrD/Rep, Pif1-like, and Upf-1-like proteins. Similar to SF2 helicases, they do not form toroidal, predominantly hexameric structures like SF3-6. SF1 helicases are a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Their helicase core is surrounded by C- and N-terminal domains with specific functions such as nucleases, RNA or DNA binding domains, or domains engaged in protein-protein interactions. The core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350173 [Multi-domain]  Cd Length: 89  Bit Score: 48.97  E-value: 2.70e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 381354069 5912 TVDSAQGSEYDFVIYSqtAETAHSVNVNRFNVAITRAKKGI 5952
Cdd:cd18786    47 TIDSSQGLTFDVVTLY--LPTANSLTPRRLYVALTRARKRL 85
Arteriviridae_RdRp cd23189
catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the family Arteriviridae of ...
4982-5176 9.50e-06

catalytic core domain of RNA-dependent RNA polymerase (RdRp) in the family Arteriviridae of positive-sense single-stranded RNA [(+)ssRNA] viruses; This group contains the catalytic core domain of RdRp of RNA viruses belonging to the family Arteriviridae, order Nidovirales. Member viruses have a viral envelope and (+)ssRNA genome. The overall genome organization of the Arteriviruses are highly similar to the Coronaviruses; however, they lack the spike proteins of the coronaviruses. The family members include equine arteritis virus (EAV), porcine reproductive and respiratory syndrome virus (PRRSV), lactate dehydrogenase elevating virus of mice, and simian hemorrhagic fever virus (SHFV). The structure of Arteriviridae RdRp contains a RdRp domain as well as a large N-terminal extension that adopts a nidovirus RdRp-associated nucleotidyltransferase (NiRAN) architecture. The RdRp domain displays a right hand with three functional subdomains, called fingers, palm, and thumb. All RdRps contain conserved polymerase motifs (A-G), located in the palm (A-E motifs) and finger (F-G) subdomains. All these motifs have been implicated in RdRp fidelity such as processes of correct incorporation and reorganization of nucleotides.


Pssm-ID: 438039 [Multi-domain]  Cd Length: 323  Bit Score: 51.87  E-value: 9.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 4982 RNVLPTLTQMNLKYAISAKNRARTVAGV---------SILSTMTGRmFHQKCLKSiaatrgvPVVIGTTKFyggwddmlR 5052
Cdd:cd23189     2 RENWQTVTPCTLKKQYCSKKKTRTILGTnnlialalrAALSGVTQG-FMKAGFNS-------PIALGKNKF--------K 65
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 5053 RLIKDVDSPVLMGwDYPKCDRAMPNILRIVSSLVLARKhdSCCSHTDRFYrLANECAQVLSeiVMCGGCyyVKPGGTSSG 5132
Cdd:cd23189    66 PLQTPVLGRCLEA-DLASCDRSTPAIVRWFAANLLFEL--ACAEECLPSY-VLNCCHDLLV--TQSGAF--TKRGGLSSG 137
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 381354069 5133 DATTAFANSVFNICQAVSANVCSLMACnGHKIedlSIRELQKRL 5176
Cdd:cd23189   138 DPVTSISNTIYSLVIYTQHMVLSALKE-GHPI---GLKFLQDQL 177
AAA_11 pfam13086
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
5660-5721 1.08e-04

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins.


Pssm-ID: 404072 [Multi-domain]  Cd Length: 248  Bit Score: 47.72  E-value: 1.08e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 381354069  5660 VQGPPGTGKSHLAIGLAVYYCT---------ARVVYTAASHAAVDALCEK--AHKFLNINDCTRIVPAKVRVD 5721
Cdd:pfam13086   18 IQGPPGTGKTTTIVELIRQLLSypatsaaagPRILVCAPSNAAVDNILERllRKGQKYGPKIVRIGHPAAISE 90
deltaCoV_PLPro cd21734
deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1684-1756 2.07e-04

deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in the non-structural protein 3 (Nsp3) region of deltacoronavirus, including Porcine deltacoronavirus, Bulbul coronavirus HKU11, and Common moorhen coronavirus HKU21. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409651  Cd Length: 313  Bit Score: 47.42  E-value: 2.07e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 381354069 1684 LLKYYNMLGMC--KWPVVVCGNYFAFKQSNNNCYINVACLMLQHLNLKFpKWQWQEAWNEFRSGKPLRFVSLVLA 1756
Cdd:cd21734    75 LSQYCVYLKYChhKWSVSRTNGLMHLKQKDNNCFVSAAINLFQNTHYQL-RPAIDALYQEYLNGNPSRFVAWIYA 148
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1322-1443 3.19e-04

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 45.17  E-value: 3.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1322 VCFVKGDVIKVarlvEAEVIVNPANGRMAHGAGVAGAIAKAAGKFFIKETADMVKnQGVCLVGECYESAGGKL-CKKVLN 1400
Cdd:COG2110     1 IEIVQGDITEL----DVDAIVNAANSSLLGGGGVAGAIHRAAGPELLEECRRLCK-QGGCPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 381354069 1401 IVGPDARGQGRQCYSLLERAYQHI------NKCDNVVTTLISAGIFSVP 1443
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaeeLGIRSIAFPAIGTGVGGFP 124
DnaC COG1484
DNA replication protein DnaC [Replication, recombination and repair];
5662-5690 1.34e-03

DNA replication protein DnaC [Replication, recombination and repair];


Pssm-ID: 441093 [Multi-domain]  Cd Length: 242  Bit Score: 44.39  E-value: 1.34e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 381354069 5662 GPPGTGKSHLAIGLAVYYCTA--RVVYTAAS 5690
Cdd:COG1484   106 GPPGTGKTHLAIALGHEACRAgyRVRFTTAP 136
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
5662-5690 1.62e-03

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 44.00  E-value: 1.62e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 381354069 5662 GPPGTGKSHLAIGLAVYYCTA--RVVYTAAS 5690
Cdd:NF038214   97 GPPGTGKTHLAIALGYAACRQgyRVRFTTAA 127
MERS-CoV-like_Nsp3_NAB cd21823
nucleic acid binding domain of non-structural protein 3 from Middle East respiratory ...
1942-2059 1.91e-03

nucleic acid binding domain of non-structural protein 3 from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), and appears to be partially conserved in the Nsp3 NAB from betacoronaviruses in the C lineage.


Pssm-ID: 409349  Cd Length: 123  Bit Score: 41.66  E-value: 1.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069 1942 KYYT-KPIIKAQFRTFEK*DGVYTNFKL*GH-------SIAEKL*AKLGFDCDSPFVE-YKITEWPTATGDV*LASDDLY 2012
Cdd:cd21823     1 KYFTsKPPIEYSPATVLA-GSVYTNSCLVASdgtpggdAISLAFNNLLGFDESKPVSKkLTYSLLPNEDGDVLLAEFSTY 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 381354069 2013 VSRYLSGCITFGKPVVWLGHeeASLKS-LTYFNRPSVvcENKFNVLPV 2059
Cdd:cd21823    80 DPIYKNGAMLKGKPILWVNN--GLFDSaLNKFNRASL--RQIYDVAPV 123
betaCoV_Nsp2_MERS-like cd21517
betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins ...
253-436 2.84e-03

betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins from betacoronaviruses in the C lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Middle East respiratory syndrome-related coronavirus (MERS-CoV) and betacoronaviruses in the merbecovirus subgenus (C lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394868  Cd Length: 660  Bit Score: 44.72  E-value: 2.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  253 VDQYGCDYTGclaKGLEDYGDLTLSEM----------------KELFPVWRESLDNevvVAWHVDRDPRAVMRlQTLATL 316
Cdd:cd21517     5 IDQYMCGKDG---KPIADYAALAAKEGltkladveadvssradSDGFITFKNKLYR---IVWHVERKDVPYPK-QTIFTI 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  317 RSIdyvgqptedvVDGDVVVRAPAHLLAADALVKRL-PR----------LVETMLYTdssvteFCYKTKLCDCGFITQFG 385
Cdd:cd21517    78 NSV----------VQKDGIEDVPPHSFTLGGKVLVLvPRnkwggksdltLKQKLLYT------FYGKDAVENPSYIYHSA 141
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 381354069  386 YVDCCGdtCDFRGWVPGNMLDGFPCpGCSKSYMPWELEAQSSGVIPEGGVL 436
Cdd:cd21517   142 FVDCTS--CGNGSWLTGNAVQGFAC-DCGASYSANDVELQSSGLVKPNALF 189
PRK06526 PRK06526
transposase; Provisional
5662-5702 2.97e-03

transposase; Provisional


Pssm-ID: 180607  Cd Length: 254  Bit Score: 43.32  E-value: 2.97e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 381354069 5662 GPPGTGKSHLAIGLAVYYCTA--RVVYTAASHaAVDALCEKAH 5702
Cdd:PRK06526  105 GPPGTGKTHLAIGLGIRACQAghRVLFATAAQ-WVARLAAAHH 146
DEXXQc_UPF1 cd18039
DEXXQ-box helicase domain of UPF1; UPF1 (also called RNA Helicase And ATPase, Regulator Of ...
5655-5703 3.46e-03

DEXXQ-box helicase domain of UPF1; UPF1 (also called RNA Helicase And ATPase, Regulator Of Nonsense Transcripts, or ATP-Dependent Helicase RENT1) is an RNA-dependent helicase and ATPase required for nonsense-mediated decay (NMD) of mRNAs containing premature stop codons. It is recruited to mRNAs upon translation termination and undergoes a cycle of phosphorylation and dephosphorylation; its phosphorylation appears to be a key step in NMD. It is recruited by release factors to stalled ribosomes together with the SMG1C protein kinase complex to form the transient SURF (SMG1-UPF1-eRF1-eRF3) complex. In EJC-dependent NMD, the SURF complex associates with the exon junction complex (EJC) located downstream from the termination codon through UPF2 and allows the formation of an UPF1-UPF2-UPF3 surveillance complex which is believed to activate NMD. Diseases associated with UPF1 include juvenile amyotrophic lateral sclerosis and epidermolysis bullosa, junctional, non-Herlitz type. UPF1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350797 [Multi-domain]  Cd Length: 234  Bit Score: 43.00  E-value: 3.46e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 381354069 5655 KRYCTVQGPPGTGKS--------HLaiglaVYYCTARVVYTAASHAAVDALCEKAHK 5703
Cdd:cd18039    16 RPLSLIQGPPGTGKTvtsativyHL-----VKQGNGPVLVCAPSNVAVDQLTEKIHQ 67
UvrD_C_2 pfam13538
UvrD-like helicase C-terminal domain; This domain is found at the C-terminus of a wide variety ...
5912-5950 4.05e-03

UvrD-like helicase C-terminal domain; This domain is found at the C-terminus of a wide variety of helicase enzymes. This domain has a AAA-like structural fold.


Pssm-ID: 463913 [Multi-domain]  Cd Length: 52  Bit Score: 38.71  E-value: 4.05e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 381354069  5912 TVDSAQGSEYDFVIYSQTAETAHSVNVNRFN---VAITRAKK 5950
Cdd:pfam13538    6 TVHKAQGSEFPAVFLVDPDLTAHYHSMLRRRllyTAVTRARK 47
betaCoV_Nsp2_SARS-like cd21516
betacoronavirus non-structural protein 2 (Nsp2) similar to SARS-CoV Nsp2, and related proteins ...
252-777 4.94e-03

betacoronavirus non-structural protein 2 (Nsp2) similar to SARS-CoV Nsp2, and related proteins from betacoronaviruses in the B lineage; Non-structural proteins (Nsps) from Severe acute respiratory syndrome coronavirus (SARS-CoV) and betacoronaviruses in the sarbecovirus subgenus (B lineage) are encoded in ORF1a and ORF1b. Post infection, the SARS-CoV genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. The function of Nsp2 remains unknown. Deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. Rather than playing a role in viral replication, SARS-CoV Nsp2 may be involved in altering the host cell environment; it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis.


Pssm-ID: 439199  Cd Length: 637  Bit Score: 43.61  E-value: 4.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  252 FVDQYGCDYTG----CLAKGLEDYG--DLTLSEMKELFPVWRESL---DNEVVVAWHVDRDpRAVMRLQTLATLRSIdyv 322
Cdd:cd21516     4 YVDNNFCGPDGypleCIKDLLARAGksSCPLSEQLDFIGLKRGVYccrEHEHEIAWYTERS-EKSYELQTPFEIKSA--- 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  323 gqptedVVDGDVVVRAPAHLLAADALVKRL-PRLVET----------MLYTDSSVTEF-------------CYKTKLCDC 378
Cdd:cd21516    80 ------KKFDTFKGECPHFVFPLNSTVKVIqPRVEKKktegfmgrirSVYPVASPGECnpmalstlmkcnhCGETSWQTS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  379 GFITQF------GYVDCCG-DTCdfrGWVPGNMLDGFPCPGCsksympweleaQSSGVIPEGGVlfTQSTDTVNREAFKL 451
Cdd:cd21516   154 DFLKATcefcgtENLTKEGpTTC---GYLPQNAVVKMPCPAC-----------KNDEVGPEHSL--ADYHNHSGIETRLR 217
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  452 YGHAVVPFGSAVYwsPYPG------MWLPVVWSSVksysGLTYTGVVG---------CKAIVQ---------------ET 501
Cdd:cd21516   218 KGGRTVCFGGCVF--AYVGcynkcaYWVPRASANI----GSNHTGVVGedvetlnddLLEILQrekvninivgdfklnEE 291
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  502 DAI---------------CRSLymDYVQHK-----CGNLdqRATLGlddvYHRQLLVNRGDYSLLLENVDLFVKR----- 556
Cdd:cd21516   292 VAIilasfsastsafietVKGL--DYKTFKqivesCGNF--KVTKG----KAKKGAWNIGTQKSVLTPLLAFPSQaagvv 363
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  557 RAEFACKFATCGDGFVPL------LLDGLVPRSYYLIKSGQAYTSMMVNfshEVIDMCMDMALLfmhdVKVATKYVKKFT 630
Cdd:cd21516   364 RSIFSRTLDTAGHSLRALqraaitILDGISPQSLRLLDAMVFTSDLATN---SVLVMAYDTGGL----VQVTSQWLDNLF 436
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 381354069  631 GKLAVRFKalgvAVVRKITEWFDLAVDIAASAAGWLCYqLVNGLFAVANG-VITFVQEAPELVKNFVAKFRAFFKVLIDS 709
Cdd:cd21516   437 GTCADKLK----PVLTWLEEKLKEGVDFLRDAWEILKF-LVTGAYKIVKGqIVLAAKNIKECVQSFVAVVNKVLSLCYDQ 511
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 381354069  710 MsvsILSGLTVVKTASNRVCLAGSKVYEVVQ---KSLSAYVLPVGCS-EATCLVGESEPAVFEDDVVgVVKT 777
Cdd:cd21516   512 I---QIAGAKVGALNLGETFIAQSKGLYRVCvraREIQQLLMPLKAPkELTFLEGDTLDTELTSEEV-VLKT 579
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH