BACKGROUND: In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs. The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the target. Key components of the RNAi systems are proteins of the Argonaute-PIWI family some of which function as slicers, the nucleases that cleave the target RNA that is base-paired to a guide RNA. Numerous prokaryotes possess the CRISPR-associated system (CASS) of defense against phages and plasmids that is, in part, mechanistically analogous but not homologous to eukaryotic RNAi systems. Many prokaryotes also encode homologs of Argonaute-PIWI proteins but their functions remain unknown. RESULTS: We present a detailed analysis of Argonaute-PIWI protein sequences and the genomic neighborhoods of the respective genes in prokaryotes. Whereas eukaryotic Ago/PIWI proteins always contain PAZ (oligonucleotide binding) and PIWI (active or inactivated nuclease) domains, the prokaryotic Argonaute homologs (pAgos) fall into two major groups in which the PAZ domain is either present or absent. The monophyly of each group is supported by a phylogenetic analysis of the conserved PIWI-domains. Almost all pAgos that lack a PAZ domain appear to be inactivated, and the respective genes are associated with a variety of predicted nucleases in putative operons. An additional, uncharacterized domain that is fused to various nucleases appears to be a unique signature of operons encoding the short (lacking PAZ) pAgo form. By contrast, almost all PAZ-domain containing pAgos are predicted to be active nucleases. Some proteins of this group (e.g., that from Aquifex aeolicus) have been experimentally shown to possess nuclease activity, and are not typically associated with genes for other (putative) nucleases. Given these observations, the apparent extensive horizontal transfer of pAgo genes, and their common, statistically significant over-representation in genomic neighborhoods enriched in genes encoding proteins involved in the defense against phages and/or plasmids, we hypothesize that pAgos are key components of a novel class of defense systems. The PAZ-domain containing pAgos are predicted to directly destroy virus or plasmid nucleic acids via their nuclease activity, whereas the apparently inactivated, PAZ-lacking pAgos could be structural subunits of protein complexes that contain, as active moieties, the putative nucleases that we predict to be co-expressed with these pAgos. All these nucleases are predicted to be DNA endonucleases, so it seems most probable that the putative novel phage/plasmid-defense system targets phage DNA rather than mRNAs. Given that in eukaryotic RNAi systems, the PAZ domain binds a guide RNA and positions it on the complementary region of the target, we further speculate that pAgos function on a similar principle (the guide being either DNA or RNA), and that the uncharacterized domain found in putative operons with the short forms of pAgos is a functional substitute for the PAZ domain. CONCLUSION: The hypothesis that pAgos are key components of a novel prokaryotic immune system that employs guide RNA or DNA molecules to degrade nucleic acids of invading mobile elements implies a functional analogy with the prokaryotic CASS and a direct evolutionary connection with eukaryotic RNAi. The predictions of the hypothesis including both the activities of pAgos and those of the associated endonucleases are readily amenable to experimental tests.
BACKGROUND: In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs.
The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the target.
Key components of the RNAi systems are proteins of the Argonaute-PIWI family some of which function as slicers,
the nucleases that cleave the target RNA that is base-paired to a guide RNA.
Numerous prokaryotes possess the CRISPR-associated system (CASS) of defense against phages and plasmids that is, in part,
mechanistically analogous but not homologous to eukaryotic RNAi systems.
Many prokaryotes also encode homologs of Argonaute-PIWI proteins but their functions remain unknown.
RESULTS: We present a detailed analysis of Argonaute-PIWI protein sequences and the genomic neighborhoods of the respective genes in prokaryotes.
Whereas eukaryotic Ago/PIWI proteins always contain PAZ (oligonucleotide binding) and PIWI (active or inactivated nuclease) domains, the prokaryotic Argonaute homologs (pAgos) fall into two major groups in which the PAZ domain is either present or absent. The monophyly of each group is supported by a phylogenetic analysis of the conserved PIWI-domains. Almost all pAgos that lack a PAZ domain appear to be inactivated, and the respective genes are associated with a variety of predicted nucleases in putative operons. An additional, uncharacterized domain that is fused to various nucleases appears to be a unique signature of operons encoding the short (lacking PAZ) pAgo form. By contrast, almost all PAZ-domain containing pAgos are predicted to be active nucleases. Some proteins of this group (e.g., that from Aquifex aeolicus) have been experimentally shown to possess nuclease activity, and are not typically associated with genes for other (putative) nucleases. Given these observations, the apparent extensive horizontal transfer of pAgo genes, and their common, statistically significant over-representation in genomic neighborhoods enriched in genes encoding proteins involved in the defense against phages and/or plasmids, we hypothesize that pAgos are key components of a novel class of defense systems. The PAZ-domain containing pAgos are predicted to directly destroy virus or plasmid nucleic acids via their nuclease activity, whereas the apparently inactivated, PAZ-lacking pAgos could be structural subunits of protein complexes that contain, as active moieties, the putative nucleases that we predict to be co-expressed with these pAgos. All these nucleases are predicted to be DNA endonucleases, so it seems most probable that the putative novel phage/plasmid-defense system targets phage DNA rather than mRNAs. Given that in eukaryotic RNAi systems, the PAZ domain binds a guide RNA and positions it on the complementary region of the target, we further speculate that pAgos function on a similar principle (the guide being either DNA or RNA), and that the uncharacterized domain found in putative operons with the short forms of pAgos is a functional substitute for the PAZ domain. CONCLUSION: The hypothesis that pAgos are key components of a novel prokaryotic immune system that employs guide RNA or DNA molecules to degrade nucleic acids of invading mobile elements implies a functional analogy with the prokaryotic CASS and a direct evolutionary connection with eukaryotic RNAi. The predictions of the hypothesis including both the activities of pAgos and those of the associated endonucleases are readily amenable to experimental tests.
Argonaute proteins comprise a diverse protein family and can be found in both prokaryotes and eukaryotes :ref{doi=10.1016/j.mib.2023.102313}. Despite low sequence conservation, eAgos and long pAgos generally have a conserved domain architecture and share a common mechanism of action; they use a 5′-phosphorylated single stranded nucleic acid guide (generally 15-22 nt in length) to target complementary nucleic acid sequences :ref{doi=10.1038/nsmb.2879} eAgos strictly mediate RNA-guided RNA silencing, while pAgos show higher mechanistic diversification, and can make use of guide RNAs and/or single-stranded guide DNAs to target RNA and/or DNA targets :ref{doi=10.1016/j.mib.2023.102313}. Depending on the presence of catalytic residues and the degree of complementarity between the guide and target sequences, eAgo and pAgos either cleave the target, or recruit and/or activate accessory proteins. This can result in degradation of the target nucleic acid, but might also trigger alternative downstream effects, ranging from poly(A) tail shortening and RNA decapping :ref{doi=10.1016/J.CELL.2018.03.006} or chromatin formation in eukaryotes :ref{doi=10.1038/s41580-022-00528-0}, to abortive infection in prokaryotes :ref{doi=10.1016/j.tcb.2022.10.005}.
Argonaute proteins comprise a diverse protein family and can be found in both prokaryotes and eukaryotes :ref{doi=10.1016/j.mib.2023.102313}. Despite low sequence conservation, eAgos and long pAgos generally have a conserved domain architecture and share a common mechanism of action; they use a 5'-phosphorylated single stranded nucleic acid guide (generally 15-22 nt in length) to target complementary nucleic acid sequences :ref{doi=10.1038/nsmb.2879} eAgos strictly mediate RNA-guided RNA silencing, while pAgos show higher mechanistic diversification, and can make use of guide RNAs and/or single-stranded guide DNAs to target RNA and/or DNA targets :ref{doi=10.1016/j.mib.2023.102313}. Depending on the presence of catalytic residues and the degree of complementarity between the guide and target sequences, eAgo and pAgos either cleave the target, or recruit and/or activate accessory proteins. This can result in degradation of the target nucleic acid, but might also trigger alternative downstream effects, ranging from poly(A) tail shortening and RNA decapping :ref{doi=10.1016/J.CELL.2018.03.006} or chromatin formation in eukaryotes :ref{doi=10.1038/s41580-022-00528-0}, to abortive infection in prokaryotes :ref{doi=10.1016/j.tcb.2022.10.005}.
## Molecular mechanism
...
...
@@ -33,7 +41,7 @@ Based on their phylogeny, Agos have been subdivided in various (sub)clades. eAgo
### Long-A pAgos
Akin to eAgos, most long A-pAgos characterized to date have a N-L1-PAZ-L2-MID-PIWI domain architecture :ref{doi=10.1038/nsmb.2879}. In contrast to eAgos, however, certain long-A pAgos use a single stranded guide DNA to bind and cleave complementary target DNA sequences :ref{doi=10.1093/nar/gkz306,10.1093/nar/gkz379,10.1038/s41586-020-2605-1,10.1093/nar/gkv415,10.1038/nature12971,10.1038/nmicrobiol.2017.35}. Long-A pAgos are preferentially programmed with guide DNAs targeting invading DNA through a poorly understood mechanism, which might involve DNA repair proteins :ref{doi=10.1038/s41586-020-2605-1} or the pAgo itself :ref{doi=10.1016/j.molcel.2017.01.033,10.1038/nmicrobiol.2017.34}. Most long-A pAgos have an intact catalytic site in the PIWI domain which allows to cleave their targets :ref{doi=10.1073/pnas.1321032111}. As such, they act as an innate immune system that clear plasmid and phage DNA from the cell :ref{doi=10.1093/nar/gkz379,10.1038/s41586-020-2605-1,10.1093/nar/gkv415,10.1038/nature12971,10.1093/nar/gkad290}.
Within the long-A pAgo clade various subclades of other pAgos exist that rely on distinct function mechanisms. For example, various long-A pAgo can (additionally) use guide RNAs and/or cleave RNA targets. Furthermore, CRISPR-associated pAgos use 5′-OH guide RNAs to target DNA :ref{doi=10.1073/pnas.1524385113}, and PliAgo-like pAgos use small DNA guides to target RNA :ref{doi=10.1038/s41467-022-32079-5}. Certain long-A pAgos genetically co-localize with other putative enzymes including (but not limited to) putative nucleases, helicases, DNA-binding proteins, or PLD-like proteins :ref{doi=,10.1038/nsmb.2879,10.1128/mBio.01935-18}. The relevance of these associations is currently unknown.
Within the long-A pAgo clade various subclades of other pAgos exist that rely on distinct function mechanisms. For example, various long-A pAgo can (additionally) use guide RNAs and/or cleave RNA targets. Furthermore, CRISPR-associated pAgos use 5'-OH guide RNAs to target DNA :ref{doi=10.1073/pnas.1524385113}, and PliAgo-like pAgos use small DNA guides to target RNA :ref{doi=10.1038/s41467-022-32079-5}. Certain long-A pAgos genetically co-localize with other putative enzymes including (but not limited to) putative nucleases, helicases, DNA-binding proteins, or PLD-like proteins :ref{doi=10.1038/nsmb.2879,10.1128/mBio.01935-18}. The relevance of these associations is currently unknown.
### Long-B pAgos
Akin to long-A pAgogs, long B-pAgos have a N-L1-PAZ-L2-MID-PIWI domain composition, but most have a shorter PAZ* domain, and in contrast to long-A pAgos all long-B pAgos are catalytically inactive :ref{doi=10.1128/mBio.01935-18}. Long-B pAgos characterized to date use guide RNAs to bind invading DNA :ref{doi=10.1038/s41598-023-32600-w,10.1016/j.molcel.2013.08.014,10.1038/s41467-023-42793-3}. In absence of co-encoded proteins, long-B pAgos repress invader activity :ref{doi=10.1016/j.molcel.2013.08.014}. In addition, most long-B pAgos are co-encoded with effector proteins including (but not limited to) SIR2, nucleases, membrane proteins, and restriction endonucleases :ref{doi=10.1038/nsmb.2879,10.1128/mBio.01935-18,10.1186/1745-6150-4-29,10.1038/s41467-023-42793-3}. These effector proteins are activated upon pAgo-mediated invader detection, and generally catalyze reactions that result in cell death :ref{doi=10.1038/s41467-023-42793-3}. As such, long-B pAgo together with their associated proteins mediate abortive infection.