Skip to content
Snippets Groups Projects

Compare revisions

Changes are shown as if the source revision was being merged into the target revision. Learn more about comparing revisions.

Source

Select target project
No results found

Target

Select target project
  • mdm-lab/wiki
  • hvaysset/wiki
  • jsousa/wiki
  • tclabby/wiki
4 results
Show changes
Showing
with 264 additions and 137 deletions
......@@ -28,63 +28,62 @@ Lamassu systems function through abortive infection (Abi), but their molecular m
## Example of genomic structure
The majority of the Lamassu-Fam systems are composed of 3 proteins: LmuA, LmuB and, an accessory LmuC proteins.
LmuA is the effector and encode for different domains depending on the subsystem.
Here is an example of a Lamassu-Fam_Cap4_nuclease found in the RefSeq database:
A total of 11 subsystems have been described for the Lamassu-Fam system.
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Cap4_nuclease.svg){max-width=750px}
Here is some examples found in the RefSeq database:
Lamassu-Fam_Cap4_nuclease subsystem in the genome of *Pseudomonas sp.* (GCF_016925675.1) is composed of 3 proteins: LmuB_SMC_Hydrolase_protease (WP_205519025.1), LmuC_acc_Cap4_nuclease (WP_205478326.1)and, LmuA_effector_Cap4_nuclease_II (WP_205478325.1).
![lamassu-amidase](/lamassu-fam/Lamassu-Amidase.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Mrr.svg){max-width=750px}
The Lamassu-Amidase system in *Variovorax sp. PBL-H6* (GCF_901827155.1, NZ_LR594659) is composed of 2 proteins LmuA_effector_Amidase (WP_162575880.1) LmuB_SMC_Amidase (WP_162575881.1)
Lamassu-Fam_Mrr subsystem in the genome of *Escherichia coli* (GCF_011404895.1) is composed of 2 proteins: LmuA_effector_Mrr (WP_044864610.1)and, LmuB_SMC_Cap4_nuclease_II (WP_226199836.1).
![lamassu-cap4_nuclease](/lamassu-fam/Lamassu-Cap4_nuclease.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Hydrolase.svg){max-width=750px}
The Lamassu-Cap4_nuclease system in *Opitutus terrae* (GCF_000019965.1, NC_010571) is composed of 3 proteins LmuA_effector_Cap4_nuclease_II (WP_012377261.1) LmuC_acc_Lipase (WP_012377262.1) LmuB_SMC_Lipase (WP_012377263.1)
Lamassu-Fam_Hydrolase subsystem in the genome of *Caldisphaera lagunensis* (GCF_000317795.1) is composed of 2 proteins: LmuA_effector_Hydrolase (WP_015232255.1)and, LmuB_SMC_Hydrolase_protease (WP_015232260.1).
![lamassu-fmo](/lamassu-fam/Lamassu-FMO.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Lipase.svg){max-width=750px}
The Lamassu-FMO system in *Mesorhizobium sp. B2-1-8* (GCF_006442545.2, NZ_CP083952) is composed of 3 proteins LmuA_effector_FMO (WP_181178269.1) LmuC_acc_FMO (WP_140755520.1) LmuB_SMC_FMO (WP_181178271.1)
Lamassu-Fam_Lipase subsystem in the genome of *Bradyrhizobium elkanii* (GCF_012871055.1) is composed of 2 proteins: LmuA_effector_Lipase (WP_172647146.1)and, LmuB_SMC_Lipase (WP_172647148.1).
![lamassu-hydrolase_protease](/lamassu-fam/Lamassu-Hydrolase_Protease.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Hydrolase_protease.svg){max-width=750px}
The Lamassu-Hydrolase_Protease system in *Deefgea piscis* (GCF_013284055.1, NZ_CP054143) is composed of 4 proteins LmuA_effector_Hydrolase (WP_173533336.1) LmuA_effector_Protease (WP_173533337.1) LmuC_acc_hydrolase_protease (WP_173533338.1) LmuB_SMC_Hydrolase_protease (WP_173533339.1)
Lamassu-Fam_Hydrolase_protease subsystem in the genome of *Klebsiella pneumoniae* (GCF_022453565.1) is composed of 3 proteins: LmuB_SMC_Cap4_nuclease_II (WP_023301569.1), LmuA_effector_Protease (WP_023301563.1)and, LmuA_effector_Hydrolase (WP_023301562.1).
![lamassu-hypothetical](/lamassu-fam/Lamassu-Hypothetical.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Hypothetical.svg){max-width=750px}
The Lamassu-Hypothetical system in *Dehalococcoides mccartyi* (GCF_002007905.1, NZ_CP019866) is composed of 3 proteins LmuA_effector_hypothetical (WP_077975422.1) LmuC_acc_hypothetical (WP_077975423.1) LmuB_SMC_hypothetical (WP_077975424.1)
Lamassu-Fam_Hypothetical subsystem in the genome of *Streptococcus constellatus* (GCF_016127875.1) is composed of 2 proteins: LmuB_SMC_Cap4_nuclease_II (WP_198458038.1)and, LmuA_effector_hypothetical (WP_198458040.1).
![lamassu-lipase](/lamassu-fam/Lamassu-Lipase.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Protease.svg){max-width=750px}
The Lamassu-Lipase system in *Klebsiella michiganensis* (GCF_022869885.1, NZ_CP094999) is composed of 3 proteins LmuB_SMC_Lipase (WP_016157240.1) LmuC_acc_Lipase (WP_032931759.1) LmuA_effector_Lipase (WP_042945865.1)
Lamassu-Fam_Protease subsystem in the genome of *Azospirillum brasilense* (GCF_022023855.1) is composed of 2 proteins: LmuA_effector_Protease (WP_237905456.1)and, LmuB_SMC_Cap4_nuclease_II (WP_237905457.1).
![lamassu-mrr](/lamassu-fam/Lamassu-Mrr.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_PDDEXK.svg){max-width=750px}
The Lamassu-Mrr system in *Agrobacterium tumefaciens* (GCF_022760295.1, NZ_CP092895) is composed of 3 proteins LmuA_effector_Mrr (WP_223215665.1) LmuC_acc_Mrr (WP_220318798.1) LmuB_SMC_Mrr (WP_220318797.1)
Lamassu-Fam_PDDEXK subsystem in the genome of *Janthinobacterium sp.* (GCF_000013625.1) is composed of 2 proteins: LmuA_effector_PDDEXK (WP_012078862.1)and, LmuB_SMC_Cap4_nuclease_II (WP_012078864.1).
![lamassu-pddexk](/lamassu-fam/Lamassu-PDDEXK.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Sir2.svg){max-width=750px}
The Lamassu-PDDEXK system in *Corynebacterium pseudodiphtheriticum* (GCF_023509295.1, NZ_CP091864) is composed of 3 proteins LmuC_acc_PDDEXK (WP_249620350.1) LmuB_SMC_PDDEXK (WP_249620351.1) LmuA_effector_PDDEXK (WP_249620352.1)
Lamassu-Fam_Sir2 subsystem in the genome of *Paenibacillus polymyxa* (GCF_022492955.1) is composed of 4 proteins: LmuB_SMC_Cap4_nuclease_II (WP_240753063.1), LmuB_SMC_Sir2 (WP_240753064.1), LmuC_acc_Sir2 (WP_240753066.1)and, LmuA_effector_Sir2 (WP_240753072.1).
![lamassu-protease](/lamassu-fam/Lamassu-Protease.svg){max-width=750px}
The Lamassu-Protease system in *Enterococcus hirae* (GCF_014218175.1, NZ_CP042289) is composed of 3 proteins LmuA_effector_Protease (WP_034866244.1) LmuC_acc_hydrolase_protease (WP_181916367.1) LmuB_SMC_Hydrolase_protease (WP_034866249.1)
![lamassu-fam](/lamassu-fam/Lamassu-Fam_FMO.svg){max-width=750px}
![lamassu-sir2](/lamassu-fam/Lamassu-Sir2.svg){max-width=750px}
Lamassu-Fam_FMO subsystem in the genome of *Acinetobacter johnsonii* (GCF_021496365.1) is composed of 2 proteins: LmuA_effector_FMO (WP_234965678.1)and, LmuB_SMC_FMO (WP_234965680.1).
![lamassu-fam](/lamassu-fam/Lamassu-Fam_Amidase.svg){max-width=750px}
Lamassu-Fam_Amidase subsystem in the genome of *Bradyrhizobium arachidis* (GCF_015291705.1) is composed of 2 proteins: LmuA_effector_Amidase (WP_143130692.1)and, LmuB_SMC_Amidase (WP_092217687.1).
The Lamassu-Sir2 system in *Enterococcus faecalis* (GCF_023375525.1, NZ_CP091235) is composed of 3 proteins LmuA_effector_Sir2 (WP_231436660.1) LmuC_acc_Sir2 (WP_231436659.1) LmuB_SMC_Sir2 (WP_231436658.1)
## Distribution of the system among prokaryotes
The Lamassu-Fam system is present in a total of 1189 different species.
Among the 22,803 complete genomes of RefSeq, the Lamassu-Fam is detected in 2867 genomes (12.57 %).
The system was detected in 1220 different species.
Among the 22k complete genomes of RefSeq, this system is present in 3939 genomes (17.3 %).
![lamassu-fam](/lamassu-fam/Distribution_Lamassu-Fam.svg){max-width=750px}
![lamassu-fam](/lamassu-fam/Distribution_Lamassu-Fam.svg){max-width=750}
Proportion of genome encoding the Lamassu-Fam system for the 14 phyla with more than 50 genomes in the RefSeq database.
*Proportion of genome encoding the Lamassu-Fam system for the 14 phyla with more than 50 genomes in the RefSeq database.*
## Structure
......
......@@ -15,23 +15,24 @@ tableColumns:
# Lit
## Example of genomic structure
The Lit system is composed of one protein: Lit.
The Lit is composed of 1 protein: Lit.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![lit](/lit/Lit.svg){max-width=750px}
Lit system in the genome of *Stenotrophomonas maltophilia* (GCF_012647025.1) is composed of 1 protein: Lit (WP_061201506.1).
The Lit system in *Klebsiella pneumoniae* (GCF_002180175.1, NZ_CP021686) is composed of 1 protein: Lit (WP_223861284.1)
## Distribution of the system among prokaryotes
The Lit system is present in a total of 193 different species.
Among the 22,803 complete genomes of RefSeq, the Lit is detected in 222 genomes (0.97 %).
Among the 22k complete genomes of RefSeq, this system is present in 455 genomes (2.0 %).
The system was detected in 78 different species.
![lit](/lit/Distribution_Lit.svg){max-width=750px}
*Proportion of genome encoding the Lit system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Lit system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -13,6 +13,27 @@ tableColumns:
## To do
## Example of genomic structure
The MADS is composed of 8 proteins: mad1, mad2, mad3, mad4, mad5, mad6, mad7 and mad8.
Here is an example found in the RefSeq database:
![mads](/mads/MADS.svg){max-width=750px}
The MADS system in *Pseudomonas otitidis* (GCF_019879085.1, NZ_CP082244) is composed of 8 proteins mad1 (WP_058876992.1) mad2 (WP_208273447.1) mad3 (WP_208273444.1) mad4 (WP_208273442.1) mad5 (WP_208273440.1) mad6 (WP_208273438.1) mad7 (WP_208273430.1) mad8 (WP_233473447.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the MADS is detected in 72 genomes (0.32 %).
The system was detected in 63 different species.
![mads](/mads/Distribution_MADS.svg){max-width=750px}
Proportion of genome encoding the MADS system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
### MADS
......
......@@ -13,6 +13,27 @@ tableColumns:
## To do
## Example of genomic structure
The MazEF is composed of 2 proteins: MazE and MazF.
Here is an example found in the RefSeq database:
![mazef](/mazef/MazEF.svg){max-width=750px}
The MazEF system in *Pantoea vagans* (GCF_004792415.1, NZ_CP038853) is composed of 2 proteins MazE (WP_135910538.1) MazF (WP_135908176.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the MazEF is detected in 4457 genomes (19.55 %).
The system was detected in 993 different species.
![mazef](/mazef/Distribution_MazEF.svg){max-width=750px}
Proportion of genome encoding the MazEF system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Experimental validation
<mermaid>
graph LR;
......
......@@ -28,23 +28,24 @@ As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The Menshen system is composed of 3 proteins: NsnA, NsnB and, NsnC_2623244837.
The Menshen is composed of 3 proteins: NsnA, NsnB and NsnC.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![menshen](/menshen/Menshen.svg){max-width=750px}
Menshen system in the genome of *Citrobacter freundii* (GCF_003937345.2) is composed of 3 proteins: NsnA (WP_125363058.1), NsnB (WP_197964486.1)and, NsnC_2617187710 (WP_125363056.1).
The Menshen system in *Streptomyces marincola* (GCF_020410765.1, NZ_CP084541) is composed of 3 proteins NsnA (WP_226077564.1) NsnB (WP_226077566.1) NsnC_2507451963 (WP_226077568.1)
## Distribution of the system among prokaryotes
The Menshen system is present in a total of 247 different species.
Among the 22,803 complete genomes of RefSeq, the Menshen is detected in 430 genomes (1.89 %).
Among the 22k complete genomes of RefSeq, this system is present in 446 genomes (2.0 %).
The system was detected in 263 different species.
![menshen](/menshen/Distribution_Menshen.svg){max-width=750px}
*Proportion of genome encoding the Menshen system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Menshen system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -12,6 +12,27 @@ tableColumns:
## To do
## Example of genomic structure
The MMB_gp29_gp30 is composed of 2 proteins: gp29 and gp30.
Here is an example found in the RefSeq database:
![mmb_gp29_gp30](/mmb_gp29_gp30/MMB_gp29_gp30.svg){max-width=750px}
The MMB_gp29_gp30 system in *Mycolicibacterium insubricum* (GCF_010731615.1, NZ_AP022618) is composed of 2 proteins gp29 (WP_234805905.1) gp30 (WP_083031848.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the MMB_gp29_gp30 is detected in 5 genomes (0.02 %).
The system was detected in 4 different species.
![mmb_gp29_gp30](/mmb_gp29_gp30/Distribution_MMB_gp29_gp30.svg){max-width=750px}
Proportion of genome encoding the MMB_gp29_gp30 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
### MMB_gp29_gp30
......
......@@ -15,23 +15,24 @@ tableColumns:
# Mok_Hok_Sok
## Example of genomic structure
The Mok_Hok_Sok system is composed of 2 proteins: Mok and, Hok.
The Mok_Hok_Sok is composed of 2 proteins: Mok and Hok.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![mok_hok_sok](/mok_hok_sok/Mok_Hok_Sok.svg){max-width=750px}
Mok_Hok_Sok system in the genome of *Raoultella terrigena* (GCF_015571975.1) is composed of 2 proteins: Hok (WP_227629320.1)and, Mok (WP_227699927.1).
The Mok_Hok_Sok system in *Escherichia fergusonii* (GCF_013819565.1, NZ_CP057221) is composed of 1 protein: Hok (WP_096937776.1)
## Distribution of the system among prokaryotes
The Mok_Hok_Sok system is present in a total of 57 different species.
Among the 22,803 complete genomes of RefSeq, the Mok_Hok_Sok is detected in 1656 genomes (7.26 %).
Among the 22k complete genomes of RefSeq, this system is present in 1687 genomes (7.4 %).
The system was detected in 83 different species.
![mok_hok_sok](/mok_hok_sok/Distribution_Mok_Hok_Sok.svg){max-width=750px}
*Proportion of genome encoding the Mok_Hok_Sok system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Mok_Hok_Sok system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -27,27 +27,44 @@ As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The Mokosh system have been describe in a total of 2 subsystems.
A total of 7 subsystems have been described for the Mokosh system.
Here is some example found in the RefSeq database:
Here is some examples found in the RefSeq database:
![mokosh](/mokosh/Mokosh_TypeI.svg){max-width=750px}
![mokosh_typeii](/mokosh/Mokosh_TypeII.svg){max-width=750px}
Mokosh_TypeI subsystem in the genome of *Vibrio alginolyticus* (GCF_022343125.1) is composed of 2 proteins: MkoB2 (WP_238970063.1)and, MkoA2 (WP_238970065.1).
The Mokosh_TypeII system in *Massilia sp. NP310* (GCF_019443485.1, NZ_CP080379) is composed of 1 protein: MkoC (WP_184237977.1)
![mokosh](/mokosh/Mokosh_TypeII.svg){max-width=750px}
![mokosh_type_i_a](/mokosh/Mokosh_Type_I_A.svg){max-width=750px}
Mokosh_TypeII subsystem in the genome of *Shigella flexneri* (GCF_022354205.1) is composed of 1 protein: MkoC (WP_000344091.1).
The Mokosh_Type_I_A system in *Cupriavidus pauculus* (GCF_019931045.1, NZ_CP082863) is composed of 2 proteins MkoA_A (WP_170292610.1) MkoB_A (WP_150981546.1)
![mokosh_type_i_b](/mokosh/Mokosh_Type_I_B.svg){max-width=750px}
The Mokosh_Type_I_B system in *Collimonas pratensis* (GCF_001584185.1, NZ_CP013234) is composed of 2 proteins MkoB_B (WP_061945389.1) MkoA_B (WP_061945390.1)
![mokosh_type_i_c](/mokosh/Mokosh_Type_I_C.svg){max-width=750px}
The Mokosh_Type_I_C system in *Klebsiella sp. WP7-S18-CRE-02* (GCF_014169515.1, NZ_AP022189) is composed of 2 proteins MkoB_C (WP_227674162.1) MkoA_C (WP_227674163.1)
![mokosh_type_i_d](/mokosh/Mokosh_Type_I_D.svg){max-width=750px}
The Mokosh_Type_I_D system in *Geomonas paludis* (GCF_023221575.1, NZ_CP096574) is composed of 2 proteins MkoA_D (WP_248647094.1) MkoB_D (WP_248647095.1)
![mokosh_type_i_e](/mokosh/Mokosh_Type_I_E.svg){max-width=750px}
The Mokosh_Type_I_E system in *Planococcus faecalis* (GCF_002009235.1, NZ_CP019401) is composed of 2 proteins MkoA_E (WP_078080177.1) MkoB_E (WP_071154541.1)
## Distribution of the system among prokaryotes
The Mokosh system is present in a total of 605 different species.
Among the 22,803 complete genomes of RefSeq, the Mokosh is detected in 1607 genomes (7.05 %).
Among the 22k complete genomes of RefSeq, this system is present in 2540 genomes (11.1 %).
The system was detected in 577 different species.
![mokosh](/mokosh/Distribution_Mokosh.svg){max-width=750px}
*Proportion of genome encoding the Mokosh system for the 14 phyla with more than 50 genomes in the RefSeq database.* *Pie chart of the repartition of all the subsystems found in the RefSeq database.*
Proportion of genome encoding the Mokosh system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -25,23 +25,24 @@ As far as we are aware, the molecular mechanism of MqsRAC is unknown.
## Example of genomic structure
The MqsRAC system is composed of 2 proteins: mqsR and, mqsC.
The MqsRAC is composed of 2 proteins: mqsC and mqsR.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![mqsrac](/mqsrac/MqsRAC.svg){max-width=750px}
MqsRAC system in the genome of *Escherichia coli* (GCF_900636115.1) is composed of 2 proteins: mqsR (WP_024222007.1)and, mqsC (WP_021568458.1).
The MqsRAC system in *Shigella dysenteriae* (GCF_002949935.1, NZ_CP026827) is composed of 2 proteins mqsR (WP_024222007.1) mqsC (WP_157899945.1)
## Distribution of the system among prokaryotes
The MqsRAC system is present in a total of 18 different species.
Among the 22,803 complete genomes of RefSeq, the MqsRAC is detected in 26 genomes (0.11 %).
Among the 22k complete genomes of RefSeq, this system is present in 26 genomes (0.1 %).
The system was detected in 20 different species.
![mqsrac](/mqsrac/Distribution_MqsRAC.svg){max-width=750px}
*Proportion of genome encoding the MqsRAC system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the MqsRAC system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -29,23 +29,24 @@ Nhi contains two domains, a nuclease and a helicase domain that are both needed
## Example of genomic structure
The Nhi system is composed of one protein: Nhi.
The Nhi is composed of 1 protein: Nhi.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![nhi](/nhi/Nhi.svg){max-width=750px}
Nhi system in the genome of *Enterococcus avium* (GCF_003711125.1) is composed of 1 protein: Nhi (WP_148712513.1).
The Nhi system in *Staphylococcus sp. MZ9* (GCF_018622975.1, NZ_CP076029) is composed of 1 protein: Nhi (WP_045177897.1)
## Distribution of the system among prokaryotes
The Nhi system is present in a total of 56 different species.
Among the 22,803 complete genomes of RefSeq, the Nhi is detected in 199 genomes (0.87 %).
Among the 22k complete genomes of RefSeq, this system is present in 202 genomes (0.9 %).
The system was detected in 61 different species.
![nhi](/nhi/Distribution_Nhi.svg){max-width=750px}
*Proportion of genome encoding the Nhi system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Nhi system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -24,23 +24,24 @@ The NixI endonuclease cleaves the ICP1 genome at the GNAANCTT motif :ref{doi=10.
## Example of genomic structure
The NixI system is composed of 2 proteins: NixI and, Stix.
The NixI is composed of 2 proteins: NixI and Stix.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![nixi](/nixi/NixI.svg){max-width=750px}
NixI system in the genome of *Vibrio cholerae* (GCF_009646135.1) is composed of 2 proteins: NixI (WP_001147214.1)and, Stix (WP_000628297.1).
The NixI system in *Apilactobacillus kunkeei* (GCF_001314945.1, NZ_CP012920) is composed of 1 protein: NixI (WP_158520138.1)
## Distribution of the system among prokaryotes
The NixI system is present in a total of 8 different species.
Among the 22,803 complete genomes of RefSeq, the NixI is detected in 8 genomes (0.04 %).
Among the 22k complete genomes of RefSeq, this system is present in 19 genomes (0.1 %).
The system was detected in 5 different species.
![nixi](/nixi/Distribution_NixI.svg){max-width=750px}
*Proportion of genome encoding the NixI system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the NixI system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,27 +15,28 @@ tableColumns:
# NLR
## Example of genomic structure
The NLR system have been describe in a total of 2 subsystems.
A total of 2 subsystems have been described for the NLR system.
Here is some example found in the RefSeq database:
Here is some examples found in the RefSeq database:
![nlr](/nlr/NLR_like_bNACHT01.svg){max-width=750px}
![nlr_like_bnacht01](/nlr/NLR_like_bNACHT01.svg){max-width=750px}
NLR_like_bNACHT01 subsystem in the genome of *Pseudomonas psychrotolerans* (GCF_001913135.1) is composed of 1 protein: NLR_like_bNACHT01 (WP_074528296.1).
The NLR_like_bNACHT01 system in *Colwellia sp. Arc7-D* (GCF_003061515.1, NZ_CP028924) is composed of 1 protein: NLR_like_bNACHT01 (WP_162533747.1)
![nlr](/nlr/NLR_like_bNACHT09.svg){max-width=750px}
![nlr_like_bnacht09](/nlr/NLR_like_bNACHT09.svg){max-width=750px}
NLR_like_bNACHT09 subsystem in the genome of *Escherichia coli* (GCF_900636105.1) is composed of 1 protein: NLR_like_bNACHT09 (WP_089572057.1).
The NLR_like_bNACHT09 system in *Parabacteroides johnsonii* (GCF_020735865.1, NZ_CP085975) is composed of 1 protein: NLR_like_bNACHT09 (WP_009859882.1)
## Distribution of the system among prokaryotes
The NLR system is present in a total of 186 different species.
Among the 22,803 complete genomes of RefSeq, the NLR is detected in 453 genomes (1.99 %).
Among the 22k complete genomes of RefSeq, this system is present in 453 genomes (2.0 %).
The system was detected in 205 different species.
![nlr](/nlr/Distribution_NLR.svg){max-width=750px}
*Proportion of genome encoding the NLR system for the 14 phyla with more than 50 genomes in the RefSeq database.* *Pie chart of the repartition of all the subsystems found in the RefSeq database.*
Proportion of genome encoding the NLR system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -27,23 +27,24 @@ The old_exonuclease is dsDNA exonuclease that digest in the 5' to 3' direction :
## Example of genomic structure
The Old_exonuclease system is composed of one protein: Old_exonuclease.
The Old_exonuclease is composed of 1 protein: Old_exonuclease.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![old_exonuclease](/old_exonuclease/Old_exonuclease.svg){max-width=750px}
Old_exonuclease system in the genome of *Escherichia coli* (GCF_016904335.1) is composed of 1 protein: Old_exonuclease (WP_015979595.1).
The Old_exonuclease system in *Shewanella xiamenensis* (GCF_022453805.1, NZ_CP092630) is composed of 1 protein: Old_exonuclease (WP_240293412.1)
## Distribution of the system among prokaryotes
The Old_exonuclease system is present in a total of 53 different species.
Among the 22,803 complete genomes of RefSeq, the Old_exonuclease is detected in 102 genomes (0.45 %).
Among the 22k complete genomes of RefSeq, this system is present in 102 genomes (0.4 %).
The system was detected in 56 different species.
![old_exonuclease](/old_exonuclease/Distribution_Old_exonuclease.svg){max-width=750px}
*Proportion of genome encoding the Old_exonuclease system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Old_exonuclease system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -33,23 +33,24 @@ To our knowledde, the molecular mechanism is unknown. Please update.
## Example of genomic structure
The Olokun system is composed of 2 proteins: OloA and, OloB.
The Olokun is composed of 2 proteins: OloA and OloB.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![olokun](/olokun/Olokun.svg){max-width=750px}
Olokun system in the genome of *Vibrio cyclitrophicus* (GCF_023206035.1) is composed of 2 proteins: OloA (WP_016800143.1)and, OloB (WP_029203700.1).
The Olokun system in *Klebsiella quasipneumoniae* (GCF_015286025.1, NZ_CP063902) is composed of 2 proteins OloA (WP_227667429.1) OloB (WP_194418165.1)
## Distribution of the system among prokaryotes
The Olokun system is present in a total of 132 different species.
Among the 22,803 complete genomes of RefSeq, the Olokun is detected in 242 genomes (1.06 %).
Among the 22k complete genomes of RefSeq, this system is present in 252 genomes (1.1 %).
The system was detected in 137 different species.
![olokun](/olokun/Distribution_Olokun.svg){max-width=750px}
*Proportion of genome encoding the Olokun system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Olokun system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,23 +15,44 @@ tableColumns:
# pAgo
## Example of genomic structure
The pAgo system is composed of one protein: pAgo_Short.
A total of 6 subsystems have been described for the pAgo system.
Here is an example found in the RefSeq database:
Here is some examples found in the RefSeq database:
![pago](/pago/pAgo.svg){max-width=750px}
![pago_longa](/pago/pAgo_LongA.svg){max-width=750px}
pAgo system in the genome of *Ensifer adhaerens* (GCF_020405145.1) is composed of 1 protein: pAgo_LongB (WP_218685258.1).
The pAgo_LongA system in *Halosimplex pelagicum* (GCF_013415905.1, NZ_CP058909) is composed of 1 protein: pAgo_LongA (WP_179918860.1)
![pago_longb](/pago/pAgo_LongB.svg){max-width=750px}
The pAgo_LongB system in *Serratia fonticola* (GCF_019252525.1, NZ_CP072742) is composed of 2 proteins pAgo_LongB (WP_218520044.1) EcAgaN (WP_235784821.1)
![pago_s1a](/pago/pAgo_S1A.svg){max-width=750px}
The pAgo_S1A system in *Parabacteroides merdae* (GCF_020735605.1, NZ_CP085927) is composed of 2 proteins pAgo_S1A (WP_227945673.1) pAgo_S1A (WP_227945674.1)
![pago_s1b](/pago/pAgo_S1B.svg){max-width=750px}
The pAgo_S1B system in *Comamonas flocculans* (GCF_007954405.1, NZ_CP042344) is composed of 2 proteins SIR2APAZ (WP_146914209.1) pAgo_S1B (WP_146913473.1)
![pago_s2b](/pago/pAgo_S2B.svg){max-width=750px}
The pAgo_S2B system in *Granulicella tundricola* (GCF_000178975.2, NC_015064) is composed of 2 proteins XAPAZ (WP_013581437.1) pAgo_S2B (WP_013581438.1)
![pago_sparta](/pago/pAgo_SPARTA.svg){max-width=750px}
The pAgo_SPARTA system in *Roseivivax sp. THAF30* (GCF_009363575.1, NZ_CP045389) is composed of 2 proteins TIRAPAZ (WP_152461295.1) pAgo_SPARTA (WP_152461296.1)
## Distribution of the system among prokaryotes
The pAgo system is present in a total of 435 different species.
Among the 22,803 complete genomes of RefSeq, the pAgo is detected in 575 genomes (2.52 %).
Among the 22k complete genomes of RefSeq, this system is present in 598 genomes (2.6 %).
The system was detected in 464 different species.
![pago](/pago/Distribution_pAgo.svg){max-width=750px}
*Proportion of genome encoding the pAgo system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the pAgo system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -13,6 +13,27 @@ tableColumns:
## To do
## Example of genomic structure
The Panchino_gp28 is composed of 1 protein: gp28.
Here is an example found in the RefSeq database:
![panchino_gp28](/panchino_gp28/Panchino_gp28.svg){max-width=750px}
The Panchino_gp28 system in *Burkholderia pseudomallei* (GCF_018228585.1, NZ_CP073731) is composed of 1 protein: gp28 (WP_249212015.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the Panchino_gp28 is detected in 16 genomes (0.07 %).
The system was detected in 5 different species.
![panchino_gp28](/panchino_gp28/Distribution_Panchino_gp28.svg){max-width=750px}
Proportion of genome encoding the Panchino_gp28 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
### Panchino_gp28
......
......@@ -21,35 +21,29 @@ This system relies on an unknown [Abortive infection](/general-concepts/abortive
## Example of genomic structure
The Paris system have been describe in a total of 4 subsystems.
A total of 2 subsystems have been described for the Rst_PARIS system.
The classical Paris is composed of two proteins AriA and AriB. The other type of Paris is the fused version with one AriAB protein.
Here is some example found in the RefSeq database:
Here is some examples found in the RefSeq database:
![paris](/paris/PARIS_I.svg){max-width=750px}
![paris_i](/paris/PARIS_I.svg){max-width=750px}
PARIS_I subsystem in the genome of *Salmonella enterica* (GCF_020715485.1) is composed of 2 proteins: AAA_15 (WP_001520831.1)and, DUF4435 (WP_010989064.1).
The PARIS_I system in *Streptomyces sampsonii* (GCF_001704195.1, NZ_CP016824) is composed of 2 proteins AriA_AAA15 (WP_079165307.1) AriB_DUF4435_I (WP_143067273.1)
![paris](/paris/PARIS_II.svg){max-width=750px}
![paris_i_merge](/paris/PARIS_I_merge.svg){max-width=750px}
PARIS_II subsystem in the genome of *Enterobacter cloacae* (GCF_023238665.1) is composed of 2 proteins: DUF4435 (WP_071830092.1)and, AAA_21 (WP_061772587.1).
![paris](/paris/PARIS_II_merge.svg){max-width=750px}
PARIS_II_merge subsystem in the genome of *Desulfovibrio desulfuricans* (GCF_017815575.1) is composed of 1 protein: AAA_21_DUF4435 (WP_209818471.1).
![paris](/paris/PARIS_I_merge.svg){max-width=750px}
PARIS_I_merge subsystem in the genome of *Sideroxydans lithotrophicus* (GCF_000025705.1) is composed of 1 protein: AAA_15_DUF4435 (WP_013030315.1).
The PARIS_I_merge system in *Rhizobium sp. SL42* (GCF_021729845.1, NZ_CP063397) is composed of 1 protein: AAA_15_DUF4435 (WP_237370054.1)
## Distribution of the system among prokaryotes
The Paris system is present in a total of 463 different species.
Among the 22,803 complete genomes of RefSeq, the Paris is detected in 1128 genomes (4.95 %).
Among the 22k complete genomes of RefSeq, this system is present in 1145 genomes (5.0 %).
The system was detected in 518 different species.
![paris](/paris/Distribution_Paris.svg){max-width=750px}
*Proportion of genome encoding the Paris system for the 14 phyla with more than 50 genomes in the RefSeq database.* *Pie chart of the repartition of all the subsystems found in the RefSeq database.*
Proportion of genome encoding the Paris system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -29,23 +29,24 @@ As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The PD-Lambda-1 system is composed of one protein: PD-Lambda-1.
The PD-Lambda-1 is composed of 1 protein: PD-Lambda-1.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![pd-lambda-1](/pd-lambda-1/PD-Lambda-1.svg){max-width=750px}
PD-Lambda-1 system in the genome of *Olleya sp.* (GCF_002831645.1) is composed of 1 protein: PD-Lambda-1 (WP_101017806.1).
The PD-Lambda-1 system in *Gemmata massiliana* (GCF_901538265.1, NZ_LR593886) is composed of 1 protein: PD-Lambda-1 (WP_197909633.1)
## Distribution of the system among prokaryotes
The PD-Lambda-1 system is present in a total of 445 different species.
Among the 22,803 complete genomes of RefSeq, the PD-Lambda-1 is detected in 1301 genomes (5.71 %).
Among the 22k complete genomes of RefSeq, this system is present in 1323 genomes (5.8 %).
The system was detected in 483 different species.
![pd-lambda-1](/pd-lambda-1/Distribution_PD-Lambda-1.svg){max-width=750px}
*Proportion of genome encoding the PD-Lambda-1 system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the PD-Lambda-1 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,23 +15,24 @@ tableColumns:
# PD-Lambda-2
## Example of genomic structure
The PD-Lambda-2 system is composed of 3 proteins: PD-Lambda-2_C, PD-Lambda-2_B and, PD-Lambda-2_A.
The PD-Lambda-2 is composed of 3 proteins: PD-Lambda-2_A, PD-Lambda-2_B and PD-Lambda-2_C.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![pd-lambda-2](/pd-lambda-2/PD-Lambda-2.svg){max-width=750px}
PD-Lambda-2 system in the genome of *Pseudomonas aeruginosa* (GCF_013341295.1) is composed of 3 proteins: PD-Lambda-2_C (WP_023115149.1), PD-Lambda-2_B (WP_014833925.1)and, PD-Lambda-2_A (WP_003116393.1).
The PD-Lambda-2 system in *Vibrio parahaemolyticus* (GCF_013393845.1, NZ_CP040100) is composed of 3 proteins PD-Lambda-2_C (WP_116869984.1) PD-Lambda-2_B (WP_029809306.1) PD-Lambda-2_A (WP_029809305.1)
## Distribution of the system among prokaryotes
The PD-Lambda-2 system is present in a total of 52 different species.
Among the 22,803 complete genomes of RefSeq, the PD-Lambda-2 is detected in 122 genomes (0.54 %).
Among the 22k complete genomes of RefSeq, this system is present in 122 genomes (0.5 %).
The system was detected in 59 different species.
![pd-lambda-2](/pd-lambda-2/Distribution_PD-Lambda-2.svg){max-width=750px}
*Proportion of genome encoding the PD-Lambda-2 system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the PD-Lambda-2 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -26,23 +26,24 @@ As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The PD-Lambda-3 system is composed of 2 proteins: PD-Lambda-3_B and, PD-Lambda-3_A.
The PD-Lambda-3 is composed of 2 proteins: PD-Lambda-3_A and PD-Lambda-3_B.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![pd-lambda-3](/pd-lambda-3/PD-Lambda-3.svg){max-width=750px}
PD-Lambda-3 system in the genome of *Serratia rubidaea* (GCF_900478395.1) is composed of 2 proteins: PD-Lambda-3_A (WP_227673955.1)and, PD-Lambda-3_B (WP_047063039.1).
The PD-Lambda-3 system in *Enterobacter sp. CRENT-193* (GCF_002787395.1, NZ_CP024812) is composed of 2 proteins PD-Lambda-3_A (WP_001542736.1) PD-Lambda-3_B (WP_039268887.1)
## Distribution of the system among prokaryotes
The PD-Lambda-3 system is present in a total of 24 different species.
Among the 22,803 complete genomes of RefSeq, the PD-Lambda-3 is detected in 72 genomes (0.32 %).
Among the 22k complete genomes of RefSeq, this system is present in 74 genomes (0.3 %).
The system was detected in 28 different species.
![pd-lambda-3](/pd-lambda-3/Distribution_PD-Lambda-3.svg){max-width=750px}
*Proportion of genome encoding the PD-Lambda-3 system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the PD-Lambda-3 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......