Skip to content
Snippets Groups Projects

Compare revisions

Changes are shown as if the source revision was being merged into the target revision. Learn more about comparing revisions.

Source

Select target project
No results found

Target

Select target project
  • mdm-lab/wiki
  • hvaysset/wiki
  • jsousa/wiki
  • tclabby/wiki
4 results
Show changes
Showing
with 281 additions and 128 deletions
......@@ -13,6 +13,27 @@ tableColumns:
## To do
## Example of genomic structure
The FS_Sma is composed of 1 protein: Sma.
Here is an example found in the RefSeq database:
![fs_sma](/fs_sma/FS_Sma.svg){max-width=750px}
The FS_Sma system in *Staphylococcus aureus* (GCF_022869625.1, NZ_CP064365) is composed of 1 protein: Sma (WP_000041883.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the FS_Sma is detected in 578 genomes (2.53 %).
The system was detected in 20 different species.
![fs_sma](/fs_sma/Distribution_FS_Sma.svg){max-width=750px}
Proportion of genome encoding the FS_Sma system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
### FS_Sma
......
......@@ -10,38 +10,49 @@ tableColumns:
Activator: Direct
Effector: Degrading nucleic acids
PFAM: PF00580, PF11398, PF13175, PF13245, PF13304, PF13361, PF13476
contributors:
- Nathalie Bechon
relevantAbstracts:
- doi: 10.1093/nar/gkab277
- doi: 10.1126/science.aar4120
- doi: 10.1016/j.chom.2023.06.014
- doi: 10.1101/2023.05.01.538945
- doi: 10.1101/2023.05.01.538930
---
# Gabija
## Description
According to recent studies, GajA is a sequence-specific DNA nicking endonuclease, whose activity is inhibited by nucleotide concentration. Accordingly, GajA would be fully inhibited at cellular nucleotides concentrations. It was hypothesized that upon nucleotide depletion during phage infection, GajA would become activated (2).
## Description
Another study suggests that the *gajB* gene could encode for an NTPase, which would form a complex with GajA to achieve anti-phage defense (3).
Gabija is named after the Lithuanian spirit of fire, protector of home and family. It is a two gene defense system found in 8.5% of the 4360 bacterial and archeal genomes that were initially analyzed :ref{doi=10.1126/science.aar4120}. Both proteins are necessary for defense and are forming a heteromeric octamer complex: GajA forms a central tetramer surrounded by two GajB dimers :ref{doi=10.1101/2023.05.01.538945,10.1093/nar/gkad951}. A phage protein inhibiting Gabija function was described, Gabidja anti defense 1 (Gad1) :ref{doi=10.1101/2023.05.01.538945,10.1101/2023.05.01.538930}.
## Molecular mechanism
The precise mechanism of the Gabija system remains to be fully described, yet studies suggest that it could act either as a nucleic acid degrading system or as an abortive infection system.
The precise mechanism of the Gabija system remains to be fully described, yet studies suggest that it could act through a dual phage inhibition mechanism.
GajA was shown to be a sequence-specific DNA nicking endonuclease, whose activity is inhibited by nucleotide concentration. This nucleotide sensing is mediated by GajA ATPase-like domain. Accordingly, GajA would be fully inhibited at cellular nucleotides concentrations. It was hypothesized that upon nucleotide depletion during phage infection, GajA would become activated :ref{doi=10.1093/nar/gkab277}.
Moreover, a later study suggests that the *gajB* gene encode an NTPase, which would form a complex with GajA to achieve anti-phage defense. GajB is activated by DNA termini produced by GajA activity and then hydrolyzes (d)A/(d)GTP, depleting essential nucleotides and increasing GajA activity :ref{doi=10.1016/j.chom.2023.06.014}.
Therefore, both proteins would be cooperating to achieve both nucleotide depletion and DNA cleavage, causing abortive infection.
## Example of genomic structure
The Gabija system is composed of 2 proteins: GajA and, GajB_2.
The Gabija is composed of 2 proteins: GajA and GajB.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gabija](/gabija/Gabija.svg){max-width=750px}
Gabija system in the genome of *Vibrio parahaemolyticus* (GCF_009883895.1) is composed of 2 proteins: GajA (WP_085576823.1)and, GajB_1 (WP_031856308.1).
The Gabija system in *Roseomonas fluvialis* (GCF_022846615.1, NZ_AP025637) is composed of 2 proteins GajA (WP_244458879.1) GajB_2 (WP_244458880.1)
## Distribution of the system among prokaryotes
The Gabija system is present in a total of 1200 different species.
Among the 22,803 complete genomes of RefSeq, the Gabija is detected in 2999 genomes (13.15 %).
Among the 22k complete genomes of RefSeq, this system is present in 3762 genomes (16.5 %).
The system was detected in 1375 different species.
![gabija](/gabija/Distribution_Gabija.svg){max-width=750px}
*Proportion of genome encoding the Gabija system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gabija system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......@@ -105,14 +116,3 @@ end
style Title3 fill:none,stroke:none,stroke-width:none
style Title4 fill:none,stroke:none,stroke-width:none
</mermaid>
## Relevant abstracts
::relevant-abstracts
---
items:
- doi: 10.1093/nar/gkab277
- doi: 10.1126/science.aar4120
---
::
......@@ -14,23 +14,24 @@ tableColumns:
# Gao_Ape
## Example of genomic structure
The Gao_Ape system is composed of one protein: ApeA.
The Gao_Ape is composed of 1 protein: ApeA.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_ape](/gao_ape/Gao_Ape.svg){max-width=750px}
Gao_Ape system in the genome of *Klebsiella sp.* (GCF_018388785.1) is composed of 1 protein: ApeA (WP_213292831.1).
The Gao_Ape system in *Enterobacter mori* (GCF_018638795.1, NZ_CP071064) is composed of 1 protein: ApeA (WP_215231222.1)
## Distribution of the system among prokaryotes
The Gao_Ape system is present in a total of 76 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Ape is detected in 76 genomes (0.33 %).
Among the 22k complete genomes of RefSeq, this system is present in 199 genomes (0.9 %).
The system was detected in 45 different species.
![gao_ape](/gao_ape/Distribution_Gao_Ape.svg){max-width=750px}
*Proportion of genome encoding the Gao_Ape system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Ape system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,27 +15,28 @@ tableColumns:
# Gao_Her
## Example of genomic structure
The Gao_Her system have been describe in a total of 2 subsystems.
A total of 2 subsystems have been described for the Gao_Her system.
Here is some example found in the RefSeq database:
Here is some examples found in the RefSeq database:
![gao_her](/gao_her/Gao_Her_DUF.svg){max-width=750px}
![gao_her_duf](/gao_her/Gao_Her_DUF.svg){max-width=750px}
Gao_Her_DUF subsystem in the genome of *Enterobacter roggenkampii* (GCF_014524505.1) is composed of 2 proteins: DUF4297 (WP_188074283.1)and, HerA_DUF (WP_063614829.1).
The Gao_Her_DUF system in *Escherichia coli* (GCF_023657935.1, NZ_CP098183) is composed of 2 proteins DUF4297 (WP_064484828.1) HerA_DUF (WP_064484829.1)
![gao_her](/gao_her/Gao_Her_SIR.svg){max-width=750px}
![gao_her_sir](/gao_her/Gao_Her_SIR.svg){max-width=750px}
Gao_Her_SIR subsystem in the genome of *Escherichia coli* (GCF_012221565.1) is composed of 2 proteins: SIR2 (WP_167839366.1)and, HerA_SIR2 (WP_021577682.1).
The Gao_Her_SIR system in *Xanthomonas citri* (GCF_018831325.1, NZ_CP029270) is composed of 2 proteins SIR2 (WP_046831681.1) HerA_SIR2 (WP_046831680.1)
## Distribution of the system among prokaryotes
The Gao_Her system is present in a total of 127 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Her is detected in 231 genomes (1.01 %).
Among the 22k complete genomes of RefSeq, this system is present in 233 genomes (1.0 %).
The system was detected in 134 different species.
![gao_her](/gao_her/Distribution_Gao_Her.svg){max-width=750px}
*Proportion of genome encoding the Gao_Her system for the 14 phyla with more than 50 genomes in the RefSeq database.* *Pie chart of the repartition of all the subsystems found in the RefSeq database.*
Proportion of genome encoding the Gao_Her system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -10,28 +10,40 @@ tableColumns:
Activator: Unknown
Effector: Unknown
PFAM: PF04480, PF13086, PF13087, PF13195, PF18741
contributors:
- Marian Dominguez-Mirazo
relevantAbstracts:
- doi: 10.1126/science.aba0372
---
# Gao_Hhe
## Description
The Gao_hhe system is composed by a single protein. It was predicted through a guilty by association approach independent of domain annotations and validated in a heterologous system :ref{doi=10.1093/nar/gkad317}. It contains a predicted helicase and a Vsr (very short patch repair) endonuclease domain :ref{doi=10.1093/nar/gkad317,10.1128/jvi.00599-23}.
## Molecular mechanisms
As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The Gao_Hhe system is composed of one protein: HheA.
The Gao_Hhe is composed of 1 protein: HheA.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_hhe](/gao_hhe/Gao_Hhe.svg){max-width=750px}
Gao_Hhe system in the genome of *Klebsiella pneumoniae* (GCF_011742415.2) is composed of 1 protein: HheA (WP_021314612.1).
The Gao_Hhe system in *Thioalkalivibrio paradoxus* (GCF_000227685.2, NZ_CP007029) is composed of 1 protein: HheA (WP_006745815.1)
## Distribution of the system among prokaryotes
The Gao_Hhe system is present in a total of 49 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Hhe is detected in 276 genomes (1.21 %).
Among the 22k complete genomes of RefSeq, this system is present in 279 genomes (1.2 %).
The system was detected in 56 different species.
![gao_hhe](/gao_hhe/Distribution_Gao_Hhe.svg){max-width=750px}
*Proportion of genome encoding the Gao_Hhe system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Hhe system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......@@ -71,13 +83,5 @@ end
style Title3 fill:none,stroke:none,stroke-width:none
style Title4 fill:none,stroke:none,stroke-width:none
</mermaid>
## Relevant abstracts
::relevant-abstracts
---
items:
- doi: 10.1126/science.aba0372
---
::
......@@ -15,23 +15,24 @@ tableColumns:
# Gao_Iet
## Example of genomic structure
The Gao_Iet system is composed of 2 proteins: IetS and, IetA.
The Gao_Iet is composed of 2 proteins: IetA and IetS.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_iet](/gao_iet/Gao_Iet.svg){max-width=750px}
Gao_Iet system in the genome of *Escherichia coli* (GCF_014169855.1) is composed of 2 proteins: IetS (WP_001551050.1)and, IetA (WP_000385105.1).
The Gao_Iet system in *Salmonella sp. SJTUF14076* (GCF_015534835.1, NZ_CP064674) is composed of 2 proteins IetA (WP_023226994.1) IetS (WP_024148175.1)
## Distribution of the system among prokaryotes
The Gao_Iet system is present in a total of 189 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Iet is detected in 389 genomes (1.71 %).
Among the 22k complete genomes of RefSeq, this system is present in 436 genomes (1.9 %).
The system was detected in 185 different species.
![gao_iet](/gao_iet/Distribution_Gao_Iet.svg){max-width=750px}
*Proportion of genome encoding the Gao_Iet system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Iet system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,23 +15,24 @@ tableColumns:
# Gao_Mza
## Example of genomic structure
The Gao_Mza system is composed of 5 proteins: MzaB, MzaC, MzaA, MzaD and, MzaE.
The Gao_Mza is composed of 5 proteins: MzaA, MzaB, MzaC, MzaD and MzaE.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_mza](/gao_mza/Gao_Mza.svg){max-width=750px}
Gao_Mza system in the genome of *Enterobacter roggenkampii* (GCF_023023065.1) is composed of 5 proteins: MzaE (WP_045418899.1), MzaD (WP_045418897.1), MzaC (WP_025912266.1), MzaB (WP_045418895.1)and, MzaA (WP_045418893.1).
The Gao_Mza system in *Massilia sp. Se16.2.3* (GCF_014171595.1, NZ_CP050451) is composed of 6 proteins MzaA (WP_182990577.1) MzaB (WP_229425381.1) MzaC (WP_182990578.1) MzaD (WP_182990579.1) MzaE (WP_182990580.1) MzaE (WP_182990581.1)
## Distribution of the system among prokaryotes
The Gao_Mza system is present in a total of 57 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Mza is detected in 68 genomes (0.3 %).
Among the 22k complete genomes of RefSeq, this system is present in 99 genomes (0.4 %).
The system was detected in 32 different species.
![gao_mza](/gao_mza/Distribution_Gao_Mza.svg){max-width=750px}
*Proportion of genome encoding the Gao_Mza system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Mza system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -14,23 +14,24 @@ tableColumns:
# Gao_Ppl
## Example of genomic structure
The Gao_Ppl system is composed of one protein: PplA.
The Gao_Ppl is composed of 1 protein: PplA.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_ppl](/gao_ppl/Gao_Ppl.svg){max-width=750px}
Gao_Ppl system in the genome of *Klebsiella pneumoniae* (GCF_002787755.1) is composed of 1 protein: PplA (WP_015059139.1).
The Gao_Ppl system in *Enterobacter sp. E76* (GCF_008931465.1, NZ_CP042499) is composed of 1 protein: PplA (WP_032676599.1)
## Distribution of the system among prokaryotes
The Gao_Ppl system is present in a total of 106 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Ppl is detected in 276 genomes (1.21 %).
Among the 22k complete genomes of RefSeq, this system is present in 364 genomes (1.6 %).
The system was detected in 104 different species.
![gao_ppl](/gao_ppl/Distribution_Gao_Ppl.svg){max-width=750px}
*Proportion of genome encoding the Gao_Ppl system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Ppl system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -15,23 +15,24 @@ tableColumns:
# Gao_Qat
## Example of genomic structure
The Gao_Qat system is composed of 4 proteins: QatA, QatB, QatC and, QatD.
The Gao_Qat is composed of 4 proteins: QatA, QatB, QatC and QatD.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_qat](/gao_qat/Gao_Qat.svg){max-width=750px}
Gao_Qat system in the genome of *Raoultella ornithinolytica* (GCF_002214825.1) is composed of 4 proteins: QatA (WP_088883811.1), QatB (WP_127146083.1), QatC (WP_088883813.1)and, QatD (WP_088883814.1).
The Gao_Qat system in *Ralstonia wenshanensis* (GCF_021173085.1, NZ_CP076413) is composed of 4 proteins QatA (WP_232041545.1) QatB (WP_232041546.1) QatC (WP_232041547.1) QatD (WP_232043074.1)
## Distribution of the system among prokaryotes
The Gao_Qat system is present in a total of 246 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Qat is detected in 621 genomes (2.72 %).
Among the 22k complete genomes of RefSeq, this system is present in 645 genomes (2.8 %).
The system was detected in 267 different species.
![gao_qat](/gao_qat/Distribution_Gao_Qat.svg){max-width=750px}
*Proportion of genome encoding the Gao_Qat system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Qat system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -28,23 +28,24 @@ As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The Gao_RL system is composed of 4 proteins: RL_D, RL_C, RL_B and, RL_A.
The Gao_RL is composed of 4 proteins: RL_A, RL_B, RL_C and RL_D.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_rl](/gao_rl/Gao_RL.svg){max-width=750px}
Gao_RL system in the genome of *Morganella morganii* (GCF_020790175.1) is composed of 4 proteins: RL_D (WP_064483389.1), RL_C (WP_064483388.1), RL_B (WP_064483387.1)and, RL_A (WP_064483386.1).
The Gao_RL system in *Methylocystis heyeri* (GCF_004802635.2, NZ_CP046053) is composed of 4 proteins RL_A (WP_136498122.1) RL_B (WP_136498123.1) RL_C (WP_154331773.1) RL_D (WP_136498124.1)
## Distribution of the system among prokaryotes
The Gao_RL system is present in a total of 77 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_RL is detected in 131 genomes (0.57 %).
Among the 22k complete genomes of RefSeq, this system is present in 133 genomes (0.6 %).
The system was detected in 74 different species.
![gao_rl](/gao_rl/Distribution_Gao_RL.svg){max-width=750px}
*Proportion of genome encoding the Gao_RL system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_RL system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -14,23 +14,24 @@ tableColumns:
# Gao_TerY
## Example of genomic structure
The Gao_TerY system is composed of 3 proteins: TerYC, TerYB and, TerYA.
The Gao_TerY is composed of 3 proteins: TerYA, TerYB and TerYC.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_tery](/gao_tery/Gao_TerY.svg){max-width=750px}
Gao_TerY system in the genome of *Burkholderia contaminans* (GCF_018223785.1) is composed of 3 proteins: TerYA (WP_039364687.1), TerYB (WP_039364686.1)and, TerYC (WP_039364684.1).
The Gao_TerY system in *Arachidicoccus terrestris* (GCF_020042345.1, NZ_CP083387) is composed of 3 proteins TerYA (WP_224070248.1) TerYB (WP_224070249.1) TerYC (WP_224070250.1)
## Distribution of the system among prokaryotes
The Gao_TerY system is present in a total of 69 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_TerY is detected in 126 genomes (0.55 %).
Among the 22k complete genomes of RefSeq, this system is present in 126 genomes (0.6 %).
The system was detected in 74 different species.
![gao_tery](/gao_tery/Distribution_Gao_TerY.svg){max-width=750px}
*Proportion of genome encoding the Gao_TerY system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_TerY system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
This diff is collapsed.
......@@ -22,23 +22,24 @@ The Gao_upx system is composed by a single protein. It was predicted through a g
As far as we are aware, the molecular mechanism is unknown.
## Example of genomic structure
The Gao_Upx system is composed of one protein: UpxA.
The Gao_Upx is composed of 1 protein: UpxA.
Here is an example found in the RefSeq database:
Here is an example found in the RefSeq database:
![gao_upx](/gao_upx/Gao_Upx.svg){max-width=750px}
Gao_Upx system in the genome of *Salmonella sp.* (GCF_020268625.1) is composed of 1 protein: UpxA (WP_060647174.1).
The Gao_Upx system in *Serratia sp. JSRIV001* (GCF_019968745.1, NZ_CP074147) is composed of 1 protein: UpxA (WP_223499068.1)
## Distribution of the system among prokaryotes
The Gao_Upx system is present in a total of 31 different species.
Among the 22,803 complete genomes of RefSeq, the Gao_Upx is detected in 39 genomes (0.17 %).
Among the 22k complete genomes of RefSeq, this system is present in 39 genomes (0.2 %).
The system was detected in 33 different species.
![gao_upx](/gao_upx/Distribution_Gao_Upx.svg){max-width=750px}
*Proportion of genome encoding the Gao_Upx system for the 14 phyla with more than 50 genomes in the RefSeq database.*
Proportion of genome encoding the Gao_Upx system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
......
......@@ -12,6 +12,27 @@ tableColumns:
## To do
## Example of genomic structure
The GAPS1 is composed of 1 protein: GAPS1.
Here is an example found in the RefSeq database:
![gaps1](/gaps1/GAPS1.svg){max-width=750px}
The GAPS1 system in *Hydrogenophaga sp. BPS33* (GCF_009859475.1, NZ_CP044549) is composed of 1 protein: GAPS1 (WP_159591124.1)
## Distribution of the system among prokaryotes
Among the 22,803 complete genomes of RefSeq, the GAPS1 is detected in 434 genomes (1.9 %).
The system was detected in 56 different species.
![gaps1](/gaps1/Distribution_GAPS1.svg){max-width=750px}
Proportion of genome encoding the GAPS1 system for the 14 phyla with more than 50 genomes in the RefSeq database.
## Structure
### GAPS1
......
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.