- 30 Oct, 2019 1 commit
-
-
Blaise Li authored
-
- 28 Oct, 2019 2 commits
- 14 Oct, 2019 1 commit
-
-
Blaise Li authored
-
- 16 Apr, 2019 1 commit
-
-
Blaise Li authored
Made genome configurable for GRO-seq, tried to debug rpy2 uncachable exceptions.
-
- 05 Oct, 2018 1 commit
-
-
Blaise Li authored
-
- 04 Oct, 2018 1 commit
-
-
Blaise Li authored
This is to limit clutter from old analyses.
-
- 22 Jun, 2018 1 commit
-
-
Blaise Li authored
Added function to compute seed disposition from bowtie2 settings.
-
- 14 Jun, 2018 1 commit
-
-
Blaise Li authored
From the results, it seems that these settings are OK for the smallest size range, but not for longer reads.
-
- 13 Jun, 2018 1 commit
-
-
Blaise Li authored
The ValueError was: `Invalid file path or buffer object type: <class 'snakemake.io.Namedlist'>` This was due to an extra comma at the end of the return of `source_norm_file`.
-
- 08 Jun, 2018 2 commits
- 04 Jun, 2018 1 commit
-
-
Blaise Li authored
-
- 01 Jun, 2018 1 commit
-
-
Blaise Li authored
The remapping is done with one mismatch allowed in the seed. It happened that very little mapped reads had mismatches, although read-through reads can have some.
-
- 11 Apr, 2018 1 commit
-
-
Blaise Li authored
-
- 08 Mar, 2018 1 commit
-
-
Blaise Li authored
-
- 05 Mar, 2018 1 commit
-
-
Blaise Li authored
-
- 21 Feb, 2018 1 commit
-
-
Blaise Li authored
-
- 16 Feb, 2018 1 commit
-
-
Blaise Li authored
-
- 02 Feb, 2018 1 commit
-
-
Blaise Li authored
Currently goes from demultiplexing to mapping, via trimming and deduplicating. The mapping is performed on 3 read type: - adapt_nodedup (the adaptor was found, and the reads were trimmed but not deduplicated) - adapt_deduped (the adaptor was found, and the reads were trimmed and deduplicated) - noadapt_deduped (the adaptor was not found, and the reads were trimmed and deduplicated) The trim_and_dedup script currenly assumes that two low-diversity zones are present, and ignores them for deduplication: NNNNNGCACTANNNWWW[YYYY]NNNN 1---5 : 5' UMI 6--11: barcode (lower diversity) 12-14: UMI 15-17: AT(or GC?)-rich (low diversity) [fragment] -4 -> -1: 3' UMI It may be a problem to deduplicate taking into account the end of the reads, which tends to be of lower quality. The reads with errors will be over-represented. That is why we decided to also look at the non-deduplicated reads.
-