Skip to content
Snippets Groups Projects
Commit 71468ece authored by Blaise Li's avatar Blaise Li
Browse files

Removed the low-quality zones removal.

Hopefully iCLIP data now has better quality and deduplication will be
more efficient taking into account those zones.
parent 0924ade8
No related branches found
No related tags found
No related merge requests found
...@@ -67,9 +67,13 @@ count_fastq_reads() ...@@ -67,9 +67,13 @@ count_fastq_reads()
# 15-17: AT(or GC?)-rich (low diversity) # 15-17: AT(or GC?)-rich (low diversity)
# [fragment] # [fragment]
# -4 -> -1: 3' UMI # -4 -> -1: 3' UMI
#strip_low_qual_zones()
#{
# bioawk -c fastx '{print $name"\t"substr($seq, 1, 5)""substr($seq, 12, 3)""substr($seq, 18)"\t"substr($qual, 1, 5)""substr($qual, 12, 3)""substr($qual, 18)}' | mawk '{print "@"$1"\n"$2"\n+\n"$3}'
#}
strip_low_qual_zones() strip_low_qual_zones()
{ {
bioawk -c fastx '{print $name"\t"substr($seq, 1, 5)""substr($seq, 12, 3)""substr($seq, 18)"\t"substr($qual, 1, 5)""substr($qual, 12, 3)""substr($qual, 18)}' | mawk '{print "@"$1"\n"$2"\n+\n"$3}' bioawk -c fastx '{print "@"$name"\n"$seq"\n+\n"$qual}'
} }
# This script performs 2 sorting and deduplicating operations, depending on the # This script performs 2 sorting and deduplicating operations, depending on the
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment