I've been wanting to filter out a DNA sequence within a line of text using awk, and have been unsuccessful. I was trying:
gawk '{gsub(/[ACTG]{10,}/,""); print}'
which I expected to work, but it was not. I've found that gawk (GNU awk) has an extra setting that allows this syntax:
gawk --re-interval '{gsub(/[ACTG]{10,}/,""); print}'
This works.
No comments:
Post a Comment