Barcode tagging
Analogy with other tools¶
1) UMI-tools¶
Read groupping¶
The --per-cell
and --per-gene
attributes are used to tell UMI-tools to group reads within their respective cells and genes. But what they eactly used for read groupping is needed to be specified separately via --cell-tag
and --gene-tag
, which use a 2-character tag (e.g., XT
) found in the tag
field of each read in the .bam
file.
-
For
--per-gene
, it must be used in combination with--gene-tag
. -
For
--per-cell
, it can optionally be used in combination with--cell-tag
.
--per-gene
--gene-tag
--gene-tag
--cell-tag
--umi-tag
gene
If you have used --gene-tag
, then you also need to use --assigned-status-tag
too, which is a bool parameter, specified in tag
region as well and first introduced by FeatureCounts.
--assigned-status-tag=XS
UMI retrieval¶
It uses the --umi-tag
attribute to extract a UMI per read, with its respective tag stored in a .bam
file. For example, to extract UMIs tagged with the MB
tag, it is used in combination with the --extract-umi-method
attribute.
--extract-umi-method=tag --umi-tag=MB
When this is not available, it uses --umi-separator
to get UMIs from read names. For example, to extract UMIs separated with _
, it does
--umi-separator=_