Skip to content

Format

Input

The input file in BAM format is only acceptable by mclUMI with each read umi attached to the read name. If you hold FASTQ file, you need to map the *.fastq.gz to a reference genome to yield a bam file using a mapping tool, such as Minimap2, HISAT2, or STAR, according to different sequencing reads.

Note

In single cell sequencing read analysis, barcodes will be attached to the read name yet prior to their UMIs.

Output

mclUMI outputs a series of files after UMI deduplication or collapsing. It has two files for recording statistics involved and one bam file as deduplicated reads. Please see details at 4 different UMI deduplication levels.