New
MiXCR v4.6.0
🖇️ Combined Heavy+Light Somatic Hypermutation Trees from Single-Cell data
- A special step is added in
findShmTreesto combine heavy and light SHM trees utilizing information added to clonotypes bygroupClonescommand. Nodes in resulting tree will contain both light and heavy chains. If there is no connection to a clone from a companion chain, a reconstructed sequence will be added. - Behaviour can be disabled with
--dont-combine-tree-by-cellsoption to reconstruct separate heavy and light SHM trees - Added
exportShmSingleCellTreescommand that export one node per line. It there is several roots in a tree, data will be exported in a different columns. - Added
-subtreeIdto tree exports to differentiate part of trees from different chains exportShmTreesWithNodesandexportShmTreescommands will export subtrees with different chains at separate rows.
🚀 Other major upgrades
Changes in groupClones command
- Previous algorithm replaced with a new one that have better way of working with contamination, can detect multi-mappers (when one cell barcode marks two different cells) and can work with non-functional clones
- Some clones are now explicitly marked as contamination. This information is available as a separate label in
exportClonesingroupIdcolumn. Such clones can be filtered out from export by--filter-out-group-types contamination - More important algorithm performance metrics are added to the report
- Fix for behaviour leading to clones with
undefienedgroup being split by cell barcodes
New characteristics in SHM trees exports
-subtreeIdfor determination of different chains in the same tree-numberOfClonesInTree [forChain]Number of uniq clones in the SHM tree.-numberOfNodesWithClonesNumber of nodes with clones, i.e. nodes with different clone sequences.-totalReadsCountInTree [forChain]Total sum of read counts of clones in the SHM tree.-totalUniqueTagCountInTree (Molecule|Cell|Sample) [forChain]Total count of unique tags in the SHM tree with specified type.-chainsChain type of the tree-treeHeightHeight of the tree-vGene,-jGene,-vFamily,-jFamily- in previous version thous were exported only for nodes with clones-vBestIdentityPercent,-jBestIdentityPercent,-isOOFand-isProductivenow exported for reconstructed nodes too
New characteristics in clonotype export
-aaLengthand-allAALengthis available alongside-nLengthand-allNLength-aaMutationsRateis available alongside-nMutationsRate- Added optional arg
germlinein-nFeature,-aaFeature,-nLength,-aaLengthinexportClones,exportAlignmentsandexportCloneGroups. It allows to export a sequence of the germline instead of a sequence of the gene. - For all mutation exports (excluding
-mutationsDetailed) added optional filter by mutation type:... [(substitutions|indels|inserts|deletions)] - Added
-nMutationsCount,-aaMutationsCount,-allNMutationsCount,-allAAMutationsCountfor all relatable exports - For mutation exports in
exportShmTreesWithNodes(germline|mrca|parent)option is now optional. Will be export mutations fromgermlineby default - Added
--export-clone-groups-sort-chains-bymixin - Nucleotide mutations now could be exported for features that contain
VCDR3Part,DCDR3PartorJCDR3Part - Now
-nLength,-nMutationsCount,-nMutationsRatecan be calculated for multiple gene features (e.g.-nMutationsRate VRegionTrimmed,JRegionTrimmed) - Added
--export-clone-groups-sort-chains-bymixin with type of sorting of clones for determination of the primary and the secondary chains. It applies toexportCloneGroupscommand. By default, it'sAuto(by UMI if it's available, by Read otherwise; previous default value wasRead) - Added
--filter-out-group-typesmixin to filter-out clones having certain clone group assignment kind:found,undefinedorcontamination. It applies toexportClonescommand - Now
exportCloneGroupsby default will export groups in separate files forIG,TRAB,TRGDandmixed. This behaviour could be switched off by using--reset-export-clone-table-splittingor single--export-clone-groups-for-cell-type. In case of several--export-clone-groups-for-cell-typeevery cell type will be exported in separate file. - In case of
--export-clone-groups-for-cell-typeinexportCloneGroupsall mixed or unmatched groups will be filtered out. - Added read and Molecule fraction columns to single cell exportClones output.
🧬 Reference library upgrades
- Previous
TRADmeta-chain split intoTRAandTRDas it should be. Chain assignment for clonotypes based on J genes. - Rebuild allelic reference for human
IGH,TRB,TRAandTRDchains. Now allelic names correspond to the IUIS nomenclature. - Human
IGKVendcoordinates corrected. UTR5Begincoordinates added to the following mouse genes: IGKV23-1, IGKV20-101-2, IGKV14-130, IGKV8-28, TRGV2
📚 Preset updates
- The
milab-human-rna-tcr-umi-racepreset has been updated: now clones are assembled by default based on the CDR3, in line with the manufacturer's recommended read length. - The
flairr-seq-bcrpreset has been updated: now the preset sets species tohumanby default according to a built-in tag pattern with primer sequences. - The following presets have been added to cover Ivivoscribe assay panels:
invivoscribe-human-dna-trg-lymphotrack,invivoscribe-human-dna-trb-lymphotrack,invivoscribe-human-dna-igk-lymphotrack,invivoscribe-human-dna-ighv-leader-lymphotrack,invivoscribe-human-dna-igh-fr3-lymphotrack,invivoscribe-human-dna-igh-fr2-lymphotrack,invivoscribe-human-dna-igh-fr1-lymphotrack,invivoscribe-human-dna-igh-fr123-lymphotrack. - The following presets have been added for mouse Thermofisher assays:
thermofisher-mouse-rna-tcb-ampliseq-sr,thermofisher-mouse-dna-tcb-ampliseq-sr,thermofisher-mouse-rna-igh-ampliseq-sr,thermofisher-mouse-dna-igh-ampliseq-sr. - Preset for SMARTer Human scTCR a/b Profiling Kit:
takara-sc-human-rna-tcr-smarter - The
milab-human-rna-ig-umi-multiplexpreset has been updated: the pattern now trims fewer nucleotides, which facilitates CDR1 identification. The splits by V and J genes have been removed as redundant due to the full-length assembling feature.
🛠️ Minor improvements & fixes
- More strict
Combining treesstep infindShmTreescommand - Better calculation of indel mutations between nodes in process of building shm trees
- Increased percent of successful alignment-aided overlaps by removing unnecessary overlap region quality sum threshold
- Impossible export of germline sequence for
VJJunctioninshmTreesexports now produces an error - Parameter validation fix in
-nMutationsRate - Fix for
-nMutationsRateif region is not covered for the clone - Fix for the formal of
exportAlignmentsPrettybroken in the previous version - Fix for IllegalArgumentException in
exportAlignmentsPrettyfor cases where translation can't be performed - Fix error for
analyzeexecuted with-fand--output-not-used-readsat the same time - Resolutions of wildcards are excluded from calculation of
-nMutationsRatefor CDR3 inexportShmTreesWithNodes - Fix OutOfMemory exception in command
extendwith.vdjcainput - In
findShmTreesfilter for productive only clones now check for stop codons in all features, not only in CDR3 - Change default value for filter for productive clones in
findShmTreesto false (was true before) - Add option
--productive-onlytofindShmTrees - Fixed parsing of
--export-clone-groups-for-cell-typeparameter - Fixed usage of
slicecommand on clnx files that weren't ordered by id. - In
slicenow default behaviour is to keep original ids. Previous behaviour available with--reassign-idsoption - Fixed parsing of composite gene features with offsets like
--assemble-clonotypes-by [VDJRegion,CBegin(0,10)] - Fixed parent directory creation for output of
exportClonesOverlap - Fixed
exportAirrin case of a clone with CDR3 that don't have VCDR3Part and JCDR3Part - Optimize calculation of ranks in clone set. Speeds up export with tags and several other places.
- Added
clone_idcolumn inexportAirr - Fixed
exportClonesin case of splitting file bytag:...if there is a clone that have several tags of requested level - Fixed calculation of
-nMutationsCount,-nMutationsRate,-aaMutationsCountand-aaMutationsRate. Previously in some cases it was calculated on different region, from what was requested. - Added
CellBarcodesWithFoundGroupsforgroupClonesQC checks - New filter
--no-featureinexportAlignmentsPretty - Fixed reporting in
align, now coverage takes into account alignment-aided overlap
❗ Breaking changes
- Option
--build-from <path>was removed fromfindShmTreescommand
Deprecations of export options
-lengthOfnow is deprecated, use-nLengthinstead-allLengthOfnow is deprecated, use-allNLengthinstead-mutationRatenow is deprecated, use-nMutationsRateinstead