Reassign alleles and Genotype Inference improvement#48
Open
ayeletperes wants to merge 5 commits into
Open
Conversation
…fidence plot in the genotype, added depth per loci for the genotype inference, update the genotypeFasta function to include genes that were not seen in the genotype inference, included the genotyped_alleles function for the bayesian inference
- Added `genotyped_alleles` parameter to `inferGenotypeBayesian` for including most likely alleles in the output. - Implemented locus-specific checks in `inferGenotype` and `reassignAlleles` to handle mixed loci warnings. - Updated `genotypeFasta` to support unseen alleles inclusion. - Enhanced tests for genotype inference and reassignment functions to validate new features.
…ot function. Added a variable to choose the allele column to plot for the genotyped_alleles column in the bayesian
…for gene annotation handling
…ization with color bins and tile geometry
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Speeds up allele reassignment and moved genotype inference functions from enchantr
Changes
reassignAlleles'hamming'distance function. The call is guarded by aruntime
exists(..., envir = asNamespace("alakazam"))check and verified toreproduce the old result bit-for-bit, so it degrades gracefully to pure R if
alakazam hasn't been updated yet.
strip_dandreassign_uncalled. These preserve the old functionbehavior, where it annotates empty cells and strips the D of duplicated genes
(e.g.
IGHV1-69D).genotypeFastastrip_dto avoid stripping the D of duplicated genes. Default preservesold function behavior.
include_unseento include germline DB alleles for genes not present inthe genotype. Default preserves old function behavior.
inferGenotypeBayesiangenotyped_allelescomputes the alleles that passed the genotype and addsa new
genotyped_allelescolumn (moved from enchantr).locuscolumn.
plotGenotypeConfidencek_diffcolumn of the Bayesian inference, based on the oldplot function.
plotGenotypeallele_colto plot thegenotyped_allelescolumn from the Bayesianinference; previously hard-coded to
alleles.Notes
Please approve after the alakazam PR (
add-mutation-count-rcpp) is approved.reassignAllelesrelies on the Rcpp hamming-distance functions exported there for its speedup. It is functionally non-breaking without them (falls back to pure R), but the intended path needs alakazam in first.