Skip to content

Possble improvements in segregation outputs #54

@XingerTang

Description

@XingerTang

AlphaImpute2 version: v0.0.2

Current segregation outputs are based on the function Heuristic_Peeling.fillPointEstimates, which only uses known haplotypes to estimate the segregation probabilities. This ensures the quality of the data, such that we transmit the genotype probabilities between the parents and children only when we are confident in how the haplotypes are inherited.

But if we are using this for the outputs, we are holding back a lot of information which we are possibly able to obtain.
For the accuracy test case with 5 generations of input data, AlphaPeel is able to give the segregation probabilities of:

  • Generation 0 - 1
  • Generation 1 - 2
  • Generation 2 - 3
  • Generation 3 - 4

In total, 4 sets of data.

While the restriction of AlphaImpute2 leads to segregation probabilities of:

  • Generation 1 - 2
  • Generation 2 - 3
  • Generation 3 - 4

In total, 3 sets of data, which is 1 less than the AlphaPeel.

Proposed fix:

  • Write a new function that generates segregation probabilities based on genotype probabilities instead of haplotypes to produce the output segregation probabilities.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions