Skip to content

SBMA_AR hg38 catalog region yields inflated NA12878 AR CAG sizing #340

Description

@jamiequint

In STRchive v2.17.1, the hg38 Stranger catalog entry for SBMA_AR uses ReferenceRegion chrX:67545316-67545419 with LocusStructure=(GCA)*, NormalMax=34, PathologicMin=38. Running ExpansionHunter 5.0.0 + Stranger 0.10.0 on the GIAB NA12878/HG001 30x BAM with the STRchive v2.17.1 catalog produced SBMA_AR allele sizes 34/38 and Stranger classified it as full_mutation/pathogenic.

This is discordant with the expected NA12878 AR CAG sizing (~20-25; our prior seed-catalog run over chrX:67545316-67545385 reported 21/25) and suggests the longer region may be including flanking sequence or otherwise measuring a different tract than the canonical AR CAG repeat used for SBMA interpretation.

Catalog entry observed locally:

{
  "LocusId": "SBMA_AR",
  "ReferenceRegion": "chrX:67545316-67545419",
  "LocusStructure": "(GCA)*",
  "DisplayRU": "GCA",
  "NormalMax": 34,
  "PathologicMin": 38,
  "Gene": "AR",
  "Disease": "SBMA"
}

For safety in our downstream clinical-adjacent pipeline, we are marking SBMA_AR not-callable pending upstream validation rather than surfacing the NA12878 pathogenic call. Could you confirm whether this hg38 region is intended to include only the canonical AR CAG/GCA tract for SBMA?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions