Skip to content

About the reward settings and playing game #16

@xwxahu

Description

@xwxahu

Hello! I read the FINDER recently and there are two questions puzzled me.

  1. In the article, you define the reward is decrease of ANC, however the computation of ANC needs the nodes removing sequence. How should I get the removing sequence? Using FINDER, HDA or other methods to remove nodes?
    2.In supplementary, the FINDER algorithm S3 shows that SGD is performed after each storing experience. however, in the last paragraph of Ⅱ.D.2 (Train algorithm), it seems that SGD is performed after each episode. What is the episode means? Removing single node or removing nodes in a graph until terminal?
    Very thanks to your work! Hope your answers.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions