CAGNIR applies Graph Attention Networks (GAT) on the Click Graph and extends it with the multi-view attention mechanism. So queries and documents can aggregate relevant information from their neighbors, and the relationship between nodes can be measured from multiple perspectives, results in a more proper and refined way than traditional methods. Through the principle close to pseudo relevance feedback (PRF), CAGNIR can get representation with complete semantics and reduce sparsity in the click log, thereby improving retrieval performance. And finally, given a query, documents are clicked on for it or close to it in the Click Graph will have a better and more reasonable ranking.
Model Architecture
| CAGNIR | Click-Attention Graph |
|---|---|
![]() |
![]() |
Retrieval Performance
| Models | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|---|---|---|---|---|
| VSM a | 0.5081 - | 0.4674 - | 0.4399 - | 0.3941 - |
| VSM b | 0.5484 - | 0.5260 - | 0.5074 - | 0.4755 - |
| DSSM a | 0.5437 - | 0.5033 - | 0.4740 - | 0.4254 - |
| VPCG b | 0.7195 - | 0.7168 - | 0.7208 - | 0.7337 - |
| CAGNIR a | 0.7684 | 0.7622 | 0.7551 | 0.7463 |
Note. The dataset Sogou-QCL is used for experiments, and the results marked with "-" are significantly weaker than CAGNIR at the same position under the Student's Paired t-test (p-value<0.01).
a,b The vocabularies used for the model are derived respectively from data through SentencePiece (BPE) and Jieba.
expt/: Scripts for experiments, including model training, model testing and performance measurement.models/: Implementation of CAGNIR and other related models (i.e. VSM, DSSM, VPCG).utils/:common/: Common utilities like interface of database, logger, etc.data/: Utilities for data processing.models/: Components of models.
Python 3.6.8TensorFlow 1.13.0Scikit-learn 0.21.2Scipy 1.3.0Numpy 1.16.4

