Skip to content

rmhsiao/CAGNIR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Click-Attention Graph for Neural Information Retrieval (CAGNIR)

Overview

CAGNIR applies Graph Attention Networks (GAT) on the Click Graph and extends it with the multi-view attention mechanism. So queries and documents can aggregate relevant information from their neighbors, and the relationship between nodes can be measured from multiple perspectives, results in a more proper and refined way than traditional methods. Through the principle close to pseudo relevance feedback (PRF), CAGNIR can get representation with complete semantics and reduce sparsity in the click log, thereby improving retrieval performance. And finally, given a query, documents are clicked on for it or close to it in the Click Graph will have a better and more reasonable ranking.

Model Architecture

CAGNIR Click-Attention Graph

Retrieval Performance

Models NDCG@1 NDCG@3 NDCG@5 NDCG@10
VSM a 0.5081 - 0.4674 - 0.4399 - 0.3941 -
VSM b 0.5484 - 0.5260 - 0.5074 - 0.4755 -
DSSM a 0.5437 - 0.5033 - 0.4740 - 0.4254 -
VPCG b 0.7195 - 0.7168 - 0.7208 - 0.7337 -
CAGNIR a 0.7684 0.7622 0.7551 0.7463

Note. The dataset Sogou-QCL is used for experiments, and the results marked with "-" are significantly weaker than CAGNIR at the same position under the Student's Paired t-test (p-value<0.01).
a,b The vocabularies used for the model are derived respectively from data through SentencePiece (BPE) and Jieba.

File Structure

  • expt/: Scripts for experiments, including model training, model testing and performance measurement.
  • models/: Implementation of CAGNIR and other related models (i.e. VSM, DSSM, VPCG).
  • utils/:
    • common/: Common utilities like interface of database, logger, etc.
    • data/: Utilities for data processing.
    • models/: Components of models.

Dependencies

  • Python 3.6.8
  • TensorFlow 1.13.0
  • Scikit-learn 0.21.2
  • Scipy 1.3.0
  • Numpy 1.16.4

About

Implementation of CAGNIR, a new Neural Information Retrieval model aggregates relevant semantics through applying Graph Attention Networks on the Click Graph.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors