Skip to content

IslamHisham/Domain_Sampler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Domain_Sampler

Get a representative sample of articles for a domain that can be used for further studies like credibility assestment of the domain or any other type of analysis.

The project is made up of two stages:

  1. Size Reduction Stage: where the number of the article is reduced baed on the statistical limited population theory
  2. Topic sampling: A representative sample from each topic is taken to ensure the diversity and representativeness of the sample. We use BERTopic.

To use the domain sample clone this git repo and follow these steps:

  1. On your terminal type: pip install -r requirements.txt
  2. Create a new .py file and import the DomainSampler class and use it.

About

Get a representative sample of articles for a domain

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages