Skip to content

alexruco/kurt-plugin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“œ Kurt Plugin

πŸ“ Description

The Kurt Plugin is a minimalistic WordPress plugin designed to internally crawl a website, generate a JSON file containing links present in each page, post, CPT, or sitemap, along with the URLs where each link was found. It also checks if each URL is available, adding a βœ… true or ❌ false value to indicate its availability.

🌟 Features

  • πŸ•΅οΈ Crawls all posts, pages, and custom post types.
  • πŸ—‚οΈ Checks the robots.txt file for sitemaps.
  • πŸ” Crawls hypothetical sitemap URLs if they exist.
  • 🚫 Ignores irrelevant links such as mailto:, javascript:, tel:, whatsapp:, and Google Maps links.
  • πŸ“Š Outputs a JSON file with the links found, where they were discovered, and their availability.

πŸ“₯ Installation

  1. Upload the plugin to the wp-content/plugins directory.
  2. Activate the plugin through the 'Plugins' menu in WordPress.

πŸš€ Usage

To generate the JSON file with the links, access your site with the URL parameter ?generate_links_json, e.g., https://yoursite.com/?generate_links_json.

πŸ“„ JSON Structure

The generated JSON file will have the following structure:

{
    "https://example.com/somepage": {
        "found_in": [
            "https://example.com/otherexample",
            "https://example.com/someotherexample",
            "https://example.com/sitemap.xml"
        ],
        "available": true
    },
    ...
}
  • https://example.com/somepage: The URL of the link.
  • found_in: An array of URLs where the link was discovered.
  • available: A boolean indicating whether the link is available.

πŸ“œ License

This plugin is licensed under the GPL v2 or later.

About

Internal webcrawler for WP websites

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages