Skip to content

Support ignoring URL params #90#111

Open
bluemindset wants to merge 3 commits into
s0rg:masterfrom
bluemindset:master
Open

Support ignoring URL params #90#111
bluemindset wants to merge 3 commits into
s0rg:masterfrom
bluemindset:master

Conversation

@bluemindset

Copy link
Copy Markdown

I have implemented a new feature by adding a flag called ignore-query to the project. This flag strips query parameters from the URI during scraping. To achieve this, I utilized the NormalizeURL function in the utils.go file to ensure consistency in URL normalization.

I would love to hear your feedback on this implementation and contribute more to the project!

Best regards,
Stefanos

@s0rg

s0rg commented Jan 2, 2025

Copy link
Copy Markdown
Owner

Thank you for your submission, but i think you did it wrong - not in terms of code, but in logic - this flag strips params only from starting url, but not from crawled ones so it will not work as expected.

best regards,
Alexey

@bluemindset

Copy link
Copy Markdown
Author

Hello, and thank you for the review. I believe I've now corrected the logic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants