We should have a CI that validates that each URL added in this repository are parseable returning not None output.
See example http:///culture.gouv.fr was added and the resulting host None is not expected:
### 0 links from living/vame.md
http:///culture.gouv.fr <--- triple slash
Traceback (most recent call last):
File "/Volumes/ExtremePro/sciencialab/cc/web-languages/../web-languages-code/extract_links.py", line 116, in <module>
links = list(map(normalize_url,
^^^^^^^^^^^^^^^^^^^^^^^
File "/Volumes/ExtremePro/sciencialab/cc/web-languages/../web-languages-code/extract_links.py", line 57, in normalize_url
if not h.isascii():
^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'isascii'
We should have a CI that validates that each URL added in this repository are parseable returning not None output.
See example
http:///culture.gouv.frwas added and the resulting hostNoneis not expected: