DownloadAllContent

Lightweight web scraping script. Fetch and download main textual content from the current page, provide special support for novels

< Commentaires sur DownloadAllContent

Question / commentaire

§
Posté le: 27/05/2023

Add Markdown support.

Please look into my script which supports Plain Text, Markdown and HTML.

hoothinAuteur
§
Posté le: 28/05/2023

Thanks for suggestion, Great work!
I think it's not a good idea to add Turndown to this project. As this script is for novel sites, and most of them are crammed with advertisements. If I convert the content with full-supported markdown, the obfuscation will be inevitable.

§
Posté le: 28/05/2023
Édité le: 28/05/2023

Thank you.

Sometimes, I do happen to manually edit markdown files produced by Turndown sue to javascript and css script that were catched in the process.

The HTML seems to work as expected, most of the time, though I should improve it.

Poster une réponse

Connectez-vous pour poster une réponse.