DownloadAllContent

Lightweight web scraping script. Fetch and download main textual content from the current page, provide special support for novels

< Feedback on DownloadAllContent

سؤال / تعليق

§
Posted: 27-05-2023

Add Markdown support.

Please look into my script which supports Plain Text, Markdown and HTML.

hoothinمؤلف
§
Posted: 28-05-2023

Thanks for suggestion, Great work!
I think it's not a good idea to add Turndown to this project. As this script is for novel sites, and most of them are crammed with advertisements. If I convert the content with full-supported markdown, the obfuscation will be inevitable.

§
Posted: 28-05-2023
Edited: 28-05-2023

Thank you.

Sometimes, I do happen to manually edit markdown files produced by Turndown sue to javascript and css script that were catched in the process.

The HTML seems to work as expected, most of the time, though I should improve it.

Post reply

تسجيل الدخول إلى مرحلة ما بعد الرد.