DownloadAllContent

Lightweight web scraping script. Fetch and download main textual content from the current page, provide special support for novels

< Feedback on DownloadAllContent

Review: OK - script works, but has bugs

§
Posted: 05/05/2024

最近69书吧应该是更新了算法,使用脚本爬69书吧的会出现段落混乱的情况
举例:https://www.69shu.top/txt/56138/35870903
网页看的内容顺序是:““嗯?”
来到宿舍门前,推门而入,宿舍门是打开的。”
爬下来的内容顺序是:““嗯?”
  半晌后,才回过神来,抬头看着一脸冷漠的陈枫”
求解决

Post reply

Sign in to post a reply.