Skip to content

jfxia/url2pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

url2pdf

Extract the content contained in multiple web page links and convert to PDF files, especially those containing math formulas and webp images.

-The web page URLs are saved in links.txt.

-Before running, please install wkhtmltopdf (https://wkhtmltopdf.org/downloads.html) and other dependencies.

pip install base64 requests bs4 urllib tqdm PIL

Usage:

python url2pdf.py  links.txt

Note: Some websites have blocked crawler programs, and this script may not work.

About

Transform the content of web links to PDF files

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages