首页 > 解决方案 > 我们如何使用 Python 将 HTML 转换为 PDF,有什么代码可以分享给我吗?

问题描述

我已经尝试过名为 pytotree 的库,但我没有得到任何答案这是代码:

import pdftotree
file= open('C:/Users/chaitanya.naidu/Downloads/test.pdf', 'rb')
f = pdftotree.parse(file)

我收到此错误

Traceback (most recent call last):
  File "<ipython-input-4-4a9a6b72801d>", line 1, in <module>
    f = pdftotree.parse(file)
  File "C:\Users\chaitanya.naidu\AppData\Local\Continuum\Anaconda3\lib\site-packages\pdftotree\core.py", line 63, in parse
    if not extractor.is_scanned():
  File "C:\Users\chaitanya.naidu\AppData\Local\Continuum\Anaconda3\lib\site-packages\pdftotree\TreeExtract.py", line 121, in is_scanned
    self.parse()
  File "C:\Users\chaitanya.naidu\AppData\Local\Continuum\Anaconda3\lib\site-packages\pdftotree\TreeExtract.py", line 91, in parse
    for page_num, layout in enumerate(analyze_pages(self.pdf_file)):
  File "C:\Users\chaitanya.naidu\AppData\Local\Continuum\Anaconda3\lib\site-packages\pdftotree\utils\pdf\pdf_utils.py", line 117, in analyze_pages
    with open(os.path.realpath(file_name), "rb") as fp:
  File "C:\Users\chaitanya.naidu\AppData\Local\Continuum\Anaconda3\lib\ntpath.py", line 542, in abspath
    path = os.fspath(path)
TypeError: expected str, bytes or os.PathLike object, not _io.BufferedReader

标签: pythonpdfdata-scienceconvertershtml2pdf

解决方案


您可以使用pdfkit,例如:

import pdfkit

pdfkit.from_url('http://google.com', 'out.pdf')
pdfkit.from_file('test.html', 'out.pdf')
pdfkit.from_string('Hello!', 'out.pdf')

推荐阅读