Skip to content

Convert PDF to HTML without losing text or format.

License

Unknown, GPL-3.0 licenses found

Licenses found

Unknown
LICENSE
GPL-3.0
LICENSE_GPLv3
Notifications You must be signed in to change notification settings

masmx64/pdf2htmlEX

This branch is up to date with coolwanglu/pdf2htmlEX:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Aug 5, 2022
1b454bc · Aug 5, 2022
Jul 22, 2015
Jul 13, 2014
Jan 1, 2016
Feb 15, 2016
Dec 11, 2016
Nov 15, 2014
May 3, 2015
Apr 29, 2015
Sep 26, 2016
Mar 16, 2015
Jul 22, 2015
Jun 7, 2014
Jul 14, 2014
Aug 31, 2012
Aug 5, 2022
Jan 1, 2016
Nov 12, 2014
Jan 1, 2016

Repository files navigation

pdf2htmlEX is no longer under active development. New maintainers are wanted.

# pdf2htmlEX

一图胜千言
A beautiful demo is worth a thousand words

  • Bible de Genève, 1564 (fonts and typography): HTML / PDF
  • Cheat Sheet (math formulas): HTML / PDF
  • Scientific Paper (text and figures): HTML / PDF
  • Full Circle Magazine (read while downloading): HTML / PDF
  • Git Manual (CJK support): HTML / PDF

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies. Academic papers with lots of formulas and figures? Magazines with complicated layouts? No problem!

pdf2htmlEX is also an online publishing tool which is flexible for many different use cases.

Learn more about who and why should use pdf2htmlEX.

Features

  • Native HTML text with precise font and location.
  • Flexible output: all-in-one HTML or on demand page loading (needs JavaScript).
  • Moderate file size, sometimes even smaller than PDF.
  • Supporting links, outlines (bookmarks), printing, SVG background, Type 3 fonts and more...

Compare to others

Portals

LICENSE

pdf2htmlEX, as a whole package, is licensed under GPLv3+. Some resource files are released with relaxed licenses, read LICENSE for more details.

Acknowledgements

pdf2htmlEX is made possible thanks to the following projects:

pdf2htmlEX is inspired by the following projects:

  • pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

Special Thanks

  • Hongliang Tian
  • Wanmin Liu

About

Convert PDF to HTML without losing text or format.

Resources

License

Unknown, GPL-3.0 licenses found

Licenses found

Unknown
LICENSE
GPL-3.0
LICENSE_GPLv3

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 82.7%
  • C++ 12.6%
  • JavaScript 1.4%
  • Python 1.1%
  • Roff 0.6%
  • C 0.6%
  • Other 1.0%