We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好~我按照PDF項目中進行Document Content Extraction, 步驟如 https://pdf-extract-kit.readthedocs.io/en/latest/project/pdf_extract.html 所示。
在output的部分是能夠解析出JSON的,但在Markdown輸出的部分會有UnicodeEncodeError: UnicodeEncodeError: 'cp950' codec can't encode character '\u5706' in position 78: illegal multibyte sequence 測試demo中的資料發現是中文的問題,猜測要使用UTF-8 (但我還沒有debug成功), 故來請問有沒有解決方法,謝謝!
UnicodeEncodeError: 'cp950' codec can't encode character '\u5706' in position 78: illegal multibyte sequence
(此外想請問使用繁體中文會影響嗎?)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
您好~我按照PDF項目中進行Document Content Extraction,
步驟如 https://pdf-extract-kit.readthedocs.io/en/latest/project/pdf_extract.html 所示。
在output的部分是能夠解析出JSON的,但在Markdown輸出的部分會有UnicodeEncodeError:
UnicodeEncodeError: 'cp950' codec can't encode character '\u5706' in position 78: illegal multibyte sequence
測試demo中的資料發現是中文的問題,猜測要使用UTF-8 (但我還沒有debug成功),
故來請問有沒有解決方法,謝謝!
(此外想請問使用繁體中文會影響嗎?)
The text was updated successfully, but these errors were encountered: