Skip to content
This repository was archived by the owner on Aug 4, 2023. It is now read-only.
This repository was archived by the owner on Aug 4, 2023. It is now read-only.

emoji unicode error #5

Description

@cole-dda

when pdf include emoji,such as:😄
unicode=0x1f604

when use ms word to generate pdf
screenshot_5883

the unicode include space

https://pdfium.googlesource.com/pdfium/+/refs/heads/main/core/fpdfapi/font/cpdf_tounicodemap.cpp
screenshot_5884

when space is break,so get unicode=0xd83d

but right is =[d8,3d,de,04], then [d8,3d,de,04].decode('utf-16-be') => '😄'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions