Skip to content

Improve PDF parsing #21

@cbldev

Description

@cbldev

Actual behavior

I noticed that on some complex PDF, with tables, pdftotext produce better result than pdf-reader gem.

pdftotext: https://www.xpdfreader.com/pdftotext-man.html

Issue in Langchainrb: patterns-ai-core/langchainrb#682

Expected behavior

Good results on complex PDF parsing.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions