Skip to content

Pagewise Markdown output #122

@bahtman

Description

@bahtman

Hi guys,

Love the work. In our current approach we convert documents to a list of markdown, where each element consists of the markdown for that specific page. This is really useful in RAG when providing citations and enriching the chunks during completion.

I'm unsure how feasible it is for PDF, docx and other "paginated" content.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions