Skip to content

techflare641/pdf-parsing-with-llm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project is a library for extracting data from PDF files and generating a summary from the extracted content.

Features

  • Extracts all data from any object from PDF documents (Image, Table, Chart)
  • Generates a concise summary from the extracted information

Getting Started

Requirements

  • Python 3.12.10

Installation

Clone the repository:

git clone https://github.com/techflare641/pdf-parsing-with-llm.git
cd pdf-parsing-with-llm

Install dependencies (if applicable):

python -m venv venv
pip install -r requirements.txt

Run

Run test scripts:

./venv/scripts/activate
./test.bat

About

Converting PDF to JSON with OpenAI

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published