Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/DOalZRh

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o I've developed a Python API service that uses GPT-4o for OCR on PDFs. It features parallel processing and batch handling for improved performance. Not only does it convert PDF to markdown, but it also describes the images within the PDF using captions like `[Image: This picture shows 4 people waving]`. In testing with NASA's Apollo 17 flight documents, it successfully converted complex, multi-oriented pages into well-structured Markdown. The project is open-source and available on GitHub. Feedback is welcome. https://ift.tt/va28i6t September 22, 2024 at 05:05AM

हमरु उत्तराखण्ड

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/DOalZRh

Post a Comment

0 Comments

Popular Posts

भरत नाट्य शास्त्र गढवाली अनुवाद

Show HN: I made a Telegram bot to get Raspberry Pi “in-stock” notification https://ift.tt/GtsFfAl

Show HN: Stratup.ai – Startup Idea Machine https://ift.tt/7RfCINq

Subscribe Us

Technology

Comments

Facebook

Categories

Menu Footer Widget

हमरु उत्तराखण्ड

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/DOalZRh

You may like these posts

Post a Comment

0 Comments

Social Plugin

Popular Posts

भरत नाट्य शास्त्र गढवाली अनुवाद

Show HN: I made a Telegram bot to get Raspberry Pi “in-stock” notification https://ift.tt/GtsFfAl

Show HN: Stratup.ai – Startup Idea Machine https://ift.tt/7RfCINq

Subscribe Us

Technology

Comments

Facebook

Categories

Menu Footer Widget