AISecKit

No OCR

A simple AI tool for PDF processing without OCR, enabling search and questioning across multiple document collections.

Visit Website

Visit Website

Introduction

No OCR

No OCR is an AI-based tool designed for simplifying the exploration of documents without relying on traditional OCR methods. Users can easily upload PDFs to perform searches or ask questions about content across various collections.

Key Features

No OCR Requirement: Process PDF pages without requiring Optical Character Recognition.
Text and Visual Queries: Perform advanced queries using text embeddings and visual questioning.
Automated Ingestion: Seamlessly creates using Hugging Face-style datasets from uploaded documents.
Vector-Based Search: Leverages LanceDB for efficient search functionalities over document collections.
Advanced Question-Answering: Utilize open-source models for intelligent inquiries on diagrams and texts.

Benefits

Effortless Document Management: Simplifies the management and exploration of large volumes of documents.
Flexible Deployment: Deployable via Docker for both backend and user interface.
Customizable Workflows: Allow unique training per case to enhance model performance.