Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Jun 3, 2025 - Python
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
A Repo For Document AI
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
SnapDoc AI processes everything on-device, ensuring your sensitive information never leaves your control. Use voice and text on-device processing in organizations.
Smartloop is an open-source SLM platform to train and run models on an edge device
SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.
Extracting Data from Document PDF and Converting to EDI211 Files Using GCP and Google Document AI
This Flask application Google Cloud Document AI to extract name, IPK (GPA), university details, etc.
AI-powered PDF extraction suite for structured insights from contracts, forms, and documents. Built with Streamlit, LangChain, GPT-4o, and PDFPlumber.
Add a description, image, and links to the document-ai topic page so that developers can more easily learn about it.
To associate your repository with the document-ai topic, visit your repo's landing page and select "manage topics."