Research And Development
R&D Overview
.avif)
Advancing Generative AI through Innovation
The R&D team at LightOn plays a pivotal role in advancing the field of generative AI through continuous innovation and development. Their expertise spans across creating and fine-tuning large language models (LLMs) that form the backbone of the Paradigm platform, a comprehensive AI solution designed for enterprise use. This platform simplifies the integration of generative AI into business workflows, offering both on-premise and cloud options to ensure flexibility and scalability for various business needs.
r&d publicationsRecent R&D Posts

Multi-vector retrieval now on the infrastructure you already run
ColBERT-grade retrieval quality on the infrastructure teams already operate, at a fraction of the storage and serving cost.
CTA Title
Lorem Ipsum

LightOn Demonstrates the Flexibility of Its OCR Model by Adapting It to Arabic Through Targeted Training
LightOn demonstrates the flexibility of LightOnOCR-2, its document understanding model, by adapting it to Arabic through fine-tuning.
CTA Title
Lorem Ipsum

Adaptive Chunking: Reasoning Starts Before the LLM Sees a Token
Document-aware chunking selection for production RAG systems
CTA Title
Lorem Ipsum

Deep Research is now Open
Agent-ModernColBERT adds ~10% over Reason-ModernColBERT on BrowseComp-Plus, stays at 149M parameters, and brings GPT-5 + Qwen3-8B-level retrieval performance to a fully open stack.
CTA Title
Lorem Ipsum

🔴 The Retriever You Actually Need
Introducing LateOn and DenseOn, two Apache 2.0 retrievers: SOTA on BEIR, built to generalize.
CTA Title
Lorem Ipsum

Document Intelligence at First Sight
OriOn-Qwen-SR1: Fast Implicit Reasoning for Long Documents
CTA Title
Lorem Ipsum

Open-Source LightOnOCR-2 Just Outscored Claude, GPT-5, Qwen3, Mistral and Mathpix at Table Extraction
The most valuable information in enterprise documents doesn't live in paragraphs. It lives in tables
CTA Title
Lorem Ipsum


