DocumentLens
Detect forged documents & compare document versions

About DocumentLens
Introduction to DocumentLens
DocumentLens is an AI-powered document intelligence suite developed by TurboLens, designed specifically for processing, analyzing, and verifying documents common in Southeast Asia. It addresses the unique challenges posed by regional document formats, multilingual content, and cultural conventions—such as Philippine BIR Forms, Vietnamese invoices, Thai receipts, and Tagalog-KYC documents. The platform serves teams in banking, insurance, legal, healthcare, and logistics that handle high volumes of structured and unstructured documents and require reliable, context-aware automation.
Unlike generic global document AI tools, DocumentLens is built from the ground up with training data drawn from thousands of localized document types. Its models incorporate linguistic nuance, layout complexity, and regional regulatory requirements, enabling accurate interpretation where standard OCR and NLP systems fall short. The solution supports both cloud-based workflows and enterprise API integrations, with security and scalability as foundational design principles.
Key Takeaways
- Detects image-level tampering and forgery in scanned documents, providing tamper probability scores and heatmap visualizations
- Compares two versions of a document to highlight insertions, deletions, and modifications—even across mixed-language or complex-layout documents
- Performs schema-first, context-aware data extraction tailored to Southeast Asian document structures (e.g., local naming conventions, form fields, stamps, and watermarks)
- Supports true multilingual OCR for Vietnamese, Thai, Bahasa, Tagalog, Malay, Hindi, and other regional languages—with awareness of cultural context and formatting norms
- Extracts structured data from tables, charts, checkboxes, multi-column layouts, and official seals with high fidelity
- Removes background noise and watermarks to improve downstream extraction accuracy
- Integrates natively with enterprise systems via secure, scalable APIs for high-volume document processing
How DocumentLens Works
DocumentLens operates through four primary functional modules: document extraction, version comparison, image forgery detection, and parsing. Users begin by uploading a document (PDF, JPG, or PNG), selecting the primary language, and optionally specifying a custom schema for targeted field extraction. The system applies language-specific OCR combined with layout understanding and semantic modeling to extract key-value pairs, structured records, and contextual relationships between fields.
For version comparison, users upload two documents (e.g., draft and final contract), and the AI aligns content semantically—not just textually—to detect changes across heterogeneous formats and languages. Forgery detection analyzes pixel-level inconsistencies, lighting anomalies, and copy-move artifacts in document images, outputting both quantitative risk scores and region-specific heatmaps. All workflows follow a consistent three-step pattern: upload → configure (language/schema/parameters) → review structured output or visual diff.
Core Benefits and Applications
DocumentLens delivers measurable value in regulated, document-intensive industries across Southeast Asia. In banking and finance, it accelerates KYC verification by detecting forged identity documents and extracting data from regional bank statements and tax forms. Insurance providers use it to automate claims processing by validating submitted receipts and comparing amended policy documents. Legal teams reduce manual review time by precisely identifying contractual changes across revisions. Healthcare organizations digitize patient records while preserving local terminology and form structure. Logistics firms automate bills of lading and customs documentation with support for multilingual shipping labels and regulatory forms. Across all use cases, the platform reduces reliance on rule-based templates and improves accuracy without requiring extensive fine-tuning.