KerdosInfrasoft
Building Tomorrow

Command Palette

Search for a command to run...

Now Live — REST API + Gradio Demo on Hugging Face

Your Documents. Your AI. Your Privacy.

Upload your company documents and ask questions. Kerdos AI answers strictly from your data — powered by LLaMA 3.1, FAISS vector search, and enterprise-grade RAG. Your data never leaves your environment.

Free demo, no sign-up Files processed in-memory only Open REST API
Live Demo

Try It Right Now

Use the embedded Gradio UI or walk through the REST API yourself — step by step.

huggingface.co/spaces/kerdosdotio/Custom-LLM-Chat

Demo requires a Hugging Face API token with write access. For a fully private deployment, contact us.

Capabilities

Everything You Need

Built for enterprise document intelligence — not just a chatbot.

PDF · DOCX · TXT · CSV
Multi-format Ingestion

Upload PDF, DOCX, TXT, MD, and CSV files — all parsed and indexed automatically.

Grounded Only
Strictly Grounded

Answers are generated only from your uploaded documents — no hallucination from internet knowledge.

Bulk Upload
Multi-document

Upload and query across multiple files simultaneously with a unified index.

Context-aware
Multi-turn Chat

Maintains full conversation context across questions — natural dialogue, not one-shot Q&A.

CPU-optimised
Fast & Efficient

CPU-friendly embeddings with all-MiniLM-L6-v2 + FAISS. Runs without expensive GPU hardware.

Zero Persistence
Session-only Privacy

Files are processed in-memory and never stored after your session ends.

How It Works

RAG Pipeline Architecture

A retrieval-augmented generation pipeline that grounds every answer in your documents.

Upload Files
PDF / DOCX / TXT
Parse & Chunk
512 chars, 64 overlap
Embed
all-MiniLM-L6-v2
FAISS Index
In-memory vector store
Similarity Search
Top-K retrieval
LLaMA 3.1 8B
Grounded answer
1. Ingest

Upload documents → parsed & chunked into 512-char segments with 64-char overlap

2. Embed

Chunks are embedded using all-MiniLM-L6-v2 and stored in a FAISS in-memory index

3. Retrieve

Your question is embedded → Top-K most relevant chunks fetched via cosine similarity

4. Generate

LLaMA 3.1 8B receives only the retrieved chunks and generates a grounded, cited answer

Enterprise Use Cases

One API. Every Industry. Any Document.

The Kerdos AI RAG API integrates into your existing workflows in under an hour. Upload your proprietary documents and get hallucination-free, grounded answers — privately, at scale.

Clinical & Regulatory

Healthcare & Pharmaceuticals

Instant answers from clinical trial documents and drug monographs

Challenge: Clinical teams spend hours searching regulatory submissions, pharmacovigilance reports, and RCT data.
Solution: Index clinical trial documents and drug monographs once. Query them in milliseconds with natural language — strictly grounded in your own data.

What to upload

Clinical trial PDFsDrug monographsPharmacovigilance reportsCDSCO / FDA filings

POST /sessions/{id}/chat

What are the contraindications for Drug X in diabetic patients?

Contract Intelligence

Legal & Compliance

AI-powered contract review with zero data leakage

Challenge: Reviewing hundreds of pages of contracts, NDAs, and regulatory filings is slow and error-prone.
Solution: Upload contracts and compliance documents. Get instant, citation-grounded answers on obligations, risk clauses, and deadlines. Your data never leaves your environment.

What to upload

NDAs and contractsCompliance filingsCourt ordersRegulatory notices

POST /sessions/{id}/chat

Summarise the indemnity clause in Schedule 3

BFSI

Banking, Financial Services & Insurance

Query RBI circulars, Basel III frameworks, and internal audit reports

Challenge: Internal teams need quick access to policy documents, investment frameworks, and regulatory circulars — accurately.
Solution: A private, grounded LLM that answers only from your internal documents. No hallucinations. No external API leakage. Full audit trail for compliance.

What to upload

RBI circularsBasel III framework docsAudit reportsInvestment policy statements

POST /sessions/{id}/chat

What is the capital adequacy ratio threshold per this policy?

Industrial Operations

Manufacturing, EPC & Infrastructure

Field engineers query SOPs and safety manuals in natural language

Challenge: Field engineers need operational manuals, safety SOPs, and equipment specs on demand — often on-site without reliable internet.
Solution: Deploy the RAG API on-premise. Engineers query manuals using plain English — even in air-gapped environments — and get step-by-step, grounded answers.

What to upload

Equipment SOPsSafety manualsEngineering specsMaintenance runbooks

POST /sessions/{id}/chat

What is the emergency shutdown procedure for Boiler Unit 4?

GovTech

Government & Public Sector

Instantly search tender specifications and statutory documents

Challenge: Government departments manage large volumes of statutory documents, tender specifications, and RTI responses that are slow to navigate.
Solution: Index tender documents and policy circulars. Enable officers to query eligibility conditions, submission procedures, and compliance requirements in seconds.

What to upload

Tender specificationsPolicy circularsRTI responsesBudget documents

POST /sessions/{id}/chat

What is the EMD amount and submission deadline for this tender?

People & Culture

HR, L&D & Organisational Knowledge

Give every employee a self-service AI assistant for HR queries

Challenge: Employees repeatedly ask the same questions on leave policies, onboarding procedures, and compliance norms — wasting HR bandwidth.
Solution: Index your employee handbook, POSH policy, payroll SOPs, and HR circulars. Deploy a private internal chatbot that answers only from approved HR documents.

What to upload

Employee handbookPOSH policyPayroll SOPsTraining materials

POST /sessions/{id}/chat

How many earned leaves can I carry forward this year?

Interested in a tailored enterprise pilot? partnership@kerdos.in

Tech Stack

Open-Source, Battle-Tested

LLaMA 3.1 8B InstructFAISSall-MiniLM-L6-v2FastAPIPyMuPDFpython-docxHuggingFaceSentence TransformersDocker
Coming Soon

Embed Kerdos AI on Any Website

A lightweight JavaScript widget backed by the Kerdos AI RAG API. Add a floating document Q&A chatbot to your product, knowledge base, or internal portal with a single <script> tag.

Fully branded — your colours, your logo
Backed by your private document index via the REST API
Works on any site — Next.js, React, plain HTML
<!-- Kerdos AI Widget -->
<script
  src="https://cdn.kerdos.in/ai-widget.js"
  data-session="auto"
  data-theme="dark"
  data-primary="#0ea5e9"
>
</script>

One tag. Instant document Q&A.

Enterprise Edition

Full Power. Full Privacy. Full Control.

The demo gives you a taste. The enterprise edition gives your organisation complete data sovereignty.

Private LLM Hosting

On-premise or private-cloud deployments — your data never reaches any external API.

Custom Model Fine-tuning

Models fine-tuned on your domain data for dramatically higher accuracy on your content.

Data Privacy Guarantees

Complete isolation. Zero external data transfer. Full audit logs.

White-label Deployments

Fully branded for your organisation — your name, your logo, your product.

Seeking Investment & Strategic Partnerships

Help Us Build the Enterprise Edition

We're raising investment to build the fully customisable enterprise edition — private on-premise LLM deployments, custom fine-tuning, and white-label solutions for Indian and global enterprises.

Kerdos Infrasoft Pvt. Ltd. (CIN: U62099KA2023PTC182869) · Bengaluru, Karnataka · Est. December 2023

Chat on WhatsApp