Ship
intelligence.
Docs in. Agent deployed.
import { Agent, RAG } from '@schift-io/sdk' const rag = new RAG({ bucket: 'support-docs', topK: 5 }) export const agent = new Agent({ name: 'support-bot', instructions: 'Answer questions from the knowledge base.', rag, tools: [searchDocs, collectLead], model: 'gpt-4.1-nano', }) const result = await agent.run('How do I reset my password?') console.log(result.output)
TypeScript | RAG | Tools | Any LLM | Open Source
97%+
retrieval accuracy across BEIR benchmarks
<300us
p50 search latency at 1M vectors
5min
to first agent deploy
3
templates ready to use
Available today
Everything an agent needs. Nothing it doesn't.
Five primitives. Zero boilerplate. Your agent ships with RAG, tools, and memory out of the box.
Agent Runtime
ReAct loop with automatic tool dispatch. Define instructions, register tools, call agent.run(). Schift handles the rest.
import { Agent } from '@schift-io/sdk' const agent = new Agent({ name: 'support-bot', instructions: 'Answer from the knowledge base.', tools: [searchDocs], model: 'gpt-4.1-nano', }) const result = await agent.run('How do I reset?')
Built-in RAG
Upload docs, search by meaning, get answers with citations. OCR, chunking, embedding, and reranking — all managed.
import { RAG } from '@schift-io/sdk' const rag = new RAG({ bucket: 'support-docs', topK: 5, }) await rag.upload(['manual.pdf', 'faq.docx']) const results = await rag.search('password reset')
Tool Calling
Register any async function as a tool. Type-safe parameters, automatic validation, error recovery. Works with OpenAI and Anthropic formats.
import type { AgentTool } from '@schift-io/sdk' export const collectLead: AgentTool = { name: 'collect_lead', description: 'Collect visitor contact info', parameters: { type: 'object', properties: { email: { type: 'string' } }, }, handler: async ({ email }) => saveLead(email), }
Any LLM, One Endpoint
GPT, Claude, Gemini — switch models without changing code. OpenAI-compatible routing with automatic fallback.
const agent = new Agent({ model: 'gpt-4.1-nano', // swap to 'claude-sonnet-4-6' // or 'gemini-2.5-flash' // — zero code change needed }) // Automatic fallback if primary model fails const result = await agent.run('Summarize this contract')
Start from a template. Ship faster.
Production-ready agent templates. Pick one, customize, deploy.
Customer Support
RAG-powered support agent with lead collection.
- FAQ answering
- Source citations
- Lead capture
Legal Q&A
Contract analysis and legal document search.
- Clause extraction
- Case law search
- Korean law support
Research Assistant
Paper Q&A with cross-reference citations.
- PDF ingestion
- Cross-paper synthesis
- Citation tracking
Scaffold. Build. Deploy.
Scaffold your agent
npx create-schift — pick a template, enter your API key. You get a working agent project in seconds.
$ npx create-schift@latest Upload your data
Push documents via SDK or dashboard. Schift handles OCR, chunking, and embedding. Your agent can search them instantly.
await rag.upload(['manual.pdf', 'faq.docx']) Deploy to production
npm run dev for local testing. schift deploy when ready. Your agent is live with built-in monitoring and logging.
$ schift deploy Code. Or Cloud.
Write your agent for full control, or use Schift Cloud to skip the infrastructure.
Schift SDK
npx create-schift, connect your data, deploy your agent. TypeScript-first, open source.
Schift Cloud
Managed RAG pipeline, embedding, vector search, LLM routing, observability. Zero infrastructure.
Agents for every use case.
Live Demo
Try it yourself
Ask our AI agent anything about Schift. No signup required.
Hi! Ask me anything about Schift — pricing, features, or how to get started.
Try asking
2.3x faster than FAISS.
3.3x better tail latency.
Same HNSW algorithm, same machine, same vectors. Schift SQ8 delivers sub-300us p50 with 3.3x tighter p99 than FAISS — at 4x less memory, with persistence, knowledge graphs, and compaction included.
3.3x better tail latency, 2.3x higher throughput
Schift SQ8 p99 is 502us vs FAISS 1,653us at 1M vectors, using 4x less memory. Unlike pure in-memory libraries, Schift includes disk persistence, WAL crash recovery, knowledge graph search, and live compaction.
74K
vectors/sec projection
4x
memory savings (SQ8)
0
re-embedding needed
Apple Silicon, single-thread search, same machine. HNSW M=32, efConstruction=200, efSearch=50. Full benchmark source available on GitHub.
Pricing
Simple. Transparent.
Subscription-first plans with included quota, credits for overage, and dedicated instances for larger teams.
Metric
Price
Deploy your first agent
in 5 minutes.
Free tier included. No credit card required.
Need help getting started?