Documentation v2.3 · Updated May 2026

Build private AI. Own everything.

The complete reference for fine-tuning, deploying, and scaling domain-specific AI models — without giving your data to anyone.

Quick Start API Reference Python SDK LoRA Fine-tuning

100+

Base Models

Llama · Mistral · Qwen · more

< 38 min

First Model Trained

LoRA on A10G, 20K pairs

99.9%

API Uptime

SLA-backed inference

100%

Data Private

Zero egress · Air-gap ready

Get running fast

From data to deployed model in 4 steps.

Full guide

terminal — step 01

Install the SDK

$ pip install langtrain

Collecting langtrain...

Successfully installed langtrain-2.3.0

$ langtrain --version

Langtrain v2.3.0 · Python 3.11 · CUDA 12.1

All data stays in your environment · Zero egress

Popular guides

Where most engineers start.

IntroductionStart here

Understand how Langtrain works and why private fine-tuning matters.

Read guide

Quick Start10 min

Go from zero to a running fine-tuned endpoint in under 10 minutes.

Read guide

LoRA & QLoRATraining

Parameter-efficient training on any GPU — laptop to A100.

Read guide

API ReferenceReference

OpenAI-compatible REST API — chat completions, embeddings, and more.

Read guide

Cloud DeploymentDeploy

Deploy to AWS, GCP, or Azure private VPC in one command.

Read guide

Python SDKSDK

Manage training jobs, call inference, and monitor models from Python.

Read guide

Integration options

Connect from any language or tool.

View Python SDK docs

pip install langtrain

Python SDK

from langtrain import Langtrain

client = Langtrain(api_key='lt_...')

resp = client.chat(

model="cs-agent-v1",

messages=[{"role": "user",

"content": "Hello"}]

)

print(resp.choices[0].message.content)

Full-featured SDK for training, inference, and monitoring.Docs

All documentation

Browse by topic.

Training

LoRA, QLoRA, full fine-tune — pick your method and GPU budget.

LoRA & QLoRA fine-tuning Full fine-tuning Data formats & ingestion Training configuration

Deployment

Push to AWS, GCP, Azure private VPC or your own hardware.

Cloud (AWS / GCP / Azure)On-premise server Air-gap setup Docker & GGUF export

SDKs & APIs

Python, Node.js, and a fully OpenAI-compatible REST interface.

Python SDK Node.js SDK REST API reference OpenAI-compatible API

Agents

Build RAG pipelines, tool-calling agents, and multi-step workflows.

Agent builder overview RAG & knowledge retrieval Tool calling Multi-agent workflows

Monitoring

Track latency, hallucination rate, and model drift in production.

LangVision overview Latency & throughput Drift detection Audit logs

Security

HIPAA, SOC 2, GDPR — compliance built in, not bolted on.

HIPAA compliance guide SOC 2 Type II Encryption at rest & transit GDPR & data residency

Free to start · No credit card required

Ready to train your first model?

Your model. Your data. Your weights. Running in under 38 minutes.

Follow the Quick Start Open Dashboard

Documentation v2.3 · Updated May 2026

Build private AI. Own everything.

The complete reference for fine-tuning, deploying, and scaling domain-specific AI models — without giving your data to anyone.

Quick Start API Reference Python SDK LoRA Fine-tuning

100+

Base Models

Llama · Mistral · Qwen · more

< 38 min

First Model Trained

LoRA on A10G, 20K pairs

99.9%

API Uptime

SLA-backed inference

100%

Data Private

Zero egress · Air-gap ready

Get running fast

From data to deployed model in 4 steps.

Full guide

terminal — step 01

Install the SDK

$ pip install langtrain

Collecting langtrain...

Successfully installed langtrain-2.3.0

$ langtrain --version

Langtrain v2.3.0 · Python 3.11 · CUDA 12.1

All data stays in your environment · Zero egress

Popular guides

Where most engineers start.

IntroductionStart here

Understand how Langtrain works and why private fine-tuning matters.

Read guide

Quick Start10 min

Go from zero to a running fine-tuned endpoint in under 10 minutes.

Read guide

LoRA & QLoRATraining

Parameter-efficient training on any GPU — laptop to A100.

Read guide

API ReferenceReference

OpenAI-compatible REST API — chat completions, embeddings, and more.

Read guide

Cloud DeploymentDeploy

Deploy to AWS, GCP, or Azure private VPC in one command.

Read guide

Python SDKSDK

Manage training jobs, call inference, and monitor models from Python.

Read guide

Integration options

Connect from any language or tool.

View Python SDK docs

pip install langtrain

Python SDK

from langtrain import Langtrain

client = Langtrain(api_key='lt_...')

resp = client.chat(

model="cs-agent-v1",

messages=[{"role": "user",

"content": "Hello"}]

)

print(resp.choices[0].message.content)

Full-featured SDK for training, inference, and monitoring.Docs

All documentation

Browse by topic.

Training

LoRA, QLoRA, full fine-tune — pick your method and GPU budget.

LoRA & QLoRA fine-tuning Full fine-tuning Data formats & ingestion Training configuration

Deployment

Push to AWS, GCP, Azure private VPC or your own hardware.

Cloud (AWS / GCP / Azure)On-premise server Air-gap setup Docker & GGUF export

SDKs & APIs

Python, Node.js, and a fully OpenAI-compatible REST interface.

Python SDK Node.js SDK REST API reference OpenAI-compatible API

Agents

Build RAG pipelines, tool-calling agents, and multi-step workflows.

Agent builder overview RAG & knowledge retrieval Tool calling Multi-agent workflows

Monitoring

Track latency, hallucination rate, and model drift in production.

LangVision overview Latency & throughput Drift detection Audit logs

Security

HIPAA, SOC 2, GDPR — compliance built in, not bolted on.

HIPAA compliance guide SOC 2 Type II Encryption at rest & transit GDPR & data residency

Free to start · No credit card required

Ready to train your first model?

Your model. Your data. Your weights. Running in under 38 minutes.

Follow the Quick Start Open Dashboard