ServiceLabs API Platform

Enterprise-grade AI API with intelligent routing, model management, and usage analytics

View Documentation Go to Dashboard

🔗

Multi-Model Access

Access 11+ AI models through a single unified API endpoint

⚡

Smart Routing

Intelligent request routing for optimal performance and availability

📊

Usage Analytics

Real-time token tracking and usage insights across all models

🔑

API Key Management

Create and manage keys with granular access control per model

11

Models

99.9%

Uptime

<1s

Latency

24/7

Monitoring

Getting Started

ServiceLabs provides a unified OpenAI-compatible API for accessing multiple AI models. All requests use the base URL:

https://router.servicelabs.biz.id/v1

Quick Start

Send your first request in seconds. Select a tab to see the example:

            
            
            
          
curl https://router.servicelabs.biz.id/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sv2-your-api-key" \
  -d '{
    "model": "glm-5.2",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
import requests

response = requests.post(
    "https://router.servicelabs.biz.id/v1/chat/completions",
    headers={
        "Authorization": "Bearer sv2-your-api-key",
        "Content-Type": "application/json"
    },
    json={
        "model": "glm-5.2",
        "messages": [{"role": "user", "content": "Hello!"}]
    }
)
print(response.json())
const response = await fetch("https://router.servicelabs.biz.id/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer sv2-your-api-key",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    model: "glm-5.2",
    messages: [{ role: "user", content: "Hello!" }]
  })
});
const data = await response.json();
console.log(data);

Authentication

All API requests require a Bearer token in the Authorization header. API keys start with sv2-.

Authorization: Bearer sv2-your-api-key

You can create and manage API keys from the Dashboard.

Endpoints

GET /v1/models

List all available models. Returns an OpenAI-compatible model list.

// Response
{
  "object": "list",
  "data": [
    { "id": "glm-5.2", "object": "model", "owned_by": "servicelabs" },
    { "id": "minimax-m3", "object": "model", "owned_by": "servicelabs" },
    ...
  ]
}

POST /v1/chat/completions

Create a chat completion. Supports all standard OpenAI parameters.

// Request Body
{
  "model": "glm-5.2",
  "messages": [
    { "role": "user", "content": "Hello!" }
  ],
  "temperature": 0.7,
  "max_tokens": 1024
}

// Response (OpenAI format)
{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "model": "glm-5.2",
  "choices": [{
    "index": 0,
    "message": { "role": "assistant", "content": "Hello! How can I help?" },
    "finish_reason": "stop"
  }],
  "usage": { "prompt_tokens": 8, "completion_tokens": 9, "total_tokens": 17 }
}

POST /v1/chat/completions (Streaming)

Set "stream": true to receive Server-Sent Events (SSE). Each chunk is a data: line with a partial delta.

// Request
{
  "model": "glm-5.2",
  "messages": [{ "role": "user", "content": "Write a poem" }],
  "stream": true
}

// Response (SSE stream)
data: {"choices":[{"delta":{"content":"Roses"}}]}
data: {"choices":[{"delta":{"content":" are"}}]}
data: {"choices":[{"delta":{"content":" red"}}]}
data: [DONE]

Available Models

Model ID	Description	Category
glm-5.2	Advanced language model for general tasks	Language
minimax-m3	High-performance multi-modal model	Language
kimi-k2.7-code	Specialized code generation model	Code
qwen3.7-plus	Enhanced reasoning and analysis	Reasoning
deepseek-v4-pro	Professional-grade deep reasoning	Reasoning
glm-5.2-cf	Fast edge-optimized language model	Language
kimi-k2.6-cf	Lightweight code assistant	Code
kimi-k2.7-code-cf	Edge-optimized code model	Code
mimo-v2.5-pro	Advanced visual understanding	Vision
mimo-v2.5	Efficient visual analysis	Vision
qwen3.7-max	Maximum capability language model	Language

Rate Limits

Rate limits are applied per API key. Limits include:

Token Limit — Maximum total tokens the key can consume (configurable per key)
Credit Limit — Maximum credits the key can use (configurable per key)
Rate Limiting — Requests are throttled to prevent abuse

When a limit is reached, the API returns a 429 Too Many Requests response. Use the dashboard to adjust limits or reset usage.

Error Codes

Code	Meaning
401	Invalid or missing API key
403	Model not allowed for this key
429	Rate limit exceeded
502	Upstream service error

Available Models

All models are accessible through the unified ServiceLabs API.

Simple, Transparent Pricing

Start free. Scale when you need to.

Free

For getting started

$0/mo

100K tokens per day
All models included
Basic support
Community access

Get Started

Pro

For production workloads

Custom

Custom token limits
Priority support
Advanced analytics
Multiple API keys

Get Started

Enterprise

For large-scale deployments

Custom

Unlimited tokens
Dedicated support
SLA guarantee
Custom integrations

Contact Sales

⚡ Admin Login

Create API Key

ServiceLabs API Platform

Multi-Model Access

Smart Routing

Usage Analytics

API Key Management

Getting Started

Quick Start

Authentication

Endpoints

GET /v1/models

POST /v1/chat/completions

POST /v1/chat/completions (Streaming)

Available Models

Rate Limits

Error Codes

Available Models

Simple, Transparent Pricing

Free

Pro

Enterprise