Deployment Overview

NeMo Guardrails can be deployed in various environments depending on your requirements. This guide provides an overview of the available deployment options.

Deployment Options

You can deploy NeMo Guardrails in the following ways:

Local Server

The quickest way to get started is running the server locally on your development machine.

Install NeMo Guardrails

Install the package using pip:

pip install nemoguardrails

Start the Server

Run the server with your configuration:

nemoguardrails server --config=/path/to/config

By default, the server runs on port 8000.

Test the Deployment

Test your deployment with a simple curl command:

curl -X POST http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

The local server is ideal for development and testing but not recommended for production use.

Docker Container

Docker provides a consistent, isolated environment for deploying NeMo Guardrails. Benefits:

Consistent environment across development and production
Easy dependency management
Simplified deployment and scaling
Isolation from host system

See the Docker Deployment guide for detailed instructions.

Production Deployment

For production workloads, you’ll need additional considerations:

Scalability: Load balancing and horizontal scaling
Security: API authentication, TLS/SSL, secrets management
Monitoring: Logging, metrics, and observability
High Availability: Redundancy and failover mechanisms

See the Production Deployment guide for best practices.

Choosing a Deployment Method

Use this guide to select the appropriate deployment method:

Deployment Type	Use Case	Complexity
Local Server	Development, testing, demos	Low
Docker	Staging, consistent environments	Medium
Production	Production workloads, enterprise	High

Server Configuration

The NeMo Guardrails server supports several configuration options:

nemoguardrails server \
  --config=/path/to/config \
  --port=8000 \
  --verbose

Common Options:

--config: Path to your guardrails configuration directory
--port: Port to run the server on (default: 8000)
--verbose: Enable verbose logging for debugging

API Endpoints

The server exposes the following main endpoints: Chat completions endpoint compatible with OpenAI API format. Retrieve the current rails configuration.

Documentation Index

​Deployment Overview

​Deployment Options

​Local Server

​Docker Container

​Production Deployment

​Choosing a Deployment Method

​Server Configuration

​API Endpoints

​Next Steps

Docker Deployment

Production Deployment

Deployment Overview

Deployment Options

Local Server

Docker Container

Production Deployment

Choosing a Deployment Method

Server Configuration

API Endpoints

Next Steps