Skip to main content
Back to top
Ctrl
+
K
Search
Ctrl
+
K
Installation
Install SGLang
Backend Tutorial
Sending Requests
OpenAI APIs - Completions
OpenAI APIs - Vision
OpenAI APIs - Embedding
SGLang Native APIs
Offline Engine API
Server Arguments
Sampling Parameters
Hyperparameter Tuning
Advanced Features
Speculative Decoding
Structured Outputs
Tool and Function Calling
Custom Chat Template
Quantization
Frontend Tutorial
Structured Generation Language
Choices Methods in SGLang
SGLang Router
Router for Data Parallelism
References
General Guidance
Hardware Supports
Multi-Node Deployment
Performance Tuning
Benchmark and Profiling
Measuring Model Accuracy in SGLang
Repository
Show source
Suggest edit
Open issue
.rst
.pdf
Performance Tuning
Performance Tuning
#
Benchmark and Profiling
Measuring Model Accuracy in SGLang