Documentation Index
Fetch the complete documentation index at: https://docs.zylon.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
The Zylon AI inferencing engine is the core component that runs artificial intelligence models on your hardware. To ensure optimal performance and prevent startup failures, you must configure the system with the correct preset based on your available GPU (Graphics Processing Unit) memory.What are AI Presets?
AI presets are pre-configured settings that optimize the AI models and memory allocation for your specific hardware setup. Each preset is carefully tuned to:- Load the appropriate AI model size for your GPU/RAM memory
- Allocate memory efficiently to prevent crashes
- Balance performance with available resources
- Enable specific capabilities when needed
Understanding GPU Memory Requirements
Your GPU (Graphics Processing Unit) has a specific amount of VRAM (Video Random Access Memory) that determines which AI models can run effectively. AI models require substantial memory to operate, and larger models with better capabilities need more VRAM.How to Check Your GPU Memory
You can verify your GPU memory using:- Command line: Run
nvidia-smicommand - Hardware documentation: Refer to your GPU manufacturer specifications
nvidia-smi will show your GPU model and total memory capacity.
Quick Start Guide
Step 1: Identify Your GPU Memory
Run the following command to check your available GPU memory:Step 2: Select the Appropriate Preset
Based on your GPU memory, choose the matching preset:| GPU Memory | Preset to Use | Example Hardware |
|---|---|---|
| 32GB | baseline-32g | RTX 5090 |
| 48GB | baseline-48g | RTX A6000, A40, L40 |
| 96GB | baseline-96g | A100 80GB, H100 |
Step 3: Configure Your System
Edit your Zylon configuration file at/etc/config/zylon-config.yaml:
Step 4: Apply Configuration
After modifying the configuration file, restart the Zylon services to apply changes:Step 5: Verify Installation
Check that the inference engine started successfully:What’s Next?
- Explore available presets to find specialized configurations
- Learn about configuration options for multi-GPU and capabilities
- Dive into advanced customization for custom models
- Troubleshoot common issues in the troubleshooting guide