SUSE AI 1.0
- WHAT?
This document focuses on techniques for gathering telemetry data from all SUSE AI components, including metrics, logs and traces.
- WHY?
To observe, analyze and maintain the behavior, performance and health of your SUSE AI environment, and to troubleshoot issues effectively.
- EFFORT
Setting up the recommended monitoring configurations with SUSE Observability is straightforward. Advanced setups for more granular control may require additional time for specialized analysis and fine-tuning.
- GOAL
To visualize the complete topology of your services and operations, providing deep insights and clarity into your SUSE AI environment.
Publication Date: 2026-03-26
- 1 Introduction
- 2 Monitoring GPU usage
- 3 Monitoring Open WebUI
- 4 Monitoring Milvus
- 5 Monitoring OpenSearch
- 6 Monitoring vLLM
- 7 Monitoring user-managed applications
- 8 Monitoring with the OpenTelemetry Operator
- A Cost estimation
- B Instrument applications with OpenLIT SDK
- Glossary
- C Copyright
- D GNU Free Documentation License
- D1 0. PREAMBLE
- D2 1. APPLICABILITY AND DEFINITIONS
- D3 2. VERBATIM COPYING
- D4 3. COPYING IN QUANTITY
- D5 4. MODIFICATIONS
- D6 5. COMBINING DOCUMENTS
- D7 6. COLLECTIONS OF DOCUMENTS
- D8 7. AGGREGATION WITH INDEPENDENT WORKS
- D9 8. TRANSLATION
- D10 9. TERMINATION
- D11 1. FUTURE REVISIONS OF THIS LICENSE
- D12 ADDENDUM: How to use this License for your documents
List of Figures
List of Examples