Jump to contentJump to page navigation: previous page [access key p]/next page [access key n]
documentation.suse.com / Introduction to SUSE AI

Introduction to SUSE AI

Publication Date: 17 Apr 2025
WHAT?

SUSE AI is an open generative AI solution. It offers the customers the freedom to run private AI workloads both on premises, in the cloud or even in air-gapped environments. SUSE AI provides secure, auditable AI capabilities that ensure complete control over data and compliance with regulatory requirements.

WHY?

To learn more about AI and the benefits of running your private AI service inside your company or in the cloud.

EFFORT

To understand the basics and benefits of SUSE AI requires less than 30 minutes of your time.

GOAL

To make you realize that SUSE AI is the right choice to run private and secure AI workloads.

1 What are the benefits of using SUSE AI?

Running the SUSE AI service brings the following benefits:

  • Cloud AI services raise concerns about data security and accessibility. AI applications running on premises, however, increase data privacy, security and compliance with regulatory standards.

  • SUSE AI provides a user-friendly interface for managing and deploying AI models.

  • Pre-built NVIDIA drivers and operator provide increased AI performance.

  • SUSE AI components are dynamically scalable, from the Web user interface to the underlying data store.

  • SUSE AI cares about security—it uses TLS certificates for Web UI and API access. Data stored in a database are encrypted as well.

  • High level of SUSE quality control on the whole software production chain used by the AI stack.

2 What are typical scenarios for using SUSE AI?

Your private instance of SUSE AI can help you with the following tasks:

  • Implement AI-driven user interface to handle customer inquiries, providing continuous support and reducing the load on human agents.

  • Build a knowledge base where users can easily interact with an AI model to ask questions about the company policies, processes and workflows.

  • Generate reports and summaries on business performance or sales metrics with minimal manual input.

  • Automate the generation of blog posts or social media content, ensuring consistency and saving time.

  • Offer personalized product or content recommendations based on customer preferences.

  • Forecast trends, customer behaviors and market shifts, enabling more informed strategic decisions.

3 SUSE AI architecture

SUSE AI is a cloud native solution that comprises multiple software building blocks. These blocks include the Linux operating system running on bare metal or virtualized, Kubernetes cluster with a Web UI management layer, supportive tools to utilize GPU capabilities, and other containerized applications that care for monitoring and security. The SUSE Application Collection includes a collection of AI-related applications calledAI Library.

SUSE AI building blocks
SLES (https://documentation.suse.com/sles)

The underlying operating system with the optional NVIDIA driver installed. If you require an immutable operating system, SLE Micro is the recommended alternative.

SUSE Rancher Prime: RKE2 (https://docs.rke2.io/)

Kubernetes cluster managed by SUSE Rancher Prime ensuring container and application lifecycle management.

NVIDIA GPU Operator

Utilizes the NVIDIA GPU computing power and capabilities for processing AI-related tasks.

SUSE Security (https://www.suse.com/solutions/security/)

For security and compliance.

SUSE Observability (https://www.suse.com/solutions/observability/)

Provides advanced performance and data monitoring.

SUSE Storage (https://www.suse.com/solutions/storage/)

Enterprise-grade storage solution.

SUSE Virtualization (https://www.suse.com/solutions/virtualization/)

For virtualized workloads.

SUSE Multi-Linux Manager (https://www.suse.com/products/multilinux-manager/)

For managing multiple Linux distributions.

SUSE Application Collection (https://www.suse.com/products/rancher/application-collection)

As a source of Helm charts and container images for the AI Library applications.

AI Library applications
Milvus (https://milvus.io)

A vector database built for generative AI applications with minimal performance loss.

Ollama (https://ollama.com)

A platform that simplifies installation and management of large language models (LLM) on local devices.

cert-manager (https://cert-manager.io/)

An extensible X.509 certificate controller for Kubernetes workloads.

Open WebUI (https://openwebui.com)

An extensible Web user interface for the Ollama LLM runner.

Basic schema of SUSE AI
Figure 1: Basic schema of SUSE AI

4 Next steps

After you understand the basics of SUSE AI, follow these links to study hardware and software requirements and understand the installation procedure.