Cloudera AI January Release

1 minute read

We’re excited to announce the latest release of Cloudera AI. This update introduces Production-Grade App Serving (Technical Preview), elevating the Cloudera AI Inference service beyond simple model serving.

This new capability provides a unified environment where your custom applications and agents can live, run, and scale directly alongside your model endpoints.

Key Features

1. Robust AI Inference and Serving

Production-Grade App Serving (Tech Preview): Host applications and agents directly within the Inference service. Apps now scale dynamically with model endpoints for a unified AI architecture.
Extended Model Support: Direct deployment for XGBoost, PyTorch, and TensorFlow models straight from the Cloudera AI Registry.
Advanced vLLM Task Specification: Manually specify model tasks (e.g., EMBED, RANK, or CLASSIFICATION) via the API during deployment to unlock broader vLLM architecture support.
Guaranteed Compute (AWS): Support for AWS On-Demand Capacity Reservations and capacity blocks ensures compute availability for critical workloads.

2. Accelerated AI Workbench Performance

Enhanced UI: Faster page load times due to memory and query optimizations across projects and jobs.
Auto-Rollback Upgrades: Safer, “one-click” side-by-side upgrades for the workbench with built-in automatic rollback support in case of failure.

3. Enhanced AI Registry and Catalog

Deep Lineage Tracking: The Registry now displays structured metadata (provider, model ID, checksum) for all models imported from Hugging Face and NVIDIA NGC.

Platform Updates

Unified Administration: Consistent admin experience across the Workbench, Registry, and Inference services.
Latest Kubernetes Support: Official support added for Amazon EKS 1.33 and Azure AKS 1.33.

Public Links

Release Notes: Review the complete list of changes.
Cloudera AI Portfolio: Deep dive into the Cloudera AI Inference Service and AI Workbench.
Press Release: Learn more about Cloudera AI Inference on-premises (GA).

As always, check out the entire DOCS for Cloudera AI.

If you would like a deeper dive, hands on experience, demos, or are interested in speaking with me further about Cloudera AI please reach out to schedule a discussion.

Share on

Twitter Facebook LinkedIn

Steven Matison

Cloudera AI January Release

Key Features

1. Robust AI Inference and Serving

2. Accelerated AI Workbench Performance

3. Enhanced AI Registry and Catalog

Platform Updates

Public Links

Share on

You may also enjoy

Cloudera Streams Messaging - Surveyor

Cloudera Streams Messaging - Schema Registry

Cloudera Streams Messaging - Kubernetes Operator 1.6

Cloudera AI January Release