2 minute read

Trino SQL Engine in Cloudera Data Warehouse (CDW) Now General Availability (GA) for On-Premises

We’re excited to announce the general availability of the Trino SQL Engine in Cloudera Data Warehouse for On-Premises environments (Data Services 1.5.5 SP2).

This release brings a fully managed, containerized Trino experience to your data estate. It delivers a high-performance, distributed SQL query engine wrapped in an enterprise-grade Virtual Warehouse, eliminating the complexity of manual configuration while offering auto-scaling, auto-suspend, and seamless Cloudera integration.


Key Benefits

  • Seamless Data Federation: Connect to 30+ data sources to analyze data where it lives without complex ETL.
  • Elastic Scalability & Cost Control: Automatically scale workers based on load and utilize auto-suspend to eliminate idle compute costs.
  • Unified Security: Full integration with Apache Ranger ensures rigorous data protection policies.
  • Simplified Operations: New UI for connector management and built-in diagnostic tools.

Feature Highlights

1. Universal Connectivity & Storage

  • Certified Enterprise Connectors: Fully supported for Oracle, Snowflake, AWS Redshift, PostgreSQL, MySQL, Hive (HMS), Iceberg, and MariaDB.
  • Expansive Ecosystem: Support for Kafka, MongoDB, Delta Lake, Google BigQuery, Druid, and MS SQL Server.
  • Teradata Connector: Now available in Tech Preview.
  • Connector Management UI: User-friendly interface to create, configure, and test connectors before deployment.
  • Hybrid Storage: Full read/write to Ozone (OFS protocol) with local SSD caching for maximum performance.

2. Automated Lifecycle Management

  • Virtual Warehouses: Easily manage the lifecycle of Trino coordinators and workers.
  • Intelligent Auto-Scaling: Custom scaling parameters (min/max workers) to handle peak loads.
  • Graceful Shutdown: Auto-suspend for “T-shirt sizes” (XS to L) ensures resources are released without interrupting queries.
  • Disaster Recovery: Full Backup and Restore support for virtual warehouses and connectors.

3. Enterprise Security & Governance

  • Fine-Grained Access: Ranger integration for authorization, dynamic column masking, and row filtering.
  • Secure Connectivity: Support for LDAP authentication and a dedicated Secrets Management system for external credentials.

4. Empowering All Users

  • Talk-to-Your-Data: AI SQL Assistance allows users to generate optimized Trino SQL from natural language.
  • Analyst Ready: Full Trino syntax support and autocomplete within the Hue SQL Editor.
  • Developer Friendly: Includes CDW CLI and a built-in Coordinator Web UI for deep-dive monitoring.
  • Visual Insights: Native integration with Cloudera Data Visualization.

Primary Use Cases

  • Federated Data Access: Connect to external sources like Snowflake and Postgres with zero-copy data.
  • Interactive Reporting: Use distributed SQL and SSD caching to power low-latency dashboards.
  • Cost-Efficient Ad Hoc Analytics: Spin up “right-sized” warehouses that automatically shut down after analysis is complete.

As always, check out the entire DOCS for Cloudera Data Warehouse.

If you would like a deeper dive, hands on experience, demos, or are interested in speaking with me further about Cloudera please reach out to schedule a discussion.