Introduction

How Unravel works

Unravel connects to your data platforms as a read-first observer, then applies optimizations through tightly controlled, validated actions — with you in control of how much autonomy to grant at each step.

Deployment model

Unravel runs as a SaaS-hosted control plane. There's nothing to install in your production environment. A lightweight connector per platform handles telemetry collection and action execution. Your data never leaves your infrastructure.

Databricks

REST API + DBFS connector

Snowflake

Native app + INFORMATION_SCHEMA

BigQuery

IAM-scoped service account

Cloudera

On-prem agent + CDP API

Architecture overview

Key concepts

⚙

Arvix

Unravel's autonomous optimization engine. Identifies inefficiencies, generates validated fixes, and applies them — with configurable automation levels.

🔄

AutoApply vs Human-in-Loop

Set per workload. AutoApply handles routine, low-risk actions continuously. Human-in-Loop queues recommendations for engineer approval before execution.

📊

Efficiency Rating

A composite 0–100 score across Cost, Performance, and Reliability. Your operational baseline — tracked over time across all connected platforms.

🔗

Context Graph

Unravel builds a relationship map across code, compute, data, and users. Fixes are informed by system-wide context, not isolated query analysis.

Tip: Most teams start in Human-in-Loop mode to build confidence in Unravel's recommendations, then progressively enable AutoApply per workload class. Typical ramp: 4–6 weeks to full AutoApply on routine infrastructure actions.

Databricks

Databricks integration

Unravel connects to Databricks via the Databricks REST API and workspace-level service principal. No agents run inside your workspace. Jobs, notebooks, pipelines, and clusters are observed continuously; optimizations are applied through the same API surface Databricks exposes to your own tools.

Architecture

Installation

Choose authentication method

Unravel supports two auth methods. Use SPN / OAuth M2M (recommended for production) — create a Databricks Service Principal and generate a client secret. Or use a PAT (Personal Access Token) for quick trials. Enter credentials in the Unravel onboarding flow.

https://<workspace-id>.azuredatabricks.net

Configure data sharing mechanism

Select how Unravel pulls telemetry from your workspace — choose one or more:

Unity Catalog + SQL Warehouse — Unravel queries system.* tables via a dedicated SQL Warehouse. Requires Unity Catalog enabled and SELECT on system.* tables.

Delta Share — Unravel receives telemetry data shared via the Delta Sharing protocol. Provide the share name and the recipient profile. No direct workspace access required.

DBFS Direct Share — Unravel reads log and metrics files written to DBFS paths. Requires DBFS read scope on the configured paths.

Grant REST API permissions

The SPN or PAT must have: Jobs read/edit Clusters read/edit(Optional edit for Auto Actions), Pipelines read, and Notebooks read (optional,for code analysis). Unity Catalog access is required for the Unity Catalog telemetry mechanism.

Select workspaces and clusters

Choose which workspaces to monitor. Scope coverage to specific clusters or job namespaces — useful for phased rollouts in large environments.

Set automation policy

Choose Human-in-Loop (default) or AutoApply per action category. Most teams start with Human-in-Loop and enable AutoApply for low-risk infrastructure actions after a few weeks.

First insights in ~30 minutes

Unravel begins ingesting job run history. Initial cost and performance insights appear within 30 minutes. Full workload profiling completes after 24–48 hours of data.

Required permissions

Resource	Access	Purpose
Jobs API	read edit	Job run history, durations, costs
Clusters API	read edit	Cluster config reading + rightsizing actions
DBFS / Delta	read	Storage profiling and Data Temperature analysis
Unity Catalog	read	Table lineage and storage attribution
Notebooks / Repos	read	Code analysis for Arvix rewrites (optional)

Key features

Workload Studio

Deep job and notebook profiling — stage-level metrics, shuffle analysis, PySpark diffs, and historical run comparisons.

Cluster rightsizing

Arvix recommends and applies autoscaling config corrections, worker type changes, and idle-cluster policies.

Code rewrites

Inefficient PySpark and SQL detected in production gets a one-click Arvix-generated fix — repartition, reorder, hint injection.

Delta / storage optimization

Data Temperature classification (Hot/Warm/Cold) drives archival and vacuum recommendations, with AutoApply support.

Cost attribution

Job-level and team-level cost breakdown. Chargeback-ready reports per business unit, with MoM trend tracking.

CI/CD integration

Unravel's CI plugin flags cost and performance regressions in PRs before they reach production. Supports GitHub Actions and Jenkins.

Snowflake

Snowflake integration

Unravel connects to Snowflake via a dedicated role with read access to INFORMATION_SCHEMA and ACCOUNT_USAGE views. Optimization actions — warehouse rightsizing, query rewrite suggestions, idle-suspend policies — are applied through a controlled write role that you define and audit separately.

Architecture

Installation

Run the Unravel setup script

Unravel provides a Snowflake SQL script that creates a dedicated role, warehouse, and grants. Run it as ACCOUNTADMIN. The script is fully auditable — no black boxes.

Generate connection credentials

Use key-pair authentication (recommended) or username/password. Provide the account identifier, role name, and warehouse to the Unravel onboarding UI.

Configure action permissions

Warehouse resize and auto-suspend actions require an additional MODIFY WAREHOUSE grant. These are gated behind a separate approval step in the UI.

Monitoring begins

All virtual warehouses in the account are discovered and monitored automatically. Unravel begins building workload profiles and usage patterns across every warehouse.

Required permissions

Resource	Access	Purpose
ACCOUNT_USAGE	read	Query history, warehouse usage, storage metadata, credit usage
INFORMATION_SCHEMA	read	Real-time query history
Virtual warehouses	read modify	Read configs, update configs, apply resize

Key features

Warehouse rightsizing

Arvix detects oversized warehouses, idle time patterns, and multi-cluster contention — then right-sizes or adjusts auto-suspend policies automatically.

Query optimization

Identifies expensive queries and generates rewritten SQL — clustering changes, join reorders, partition pruning, predicate pushdown, etc.

Storage optimisation

Identifies cold tables, transient table opportunities, and excessive time-travel/fail-safe retention — recommends cleanup and reclassification to reduce storage costs.

Credit attribution

Breaks credit spend down to individual queries, users, and business units. Chargeback-ready exports to common FinOps tools.

SLA tracking

Monitors query SLAs in real time. Arvix can proactively reschedule or scale to prevent missed windows before they occur.

BigQuery

BigQuery integration

Unravel connects to BigQuery using a GCP service account with project-scoped IAM roles. Telemetry is collected from Information schema views and BigQuery APIs. Optimization actions — slot reservation management, query rewrites, storage recommendations — are applied through the same service account with write roles you control.

Architecture

Installation

Create a service account

Create a GCP service account in the target project. Assign the roles listed in the table below. Download the JSON key file.

Upload credentials to Unravel

Provide the service account JSON key to Unravel via the onboarding UI or a secure handoff.

Enable billing export

Setup billing export to a dataset in your project.

Select projects to monitor

Specify the list of projects to be monitored by unravel. Specify the GCP project ID, the dataset ID, and the target table name for billing export. Unravel builds a unified view across all the projects.

Required IAM roles

Role	Access	Purpose
BigQuery Resource Viewer	read	Projects metadata, job history and query stats
BigQuery Metadata Viewer	read	Storage metadata
BigQuery Data Viewer (For billing export table only)	read	Billing export dataset access
BigQuery User (for one project only)	create	Execute BQ queries by Unravel
BigQuery Resource Editor (optional, for AutoApply)	write	Slot reservation management
BigQuery Resource Admin (optional, for AutoApply)	write	Slot Capacity management

Key features

Slot optimization

Continuously analyzes slot demand patterns to find the optimal balance between on-demand and committed capacity — then automatically adjusts reservations so you pay less without sacrificing query performance.

Query optimisation

Identifies expensive queries and generates rewritten SQL — clustering changes, join reorders, partition pruning, predicate pushdown, etc.

Storage optimisation

Detects unused partitions, unqueried tables, and suboptimal clustering — recommends cleanup, partition expiration, and storage type changes to cut costs. Classifies tables as Hot/Warm/Cold based on access patterns. Auto-identifies long-tail tables generating storage cost with no query activity.

Multi-project attribution

Unified cost view across projects and teams. Chargeback reports at the project, dataset, user, and label level.

Cloudera

Cloudera integration

Unravel's Cloudera integration covers both on-premises CDH/CDP environments and Cloudera Data Platform on public cloud. A lightweight Unravel server is deployed within your network boundary — no data leaves your infrastructure. Unravel reads from Cloudera Manager API and YARN, Spark, and Hive event streams.

Note: The Cloudera integration uses an on-premises agent model rather than SaaS connector. The Unravel server runs in your environment and communicates outbound only for UI access and license management. All telemetry stays within your network.

Architecture

Installation

Provision the Unravel server

Deploy the Unravel server on a dedicated host (or VM) within your cluster network. Minimum: 16 CPU cores, 64 GB RAM, 500 GB SSD. The server communicates with cluster nodes over your internal network only.

Install the Unravel agent package

Download the Unravel RPM or tarball from the customer portal. Run the installer script as root on the Unravel server host.

sudo ./unravel-install.sh --cluster-manager cloudera

Configure Cloudera Manager credentials

Provide your Cloudera Manager hostname, port, and a read-only API account. Unravel auto-discovers cluster services and begins registering event listeners.

Enable Spark and YARN instrumentation

Unravel adds a Spark listener JAR via Cloudera Manager parcel or classpath injection. A rolling restart of affected services is required. YARN history server integration requires no restart.

Access the Unravel UI

The UI is served from the Unravel server on port 3000 by default. It's accessible only within your network unless you configure a proxy or VPN. No inbound connections from Unravel SaaS infrastructure.

Key features

Spark job analysis

Stage-level profiling, DAG visualization, executor skew detection, and Arvix-generated PySpark optimization recommendations.

YARN queue optimization

Queue utilization analysis, capacity planning recommendations, and workload scheduling optimization across YARN resource pools.

Hive / Impala query tuning

Query plan analysis, statistics freshness checks, and partition/bucket recommendation for slow-running Hive and Impala workloads.

Migration readiness

Workload inventory and compatibility scoring for teams planning migration from Cloudera to Databricks, Snowflake, or BigQuery.

Cloudera to cloud migration: Unravel's Cloudera integration is often the starting point for teams modernizing to cloud data platforms. The workload inventory and cost attribution data generated here carries forward into your Databricks or Snowflake environment.

How Unravel works

Deployment model

Architecture overview

Key concepts

Databricks integration

Architecture

Installation

Required permissions

Key features

Snowflake integration

Architecture

Installation

Required permissions

Key features

BigQuery integration

Architecture

Installation

Required IAM roles

Key features

Cloudera integration

Architecture

Installation

Key features

Technical FAQ