From oracle-ai-data-platform-workbench-engineer-agent
Answers business questions over the AIDP lakehouse by translating natural language to Spark SQL, grounded in a catalog and semantic model.
How this skill is triggered — by the user, by Claude, or both
Slash command
/oracle-ai-data-platform-workbench-engineer-agent:aidp-analyzing-dataThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Answer business questions by grounding in the catalog/semantic model, reusing verified queries when
aidp-analyzing-data — natural language → Spark SQLAnswer business questions by grounding in the catalog/semantic model, reusing verified queries when
possible, then executing Spark SQL via the bundled scripts/aidp_sql.py helper.
Source is an external / non-lakehouse system (Fusion, EPM, Oracle ADB/ExaCS, Snowflake, S3, …)? This skill is lakehouse-native Spark SQL. To pull from an external source, use the
oracle-ai-data-platform-workbench-spark-connectorsplugin'saidp-<source>skill (install it if absent; run itsaidp-connectors-bootstrapskill once to push the helper package to the cluster), oraidp-federateto join across sources.
.aidp/verified-queries.md; if a verified: true entry closely matches
the question (similar text + table overlap), reuse its SQL (adapt only dates/bind values) and say so..aidp/catalog.md + .aidp/semantic.md: map concepts→tables via Quick
Reference/synonyms, use recorded join keys (don't guess joins), use value dictionaries for WHERE
literals, prefer metric SQL expressions from the semantic model. If the catalog cache is missing, run
aidp-catalog-init first.python "$PLUGIN_DIR/scripts/aidp_sql.py" \
--region <region> --datalake <DATALAKE_OCID> --workspace <ws> --cluster <cluster-key> \
--code "spark.sql('''<SQL>''').show(50, truncate=False)"
Returns JSON {status, execution_count, outputs, spark_job_ids, error}. Each invocation runs the cell;
keep the same <SQL> shape across follow-ups. Smoke-test connectivity with --code "spark.sql('SELECT 1').show()"..aidp/catalog.md and/or register
the working query via aidp-verified-queries (which validates before marking it verified).SHOW COLUMNS / DESCRIBE cell) or ask.catalog.schema.table). Default catalog/schema only when the user implies them.
This includes metadata commands: use SHOW TABLES IN <catalog>.<schema> (e.g. SHOW TABLES IN default.default),
not the unqualified SHOW TABLES IN default — the bare form raises AnalysisException: [SCHEMA_NOT_FOUND]
because default resolves as a catalog, not a schema.error field, fix grounded in the catalog, and
retry — don't guess repeatedly.aidp-cluster-ops); the helper attaches to the
cluster you pass via --cluster.ai_generate) see aidp-ai-sql; for cross-source joins see aidp-federate.2plugins reuse this skill
First indexed Jun 12, 2026
npx claudepluginhub anthropics/claude-plugins-official --plugin oracle-ai-data-platform-workbench-engineer-agentRuns LLM functions inside Spark SQL on AIDP via ai_generate() for summarizing, classifying, extracting, enriching rows, generating narratives, or grounded RAG analysis in the lakehouse.
Analyze lakehouse data interactively via Fabric Lakehouse Livy API sessions using PySpark/Spark SQL for DataFrames, cross-lakehouse joins, Delta time-travel, and unstructured/JSON data.
Analyzes lakehouse data interactively using Fabric Lakehouse Livy API sessions and PySpark/Spark SQL for DataFrames, joins, Delta time-travel, and JSON analysis.