Databricks Naming Compliance Review¶
Instructions¶
Use this skill to run a repeatable naming-standards review of a Databricks ETL repository.
- Read references/review-prompt.md before inspecting the target repo. Treat the rules in that file as the default naming standard; no PDF is required.
- If the user explicitly supplies an updated naming standard, use it as an override or addendum and call out where it differs from the embedded rules.
- Inspect the requested branch without disturbing the user's working tree. If
the review target is
main, prefergit -C <repo> ls-tree mainandgit -C <repo> show main:<path>style reads. - If the output format is not already specified, ask the user:
Should I create a canvas-like visual artifact if this IDE or agent supports one, and otherwise generate a regular Markdown (.md) report? - If the user agrees, use the richest canvas-like output available in the
current environment. If that capability is unavailable, generate a regular
.mdreport with the same sections and evidence. - Include a run metadata block at the top of the output:
- last updated by reviewer name
- run date/time
- reviewed branch/ref
- reviewed head commit SHA
- reviewed head commit date
- delay since last run
- Compute "delay since last run" by finding the newest prior naming review
artifact for the same repo and parsing its run date. If no previous artifact
is available, write
First recorded run. - Calculate a weighted roll-up compliance score out of 100. Use 50% weight for Databricks object references, 25% for code files, 15% for folders, and the remaining 10% for job YAML files. Explain the weighting briefly in the output.
- Produce counts, percentages, representative compliant examples, representative violations, anti-patterns, and ambiguity/assumptions.
- Keep evidence concrete: cite paths, object names, job names, task keys, branch refs, and command-derived counts.
- Do not edit the target repository unless the user explicitly asks for remediation.
Output¶
Prefer a standalone analytical artifact when the environment supports one and the user has accepted that output mode. Otherwise produce Markdown suitable for a living review document.
Always include:
- executive summary
- standards interpreted
- weighted roll-up score and scoring explanation
- compliance dashboard
- findings ordered by severity
- hotspots by project/module
- progress or delay since last run
- recommended remediation order
- assumptions and standards gaps
Troubleshooting¶
Error: The embedded rules reference another standard.
Cause: Catalog, user/group, or compute naming is intentionally delegated to
separate standards.
Solution: Mark those areas as not fully assessable from embedded rules;
still report concrete drift and ask for the missing standard before calling it
non-compliant.
Error: Counts differ from a previous run. Cause: Branch, file-extension scope, compound extensions, or generated files were treated differently. Solution: State the branch/ref and scope rules in the metadata and assumptions, then use the same scope on future reruns.
Error: No previous review artifact can be found.
Cause: The review has not been recorded in a parseable location.
Solution: Set delay to First recorded run and include parseable run
metadata so the next rerun can compute the delay.