Skip to main content

Semantic structure

Files and Folders

The first thing you need to establish is how you’re going to consistently structure your code. There are two recommend best practices to choose from:

  • 🏡 Co-locate your semantic layer code in a one-YAML-file-per-marts-model system.
    • Puts documentation, data tests, unit tests, semantic models, and metrics into a unified file that corresponds to a dbt-modeled mart.
    • Trades larger file size for less clicking between files.
    • Simpler for greenfield projects that are building the Semantic Layer alongside dbt models.
  • 🏘️Create a sub-folder called models/semantic_models/.
    • Create a parallel file and folder structure within that specifically for semantic layer code.
    • Gives you more targeted files, but may involves switching between files more often.
    • Better for migrating large existing projects, as you can quickly see what marts have been codified into the Semantic Layer.

It’s not terribly difficult to shift between these (it can be done with some relatively straightforward shell scripting), and this is purely a decision based on your developers’ preference (i.e. it has no impact on execution or performance), so don’t feel locked in to either path. Just pick the one that feels right and you can always shift down the road if you change your mind.

Naming

Next, establish your system for consistent file naming:

  • 1️⃣ If you’re doing one-YAML-file-per-mart then you’d have an orders.sql and an orders.yml.
  • 📛 If you’re using a parallel subfolder approach, for the sake of unique file names it’s recommended to use the prefix sem_ e.g. sem_orders.yml for the dedicated semantic model and metrics that build on orders.sql and orders.yml.

Can't decide?

Start with a dedicated subfolder for your semantic models and metrics, and then if you find that you’re spending a lot of time clicking between files, you can always shift to a one-YAML-file-per-mart system. Our internal data team has found that the dedicated subfolder approach is more manageable for migrating existing projects, and this is the approach our documentation uses, so if you can't pick go with that.

0