<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Stefan Koch - Data Engineering Blog</title><description>Data Engineering, Databricks &amp; DevOps Blog by Stefan Koch</description><link>https://stefanko.ch/</link><item><title>Nexus Stack: Your Data. Your Rules. Your Flow</title><link>https://stefanko.ch/p/nexus-stack/</link><guid isPermaLink="true">https://stefanko.ch/p/nexus-stack/</guid><description>Deploy your Data Stack with Docker and Cloudflare in an automated, secure, and scalable way.</description><pubDate>Thu, 22 Jan 2026 17:12:11 GMT</pubDate><category>DevOps</category><category>DataStack</category><category>Hetzner</category><category>OpenTofu</category></item><item><title>Secure Hetzner Docker Deployment via Cloudflare Zero Trust Tunnel</title><link>https://stefanko.ch/p/zero-entry-docker/</link><guid isPermaLink="true">https://stefanko.ch/p/zero-entry-docker/</guid><description>Self-hosting is awesome – you have full control over your data, no subscription fees, and you learn a lot about system administration along the way. But it comes with real challenges: security concern</description><pubDate>Tue, 06 Jan 2026 18:00:00 GMT</pubDate><category>howto</category><category>self-hosting</category><category>infrastructure-as-code</category></item><item><title>Building a Live Data Simulator for Data Engineering Practice</title><link>https://stefanko.ch/p/adventureworks-simulator/</link><guid isPermaLink="true">https://stefanko.ch/p/adventureworks-simulator/</guid><description>If you&apos;ve ever tried to learn Change Data Capture (CDC), streaming pipelines, or incremental data loading, you&apos;ve probably hit the same frustrating wall I did: **static sample databases don&apos;t teach yo</description><pubDate>Tue, 09 Dec 2025 06:00:00 GMT</pubDate><category>Data Engineering</category><category>Azure</category></item><item><title>From SharePoint to Databricks – and Back: Seamless Bidirectional Integration</title><link>https://stefanko.ch/p/databricks-sharepoint-integration/</link><guid isPermaLink="true">https://stefanko.ch/p/databricks-sharepoint-integration/</guid><description>In modern data workflows, it’s essential to collect data where it originates and deliver it to where it’s needed. In this blog post, I’ll show how to connect **SharePoint directly with Databricks** – </description><pubDate>Wed, 28 May 2025 07:20:17 GMT</pubDate><category>howto</category><category>databricks</category><category>integration</category><category>SharePoint</category></item><item><title>From Snapshots to CDC: How to load Snapshot-Data with Databricks Delta Live Tables</title><link>https://stefanko.ch/p/delta-live-table-full-snapshot-source/</link><guid isPermaLink="true">https://stefanko.ch/p/delta-live-table-full-snapshot-source/</guid><description>In this article I describe how to load data from recurring full snapshots with Delta Live Tables relatively easily and elegantly into a bronze table without the amount of data exploding.</description><pubDate>Sun, 16 Mar 2025 12:31:27 GMT</pubDate><category>howto</category><category>databricks</category><category>Business Intelligence</category></item><item><title>MS SQL Server Meets Lakehouse: Integrating Databricks SQL Warehouse as a Linked Server</title><link>https://stefanko.ch/p/databricks-sql-warehouse-as-a-linked-server/</link><guid isPermaLink="true">https://stefanko.ch/p/databricks-sql-warehouse-as-a-linked-server/</guid><description>In this blog post, I explore how to set up Databricks SQL Warehouse as a Linked Server in MS SQL Server to seamlessly query data from your Lakehouse directly within SQL Server. </description><pubDate>Thu, 09 Jan 2025 17:15:57 GMT</pubDate><category>Databricks</category><category>SQL Server</category></item><item><title>Effortless Databricks Asset Bundle Deployments with Azure DevOps</title><link>https://stefanko.ch/p/effortless-databricks-asset-bundle-deployments-with-azure-devops/</link><guid isPermaLink="true">https://stefanko.ch/p/effortless-databricks-asset-bundle-deployments-with-azure-devops/</guid><description>In this step-by-step tutorial, you’ll learn how to set up an Azure DevOps pipeline for seamless deployment of Databricks asset bundles. The post walks you through the entire process—from configuring your Azure DevOps project to automating the deployment of Notebooks, Workflows, and other Databricks resources.</description><pubDate>Wed, 08 Jan 2025 19:47:57 GMT</pubDate><category>databricks</category><category>DevOps</category><category>CI/CD</category></item><item><title>Step-by-Step in Databricks: Creating a Date and Time dimension for BI Analytics</title><link>https://stefanko.ch/p/create-dim-date-time-in-databricks/</link><guid isPermaLink="true">https://stefanko.ch/p/create-dim-date-time-in-databricks/</guid><description>In this article, you will learn how to set up date and time dimensions in Databricks to enable precise time-based analyses and reports.</description><pubDate>Sun, 10 Nov 2024 07:51:57 GMT</pubDate><category>howto</category><category>databricks</category><category>Business Intelligence</category></item><item><title>Trigger and Monitor Data Factory Jobs from Databricks Workflows</title><link>https://stefanko.ch/p/trigger-and-monitor-adf-from-databricks/</link><guid isPermaLink="true">https://stefanko.ch/p/trigger-and-monitor-adf-from-databricks/</guid><description>In data engineering in the Azure Cloud, a common setup is to use Azure Data Factory to orchestrate data pipelines. If you wanted to orchestrate Databricks pipelines, you had a powerful tool at hand wi</description><pubDate>Fri, 01 Nov 2024 07:11:21 GMT</pubDate><category>howto</category><category>databricks</category><category>data factory</category></item><item><title>Databricks CLI Step by Step - Part 1 - The basics</title><link>https://stefanko.ch/p/databricks-cli-tutorial-1/</link><guid isPermaLink="true">https://stefanko.ch/p/databricks-cli-tutorial-1/</guid><description>In the upcoming blog series, I will highlight different areas of the Databricks CLI with various practical examples.</description><pubDate>Tue, 29 Oct 2024 19:04:16 GMT</pubDate><category>howto</category><category>databricks</category></item><item><title>Authentication Methods with Databricks CLI on Azure Databricks - Step by Step</title><link>https://stefanko.ch/p/databricks-cli-authentication/</link><guid isPermaLink="true">https://stefanko.ch/p/databricks-cli-authentication/</guid><description>I will show you different methods to authenticate with the Databricks CLI to Azure Databricks in this quick guide.</description><pubDate>Wed, 23 Oct 2024 18:29:51 GMT</pubDate><category>howto</category><category>databricks</category></item><item><title>How to load, process and visualize weather data from OpenWeather in the Databricks Lakehouse</title><link>https://stefanko.ch/p/openweather/</link><guid isPermaLink="true">https://stefanko.ch/p/openweather/</guid><description>In this HowTo I will show you how to load weather data from the OpenWeather API into the Databricks Lakehouse. The data is loaded via REST API from OpenWeather and then processed in the Medallion arch</description><pubDate>Thu, 21 Mar 2024 06:51:57 GMT</pubDate><category>howto</category><category>Databricks</category><category>ETL</category></item><item><title>How To: Copying all schemas and tables from one catalog to another catalog in Databricks</title><link>https://stefanko.ch/p/copy-schema-tables-databricks/</link><guid isPermaLink="true">https://stefanko.ch/p/copy-schema-tables-databricks/</guid><description>In this quick guide, I will show you how to automatically copy all schemas and tables from one catalog to another in Databricks.</description><pubDate>Sun, 17 Mar 2024 05:51:57 GMT</pubDate><category>howto</category><category>databricks</category></item><item><title>dbt core: Set up in VS Code and run seamlessly on Databricks – step by step</title><link>https://stefanko.ch/p/dbt-core-databricks/</link><guid isPermaLink="true">https://stefanko.ch/p/dbt-core-databricks/</guid><description>In this step-by-step guide, I will show you how to install dbt core locally in Visual Studio Code, set up the corresponding Visual Studio Code Extension and run dbt on Databricks.</description><pubDate>Sun, 10 Mar 2024 08:51:57 GMT</pubDate><category>howto</category><category>dbt</category><category>Databricks</category></item><item><title>How to create a self-hosted Airbyte environment in the Azure Cloud</title><link>https://stefanko.ch/p/airbyte-vm/</link><guid isPermaLink="true">https://stefanko.ch/p/airbyte-vm/</guid><description>In this quick guide, I will show you how to set up an Airbyte environment in a virtual machine in Azure.</description><pubDate>Sun, 03 Mar 2024 09:51:57 GMT</pubDate><category>howto</category><category>azure</category><category>ETL</category></item><item><title>How to create an Azure Databricks Lakehouse including Unity Catalog, DataLake and KeyVault – step by step</title><link>https://stefanko.ch/p/databricks_lakehouse/</link><guid isPermaLink="true">https://stefanko.ch/p/databricks_lakehouse/</guid><description>In this step-by-step guide, I describe how to create a Databricks Lakehouse environment in the Azure Cloud within 30 minutes. This guide is aimed at people who are new to Databricks and want to create</description><pubDate>Sun, 18 Feb 2024 09:51:57 GMT</pubDate><category>howto</category><category>databricks</category></item><item><title>I can now also be found on Medium</title><link>https://stefanko.ch/p/medium/</link><guid isPermaLink="true">https://stefanko.ch/p/medium/</guid><description>I have now decided in the old year to write on Medium as well as my personal blog.</description><pubDate>Tue, 12 Dec 2023 11:51:57 GMT</pubDate><category>general</category><category>blogging</category></item><item><title>First steps with the Azure Cloud</title><link>https://stefanko.ch/p/First_steps_with_the_Azure_Cloud/</link><guid isPermaLink="true">https://stefanko.ch/p/First_steps_with_the_Azure_Cloud/</guid><description>Azure is Microsoft’s cloud computing platform. It offers a comprehensive suite of cloud services that enable companies to develop, deploy and manage applications without having to have physical hardware on site. Azure enables users to use resources such as virtual machines, storage, databases, networks and much more in Microsoft’s globally distributed data centers. The Azure Cloud is one of the leading hyper-scalers on the market. In the following article, I would like to give beginners a few useful tips to make it easier to get started with the Azure portal.</description><pubDate>Mon, 11 Dec 2023 06:51:57 GMT</pubDate><category>howto</category><category>azure</category></item><item><title>Connect Databricks SQL Warehouse with DBeaver</title><link>https://stefanko.ch/p/DBeaver_SQL_Warehouse/</link><guid isPermaLink="true">https://stefanko.ch/p/DBeaver_SQL_Warehouse/</guid><description>In this quick guide I will show you how to connect to a Databricks SQL Warehouse Cluster with DBeaver.</description><pubDate>Tue, 28 Nov 2023 08:51:57 GMT</pubDate><category>howto</category><category>databricks</category></item></channel></rss>