Resources and insights
Our Blog
Explore insights and practical tips on mastering Databricks Data Intelligence Platform and the full spectrum of today's modern data ecosystem.
CI/CD Best Practices: Passing tests isn't enough
CI/CD pipelines can pass all jobs yet still deploy broken functionality. This blog covers smoke testing, regression testing, and critical validation strategies: especially useful for data projects where data quality is as important as code quality.
Recursive CTE: The beauty of SQL Self-Referencing Queries
Recursive CTEs in SQL: queries that can reference themselves to solve complex problems iteratively. From generating sequences to traversing network graphs and hierarchical data, learn how to eliminate manual looping with SQL solutions.
Managing Data Changes with SCDs in Databricks
Discover how to build trustworthy data systems with Slowly Changing Dimensions in Databricks. This comprehensive guide covers SCD Types 1, 2, and 6 implementations using Delta Lake's MERGE operations and LakeFlow Declarative Pipelines, with practical SQL and Python examples.
Add External Data Sources to Unity Catalog Lineage
Enhance your Unity Catalog lineage by incorporating external data sources such as Kafka streams and IoT devices. This blog covers both UI-based and programmatic methods for creating complete data lineage visibility in your Databricks environment.
AI_PARSE_DOCUMENT() Get PDF Invoices Into The Database
Learn how to automate invoice processing with Databricks' AI_PARSE_DOCUMENT() function. Step-by-step guide to convert PDF invoices into structured database records using SQL and Agent Bricks. Includes cost analysis and real examples.
Managed Iceberg Tables
Learn when to choose Apache Iceberg over Delta tables in Databricks. Complete guide covering manifest files, CDC limitations, liquid partitioning, and table properties with practical examples.
The Hidden Benefits of Databricks Serverless
Most Databricks cost comparisons focus only on compute pricing, missing two critical factors that can save thousands monthly. Learn how serverless waives private link transfer fees (up to $10/TB) and provides persistent remote caching that survives warehouse restarts - hidden benefits that often justify the serverless premium entirely.
Data Intelligence for All: 9 Ways Databricks Just Redefined the Future of Data
Discover how Databricks' 9 major announcements at Summit 2025 are democratizing AI with Agent Bricks, Lakebase, free edition, and more game-changing innovations.
End The Data Engineering Nightmare with Metrics.
Learn how Databricks metrics views simplify SQL analytics by centralizing business rules and eliminating repetitive code. Complete tutorial with examples.
Unity Catalog to Azure Key Vault: No more dbutils.secrets()
Learn how to securely connect Azure Databricks to Key Vault using Unity Catalog Service Credentials for enterprise-grade secret management and governance.
Oracle to Databricks Migration: The Complete Guide
Complete technical guide for Oracle to Databricks migration. Includes code conversion examples, performance optimization & proven methodologies. Download free.
Stop ELT Headaches: Why We Partner with Fivetran + Databricks
Discover how SunnyData overcame ELT challenges by partnering with Fivetran and Databricks, creating reliable data pipelines that eliminate late-night fixes and accelerate insights.
Bridge the Gap in Your Data Stack: Leverage Databricks BI/AI to Enhance Traditional BI
Bridge the gap in your data architecture by strategically combining Databricks BI/AI with traditional business intelligence tools. Learn how to reduce licensing costs, improve dashboard performance, and implement a hybrid approach that leverages the best of both worlds. This practical guide shows you when to use native Databricks capabilities versus tools like Power BI for optimal cost-efficiency and performance.
PostgreSQL to Databricks Migration: A Simpler Path to the Lakehouse
Discover a complete guide to PostgreSQL to Databricks migration, including analysis, design, development, and deployment phases. Learn how to transform stored procedures, manage data transfer, and leverage the new SQL Scripting feature for a simplified lakehouse journey.
The Chaos of Data: How Fragmentation is Stalling Innovation
Discover how data fragmentation across multiple platforms creates costly silos and slows decision-making. Learn why Databricks offers the unified solution modern enterprises need.
 
                         
            
              
            
            
          
             
  
  
    
    
     
  
  
    
    
     
  
  
    
    
     
 
 
 
 
 
 
 
 
 
 
 
 
