Resources and insights
Our Blog
Explore insights and practical tips on mastering Databricks Data Intelligence Platform and the full spectrum of today's modern data ecosystem.
Add External Data Sources to Unity Catalog Lineage
Enhance your Unity Catalog lineage by incorporating external data sources such as Kafka streams and IoT devices. This blog covers both UI-based and programmatic methods for creating complete data lineage visibility in your Databricks environment.
AI_PARSE_DOCUMENT() Get PDF Invoices Into The Database
Learn how to automate invoice processing with Databricks' AI_PARSE_DOCUMENT() function. Step-by-step guide to convert PDF invoices into structured database records using SQL and Agent Bricks. Includes cost analysis and real examples.
Managed Iceberg Tables
Learn when to choose Apache Iceberg over Delta tables in Databricks. Complete guide covering manifest files, CDC limitations, liquid partitioning, and table properties with practical examples.
The Hidden Benefits of Databricks Serverless
Most Databricks cost comparisons focus only on compute pricing, missing two critical factors that can save thousands monthly. Learn how serverless waives private link transfer fees (up to $10/TB) and provides persistent remote caching that survives warehouse restarts - hidden benefits that often justify the serverless premium entirely.
Data Intelligence for All: 9 Ways Databricks Just Redefined the Future of Data
Discover how Databricks' 9 major announcements at Summit 2025 are democratizing AI with Agent Bricks, Lakebase, free edition, and more game-changing innovations.
End The Data Engineering Nightmare with Metrics.
Learn how Databricks metrics views simplify SQL analytics by centralizing business rules and eliminating repetitive code. Complete tutorial with examples.
Bridge the Gap in Your Data Stack: Leverage Databricks BI/AI to Enhance Traditional BI
Bridge the gap in your data architecture by strategically combining Databricks BI/AI with traditional business intelligence tools. Learn how to reduce licensing costs, improve dashboard performance, and implement a hybrid approach that leverages the best of both worlds. This practical guide shows you when to use native Databricks capabilities versus tools like Power BI for optimal cost-efficiency and performance.
The Chaos of Data: How Fragmentation is Stalling Innovation
Discover how data fragmentation across multiple platforms creates costly silos and slows decision-making. Learn why Databricks offers the unified solution modern enterprises need.
Databricks: An Insider’s Perspective with Franco Patano
Josue Brogan interviews Databrick’s Product Owner Franco Patano to discuss Databricks current state and future.
Databricks Compute Types: A Performance & Cost Analysis
Discover which Databricks compute type delivers the best value through real-world testing. Compare SQL Serverless, Classic, and Serverless performance across 19 intensive queries.
Seamless Data Integration: SAP to Databricks
Learn how to integrate SAP data into Databricks with this comprehensive blog. Discover the essential components of the SAP ecosystem, including SAP HANA, S/4HANA, and BTP, and explore proven integration methods using SparkJDBC and Azure Data Factory. Perfect for data engineers and architects looking to combine SAP's enterprise management capabilities with Databricks' advanced analytics.
5 Reasons Why We Recommend Databricks
Discover why Databricks stands out as the leading data platform in this insightful blog. From unified data management to cost efficiency, unmatched performance, and robust analytics, Josue Bogran explains the top 5 reasons Databricks excels in the competitive landscape. Learn how Databricks balances innovation, user-centric design, and industry versatility to deliver exceptional results.
Performance, Benchmarks, and Optimization Tips for Databricks Users
Josue Bogran interviews Jeremy Lewallen from Databricks’ Performance Team, exploring benchmarks, storage cost optimization, rightsizing SQL Serverless Compute, and common compute mistakes. Discover why Databricks continually enhances performance, tips for using the latest DBRs, and how their innovations provide a fast, efficient, and developer-friendly data platform.
Redshift to Databricks - Part 2: Technical Implementation Guide
This guide dives into the technical steps required to migrate from Amazon Redshift to Databricks. Covering everything from discovery and data evaluation to security protocols and cost estimation, it offers detailed, practical strategies for managing dependencies, optimizing queries, and planning for future scalability within Databricks’ robust ecosystem.
Databricks SQL in 5 Minutes
Databricks SQL is easy to get started and highly performant. Don't just take our word for it. Check out the video our technical advisor Josue put out that shows how simple it is to get started with, as well as another video that he did comparing Databricks vs Snowflake query performance.