DM
  • Menu ▾
    • About
    • Writing
    • Research
    • Speaking
    • Books
  • About
  • Writing
  • Research
  • Speaking
  • Books
  • LinkedIn
  • X

Writing

Featured Technical Blogs

  • Apache Iceberg: Key Innovations So Far An exploration of the core architectural features that made Apache Iceberg an industry standard for open table formats, including snapshot isolation and partition evolution.

  • Apache Parquet vs. Newer File Formats A technical comparison between the established Parquet format and emerging high-performance alternatives like BtrBlocks and Lance for specialized data workloads.

  • What is Apache Arrow Flight & ADBC? Breaking down how Arrow Flight and the Arrow Database Connectivity (ADBC) standard are solving the “data bottleneck” in high-speed analytical systems.

  • Concurrency Control in Apache Hudi A deep dive into multi-writer scenarios in Apache Hudi, explaining how optimistic concurrency control and lock providers ensure data integrity.

  • ACID Transactions in a Lakehouse An explanation of how open table formats bring database-grade reliability to data lakes, enabling atomic commits and consistent reads across massive datasets.

  • How Z-Ordering in Apache Iceberg Helps Performance A guide to multi-dimensional data clustering and how Z-Ordering significantly reduces data skipped during query execution for faster performance.

  • Getting Started with Flink SQL and Apache Iceberg A hands-on tutorial on integrating Apache Flink with Iceberg to build robust, real-time streaming pipelines with full transactional support.

© 2026 Powered by Hugo :: Theme made by panr