To Nha Notes | Dec. 22, 2025, 4:58 p.m.
The data landscape is evolving. As AI workloads become mainstream, the traditional lakehouse architecture—built primarily for BI and analytics—is being stretched to its limits. A new pattern is emerging: dual-format lakehouses that leverage both Apache Iceberg and Lance.
Apache Iceberg has been the backbone of analytics lakehouses, providing transactional guarantees and schema evolution at scale. But AI/ML workloads bring different requirements: vector embeddings, multimodal data (images, audio, video), and fundamentally different access patterns.
This is where Lance enters the picture—a columnar format purpose-built for AI/ML workloads.
Iceberg excels at:
Lance shines for:
Companies like Netflix are now adopting both formats: Iceberg for BI workloads, Lance for AI and multimodal data. This dual-format strategy lets organizations leverage the strengths of each without compromise.
The key insight? It's not about choosing one over the other—it's about using the right tool for each workload while maintaining interoperability at the compute layer.
Want to dive deeper? Read the full technical analysis: From BI to AI: A Modern Lakehouse Stack with Lance and Iceberg