Beyond Hive: Navigating the Open Table Format Revolution in Modern Data Lakes

The data lake landscape is undergoing a fundamental transformation. Traditional Hive tables are giving way to a new generation of open table formats—Apache Iceberg, Apache Hudi, Delta Lake, and emerging contenders like DuckDB—each promising to solve the inherent challenges of managing massive datasets at scale.
But which format fits your architecture? This session cuts through the marketing noise to deliver practical insights for data architects and engineers navigating this critical decision. We’ll explore how these formats tackle schema evolution, time travel, ACID transactions, and metadata management differently, and what these differences mean for your data platform’s performance, reliability, and total cost of ownership.
Drawing from real-world implementations, you’ll discover the hidden complexities, unexpected benefits, and common pitfalls of each approach. Whether you’re modernizing legacy Hive infrastructure, building greenfield data lakes, or evaluating lakehouse architectures, you’ll leave with a clear framework for choosing and implementing the right open table format for your specific use case—and the confidence to justify that decision to stakeholders.

Highlights:

Format Face-Off: Direct comparison of Hive, Iceberg, Hudi, Delta Lake, and Ducklake capabilities across critical dimensions including ACID guarantees, partition evolution, and query performance optimization
Real-World Battle Scars: Lessons learned from production deployments including migration strategies, performance tuning insights, and cost implications at petabyte scale
Ecosystem Integration Deep-Dive: How each format plays with modern compute engines (e.g. Spark, Flink, Trino, Presto, DuckDB) and cloud platforms, plus vendor lock-in considerations
The Hidden Costs: Beyond storage and compute—examining operational overhead, team expertise requirements, and long-term maintenance implications of your format choice
Decision Framework: A practical methodology for evaluating which open table format aligns with your organization’s data architecture, workload patterns, and strategic goals.

Click here for the conference schedule

Speakers

Rick van der Lans

Daragh O Brien

Emiel van Bockel

Date	Price	Contact
April 6 and 7, 2016	€1.340,-	seminars@adeptevents.nl
		+31 (0)172 742680
Time	Location
9:30 - 17:00	Mercure Hotel Amsterdam City

Next Edition	TYPE
March 2018

Date	Price
April 6 and 7, 2016	€1.340,-
Time
	9:30 - 17:00
Location	Contact
Mercure Hotel Amsterdam City	seminars@adeptevents.nl
	+31 (0)172 742680
Next Edition
March 2018
TYPE

Data Warehousing & Business Intelligence Summit 2016

Beyond Hive: Navigating the Open Table Format Revolution in Modern Data Lakes

Speakers

Rick van der Lans

Daragh O Brien

Emiel van Bockel

Harm van der Lek

Krish Krishnan

Lex Pierik

Jan Henderyckx

Mike Ferguson

Pieter den Hamer

Gold and Platinum Partners

Exhibitors & Media partners

News

Nicola Askham presents keynote and workshop at DW & BI Summit 2025

Winfried Etzel presents keynote and workshop at DW & BI Summit 2025

Alec Sharp presents keynote and workshop at DW & BI Summit 2025

Linda Terlouw presents keynote at DW & BI Summit 2025

Ron Tolido presents keynote at DW & BI Summit 2024

Peter Boncz presents keynote on MotherDuck at DW & BI Summit 2024

MicroStrategy sponsor of DW & BI Summit 2024

Thomas Frisendal presents keynote and workshop about Graph Technology on DW & BI Summit jubilee

Nigel Turner presents keynote and workshop about Data Quality on DW & BI Summit jubilee

Alec Sharp presents keynote and workshop about Conceptual Data Modelling on DW & BI Summit jubilee

Beyond Hive: Navigating the Open Table Format Revolution in Modern Data Lakes

Speakers

Rick van der Lans

Daragh O Brien

Emiel van Bockel

Harm van der Lek

Krish Krishnan

Lex Pierik

Jan Henderyckx

Mike Ferguson

Pieter den Hamer

Gold and Platinum Partners

Exhibitors & Media partners

Related events

News