[Coinbase Lakehouse Architecture] Achieving Data Warehouse Performance on a Data Lakehouse
Join Sida Shen from CelerData and Eric Sun from Coinbase in this video as they dive into the latest advancements in data lakehouse querying and share tips to make the most out of your data lakehouse. They’ll cover:
🌟Why you shouldn’t rely on proprietary data warehouses just to speed up queries
🌟The latest cool stuff in query engines boosting lakehouse performance
🌟A close look at how Coinbase is using StarRocks, Delta Lake, and Unity Catalog
———————————————————————————————————————-
Timestamps
00:00 Intro
00:30 Data Lakehouse – Data Warehouse Features on Data Lake
04:46 Challenges of Fast Data Lake Queries on Data Lake
06:45 How to Accelerate Data Lake Query Performance
08:14 What Is StarRocks
08:54 How Fast Is a Purposely Built Lakehouse Engine
09:43 SSB Benchmark Test – StarRocks vs. ClickHouse vs. Apache Druid, Out-Of-Box
10:51 Benchmark: StarRocks as a Data Warehouse vs. StarRocks as a Lakehouse Query Engine – TPC-DS 1TB Benchmark
11:13 Comparing to Other Query Engines: StarRocks vs. Trino
13:11 Coinbase – Data Lake with Open Format, Unity Catalog, and Multiple Query Engines
14:29 Coinbase’ Data Stack
15:12 How Coinbase Uses StarRocks + Unity Catalog + Delta Lake
18:45 PuppyGraph
19:45 DuckDB
20:34 The True Benefit of an Open Lakehouse
21:40 Conclusion
———————————————————————————————————————-
Learn more at https://celerdata.com/
Connect with us:
LinkedIn: https://www.linkedin.com/company/celerdata/
Twitter: https://twitter.com/celerdata
StarRocks GitHub: https://github.com/StarRocks/StarRocks
StarRocks Website: https://www.starrocks.io/
Slack: https://try.starrocks.com/join-starrocks-on-slack
#DataAnalytics #DataEngineering #DataLakeAnalytics #OLAP #DataAnalyst #DataEngineer #DataInfrastructure #Database #AnalyticalDatabase #DataLake #DataLakeHouse #Trino #Presto #DataWarehouse #DataScience #ApacheIceberg
by CelerData
linux foundation
🌟Join StarRocks on Slack: https://try.starrocks.com/join-starrocks-on-slack