The Data Lakehouse
It would appear Microsoft has run out of words to use for new features/products, so they are just going to recycle existing words for a bit of fun at our expense. In this episode we explore the Azure Data Lakehouse—which is not a data lake, nor a traditional data warehouse and Books Online (BOL) refers to it as Databricks Lakehouse. Luke Moloney walks us through how combining the flexibility of data lake storage with some ACID transaction and data governance gives organizations looking to analyze their data a new option. The hope is the cost to build, store, and analyze data will be easier and more approachable for organizations who don’t want to go with the traditional data warehouse model. This still feels like an Enterprise feature to me but let me know if your organization would be interested in this approach. As always, special thanks to Luke and the folks at Microsoft for making themselves available to us.
Luke Moloney is a Senior Program Manager at Microsoft, where he focuses on Analytics technologies – particularly Azure Synapse Analytics.
He has previously been a Cloud Solution Architect with Microsoft. Prior to that, he worked in consulting building cloud data platforms.
He currently resides in Dublin, Ireland, and can be found on Twitter @lukemoloney.
Am I keeping every piece of data until the end of time, or alternatively, am I getting rid of data very quickly? But there’s generally a balance.
Meet the Hosts
With more than 10 years of working with SQL Server, Carlos helps businesses ensure their SQL Server environments meet their users’ expectations. He can provide insights on performance, migrations, and disaster recovery. He is also active in the SQL Server community and regularly speaks at user group meetings and conferences. He helps support the free database monitoring tool found at databasehealth.com and provides training through SQL Trail events.
Eugene works as an independent BI consultant and Pluralsight author, specializing in Power BI and the Azure Data Platform. He has been working with data for over 8 years and speaks regularly at user groups and conferences. He also helps run the GroupBy online conference.
Kevin is a Microsoft Data Platform MVP and proprietor of Catallaxy Services, LLC, where he specializes in T-SQL development, machine learning, and pulling rabbits out of hats on demand. He is the lead contributor to Curated SQL, president of the Triangle Area SQL Server Users Group, and author of the books PolyBase Revealed (Apress, 2020) and Finding Ghosts in Your Data: Anomaly Detection Techniques with Examples in Python (Apress, 2022). A resident of Durham, North Carolina, he can be found cycling the trails along the triangle whenever the weather's nice enough.