Actions for Apache Iceberg essentials : storing and managing data at scale
Apache Iceberg essentials : storing and managing data at scale
- Published
- [Sebastopol, California] : O'Reilly Media, Inc., [2025]
- Edition
- [First edition].
- Physical Description
- 1 online resource (1 video file (1 hr., 59 min.)) : sound, color
- Additional Creators
- Gancarski, Michal and O'Reilly (Firm)
Access Online
- Summary
- Apache Iceberg is a transactional table format designed for handling large analytical datasets and has become an industry standard for storing data in object stores and distributed file systems. Iceberg not only ensures data correctness but also allows data engineers to simplify existing architectures, enhance the efficiency of data processing jobs, and unlock new use cases for data lakes and data meshes. This on-demand course offers a comprehensive overview of Apache Iceberg. Through concise presentations and interactive assessments, the course explores how Iceberg functions and highlights its key features, including ACID transactions, time travel, and flexible schema evolution. Learners will discover how Iceberg can be used to streamline the design, implementation, and operation of data pipelines and storage systems, based on the principles of the data lakehouse. Additionally, the course demonstrates how to integrate Iceberg with popular query and processing engines such as Trino and Apache Spark.
- Subject(s)
- Genre(s)
- Duration
- ["01:59:00"]
- Sound Characteristics
- digital
- Digital File Characteristics
- video file
- Form of work
- Instructional films
- Participant/Performer Note
- Michal Gancarski, instructor.
View MARC record | catkey: 46750333