Resources
Learning Resources
Slide
Discover Apache Doris - Technical Overview

Discover Apache Doris - Technical Overview

Apache Doris is an open-source real-time data warehouse. It supports both batch import and stream writing, and is capable of reporting, ad-hoc analysis, federated queries, and log analysis, so it can be used for dashboarding, self-service BI, A/B testing, and user behavior analysis, etc. As a real-time OLAP engine, Apache Doris hasn a competitive edge in query speed. According to the TPC-H and SSB-Flat benchmarking results, Doris can deliver much faster performance than Presto, Greenplum, and ClickHouse. Behind the fast speed of Apache Doris is the architectural design, features, and mechanisms that contribute to the performance of Doris. For example, it has a cost-based optimizer, a fully vectorized execution engine, and a data-driven query engine. Apache Doris provides a wide range of data ingestion methods and data update operations. It allows cross-cluster replication, fine-grained privilege control, and workload isolation. It also has native support for semi-structured data, and enables tiered storage of hot and cold data to reduce storage costs for users.