Apache Kylin
From Wikipedia, the free encyclopedia
| Apache Kylin | |
|---|---|
| Developer | Apache Kylin Committee |
| Initial release | June 10, 2015[1] |
| Stable release | |
| Written in | Java |
| License | Apache License 2.0 |
| Website | kylin |
| Repository | Kylin Repository |
Apache Kylin is an open source distributed analytics engine designed to provide a SQL interface and multi-dimensional analysis (OLAP) on Hadoop and Alluxio supporting extremely large datasets.
It was originally developed by eBay, and is now a project of the Apache Software Foundation.[3]
The Kylin project was started in 2013, in eBay's R&D in Shanghai, China. In Oct 2014, Kylin v0.6 was open sourced on GitHub.com with the name "KylinOLAP".[4]
In November 2014, Kylin joined Apache Software Foundation incubator.
In December 2015, Apache Kylin graduated to be a Top Level Project.[3]
In March 2016, Kyligence, Inc. was founded by the creators of Apache Kylin.[5][6] Kyligence provides a commercial analytics platform based on Apache Kylin for on-premise and cloud-based datasets.[7]
Architecture
Apache Kylin is built on top of Apache Hadoop, Apache Hive, Apache HBase, Apache Parquet, Apache Calcite, Apache Spark and other technologies.[8] These technologies enable Kylin to easily scale to support massive data loads.[9]
Kylin has the following core components:[10][8]
- REST Server: Receive and response user or API requests
- Metadata: Persistent and manage system, especially the cube metadata;
- Query Engine: Parse SQL queries to execution plan, and then talk with storage engine;
- Storage Engine: Pushdown and scan underlying cube storage (default in HBase);
- Job Engine: Generate and execute MapReduce or Spark job to build source data into cube;