Apache Sedona

Data analysis software From Wikipedia, the free encyclopedia

Apache Sedona (formerly GeoSpark) is an open-source framework designed for processing and analyzing large-scale spatial data in a distributed computing environment.[1][2] It originated as GeoSpark in 2010 by researchers at Arizona State University[3] and later entered incubation with the Apache Software Foundation in 2020. It graduated as a top-level project in February 2023.[4]

Other namesGeoSpark
Original authorsJia Yu, Mohamed Sarwat
DeveloperApache Software Foundation
Quick facts Other names, Original authors ...
Apache Sedona
Other namesGeoSpark
Original authorsJia Yu, Mohamed Sarwat
DeveloperApache Software Foundation
Initial releaseDecember 10, 2017; 8 years ago (2017-12-10)
Available inScala, Java, SQL, Python, R,
LicenseApache 2.0
Websitesedona.apache.org
Repositoryhttps://github.com/apache/sedona
Close

Overview

Sedona is a framework that facilities distributed geospatial data processing. It integrates with Apache Spark, Apache Flink, Snowflake[5][6] and includes Spatial Datasets and Spatial SQL functions to loading, processing, and analyzing large-scale geospatial data across systems.[7] It supports spatial data formats, including GeoJSON, Well Known Text and Well-Known Binary[8][9] as well as multiple coding languages, including Java, Python, R, Scala, and SQL.[10][11]

History

The project was initiated as GeoSpark by Jia Yu and Mohamed "Mo" Sarwart at Arizona State University in 2010.[12] In 2020, the project was submitted to the Apache Software Foundation[13] and graduated in 2023.

See also

References

Related Articles

Wikiwand AI