Article · Wikipedia archive · Last revised May 28, 2026

Apache Samza

Apache Samza is an open-source, near-realtime, asynchronous computational framework for stream processing developed by the Apache Software Foundation in Scala and Java. It has been developed in conjunction with Apache Kafka. Both were originally developed by LinkedIn.

Last revised
May 28, 2026
Read time
≈ 1 min
Length
266 w
Citations
8
Source
Apache Samza
Original authorLinkedIn
DeveloperApache Software Foundation
Stable release
1.8.0 / 17 January 2023 (2023-01-17)1
Written inScala, Java
Operating systemCross-platform
TypeDistributed stream processing
LicenseApache License 2.0
Websitesamza.apache.org
RepositorySamza Repository

Apache Samza is an open-source, near-realtime, asynchronous computational framework for stream processing developed by the Apache Software Foundation in Scala and Java. It has been developed in conjunction with Apache Kafka. Both were originally developed by LinkedIn.2

Overview

Samza allows users to build stateful applications that process data in real-time from multiple sources including Apache Kafka.

Samza provides fault tolerance, isolation and stateful processing. Unlike batch systems such as Apache Hadoop or Apache Spark, it provides continuous computation and output, which result in sub-second3 response times.

There are many players in the field of real-time stream processing and Samza is one of the mature products.456 It was added to Apache in 2013.7

Samza is used by multiple companies.8 The biggest installation is in LinkedIn.

See also

See also

References

References

External links