Skip to content

Apache Spark Streaming

Apache Spark Streaming is Spark's streaming API for processing live data streams.

When To Use It

  • You already use Spark and want stream processing in the same ecosystem.
  • You need micro-batch processing rather than sub-second event handling.

Practical Notes

  • For newer Spark work, Structured Streaming is usually the better default.
  • Keep the streaming pipeline close to the batch pipeline when possible.