没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
Apache Pulsar™ vs. Apache Kafka
Ⓡ
2022 Benchmark
Table of Contents
Executive Summary 3
○ Figure 1: Pulsar vs. Kafka Monthly Active Contributors 3
I. Key Benchmark Findings 4
Benchmark Tests 5
I. What We Tested 5
A. Maximum Sustainable Throughput 5
B. Publish Latency at a Fixed Throughput 6
C. Catch-up Reads / Backlog Draining 6
II. How We Set up the Tests 6
III. Benchmark Tests & Results 8
A. Test #1: Maximum Throughput 8
1. Test #1 / Case #1: Maximum Throughput with 1 Partition 8
a. Case #1 Results: Maximum Throughput with 1 Partition 9
○ Figure 2: Single Partition Max Write Throughput 9
b. Case #1 Analysis 10
2. Test #1 / Case #2: Maximum Throughput with 100 Partitions 10
a. Case #2 Results: Maximum Throughput with 100 Partitions 11
○ Figure 3: 100 Partitions Max Write Throughput 11
b. Case #2 Analysis 12
B. Test #2: Publish Latency 12
a. Test #2 Results: Publish Latency 14
○ Figure 4: 500K Rate Publish Latency Percentiles 14
b. Test #2 Analysis 15
C. Test #3: Catch-up Reads 16
a. Test #3 Results: Catch-up Reads 16
○ Figure 5a: Catch-up Read Throughput 17
○ Figure 5b: Catch-up Read Chase Time 18
○ Figure 5c: Impact Publish Latency during Catchup Read 19
b. Test #3 Analysis 19
Conclusion 20
About StreamNative 21
References 22
Copyright © StreamNative, Inc. 2022 2
Apache Pulsar™ vs. Apache Kafka
Ⓡ
2022 Benchmark
Executive Summary
As we move into 2022, the Apache Pulsar
TM
versus Apache Kafka
Ⓡ
debate continues.
Organizations often make comparisons based on features, capabilities, size of the
community, and a number of other metrics of varying importance. This report focuses
purely on comparing the technical performance based on benchmark tests.
The last widely published Pulsar versus Kafka benchmark was performed in 2020, and a lot
has happened since then. In 2021, Pulsar ranked as a Top 5 Apache Software Foundation
project and surpassed Apache Kafka in monthly active contributors as shown in the chart
below. Pulsar also averaged more monthly active contributors than Kafka for most of the
past 18 months.
Figure 1: Pulsar vs. Kafka Monthly Active Contributors
These contributions led to major performance improvements for Pulsar. To measure the
impact of the improvements, the engineering team at StreamNative, led by Matteo Merli,
one of the original creators of Apache Pulsar, Apache Pulsar PMC Chairperson, performed
a benchmark study using the Linux Foundation Open Messaging benchmark.
Copyright © StreamNative, Inc. 2022 3
Apache Pulsar™ vs. Apache Kafka
Ⓡ
2022 Benchmark
The team measured Pulsar performance in terms of throughput and latency, and then
performed the same tests on Kafka. We’ve included the testing framework and details
below and encourage anyone who is interested in validating the tests to do so.
Let’s take a look at three key findings before jumping into the full results.
I. Key Benchmark Findings
2.5x
Maximum
throughput
compared to
Kafka
Pulsar is able to achieve 2.5 times the maximum
throughput compared to Kafka. This is a significant
advantage for use cases that ingest and process large
volumes of data, such as log analysis, cybersecurity, and
sensor data collection. Higher throughput means less
hardware, resulting in lower operational costs.
100x
Lower
single-digit
publish
latency than
Kafka
Pulsar provides consistent single-digit publish latency that
is 100x lower than Kafka at P99.99 (ms). Low publish
latency is important because it enables systems to hand
off messages to a message bus quickly. Once a message is
published, the data is safe and the "action" will be
executed.
1.5x
Faster
historical
read rate
than Kafka
With a historical read rate that is 1.5 times faster than
Kafka, applications using Pulsar as their messaging
system can catch-up after an unexpected interruption in
half the time. Read throughput is critically important for
use cases such as Database Migration/Replication where
you are feeding data into a system of record.
Below we provide details on how the benchmark was performed and its results.
Copyright © StreamNative, Inc. 2022 4
Apache Pulsar™ vs. Apache Kafka
Ⓡ
2022 Benchmark
Benchmark Tests
Using the Linux Foundation Open Messaging benchmark [1], we ran the latest versions of
Apache Pulsar (2.9.1) and Apache Kafka (3.0.0). To ensure an objective baseline
comparison, each test in this Benchmark Report compares Kafka to Pulsar in two
scenarios: Pulsar with Journaling and Pulsar without Journaling.
Pulsar’s default configuration includes Journaling, which offers a higher durability
guarantee than Kafka’s default configuration. Pulsar without Journaling provides the same
durability guarantees as the default Kafka configuration, which results in an
apples-to-apples comparison.
I. What We Tested
For this benchmark, we selected a handful of tests to represent common patterns in the
messaging and streaming domains and to test the limits of each system:
A. Maximum Sustainable Throughput
This test measures the maximum data throughput the system can deliver when consumers
are keeping up with the incoming trac.
We ran this test in two scenarios to test the upper boundary performance and to test the
cost profile for each system:
1. Topic with a single partition. This scenario tests the upper boundary performance
for a total-order use case or, in the worst case, where partition keys’ data is skewed.
At some scale, the design of a system that relies upon single ordering or handling
large amounts of skewed data will need to be reconsidered. Pulsar has the ability to
handle situations where total ordering is required at higher scale or large amounts
of skew arise.
2. Topic with 100 partitions. With more partitions to stress available resources, this
test illustrates how well a system scales horizontally (by adding more machines) and
its cost effectiveness. For example, by modeling the hardware cost per 1GB/s of
trac, it is easy to derive the cost profile for each system.
Copyright © StreamNative, Inc. 2022 5
剩余21页未读,继续阅读
资源评论
Julywhj
- 粉丝: 107
- 资源: 20
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功