windows下kafka_2.12-2.9.0.rar（含单机伪分布式配置）资源-CSDN文库

共629个文件

jar：99个

leader-epoch-checkpoint：95个

metadata：88个

kafka

分布式

windows

zookeeper

需积分: 5 111 浏览量 2022-04-12 16:45:28 上传评论收藏 67.91MB RAR 举报

标题中的“windows下kafka_2.12-2.9.0.rar”指的是Apache Kafka的一个版本，这是在Windows环境下使用的。Kafka是一款开源的流处理平台，由LinkedIn开发并贡献给Apache软件基金会。它主要设计用于构建实时数据管道和流应用，能够处理大量实时数据。2.12-2.9.0是Kafka的一个特定发行版，其中2.12代表它支持Scala 2.12编译器，而2.9.0则是Kafka的版本号。描述中的“含单机伪分布式配置”意味着这个压缩包包含了使Kafka在单台机器上模拟分布式环境的配置文件。在分布式环境中，通常需要多台服务器来运行Kafka集群，但为了测试或学习目的，可以使用单机上的伪分布式设置，这使得开发者能够在本地环境中快速搭建Kafka实例，而无需复杂的网络配置。标签中的“kafka”、“分布式”、“windows”和“zookeeper”进一步揭示了这个压缩包所涉及的关键技术。Kafka的分布式特性使得它能在多个节点间高效地分发和存储消息。在Windows操作系统下部署Kafka可能需要一些额外的步骤，因为Kafka最初是为Unix-like系统设计的。Zookeeper是Apache的一个开源项目，它提供分布式协调服务，对于Kafka来说，Zookeeper用于管理集群的元数据和选举领导者。 “云原生”虽然没有直接出现在标题或描述中，但它通常指的是设计用于云环境的应用，强调容器化、微服务、持续交付和声明式API等原则。在Kafka的上下文中，这意味着它可能被设计为适应云环境，如AWS、Azure或Google Cloud，可以通过容器化技术（如Docker）进行部署，并与其他云原生工具和服务集成。压缩包内的“kafka_2.12-2.8.0”可能是Kafka的安装目录或解压后的文件结构，包含了所有运行Kafka所需的组件，如服务器配置文件（server.properties）、日志配置（log4j.properties）、Zookeeper配置（zoo.cfg）以及启动脚本等。用户在解压后，需要根据提供的单机伪分布式配置指南来修改这些配置文件，然后启动Kafka和Zookeeper服务，就可以在Windows环境下开始使用Kafka了。在实际操作中，用户还需要了解如何创建主题（topics）、生产者（producers）和消费者（consumers），以及如何使用Kafka命令行工具来管理和监控Kafka集群。此外，熟悉Kafka的副本复制、分区策略以及高可用性设置也是至关重要的。理解Kafka如何与大数据生态系统中的其他工具，如Hadoop、Spark或Flink集成，可以帮助构建更强大的数据处理流水线。

资源详情

资源评论

资源推荐

收起资源包目录

windows下kafka_2.12-2.9.0.rar（含单机伪分布式配置）（629个子文件）

eclipse-public-license-2.0 14KB

eclipse-distribution-license-1.0 2KB

snapshot.0 424B

log.1 64MB

CDDL+GPL-1.1 38KB

server.log.2022-04-07-10 52KB

server.log.2022-04-07-14 2.61MB

server.log.2022-04-08-09 1.2MB

server.log.2022-04-08-13 458KB

snapshot.261 29KB

log.262 64MB

snapshot.35d 29KB

log.35e 64MB

snapshot.688 30KB

log.689 64MB

snapshot.83e 29KB

argparse-MIT 1KB

kafka_2.12-2.8.0-sources.jar.asc 821B

kafka_2.12-2.8.0-test.jar.asc 821B

kafka_2.12-2.8.0.jar.asc 821B

kafka_2.12-2.8.0-test-sources.jar.asc 821B

kafka_2.12-2.8.0-javadoc.jar.asc 821B

snapshot.b3 21KB

log.b4 64MB

kafka-run-class.bat 5KB

kafka-server-start.bat 1KB

connect-distributed.bat 1KB

connect-standalone.bat 1KB

zookeeper-server-start.bat 1KB

zookeeper-shell.bat 1KB

kafka-server-stop.bat 997B

kafka-streams-application-reset.bat 972B

kafka-producer-perf-test.bat 940B

kafka-consumer-perf-test.bat 938B

kafka-console-consumer.bat 925B

kafka-console-producer.bat 925B

zookeeper-server-stop.bat 905B

kafka-preferred-replica-election.bat 900B

kafka-reassign-partitions.bat 888B

kafka-replica-verification.bat 886B

kafka-delegation-tokens.bat 885B

kafka-broker-api-versions.bat 885B

kafka-leader-election.bat 884B

kafka-delete-records.bat 883B

kafka-consumer-groups.bat 883B

kafka-dump-log.bat 878B

kafka-log-dirs.bat 877B

kafka-configs.bat 876B

kafka-topics.bat 875B

kafka-mirror-maker.bat 874B

kafka-acls.bat 873B

cleaner-offset-checkpoint 6B

trogdor.conf 1KB

DWTFYWTPL 484B

00000000000000000000.index 10MB

共 629 条

KRaft (aka KIP-500) mode Early Access Release ========================================================= # Introduction It is now possible to run Apache Kafka without Apache ZooKeeper! We call this the [Kafka Raft metadata mode](https://cwiki.apache.org/confluence/display/KAFKA/KIP-500%3A+Replace+ZooKeeper+with+a+Self-Managed+Metadata+Quorum), typically shortened to `KRaft mode`. `KRaft` is intended to be pronounced like `craft` (as in `craftsmanship`). It is currently *EARLY ACCESS AND SHOULD NOT BE USED IN PRODUCTION*, but it is available for testing in the Kafka 2.8 release. When the Kafka cluster is in KRaft mode, it does not store its metadata in ZooKeeper. In fact, you do not have to run ZooKeeper at all, because it stores its metadata in a KRaft quorum of controller nodes. KRaft mode has many benefits -- some obvious, and some not so obvious. Clearly, it is nice to manage and configure one service rather than two services. In addition, you can now run a single process Kafka cluster. Most important of all, KRaft mode is more scalable. We expect to be able to [support many more topics and partitions](https://www.confluent.io/kafka-summit-san-francisco-2019/kafka-needs-no-keeper/) in this mode. # Quickstart ## Warning KRaft mode in Kafka 2.8 is provided for testing only, *NOT* for production. We do not yet support upgrading existing ZooKeeper-based Kafka clusters into this mode. In fact, when Kafka 3.0 is released, it will not be possible to upgrade your KRaft clusters from 2.8 to 3.0. There may be bugs, including serious ones. You should *assume that your data could be lost at any time* if you try the early access release of KRaft mode. ## Generate a cluster ID The first step is to generate an ID for your new cluster, using the kafka-storage tool: ~~~~ $ ./bin/kafka-storage.sh random-uuid xtzWWN4bTjitpL3kfd9s5g ~~~~ ## Format Storage Directories The next step is to format your storage directories. If you are running in single-node mode, you can do this with one command: ~~~~ $ ./bin/kafka-storage.sh format -t <uuid> -c ./config/kraft/server.properties Formatting /tmp/kraft-combined-logs ~~~~ If you are using multiple nodes, then you should run the format command on each node. Be sure to use the same cluster ID for each one. ## Start the Kafka Server Finally, you are ready to start the Kafka server on each node. ~~~~ $ ./bin/kafka-server-start.sh ./config/kraft/server.properties [2021-02-26 15:37:11,071] INFO Registered kafka:type=kafka.Log4jController MBean (kafka.utils.Log4jControllerRegistration$) [2021-02-26 15:37:11,294] INFO Setting -D jdk.tls.rejectClientInitiatedRenegotiation=true to disable client-initiated TLS renegotiation (org.apache.zookeeper.common.X509Util) [2021-02-26 15:37:11,466] INFO [Log partition=@metadata-0, dir=/tmp/kraft-combined-logs] Loading producer state till offset 0 with message format version 2 (kafka.log.Log) [2021-02-26 15:37:11,509] INFO [raft-expiration-reaper]: Starting (kafka.raft.TimingWheelExpirationService$ExpiredOperationReaper) [2021-02-26 15:37:11,640] INFO [RaftManager nodeId=1] Completed transition to Unattached(epoch=0, voters=[1], electionTimeoutMs=9037) (org.apache.kafka.raft.QuorumState) ... ~~~~ Just like with a ZooKeeper based broker, you can connect to port 9092 (or whatever port you configured) to perform administrative operations or produce or consume data. ~~~~ $ ./bin/kafka-topics.sh --create --topic foo --partitions 1 --replication-factor 1 --bootstrap-server localhost:9092 Created topic foo. ~~~~ # Deployment ## Controller Servers In KRaft mode, only a small group of specially selected servers can act as controllers (unlike the ZooKeeper-based mode, where any server can become the Controller). The specially selected controller servers will participate in the metadata quorum. Each controller server is either active, or a hot standby for the current active controller server. You will typically select 3 or 5 servers for this role, depending on factors like cost and the number of concurrent failures your system should withstand without availability impact. Just like with ZooKeeper, you must keep a majority of the controllers alive in order to maintain availability. So if you have 3 controllers, you can tolerate 1 failure; with 5 controllers, you can tolerate 2 failures. ## Process Roles Each Kafka server now has a new configuration key called `process.roles` which can have the following values: * If `process.roles` is set to `broker`, the server acts as a broker in KRaft mode. * If `process.roles` is set to `controller`, the server acts as a controller in KRaft mode. * If `process.roles` is set to `broker,controller`, the server acts as both a broker and a controller in KRaft mode. * If `process.roles` is not set at all then we are assumed to be in ZooKeeper mode. As mentioned earlier, you can't currently transition back and forth between ZooKeeper mode and KRaft mode without reformatting. Nodes that act as both brokers and controllers are referred to as "combined" nodes. Combined nodes are simpler to operate for simple use cases and allow you to avoid some fixed memory overheads associated with JVMs. The key disadvantage is that the controller will be less isolated from the rest of the system. For example, if activity on the broker causes an out of memory condition, the controller part of the server is not isolated from that OOM condition. ## Quorum Voters All nodes in the system must set the `controller.quorum.voters` configuration. This identifies the quorum controller servers that should be used. All the controllers must be enumerated. This is similar to how, when using ZooKeeper, the `zookeeper.connect` configuration must contain all the ZooKeeper servers. Unlike with the ZooKeeper config, however, `controller.quorum.voters` also has IDs for each node. The format is id1@host1:port1,id2@host2:port2, etc. So if you have 10 brokers and 3 controllers named controller1, controller2, controller3, you might have the following configuration on controller1: ``` process.roles=controller node.id=1 listeners=CONTROLLER://controller1.example.com:9093 controller.quorum.voters=1@controller1.example.com:9093,2@controller2.example.com:9093,3@controller3.example.com:9093 ``` Each broker and each controller must set `controller.quorum.voters`. Note that the node ID supplied in the `controller.quorum.voters` configuration must match that supplied to the server. So on controller1, node.id must be set to 1, and so forth. Note that there is no requirement for controller IDs to start at 0 or 1. However, the easiest and least confusing way to allocate node IDs is probably just to give each server a numeric ID, starting from 0. Note that clients never need to configure `controller.quorum.voters`; only servers do. ## Kafka Storage Tool As described above in the QuickStart section, you must use the `kafka-storage.sh` tool to generate a cluster ID for your new cluster, and then run the format command on each node before starting the node. This is different from how Kafka has operated in the past. Previously, Kafka would format blank storage directories automatically, and also generate a new cluster UUID automatically. One reason for the change is that auto-formatting can sometimes obscure an error condition. For example, under UNIX, if a data directory can't be mounted, it may show up as blank. In this case, auto-formatting would be the wrong thing to do. This is particularly important for the metadata log maintained by the controller servers. If two controllers out of three controllers were able to start with blank logs, a leader might be able to be elected with nothing in the log, which would cause all metadata to be lost. # Missing Features We do not yet support generating or loading KIP-630 metadata snapshots. This means that after a while, the time required to restart a broker will become very large. This is a known issue and we are working on completi