Skip to content
Youngwoo Kim edited this page Mar 7, 2019 · 83 revisions

Apache Kafka

Consumers and Consumer Groups

Consumers read from any single partition, allowing you to scale throughput of message consumption in a similar fashion to message production. Consumers can also be organized into consumer groups for a given topic — each consumer within the group reads from a unique partition and the group as a whole consumes all messages from the entire topic. If you have more consumers than partitions then some consumers will be idle because they have no partitions to read from. If you have more partitions than consumers then consumers will receive messages from multiple partitions. If you have equal numbers of consumers and partitions, each consumer reads messages in order from exactly one partition.

Mirroring (replication)

Cluster topology (Hierarchy)

Docker

git clone https://github.com/youngwookim/kafka-stack-docker-compose.git

git tag -l
v3.2.0
v3.2.1
v3.3.0
v3.3.1
v4.0.0
v4.1.0
v4.1.2
v5.0.0
v5.1.0

git checkout tags/v4.1.2
docker-compose -f zk-single-kafka-single.yml up
docker-compose -f zk-single-kafka-single.yml down

or

docker-compose -f full-stack.yml up
docker-compose -f full-stack.yml down

Single Zookeeper: $DOCKER_HOST_IP:2181
Single Kafka: $DOCKER_HOST_IP:9092
Kafka Schema Registry: $DOCKER_HOST_IP:8081
Kafka Schema Registry UI: $DOCKER_HOST_IP:8001
Kafka Rest Proxy: $DOCKER_HOST_IP:8082
Kafka Topics UI: $DOCKER_HOST_IP:8000
Kafka Connect: $DOCKER_HOST_IP:8083
Kafka Connect UI: $DOCKER_HOST_IP:8003
Zoonavigator Web: $DOCKER_HOST_IP:8004

Benchmark

Kafka Streams (+ Avro)

Tuning Kafka

KSQL

Kafka connect

Kafka Consumer client

Kafka REST Proxy

Kafka, Spark and Avro Integration

Kafka Schema Registry

The compatibility checks value is one of the following:

NONE - don’t check for schema compatibility
FORWARD - check to make sure last schema version is forward compatible with new schemas
BACKWARDS (default) - make sure new schema is backwards compatible with latest
FULL - make sure new schema is forwards and backwards compatible from latest to new and from new to latest

Unit Test

https://github.com/confluentinc/kafka-streams-examples/blob/4710dbf7017666157ce01de4108b57da41399c28/src/test/java/io/confluent/examples/streams/kafka/EmbeddedSingleNodeKafkaCluster.java

<dependency>
			<groupId>io.confluent</groupId>
			<artifactId>kafka-streams-examples</artifactId>
			<version>4.1.0</version>
			<!-- Required for e.g. schema registry's RestApp -->
			<classifier>tests</classifier>
			<scope>test</scope>
		</dependency>

Kafka OPS

1. Alter configs for Kafka topics
# List topics
kafka-topics.sh --list --zookeeper localhost:2181

# Describe the topic
kafka-configs.sh --zookeeper localhost:2181 --entity-type topics --entity-name TOPIC_NAME

# The default retention time is 24 hours (86400000 millis)
kafka-configs.sh --zookeeper localhost:2181 --entity-type topics --entity-name TOPIC_NAME --alter --add-config retention.ms=5184000000


https://docs.confluent.io/current/kafka/operations.html

Monitoring

Tips

zookeeper:

zookeeper.connect=zk1:2181,zk2:2181,zk3:2181/kafka

Examples

Cheatsheet

Clone this wiki locally