Kafka

Apache Kafka is a distributed streaming platform that allows for publishing and subscribing to streams of records, known as topics. It allows both publishing (producers) and consumption (consumers) of data streams, with topics providing a category for different streams of records. Kafka runs as a cluster of one or more servers (brokers) that maintain feeds of these records.

Uploaded by

Prabhakar Reddy Bokka

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Kafka

Uploaded by

Prabhakar Reddy Bokka

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Apache Kafka

A distributed messing system. Apache Kafka is publish-subscribe messaging

rethought as a distributed commit log.

Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a
messaging system, but with a unique design

 Kafka maintains feeds of messages in categories called topics.

 We'll call processes that publish messages to a Kafka topic producers.
 We'll call processes that subscribe to topics and process the feed of published
messages consumers..
 Kafka is run as a cluster comprised of one or more servers each of which is called
a broker.
Producers
Producers publish data to the topics of their choice. The producer is responsible for choosing
which message to assign to which partition within the topic. This can be done in a round-robin
fashion simply to balance load or it can be done according to some semantic partition function
(say based on some key in the message). More on the use of partitioning in a second.

Consumers
Messaging traditionally has two models: queuing and publish-subscribe. In a queue, a pool of
consumers may read from a server and each message goes to one of them; in publish-
subscribe the message is broadcast to all consumers. Kafka offers a single consumer
abstraction that generalizes both of these—the consumer group
Use Case - In Integration with Spark Streaming
import org.apache.kafka.clients.producer.{KafkaProducer, ProducerConfig, ProducerRecord}
import org.apache.spark.SparkConf
import org.apache.spark.streaming._
import org.apache.spark.streaming.kafka._
/**
* Consumes messages from one or more topics in Kafka and does wordcount.
* Usage: KafkaWordCount <zkQuorum> <group> <topics> <numThreads>
* <zkQuorum> is a list of one or more zookeeper servers that make quorum
* <group> is the name of kafka consumer group
* <topics> is a list of one or more kafka topics to consume from
* <numThreads> is the number of threads the kafka consumer should use
*
* Example:
* `$ bin/run-example \
* org.apache.spark.examples.streaming.KafkaWordCount zoo01,zoo02,zoo03 \
* my-consumer-group topic1,topic2 1`
*/
object KafkaWordCount {
def main(args: Array[String]) {
if (args.length < 4) {
System.err.println("Usage: KafkaWordCount <zkQuorum> <group> <topics>
<numThreads>")
System.exit(1)
}
StreamingExamples.setStreamingLogLevels()
val Array(zkQuorum, group, topics, numThreads) = args
val sparkConf = new SparkConf().setAppName("KafkaWordCount")
val ssc = new StreamingContext(sparkConf, Seconds(2))
ssc.checkpoint("checkpoint")
val topicMap = topics.split(",").map((_, numThreads.toInt)).toMap
val lines = KafkaUtils.createStream(ssc, zkQuorum, group, topicMap).map(_._2)
val words = lines.flatMap(_.split(" "))
val wordCounts = words.map(x => (x, 1L))
.reduceByKeyAndWindow(_ + _, _ - _, Minutes(10), Seconds(2), 2)
wordCounts.print()
ssc.start()
ssc.awaitTermination()
}
}
// Produces some random words between 1 and 100.
object KafkaWordCountProducer {
def main(args: Array[String]) {
if (args.length < 4) {
System.err.println("Usage: KafkaWordCountProducer <metadataBrokerList> <topic> "
+
"<messagesPerSec> <wordsPerMessage>")
System.exit(1)
}
val Array(brokers, topic, messagesPerSec, wordsPerMessage) = args
// Zookeeper connection properties
val props = new HashMap[String, Object]()
props.put(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG, brokers)
props.put(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG,
"org.apache.kafka.common.serialization.StringSerializer")
props.put(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG,
"org.apache.kafka.common.serialization.StringSerializer")
val producer = new KafkaProducer[String, String](props)
// Send some messages
while(true) {
(1 to messagesPerSec.toInt).foreach { messageNum =>
val str = (1 to wordsPerMessage.toInt).map(x =>
scala.util.Random.nextInt(10).toString)
.mkString(" ")
val message = new ProducerRecord[String, String](topic, null, str)
producer.send(message)
}
Thread.sleep(1000)
}
}
}

Aws Cloud Technical Essentials
No ratings yet
Aws Cloud Technical Essentials
2 pages
1-Rent Receipt Format
0% (1)
1-Rent Receipt Format
1 page
Tax Invoice/Bill of Supply/Cash Memo: (Original For Recipient)
No ratings yet
Tax Invoice/Bill of Supply/Cash Memo: (Original For Recipient)
1 page
Aws Basics: Suresha Ejari (Product Engineering Service) (COMPANY NAME) (Company Address)
100% (1)
Aws Basics: Suresha Ejari (Product Engineering Service) (COMPANY NAME) (Company Address)
135 pages
Apache Kafka Key Concepts
100% (1)
Apache Kafka Key Concepts
8 pages
Microservices architecture kafka
No ratings yet
Microservices architecture kafka
13 pages
Kafka Cluster
No ratings yet
Kafka Cluster
11 pages
AK
No ratings yet
AK
22 pages
kafka (1)
No ratings yet
kafka (1)
33 pages
BDA Lab A7
No ratings yet
BDA Lab A7
10 pages
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
No ratings yet
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
23 pages
kafka installation
No ratings yet
kafka installation
3 pages
hello
No ratings yet
hello
33 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Kafka 1
No ratings yet
Kafka 1
10 pages
Apache Kafka
No ratings yet
Apache Kafka
6 pages
KAFKAExample2
No ratings yet
KAFKAExample2
12 pages
Introduction To Apache Ka Ka For Python Programmers: Installation
No ratings yet
Introduction To Apache Ka Ka For Python Programmers: Installation
8 pages
Q & A
No ratings yet
Q & A
2 pages
Apache Kafka | Thi Nguyen's Blog
No ratings yet
Apache Kafka | Thi Nguyen's Blog
39 pages
Introduction To Apache Kafka - 070224-1155-334
No ratings yet
Introduction To Apache Kafka - 070224-1155-334
7 pages
Kafka - Premiera Ola
No ratings yet
Kafka - Premiera Ola
5 pages
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
No ratings yet
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
54 pages
Apache Kafka 101
No ratings yet
Apache Kafka 101
25 pages
Apache Kafka Tutorial
No ratings yet
Apache Kafka Tutorial
6 pages
Getting Started With Apache Kafka in Python - Towards Data Science PDF
No ratings yet
Getting Started With Apache Kafka in Python - Towards Data Science PDF
17 pages
Kafka Interview Q&A
No ratings yet
Kafka Interview Q&A
28 pages
Apache Kafka Long Polling
No ratings yet
Apache Kafka Long Polling
20 pages
RabbitMQ Architecture
No ratings yet
RabbitMQ Architecture
8 pages
Kafka Monitoring
No ratings yet
Kafka Monitoring
64 pages
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Publish Subscribe
No ratings yet
Publish Subscribe
19 pages
Amazon Managed Streaming For Apache Kafka
No ratings yet
Amazon Managed Streaming For Apache Kafka
11 pages
Kafka SlidesShare
No ratings yet
Kafka SlidesShare
100 pages
Kafka Integration Made Easy with Spring Boot _ by Avinash Hargun _ Simform Engineering _ Medium
No ratings yet
Kafka Integration Made Easy with Spring Boot _ by Avinash Hargun _ Simform Engineering _ Medium
19 pages
Provectus Kafka Ui
No ratings yet
Provectus Kafka Ui
7 pages
KAFKA LAB MANUAL- 3 EXPERIMENTS
No ratings yet
KAFKA LAB MANUAL- 3 EXPERIMENTS
15 pages
Apache Kafka - Basic Operations
No ratings yet
Apache Kafka - Basic Operations
6 pages
Kafka For Beginners
No ratings yet
Kafka For Beginners
77 pages
Documentation
No ratings yet
Documentation
105 pages
kafka
No ratings yet
kafka
5 pages
28_Kafka_Notes
No ratings yet
28_Kafka_Notes
2 pages
Apache Kafka in Spring Boot Application
No ratings yet
Apache Kafka in Spring Boot Application
8 pages
Big Data-Kafka
No ratings yet
Big Data-Kafka
14 pages
Kafka 2
No ratings yet
Kafka 2
11 pages
Interview Question
No ratings yet
Interview Question
24 pages
Kafka Interview Questions
No ratings yet
Kafka Interview Questions
10 pages
Kafka
No ratings yet
Kafka
5 pages
Integrating Apache Nifi and Apache Kafka
No ratings yet
Integrating Apache Nifi and Apache Kafka
5 pages
Kafka Notes 1697295201
No ratings yet
Kafka Notes 1697295201
14 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
Bigdata Notes
No ratings yet
Bigdata Notes
26 pages
4-kafka cheetsheet final
No ratings yet
4-kafka cheetsheet final
8 pages
Kafka Architectures Notes
No ratings yet
Kafka Architectures Notes
9 pages
Lecture Intro Kafka
No ratings yet
Lecture Intro Kafka
27 pages
Apache Kafka Cookbook - Sample Chapter
100% (1)
Apache Kafka Cookbook - Sample Chapter
14 pages
ELK Setup
No ratings yet
ELK Setup
16 pages
New Section 1
No ratings yet
New Section 1
8 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Event-Driven Architecture- Building Scalable Systems With Apache Kafka - The Tal
No ratings yet
Event-Driven Architecture- Building Scalable Systems With Apache Kafka - The Tal
19 pages
Lab-Confluence Kafka KSQL II
No ratings yet
Lab-Confluence Kafka KSQL II
62 pages
Kafka Ebook SoftwareMill
No ratings yet
Kafka Ebook SoftwareMill
27 pages
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
From Everand
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
Peter Jones
No ratings yet
Mastering Kafka Streams: From Basics to Expert Proficiency
From Everand
Mastering Kafka Streams: From Basics to Expert Proficiency
William Smith
No ratings yet
Rama
No ratings yet
Rama
7 pages
Colleges Offering LL.B 3 Years Course
No ratings yet
Colleges Offering LL.B 3 Years Course
2 pages
Introduction To Linux Concepts: Iva Umar Huchipalli
100% (2)
Introduction To Linux Concepts: Iva Umar Huchipalli
31 pages
Sonia Is An Indian: Film Actress Malayalam Baby Shalini Nombarathi Poovu Tamil Kannada Telugu Tinku
No ratings yet
Sonia Is An Indian: Film Actress Malayalam Baby Shalini Nombarathi Poovu Tamil Kannada Telugu Tinku
1 page
Spouse Declaration Form: To Whomsoever It May Concern
No ratings yet
Spouse Declaration Form: To Whomsoever It May Concern
1 page
Sonia Is An Indian: Film Actress Malayalam Baby Shalini Nombarathi Poovu Tamil Kannada Telugu Tinku
No ratings yet
Sonia Is An Indian: Film Actress Malayalam Baby Shalini Nombarathi Poovu Tamil Kannada Telugu Tinku
1 page
Professional Summary
No ratings yet
Professional Summary
7 pages
Items
No ratings yet
Items
2 pages
Ap Educe Undamentals: Business
No ratings yet
Ap Educe Undamentals: Business
74 pages
Cse3035 - Principles-Of-Cloud-Computing - Eth - 1.0 - 57 - Cse3035 - 61 Acp
No ratings yet
Cse3035 - Principles-Of-Cloud-Computing - Eth - 1.0 - 57 - Cse3035 - 61 Acp
3 pages
Chapter 5 Cloud Computing
No ratings yet
Chapter 5 Cloud Computing
21 pages
Jntu r15 CC Lab Programs
No ratings yet
Jntu r15 CC Lab Programs
4 pages
J, DL ¿ MSS - KL Ixl, Amh (Cloud Computing)
No ratings yet
J, DL ¿ MSS - KL Ixl, Amh (Cloud Computing)
5 pages
Cloud Computing Questions-1
No ratings yet
Cloud Computing Questions-1
38 pages
Google ACE
No ratings yet
Google ACE
5 pages
VMware Q2 CY2023 VCPP PUG EN
No ratings yet
VMware Q2 CY2023 VCPP PUG EN
159 pages
Event Driven Architecture With Kafka
No ratings yet
Event Driven Architecture With Kafka
8 pages
Cloud Computing
No ratings yet
Cloud Computing
6 pages
AWS to Azure services comparison
No ratings yet
AWS to Azure services comparison
26 pages
Infrastructure As A Service (Iaas) : Presented by
No ratings yet
Infrastructure As A Service (Iaas) : Presented by
8 pages
Chapter 5 CCD
No ratings yet
Chapter 5 CCD
17 pages
AZ-900 Practice
No ratings yet
AZ-900 Practice
36 pages
Chapter 3
No ratings yet
Chapter 3
54 pages
Lab Experiments:05: OBJECTIVE: To Study Cloud Security Management
No ratings yet
Lab Experiments:05: OBJECTIVE: To Study Cloud Security Management
15 pages
Be Computer Engineering Semester 6 2024 May Cloud Computing Cc Pattern 2019
No ratings yet
Be Computer Engineering Semester 6 2024 May Cloud Computing Cc Pattern 2019
2 pages
AWS Sample Resume 2
50% (4)
AWS Sample Resume 2
3 pages
MODULE 1 - Cloud Concepts
No ratings yet
MODULE 1 - Cloud Concepts
4 pages
Final Year Project
0% (1)
Final Year Project
15 pages
Cloud Platform AWS 3rd
No ratings yet
Cloud Platform AWS 3rd
29 pages
odd_de
No ratings yet
odd_de
4 pages
5G NFV SDN and MEC
No ratings yet
5G NFV SDN and MEC
45 pages
Fcet Unit 4
No ratings yet
Fcet Unit 4
12 pages
Unit-3-AWS Solution
No ratings yet
Unit-3-AWS Solution
51 pages
NAS Tutorial - Oplocks and NASes - SmallNetBuilder
No ratings yet
NAS Tutorial - Oplocks and NASes - SmallNetBuilder
1 page
Dork List
No ratings yet
Dork List
464 pages
AWS Administrator MCQ
No ratings yet
AWS Administrator MCQ
12 pages
Amazon - Testkings.aws Solution Architect Associate - Vce.2023 Jun 25.by - Jason.279q.vce
No ratings yet
Amazon - Testkings.aws Solution Architect Associate - Vce.2023 Jun 25.by - Jason.279q.vce
28 pages