Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
__MACOSX		__MACOSX
screenshots		screenshots
.gitattributes		.gitattributes
README.md		README.md
data_stream.py		data_stream.py
kafka_server.py		kafka_server.py
producer_server.py		producer_server.py
radio_code.json		radio_code.json
requirements.txt		requirements.txt
start.sh		start.sh

Repository files navigation

Crime-Statistics-with-Spark-Streaming

Q. How did changing values on the SparkSession property parameters affect the throughput and latency of the data?

A. By checking processedRowsPerSecond

Q. What were the 2-3 most efficient SparkSession property key/value pairs? Through testing multiple variations on values, how can you tell these were the most optimal?

A.

spark.streaming.kafka.maxRatePerPartition
spark.sql.shuffle.partitions
spark.default.parallelism

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages