Compiling Yahoo Streaming Benchmark

Download the Yahoo Streaming Benchmark source package.

       
            wget https://github.com/dataArtisans/yahoo-streaming-benchmark/archive/master.zip

Decompress the source package.
1

unzip master.zip
Go to the source code directory yahoo-streaming-benchmark-master.
1

cd yahoo-streaming-benchmark-master

Modify the stream-bench.sh file.

Open the file.
1

vi stream-bench.sh

Press i to enter the insert mode and modify the file as follows:

         
              #Fetch Kafka
KAFKA_FILE="$KAFKA_DIR.tgz"
fetch_untar_file "$KAFKA_FILE" "https://archive.apache.org/dist/kafka/$KAFKA_VERSION/$KAFKA_FILE"
#Fetch Storm
STORM_FILE="$STORM_DIR.tar.gz"
fetch_untar_file "$STORM_FILE" "https://archive.apache.org/dist/storm/$STORM_DIR/$STORM_FILE"
#Fetch Flink
FLINK_FILE="$FLINK_DIR-bin-hadoop27-scala_${SCALA_BIN_VERSION}.tgz"
fetch_untar_file "$FLINK_FILE" "https://archive.apache.org/dist/flink/flink-$FLINK_VERSION/$FLINK_FILE"
#Fetch Spark
SPARK_FILE="$SPARK_DIR.tgz"
fetch_untar_file "$SPARK_FILE" "https://archive.apache.org/dist/spark/spark-$SPARK_VERSION/$SPARK_FILE"

Press Esc, type :wq!, and press Enter to save the file and exit.

Create a code repository and track all code.
1 2

git init git add *
Run the compilation script.
1

sh stream-bench.sh SETUP
Figure 1 Spark TAR file downloaded

Parent topic: Yahoo Streaming Benchmark Usage Guidance (for Flink)