Compiling Yahoo Streaming Benchmark
- Download the Yahoo Streaming Benchmark source package.
1wget https://github.com/dataArtisans/yahoo-streaming-benchmark/archive/master.zip - Decompress the source package.
1unzip master.zip - Go to the source code directory yahoo-streaming-benchmark-master.
1cd yahoo-streaming-benchmark-master
- Modify the stream-bench.sh file.
- Open the file.
1vi stream-bench.sh - Press i to enter the insert mode and modify the file as follows:
1 2 3 4 5 6 7 8 9 10 11 12
#Fetch Kafka KAFKA_FILE="$KAFKA_DIR.tgz" fetch_untar_file "$KAFKA_FILE" "https://archive.apache.org/dist/kafka/$KAFKA_VERSION/$KAFKA_FILE" #Fetch Storm STORM_FILE="$STORM_DIR.tar.gz" fetch_untar_file "$STORM_FILE" "https://archive.apache.org/dist/storm/$STORM_DIR/$STORM_FILE" #Fetch Flink FLINK_FILE="$FLINK_DIR-bin-hadoop27-scala_${SCALA_BIN_VERSION}.tgz" fetch_untar_file "$FLINK_FILE" "https://archive.apache.org/dist/flink/flink-$FLINK_VERSION/$FLINK_FILE" #Fetch Spark SPARK_FILE="$SPARK_DIR.tgz" fetch_untar_file "$SPARK_FILE" "https://archive.apache.org/dist/spark/spark-$SPARK_VERSION/$SPARK_FILE"
- Press Esc, type :wq!, and press Enter to save the file and exit.
- Open the file.
- Create a code repository and track all code.
1 2
git init git add *
- Run the compilation script.
1sh stream-bench.sh SETUP
Figure 1 Spark TAR file downloaded
Parent topic: Yahoo Streaming Benchmark Usage Guidance (for Flink)