Introduction

This document describes how to port Spark from the x86 platform to the Kunpeng Arm platform. It covers dependency installation, configuration adjustment, and troubleshooting.

This document uses Spark 2.3.0 as an example for illustration.

Spark Overview

Spark is a unified analytics engine for large-scale data processing. It features scalable architecture and in-memory computing capabilities. It supports multiple scenarios such as real-time streaming, machine learning, and interactive query. Spark has evolved into a unified, lightweight platform for fast big data processing, supporting deployment across various storage and operating systems.

For more information about Spark, visit the Spark official website.

Programming language: Scala

Brief description: engine for large-scale data computing

Background

During the migration of enterprise IT architectures from the x86 platform to the Kunpeng platform, Spark faces challenges in architecture adaptation, performance optimization, and ecosystem compatibility. The Kunpeng platform provides high-performance with minimal power consumption using its advanced processors. It excels in energy efficiency and handles multiple tasks simultaneously with ease.

Intended Audience

This document is designed to guide users in smoothly migrating Spark from the x86 platform to the Kunpeng platform, improving its performance, and supporting enterprise technology upgrades. It is intended for the audiences who possess the following qualifications:

A foundational understanding of big data, with the ability to develop, compile, and run open source Spark for big data scenarios, as well as experience in cluster performance tuning and troubleshooting.
Proficiency in Scala and functional programming, alongside hands-on experience with Linux systems and cluster management.
A solid grasp of Spark's core components and its execution principles.

Parent topic: Porting Guide (CentOS & openEuler)