Compare and Compute: High-Performance Computing with Alibaba Cloud

Weighing up the essentials of high performance, compute intensive, robust and reliable technology stacks with Alibaba Cloud’s world-class cloud compute platforms and solutions.

With China poised to reach exascale supercomputing capability this year[1], it may come as no surprise that Alibaba Cloud offers an impressive range of high performance, next generation, cloud compute products and services. Our compute products support enterprise scale, resource intensive systems that implement artificial intelligence (AI) and machine learning (ML) software, video and audio processing, as well as big data computations, simulations, modelling, and analysis.

Even the compute behemoths of old are facing up to the stiff competition coming from the likes of Alibaba Cloud[2], and some are now teaming up with Alibaba Cloud to strengthen and enhance their own compute product offerings[3].

It’s All about the Hardware

In this blog we’ll take a look at two of Alibaba Cloud’s high-end compute products: the Super Computing Cluster (SCC) and the Elastic-High Performance Computing (E-HPC) all-in-one HPC High-performance Public Computing as a Service (HPCaaS) product.

Super Computing Cluster (SCC)

EBMs guarantee the same elasticity as virtual servers, as well as the high-performance features of physical servers, including physical isolation. They communicate using high-speed Remote Direct Memory Access (RDMA) over Converged Ethernet[6] network which rivals the speed of InfiniBand[7]. The optional plug-in accelerating GPUs ensure high bandwidth and low latency for compute intensive tasks. CPUs combined with GPUs support extraordinary compute requirements and the GPUDirect RDMA option offers the revolutionary Nvidia GPU option[8].

SCC eliminates network bottlenecks, significantly improving cluster acceleration. EBM cluster nodes can be deployed in minutes and scaled as necessary, in equally efficient timescales.

Alibaba Cloud’s Super Computing Cluster

Alibaba Cloud’s SCC has flexible Pay-As-You-Go billing and is also highly secure.

Elastic-High Performance Computing (E-HPC)

E-HPC is an end-to-end public cloud service, also known as a High Performance Compute as a Service (HPCaaS) cloud computing platform[9]. E-HPC includes the following specifications:

  1. Parallel scheduling with open source solutions PBS Pro and Slurm.
  2. Load-balanced cluster auto-scaling.
  3. Parallel communication based on the VPC and RoCE network architectures.
  4. Alibaba Cloud CloudMetrics for monitoring.[10]

Alibaba Cloud’s Super Computing Cluster and E-HPC solutions have similarly cutting-edge specifications, although E-HPC misses out on the advantages of EBM instance isolation. Even so, E-HPC performs in equal measure and is a good choice if you don’t need the extra assurance coming from isolated physical instances.

Like SCC, E-HPC is built up of high-performance elastic instances (Intel Skylake CPU), the RoCE v2 RDMA network, and Nvidia P100/V100 GPUDirect options. All these maximize network and compute performance, availability, low latency, and reliability.

Like SCC, E-HPC integrates with a range of Alibaba Cloud products and services to boost and expand the technology stack. You can also combine SCC with E-HPC for full power optimization. To demonstrate this, Alibaba Cloud engineers and developers tested an automotive wind tunnel and compared it to a digital simulated wind tunnel running on SCC and E-HPC at the 2018 Computing Conference held in Hangzhou.[11]

How do SCC and E-HPC Stack Up against Regular Physical and Virtual Servers?

Comparison of ECS Bare Metal Instances, physical machines, and virtual machines[12]

Exploring Manual Options for Building an E-HPC or SCC Solution

Likewise, building a solution to match SCC or E-HPC using Alibaba Cloud products and services would not be without complications. Remember, Alibaba Cloud’s plug-and-play products have eliminated much of the pain of infrastructure build and management such as upgrade concerns, security, and software license management.

Safe, Secure, Supported

You also have the choice of enhanced plugin security defenses from the Alibaba Cloud security product range as well as a wide range of support options too.

Documentation and Community

Amongst Friends

With many regional discounts and special offers, is now the time to move to Alibaba Cloud?

References

  1. https://www.forbes.com/sites/paulteich/2019/11/27/dell-hpe-ibm-and-lenovo-face-competition-from-cloud-based-supercomputing/
  2. https://www.computerweekly.com/news/252449146/Alibaba-Cloud-and-Intel-team-up-on-IoT
  3. https://www.chinadaily.com.cn/a/201909/25/WS5d8b5150a310cf3e3556d75b.html
  4. https://www.alibabacloud.com/product/scc
  5. https://en.wikipedia.org/wiki/RDMA_over_Converged_Ethernet
  6. https://en.wikipedia.org/wiki/InfiniBand
  7. https://docs.nvidia.com/cuda/gpudirect-rdma/index.html
  8. https://www.alibabacloud.com/help/doc-detail/57680.htm
  9. https://www.alibabacloud.com/blog/improving-automotive-simulation-efficiency-with-alibaba-cloud-e-hpc_594144
  10. https://www.alibabacloud.com/the-computing-conference-2018
  11. https://www.alibabacloud.com/help/doc-detail/60576.htm

Original Source:

Follow me to keep abreast with the latest technology news, industry insights, and developer trends.