Best Practices of ECS Container Network Multi-NIC Solution

Container-based virtualization is a type of virtualization technology. Compared with a virtual machine (VM), a container is lighter and more convenient to deploy. Docker is currently a mainstream container engine, which supports platforms such as Linux and Windows, as well as mainstream Docker orchestration systems such as Kubernetes (K8S), Swarm, and Rocket (RKT). Common container networks support multiple models such as Bridge, Overlay, Host, and user-defined networks. Systems such as K8S rely on the Container Network Interface (CNI) plug-ins for network management. Commonly used CNI plug-ins include Calico and Flannel.

This article will introduce the basics of container networks. Based on Alibaba Cloud’s Elastic Network Interface (ENI) technology, the ECS container network features high performance, easy deployment and maintenance, strong isolation, and high security.

Traditional Container Network Solution

CNI is an open source project managed by the Cloud Native Computing Foundation (CNCF). It develops standards and provides source code libraries for major vendors to develop plug-ins for Linux container network management. Well-known CNI plug-ins include Calico and Flannel. Calico implements protocols such as BGP through Flex/Bird, and stores them into a distributed in-memory database to establish a large Layer 3 network, enabling containers on different hosts to communicate with containers on different subnets without sending ARP.

Flannel implements a container overlay network based on tunneling technologies such as VXLAN. The CNIs such as Calico/Flannel use VETH pairs to configure the container network. A pair of VETH devices are created, with one end bound to the container, and the other end to the VM. The VM forwards the container network through technologies such as the network protocol stack (overlay network), Iptables (Calico plug-in), or Linux Bridge. (When the container network is connected to the vSwitch through the bridge in the ECS, the VPC can only reach the ECS level, and the container network is a private network on the bridge.)

The following figure shows the workflow of the currently mainstream container network, which differs from the multi-NIC container network from the following aspects:

  1. The message sent by the container on host 1 is transmitted to the Linux Bridge on the VM through VETH, and the Linux Bridge runs the forwarding logic to send the NIC on the message VM to the vSwitch located on the host.
  2. The VM on host 2 receives the message sent by the vSwitch and sends it to the container through VETH by using the forwarding logic of the Linux Bridge.

In the entire network system, the VM internally needs CNI plug-ins of orchestration systems such as K8S for network configuration. The vSwitch supports communication protocols such as Openflow and Netconf, which are managed and configured through a Software Defined Network (SDN) controller. Mainstream ToR switches use the Netconf protocol for remote configuration. SDN physical switches that support Openflow are also available in the market.

To manage the entire network, two different network control systems are needed. The configuration is relatively complicated, and certain performance bottlenecks exist due to factors such as the implementation mechanism. The security policies on the host cannot be applied to container applications.

Multi-NIC Container Network

Solution Overview

Rules such as ACL, QoS, and Session are applied in vSwitch to forward the traffic. When a container running on a VM on host 1 accesses a container running on a VM on host 2, the traffic generally goes through the following process:

  1. A network message passes through the container core network protocol stack. After the route is queried, the message is sent through the eth0 NIC.
  2. The vSwitch on the host receives the message from the container through the virtual port, and runs the forwarding logic of the vSwitch to send the packet to the Top of Rack (ToR) switch through the physical network port. If a virtual private cloud (VPC) is established for a container or a VM network, the message needs to be encapsulated using a tunneling technology such as VXLAN.
  3. The ToR switch queries the route and forwards the message by connecting to the physical port of host 2.
  4. The vSwitch on host 2 receives the physical port message and sends it to the virtual port that connects to the container through the forwarding logic.
  5. The protocol stack eth0 in the container receives the message sent by the other end, and then the message is processed by the network protocol stack in the container.

Solution Features

Direct Connection to the VPC

Cross-VPC

High Performance

Single-thread (Mbps)Single-thread (pps)Multi-thread (pps)TBase test 1 KB (QPS)Linux Bridge32.867295,9802,341,669363,300Multi-NIC solution51.389469,8653,851,922470,900Performance improvement56.35%58.7%64.49%29.6%

Strong Isolation

Easy Management

The multi-NIC solution also facilitates container migration. Taking another VM migrated to the same host as an example, K8S’s Kubelet module migrates the applications, then reconfigures the network through CNI plug-ins, manages the container IP and VIP, and configures the way to access the container application. The whole process is complicated, but the NIC solution can make it easy. After the container is dispatched to a VM, the NIC bound to the old container is unplugged from the old VM and inserted into the VM where the new container is located. The NIC is then bound to the container network namespace on the VM. The new container can communicate normally with no more network re-configuration needed.

DPDK Support

VM Multi-NIC

In a virtual environment, the low cost and flexibility of the virtual NIC greatly improve the availability of the VM. Users can dynamically allocate or release NICs as required, and dynamically plug or unplug NICs into or from the VM without affecting the normal operation of the VM. The way libvirt/qemu simulates virtual devices has the following advantages that physical hosts cannot match:

Resource Limits

Dynamic Hot Swapping

Container Network Implementation

  1. Create a VM instance on the Alibaba Cloud console, and select multiple NICs when creating the instance. Then, multiple NICs are displayed on the VM.
  2. Deploy the container application on the VM.
  • ~# docker run -itd --net none ubuntu:16.04
  1. Note: Specify the container’s network type to be none when starting Docker
  2. Log on to the VM and bind one of the NICs to the container namespace. In the following example, the newly dynamically inserted NIC is eth2, and the container’s network namespace is 2017 (for clarification, the PID seen by docker inspect is used as the network namespace).
  • ~# mkdir /var/run/netns ~# ln -sf /proc/2017/ns/net /var/run/netns/2017 ~# ip link set dev eth2 netns 2017 ~# ip netns exec 2017 ip link set eth2 name eth0 ~# ip netns exec 2017 ip link set eth0 up ~# ip netns exec 2017 dhclient eth0
  1. Note: Depending on the release version, users may not need to “create” the container’s network namespace by manually creating a connection. After binding eth2 to the container’s network namespace, rename it to eth0.
  2. View the NIC configuration status on the VM and in the container.
    Check whether the NIC still exists on the VM.
  • ~# ifconfig -a
  1. Check whether there is a newly configured NIC in the container.
  • /# ifcofig -a
  1. It can be seen that eth2 has been removed from the VM and applied in the container.
  2. Repeat steps 1 to 4 to start another VM and container.
  3. Use tools such as sockperf for performance test and comparison.
  • $ cat server.sh #!/bin/bash for i in $(seq 1 $1) do sockperf server --port 123`printf "%02d" $i` & Done $ sh server.sh 10 $ cat client.sh #!/bin/bash for i in $(seq 1 $1) do sockperf tp -i 192.168.2.35 --pps max --port 123`printf "%02d" $i` -t 300 & done $ sh client 10

Ant Financial Use Case

Traditional Linux Bridge test

Server: 16C60G x 1 (half A8)

Client: 4C8G x 8

TBase server deployment: 7G x 7 instances

TBase client deployment: 8 x (16 threads + 1 client) => 128 threads + 8 clients

Testing report

OperationPacket sizeClientsNICload1CPUQPSAVG rt99th rtset1 KB8424 MB7.1544%363,3000.39 ms< 1 msget1 KB8421 MB7.0645%357,0000.39 ms< 1 msset64 KB11,884 MB2.317%29,0000.55 ms< 5 msset128 KB12,252 MB2.5318%18,2000.87 ms< 6 msset256 KB12,804 MB2.3620%11,1001.43 ms< 5 msset512 KB13,104 MB2.6120%6,0002.62 ms< 10 ms

ENI multi-NIC test

Server: 16C60G x 1 (half A8)

Client: 4C8G x 8

TBase server deployment: 7G x 7 instances

TBase client deployment: 16 x (16 threads + 1 client) => 256 threads + 16 clients

Testing report

OperationPacket sizeClientsNICload1CPUQPSAVG rt99th rtset/get1 KB16570 MB6.9745%470,9000.30 ms< 1 ms

Test Conclusion

For the solution of the VPC route table, Flannel/Canal, there is no substantial loss in bandwidth and throughput. The latency will be about 0.1 ms relative to the host. Nginx is used to test the QPS, and the loss is about 10% when the page is small. For the ENI solution, there is no substantial loss in bandwidth and throughput relative to the host, and the latency is slightly lower than that on the host. In the application test, the performance is better than that on the host network by about 10%, because the POD has not been subject to iptables. For the default Flannel VXLAN, the bandwidth and throughput loss is about 5%, and in the maximum QPS for the Nginx small page test, the performance loss is about 30% relative to the host.

Summary

Alibaba Cloud Elastic Network Interface (ENI) is a virtual network interface that can be attached to an ECS instance in a VPC. By using ENIs, you can build high-availability clusters, implement failover at a lower cost, and achieve refined network management. The ENI feature is available in all regions. To learn more about ENI, visit the following pages:

  1. Features and Introduction: https://www.alibabacloud.com/help/doc-detail/58496.htm
  2. User guide: https://www.alibabacloud.com/help/doc-detail/58503.htm
  3. Developer guide: https://www.alibabacloud.com/help/doc-detail/25485.htm

Reference:

https://www.alibabacloud.com/blog/best-practices-of-ecs-container-network-multi-nic-solution_593997?spm=a2c4.12037566.0.0

Follow me to keep abreast with the latest technology news, industry insights, and developer trends. Alibaba Cloud website:https://www.alibabacloud.com