Benchmarking improved conntrack performance in OvS 3.0.0

Open vSwitch (OvS), an open source tool for creating virtual Layer 2 networks, relies in some use cases on connection tracking. The recent 3.0.0 release of OvS included this patch series to improve multithread scalability, which makes connection tracking more efficient when OvS is run on multiple CPUs. This article shows how to measure the performance of connection tracking with OvS.

What is connection tracking and why is it critical?

Connection tracking, or conntrack, maintains an internal table of logical network connections (also called flows). The table identifies all packets that make up each flow so that they can be handled consistently.

Conntrack is a requirement for network address translation (NAT)—in IP address masquerading, for example (described in detail in RFC 3022). Conntrack is also required for stateful firewalls, load balancers, intrusion detection and prevention systems, and deep packet inspection. More specifically, OvS conntrack rules are used to isolate different OpenStack virtual networks (aka security groups).

Connection tracking is usually implemented by storing known connection entries in a table, indexed by a bidirectional 5-tuple consisting of a protocol, source address, destination address, source port, and destination port. Each entry also has a state as seen from the connection tracking system. The state (new, established, closed, etc.) is updated every time a packet matching its 5-tuple is processed. If a received packet does not match any existing conntrack entry, a new one is created and inserted into the table.

Performance aspects

There are two aspects to consider when measuring conntrack performance.

How many new connections can be handled per second? This question depends on the following details:
- What is the cost of looking up an existing connection entry for each received packet?
- Can multiple threads insert and destroy conntrack entries concurrently?
- What is the cost of creating a conntrack entry for a new connection?
- How many packets are exchanged per connection?
How many concurrent connections can the system support? This question depends on the following details:
- What is the size of the conntrack table?
- What is the duration of each individual connection?
- After a connection has been closed, how long does the conntrack entry linger in the table until it is expunged to make room for new connections? What if the connection is not closed but no longer exchanges traffic (because the client or server crashed or disconnected)?
- What happens when the conntrack table is full?

These two aspects of performance are somewhat connected, because even a low rate of very long new connections causes the conntrack table to fill up eventually.

In order to properly size the connection tracking table, one needs to know the average number of new connections per second and their average duration. Testing also requires tuning the timeout values of the conntrack engine.

Benchmarking process

To take the measurements necessary to answer the questions in the previous section, you need a way to simulate clients and servers. Such a system must specify how many clients and servers to test, how many connections per second they are creating, how long the connections are, and how much data is exchanged in each connection.

A few commercial traffic generators have these capabilities, more or less refined. This article describes how to carry out the simulation with TRex—an open source traffic generator based on the Data Plane Development Kit (DPDK).

TRex has multiple modes of operation. This article uses the advanced stateful (ASTF) mode, which allows TRex to simulate TCP and UDP endpoints. I have tailored a script using the TRex Python API to perform benchmarks in a manner like RFC 2544, but focusing on how many new connections can be created per second.

Basically, this script connects to a running TRex server started in ASTF mode and creates TCP and UDP connection profiles. These profiles are state machines representing clients and servers with dynamic IP addresses and ports. You can define the number of data exchanges and their sizes, add some arbitrary wait time to simulate network latency, etc. TRex takes care of translating your specifications into real traffic.

Here is a stripped down example, in Python, of a TCP connection profile:

client = ASTFProgram(stream=True)
server = ASTFProgram(stream=True)
for _ in range(num_messages):
    client.send(message_size * b"x")
    server.recv(message_size)
    if server_wait > 0:
        server.delay(server_wait * 1000)  # trex wants microseconds
    server.send(message_size * b"y")
    client.recv(message_size)

tcp_profile = ASTFTemplate(
    client_template=ASTFTCPClientTemplate(
        program=client,
        port=8080,
        cps=99, # base value which is changed during the binary search
        cont=True,
    ),
    server_template=ASTFTCPServerTemplate(
        program=server, assoc=ASTFAssociationRule(port=8080)
    ),
)

Type	Connection rate	Active flows	Bandwidth	Packet rate
Short-lived	1.8M conn/s	1.7M	8.4G bit/s	12.7M pkt/s
Long-lived	11.1K conn/s	898K	8.0G bit/s	11.4M pkt/s

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	1.0M conn/s	524.8K	4.5G bit/s	7.3M pkt/s
3.0.0	1.0M conn/s	513.1K	4.5G bit/s	7.1M pkt/s	-1.74%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	3.1K conn/s	79.9K	3.1G bit/s	4.7M pkt/s
3.0.0	2.8K conn/s	71.9K	2.8G bit/s	4.2M pkt/s	-9.82%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	39.7K conn/s	20.0K	172.0M bit/s	275.8K pkt/s
3.0.0	48.2K conn/s	24.3K	208.9M bit/s	334.9K pkt/s	+21.36%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	959 conn/s	24.7K	956.6M bit/s	1.4M pkt/s
3.0.0	1.2K conn/s	31.5K	1.2G bit/s	1.8M pkt/s	+28.15%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	39.9K conn/s	20.0K	172.8M bit/s	277.0K pkt/s
3.0.0	46.8K conn/s	23.5K	202.7M bit/s	325.0K pkt/s	+17.28%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	885 conn/s	22.7K	883.1M bit/s	1.3M pkt/s
3.0.0	1.1K conn/s	28.6K	1.1G bit/s	1.7M pkt/s	+25.19%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	48.3K conn/s	24.3K	208.8M bit/s	334.8K pkt/s
3.0.0	65.9K conn/s	33.2K	286.8M bit/s	459.9K pkt/s	+36.41%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	1.1K conn/s	29.1K	1.1G bit/s	1.7M pkt/s
3.0.0	1.4K conn/s	37.0K	1.4G bit/s	2.2M pkt/s	+26.77%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	47.4K conn/s	23.9K	206.2M bit/s	330.6K pkt/s
3.0.0	49.1K conn/s	24.7K	212.1M bit/s	340.1K pkt/s	+3.53%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	981 conn/s	25.2K	977.7M bit/s	1.5M pkt/s
3.0.0	2.0K conn/s	52.4K	2.0G bit/s	3.1M pkt/s	+108.31%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	66.1K conn/s	33.2K	286.4M bit/s	459.2K pkt/s
3.0.0	100.8K conn/s	50.6K	437.0M bit/s	700.6K pkt/s	+52.55%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	996 conn/s	25.9K	994.2M bit/s	1.5M pkt/s
3.0.0	2.6K conn/s	67.0K	2.6G bit/s	3.9M pkt/s	+162.89%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	62.2K conn/s	31.3K	269.8M bit/s	432.5K pkt/s
3.0.0	90.1K conn/s	45.2K	390.9M bit/s	626.7K pkt/s	+44.89%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	576 conn/s	17.1K	567.2M bit/s	852.5K pkt/s
3.0.0	3.8K conn/s	97.8K	3.8G bit/s	5.7M pkt/s	+562.76%

Version	Short-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	50.6K conn/s	25.5K	219.5M bit/s	351.9K pkt/s
3.0.0	100.9K conn/s	50.7K	436.0M bit/s	698.9K pkt/s	+99.36%

Version	Long-lived connections	Active flows	Bandwidth	Packet rate	Difference
2.17.2	541 conn/s	14.0K	539.2M bit/s	810.3K pkt/s
3.0.0	4.8K conn/s	124.1K	4.8G bit/s	7.2M pkt/s	+792.83%

Benchmarking improved conntrack performance in OvS 3.0.0

Share:

What is connection tracking and why is it critical?

Performance aspects

Benchmarking process

Setup

Base system

TRex and the traffic generator

The device under test

Test procedure

Short-lived connections

Long-lived connections

Performance statistics

Baseline results for comparison

1 CPU, 1 queue per port, without connection tracking

1 CPU, 1 queue per port

2 CPUs, 1 queue per port

2 CPUs, 2 queues per port

4 CPUs, 2 queues per port

4 CPUs, 4 queues per port

8 CPUs, 4 queues per port

8 CPUs, 8 queues per port

Performance improvements in version 3.0.0 of Open vSwitch

Scaling

Performance during high traffic

Performance in version 2.17.2

Performance in version 3.0.0

Final words

Kafka Monthly Digest: March 2025

Enable 3.5 times faster vision language models with quantization

How to set up OpenShift confidential clusters on Azure

What’s new in Red Hat OpenShift GitOps 1.16

Run Red Hat Developer Hub Locally with Ease

Products

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links

Report a website issue