We can do a bit more than that!
These are in a lab with server and client connected via 10G SFPs.
The router is running OpenWrt main/master on kernel 6.1 using the recently merged armsr target.
Sadly, [affordable] residential FTTH plans in Australia only do 50Mbps upload (1G down) so I don’t think a live test will be very interesting. I’ll keep this test in mind if I visit someone with a big link.
rrul:
TCP upload sum : 1956.52 N/A N/A Mbits/s 1400
rrrul_be:
TCP upload sum : 1613.04 N/A N/A Mbits/s 1400
tcp_nup:
TCP upload sum : 1376.97 N/A N/A Mbits/s 1400
Full result set here:
Here is what the CPU cores look like during the tests. You can see saturation on core 1 here:
If there were an SQM implementation that hooks onto XDP rather than tc then I think there would be a significant uplift in performance.
Results without SQM for comparison:
rrul:
TCP upload sum : 2728.44 N/A N/A Mbits/s 1400
I can email you the flent result files, if you would like to see them.