Re: [RFC v2 10/11] tap: add poll(2) to used_idx

29 Jul 2025

      On Tue, Jul 29, 2025 at 2:33 AM David Gibson
 wrote:
...
On Mon, Jul 28, 2025 at 07:03:12PM +0200, Eugenio Perez Martin wrote:
...
On Thu, Jul 24, 2025 at 3:21 AM David Gibson
 wrote:
...
On Wed, Jul 09, 2025 at 07:47:47PM +0200, Eugenio Pérez wrote:
...
From ~13Gbit/s to ~11.5Gbit/s.
Again, I really don't know what you're comparing to what here.
When the buffer is full I'm using poll() to wait until vhost free some
buffers, instead of actively checking the used index. This is the cost
of the syscall.
Ah, right.  So.. I'm not sure if it's so much the cost of the syscall
itself, as the fact that you're actively waiting for free buffers,
rather than returning to the main epoll loop so you can maybe make
progress on something else before returning to the Tx path.
Previous patch also wait for free buffers, but it does it burning a
CPU for that.

The next patch is the one that allows to continue progress as long as
there are enough free buffers, instead of always wait until all the
buffer has been sent. But there are situations where this conversion
needs other code changes. In particular, all the calls to
tcp_payload_flush after checking that we have enough buffers like:

if (tcp_payload_sock_used > TCP_FRAMES_MEM - 2) {
        tcp_buf_free_old_tap_xmit(c, 2);
        tcp_payload_flush(c);
        ...
}

Seems like coroutines would be a good fix here, but maybe there are
simpler ways to go back to the main loop while keeping the tcp socket
"ready to read" by epoll POV. Out of curiosity, what do you think
about setjmp()? :).