netdev
[Top] [All Lists]

Re: Csum and csum copyroutines benchmark

To: vda@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
Subject: Re: Csum and csum copyroutines benchmark
From: Alan Cox <alan@xxxxxxxxxxxxxxxxxxx>
Date: 25 Oct 2002 11:19:51 +0100
Cc: Momchil Velikov <velco@xxxxxxxxx>, Russell King <rmk@xxxxxxxxxxxxxxxx>, Roy Sigurd Karlsbakk <roy@xxxxxxxxxxxxx>, netdev@xxxxxxxxxxx, Linux Kernel Mailing List <linux-kernel@xxxxxxxxxxxxxxx>
In-reply-to: <200210250906.g9P96Yp14775@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
References: <200210231218.18733.roy@xxxxxxxxxxxxx> <200210250643.g9P6hop13980@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> <87n0p3x8lh.fsf@xxxxxxxxx> <200210250906.g9P96Yp14775@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
Sender: netdev-bounce@xxxxxxxxxxx
On Fri, 2002-10-25 at 14:59, Denis Vlasenko wrote:
> Well, that makes it run entirely in L0 cache. This is unrealistic
> for actual use. movntq is x3 faster when you hit RAM instead of L0.
> 
> You need to be more clever than that - generate pseudo-random
> offsets in large buffer and run on ~1K pieces of that buffer.

In a lot of cases its extremely realistic to assume the network buffers
are in cache. The copy/csum path is often touching just generated data,
or data we just accessed via read(). The csum RX path from a card with
DMA is probably somewhat different.


<Prev in Thread] Current Thread [Next in Thread>