| To: | vda@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx |
|---|---|
| Subject: | Re: Csum and csum copyroutines benchmark |
| From: | Alan Cox <alan@xxxxxxxxxxxxxxxxxxx> |
| Date: | 25 Oct 2002 11:19:51 +0100 |
| Cc: | Momchil Velikov <velco@xxxxxxxxx>, Russell King <rmk@xxxxxxxxxxxxxxxx>, Roy Sigurd Karlsbakk <roy@xxxxxxxxxxxxx>, netdev@xxxxxxxxxxx, Linux Kernel Mailing List <linux-kernel@xxxxxxxxxxxxxxx> |
| In-reply-to: | <200210250906.g9P96Yp14775@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> |
| References: | <200210231218.18733.roy@xxxxxxxxxxxxx> <200210250643.g9P6hop13980@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> <87n0p3x8lh.fsf@xxxxxxxxx> <200210250906.g9P96Yp14775@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> |
| Sender: | netdev-bounce@xxxxxxxxxxx |
On Fri, 2002-10-25 at 14:59, Denis Vlasenko wrote: > Well, that makes it run entirely in L0 cache. This is unrealistic > for actual use. movntq is x3 faster when you hit RAM instead of L0. > > You need to be more clever than that - generate pseudo-random > offsets in large buffer and run on ~1K pieces of that buffer. In a lot of cases its extremely realistic to assume the network buffers are in cache. The copy/csum path is often touching just generated data, or data we just accessed via read(). The csum RX path from a card with DMA is probably somewhat different. |
| Previous by Date: | Re: Csum and csum copyroutines benchmark, Momchil Velikov |
|---|---|
| Next by Date: | Re: Csum and csum copyroutines benchmark, Denis Vlasenko |
| Previous by Thread: | Re: Csum and csum copyroutines benchmark, Momchil Velikov |
| Next by Thread: | Re: Csum and csum copyroutines benchmark, Denis Vlasenko |
| Indexes: | [Date] [Thread] [Top] [All Lists] |