|
|
|
Re: 4x4 single-precision matrix product with SSE | |
| [Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] | |
Hello Nicolas, Yes, it's the right place :) could you please paste your code as well as your benchmark context ? Fred 2011/3/11 Nicolas Bock <nicolasbock@xxxxxxxxx> > > Hello list, > > I am writing an assembly function that multiplies 2 4x4 single precision > matrices. I wrote 2 versions, one using SSE the other using SSE4.1. What > surprised me is that the SSE4.1 version fails to beat the SSE version, > it is in fact slightly slower. > > Is this the right place to ask for help? If anyone is interested I can > post some code which would maybe clarify the situation a bit. > > If this is not the right place, please ignore me... > > nick > -- To unsubscribe from this list: send the line "unsubscribe linux-assembly" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html
[Kernel Newbies] [Security] [Linux C Programming] [Linux for Hams] [DCCP] [Netfilter] [Bugtraq] [Photo] [Yosemite] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux RAID] [Linux Admin] [Samba] [Video 4 Linux]
![]() |